QUICK REVIEW

[论文解读] Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification

Garima Chhikara, Anurag Sharma|arXiv (Cornell University)|Feb 28, 2024

Ethics and Social Impacts of AI被引用 5

一句话总结

本文研究大型语言模型是否能够通过在上下文学习中，在零-shot 和少-shot 设置下，结合各种公平性概念来实现公平感知的分类。GPT-4 通常在准确性和公平性权衡方面比其他模型表现更好。

ABSTRACT

Employing Large Language Models (LLM) in various downstream applications such as classification is crucial, especially for smaller companies lacking the expertise and resources required for fine-tuning a model. Fairness in LLMs helps ensure inclusivity, equal representation based on factors such as race, gender and promotes responsible AI deployment. As the use of LLMs has become increasingly prevalent, it is essential to assess whether LLMs can generate fair outcomes when subjected to considerations of fairness. In this study, we introduce a framework outlining fairness regulations aligned with various fairness definitions, with each definition being modulated by varying degrees of abstraction. We explore the configuration for in-context learning and the procedure for selecting in-context demonstrations using RAG, while incorporating fairness rules into the process. Experiments conducted with different LLMs indicate that GPT-4 delivers superior results in terms of both accuracy and fairness compared to other models. This work is one of the early attempts to achieve fairness in prediction tasks by utilizing LLMs through in-context learning.

研究动机与目标

评估 LLMs 是否理解并能够在分类任务中应用公平性概念。
在不同公平性标准下比较多种最先进的 LLM（GPT-4、LLaMA-2、Gemini）。
在零-shot 与少-shot 上下文学习设置中分析准确性-公平性权衡。
提供公开可用的预测以推动公平性感知的 LLM 分类的进一步研究。

提出的方法

定义一个包含七种公认公平性概念外加第八种 Generic Fairness 的公平性框架。
使用带有抽象（Rule_A）和详细（Rule_D）公平性提示的上下文学习来引导预测。
评估零-shot 与少-shot 设置，是否使用检索到的上下文演示（RAG），从中选取 20 个相似示例。
采用标准化的提示结构，包括任务描述、测试实例和 fairness pi，并在 UCI Adult 数据集上将输出与真实值进行比较。

实验结果

研究问题

RQ1LLMs 是否能够在不进行微调的情况下理解并应用形式化的公平性概念于分类任务？
RQ2在不同公平性标准下，哪些 LLM（GPT-4、LLaMA-2、Gemini）表现最佳？
RQ3在用于公平导向分类的零-shot 与少-shot 上下文学习中，准确性-公平性权衡如何？
RQ4相较于零-shot 提示，检索增强的上下文学习是否提高了公平性和准确性？
RQ5在 Adult 数据集上，八种定义的公平性指标在不同模型和设置下的表现如何？

主要发现

GPT-4 在若干设置中比竞争模型取得更高的准确性和公平性分数。
通过抽象提示（Rule_A）和详细提示（Rule_D）定义的公平性规则会影响结果，并在各指标上出现可测量的变化。
零-shot 与少-shot 配置在公平性定义和模型不同的情况下产生不同的公平性-准确性权衡。
检索式上下文示例（RAG）用于少-shot 学习，从 46,621 条实例中选择演示。
该研究公开发布了超过 1,000 个测试实例的预测以支持未来研究。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。