QUICK REVIEW

[论文解读] Prediction-Based Decisions and Fairness: A Catalogue of Choices, Assumptions, and Definitions

Shira Mitchell, Eric Potash|arXiv (Cornell University)|Nov 19, 2018

Ethics and Social Impacts of AI参考文献 85被引用 108

一句话总结

一项对预测型决策系统中的公平性选择、假设和定义进行编目，并澄清数据、模型与社会目标如何互动。

ABSTRACT

A recent flurry of research activity has attempted to quantitatively define "fairness" for decisions based on statistical and machine learning (ML) predictions. The rapid growth of this new field has led to wildly inconsistent terminology and notation, presenting a serious challenge for cataloguing and comparing definitions. This paper attempts to bring much-needed order. First, we explicate the various choices and assumptions made---often implicitly---to justify the use of prediction-based decisions. Next, we show how such choices and assumptions can raise concerns about fairness and we present a notationally consistent catalogue of fairness definitions from the ML literature. In doing so, we offer a concise reference for thinking through the choices, assumptions, and fairness considerations of prediction-based decision systems.

研究动机与目标

Explain the social and technical choices that underpin prediction-based decision systems.
Systematize assumptions and decisions that affect fairness in ML, including data, models, and evaluation.
Provide a notationally consistent catalogue of fairness definitions from the ML literature.
Highlight gaps between mathematical formalism and broader social goals in fairness research.

提出的方法

Present a structured taxonomy of the policy design choices that influence fairness (over-arching goals, population, and decision space).
Decompose data bias into statistical bias (sampling and measurement) and societal bias, and discuss their impact on fairness.
Summarize predictive modeling choices (data, model class, covariates) and evaluation assumptions that affect fairness.
Review and organize a catalogue of fairness definitions from the literature, including confusion-matrix based, score-based, and sufficiency-type criteria.
Discuss causal frameworks for fairness and identify tensions and impossibilities among definitions.

实验结果

研究问题

RQ1What choices and assumptions in prediction-based decision systems most influence fairness outcomes?
RQ2How do statistical and societal biases in data interact with model choices to affect fairness across groups?
RQ3What are the major fairness definitions in ML, and how do they relate to decision contexts and evaluation assumptions?
RQ4To what extent do single-threshold and score-based fairness notions align with real-world decision objectives?

主要发现

Fairness in ML requires careful alignment of social goals, population, and decision space with modeling and evaluation choices.
Data bias consists of statistical bias (sampling/measurement) and societal bias, each with distinct implications for fairness.
Multiple, sometimes conflicting, fairness definitions exist (e.g., error-rate balance, predictive parity, calibration) and no single definition is universally applicable.
Single-threshold fairness ties to utility maximization under certain assumptions but depends on scoring quality and chosen utilities.
Evaluation assumptions (no interference, symmetric harms, batch decisions) critically shape perceived fairness and outcomes.
Causal and counterfactual analyses offer additional perspectives on fairness beyond purely statistical definitions.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。