QUICK REVIEW

[논문 리뷰] Counterfactual Fairness

Matt J. Kusner, Joshua R. Loftus|arXiv (Cornell University)|2017. 03. 20.

Ethics and Social Impacts of AI참고 문헌 31인용 수 868

한 줄 요약

대안적 공정성(counterfactual fairness)을 제시하고, 보호 속성이 반사실적 변화 하에서도 예측이 불변하도록 하는 인과 프레임워크와 이러한 공정한 예측기를 학습하는 알고리즘을 제시하며, 로스쿨 합격 예측에 적용한 예시를 보여준다.

ABSTRACT

Machine learning can impact people with legal or ethical consequences when it is used to automate decisions in areas such as insurance, lending, hiring, and predictive policing. In many of these scenarios, previous decisions have been made that are unfairly biased against certain subpopulations, for example those of a particular race, gender, or sexual orientation. Since this past data may be biased, machine learning predictors must account for this to avoid perpetuating or creating discriminatory practices. In this paper, we develop a framework for modeling fairness using tools from causal inference. Our definition of counterfactual fairness captures the intuition that a decision is fair towards an individual if it is the same in (a) the actual world and (b) a counterfactual world where the individual belonged to a different demographic group. We demonstrate our framework on a real-world problem of fair prediction of success in law school.

연구 동기 및 목표

역사적 편향이 있는 고위험 도메인에서 공정성 인식 예측의 필요성을 동기 부여한다.
페어럴의 프레임워크에 기반한 counterfactual fairness의 형식적이고 인과적 정의를 제안한다.
잠재 변수 활용을 통해 counterfactual fairness를 만족하는 예측 모델을 학습하는 알고리즘을 개발한다.
현실 세계에 준하는 로스쿨 합격 예측 사례 연구를 통해 접근법을 설명한다.
실무에서의 인과적 공정성의 설계 선택과 한계에 대해 논의한다.

제안 방법

보호 속성 A, 관찰된 X, 결과 Y를 가진 인과 모델(U,V,F)을 채택한다.
counterfactual fairness를 정의한다: hatY의 분포가 X와 A가 주어진 상태에서 A를 counterfactually 변화시키는 경우(A ← a′)에도 불변일 때 hatY는 공정하다.
학습 알고리즘 FairLearning을 제안한다. 이는 MCMC를 통해 P_M(U | X, A)에서 잠재 변수 U를 샘플링하고 g_theta(U, X_nsucc(A))를 학습하여 예측 손실을 최소화한다.
Prediction-time 추정치를 E[hatY(U*, X_nsucc(A*)) | X*, A*]로 제공한다.
기존의 공정성 개념과의 연계성을 보여주고 순수한 통계적 기준보다 명시적인 인과 모델링의 필요성을 주장한다.

Figure 1: (a), (b) Two causal models for different real-world fair prediction scenarios. See Section 3.1 for discussion. (c) The graph corresponding to a causal model with $A$ being the protected attribute and $Y$ some outcome of interest, with background variables assumed to be independent. (d) Exp

실험 결과

연구 질문

RQ1보호 속성이 관찰된 데이터에 인과적으로 영향을 주는 설정에서 공정성을 일치시키는 예측기를 어떻게 정의할 수 있는가?
RQ2보호 속성의 counterfactual 변화에 대해서도 결과가 불변하면서 예측적 유용성을 유지하는 예측자를 구성할 수 있는가?
RQ3인과 모델과 한정된 데이터가 주어졌을 때 counterfactual을 근사하기 위한 실용적 학습 전략은 무엇인가?
RQ4Counterfactual fairness가 Equal Opportunity 및 Predictive Parity와 같은 다른 공정성 기준과 어떻게 관련되고 잠재적으로 조화될 수 있는가?

주요 결과

모델	RMSE
Full	0.873
Unaware	0.894
Fair K	0.929
Fair Add	0.918

Counterfactual fairness는 보호 속성이 인과 경로를 따라 counterfactual하게 변화하더라도 개인 차원의 공정성 개념을 유지한다.
잠재 변수 U와 A의 비하향(descendants)이 아닌 부분에 기반하여 Counterfactual 기대치를 근사하기 위해 MCMC를 사용하는 FairLearning이라는 알고리즘이 제안되었다.
법대학 합격 예시에서 counterfactually fair 모델은 공정성을 위해 일부 예측 정확도를 포기하며, Level 1–3 접근법은 점진적으로 더 강한 가정과 서로 다른 RMSE 결과를 제공한다.
잠재적으로 공정한 변수(Level 2) 또는 추가 오차(Additive-error) 모델(Level 3)을 사용하면 Full 및 Unaware 기준선 대비 예측 성능이 달라져 공정성과 정확도 간의 절충이 나타난다.
Counterfactual fairness는 Y와 A가 연관될 때 전통적 기준이 충돌하는 공정성 역설을 해결하는 데 도움이 된다.

Figure 2: Left: A causal model for the problem of predicting law school success fairly. Right: Density plots of predicted $\mbox{FYA}_{a}$ and $\mbox{FYA}_{a^{\prime}}$ .

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.