Skip to main content
QUICK REVIEW

[Paper Review] Probably Approximately Metric-Fair Learning

Guy N. Rothblum, Gal Yona|arXiv (Cornell University)|Mar 8, 2018
Machine Learning and Algorithms23 references43 citations
TL;DR

The paper introduces a relaxed PACF framework for metric-fair learning, proving generalization guarantees and providing polynomial-time PACF learners for linear and logistic predictors, while showing hardness for perfect metric-fairness.

ABSTRACT

The seminal work of Dwork {\em et al.} [ITCS 2012] introduced a metric-based notion of individual fairness. Given a task-specific similarity metric, their notion required that every pair of similar individuals should be treated similarly. In the context of machine learning, however, individual fairness does not generalize from a training set to the underlying population. We show that this can lead to computational intractability even for simple fair-learning tasks. With this motivation in mind, we introduce and study a relaxed notion of {\em approximate metric-fairness}: for a random pair of individuals sampled from the population, with all but a small probability of error, if they are similar then they should be treated similarly. We formalize the goal of achieving approximate metric-fairness simultaneously with best-possible accuracy as Probably Approximately Correct and Fair (PACF) Learning. We show that approximate metric-fairness {\em does} generalize, and leverage these generalization guarantees to construct polynomial-time PACF learning algorithms for the classes of linear and logistic predictors.

Motivation & Objective

  • Motivate relaxing perfect metric-fairness to approximate metric-fairness to enable generalization across the population.
  • Develop PACF learning as a framework balancing fairness and best-possible accuracy.
  • Establish generalization bounds for approximate metric-fairness using Rademacher complexity.
  • Provide efficient (polynomial-time) PACF algorithms for linear and logistic predictors.
  • Contrast with hardness results for perfect metric-fairness to justify the relaxed approach.

Proposed method

  • Define approximate metric-fairness (MF) with parameters (α, γ) and failure probability δ.
  • Formulate Probably Approximately Correct and Fair (PACF) learning combining MF with accuracy guarantees relative to the best approximately MF predictor.
  • Prove fairness-generalization bounds using Rademacher complexity to ensure training MF implies population MF.
  • Design polynomial-time relaxed-PACF learners for linear predictors (H_lin) via bounded empirical MF loss and convex MF-violation constraints.
  • Extend to logistic predictors (H_{φ,L}) using kernel-based improper learning and convexification via RHKS with a polynomial kernel; achieve relaxed PACF results.
  • Provide information-theoretic and computational hardness discussions contrasting perfect vs. approximate MF.

Experimental results

Research questions

  • RQ1Can approximate metric-fairness generalize from finite samples to the underlying population?
  • RQ2What are the sample complexity and generalization guarantees for PACF learning under MF constraints?
  • RQ3Can we design efficient (polynomial-time) PACF learners for linear and logistic predictors under MF?
  • RQ4How does relaxing MF (α, γ) affect accuracy guarantees relative to the best approximately MF predictor?
  • RQ5What are the computational barriers of perfect metric-fairness, and does relaxing MF bypass these barriers?

Key findings

  • Approximate MF generalizes: empirical MF on a sample implies MF on the underlying distribution with high probability (Theorem 1.3).
  • Information-theoretic strong PACF learnability is achievable with sample complexity similar to standard PAC learning for MF classes.
  • Polynomial-time relaxed-PACF learners exist for linear predictors (H_lin) with poly-time complexity in 1/ε parameters and sample size.
  • Polynomial-time relaxed-PACF learners exist for logistic predictors (H_{φ,L}) with exponential dependence on the Lipschitz parameter L due to kernel-based embedding.
  • For perfect metric-fairness, some tasks remain hard even with simple predictors, motivating the shift to approximate MF (PACF).
  • H_lin achieves relaxed PACF learnability; H_{φ,L} is learnable in the relaxed sense with kernelization; both maintain competitive accuracy under MF constraints.

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.