QUICK REVIEW

[論文レビュー] An Empirical Analysis of Fairness Notions under Differential Privacy

Anderson Santana de Oliveira, Caelin Kaplan|arXiv (Cornell University)|Feb 6, 2023

Privacy-Preserving Technologies in Data被引用数 8

ひとこと要約

The paper empirically studies how selecting model architectures optimized for DP-SGD affects fairness notions (demographic parity, equalized odds, predictive parity) across real fairness datasets, finding that DP can reduce disparities or negligibly impact fairness while maintaining utility.

ABSTRACT

Recent works have shown that selecting an optimal model architecture suited to the differential privacy setting is necessary to achieve the best possible utility for a given privacy budget using differentially private stochastic gradient descent (DP-SGD)(Tramer and Boneh 2020; Cheng et al. 2022). In light of these findings, we empirically analyse how different fairness notions, belonging to distinct classes of statistical fairness criteria (independence, separation and sufficiency), are impacted when one selects a model architecture suitable for DP-SGD, optimized for utility. Using standard datasets from ML fairness literature, we show using a rigorous experimental protocol, that by selecting the optimal model architecture for DP-SGD, the differences across groups concerning the relevant fairness metrics (demographic parity, equalized odds and predictive parity) more often decrease or are negligibly impacted, compared to the non-private baseline, for which optimal model architecture has also been selected to maximize utility. These findings challenge the understanding that differential privacy will necessarily exacerbate unfairness in deep learning models trained on biased datasets.

研究の動機と目的

DP-SGD が保護属性を含むデータセット全体の公正性概念にどのような影響を与えるかを調査する。
DP対応アーキテクチャの最適化がグループ間の格差を緩和するか、あるいは悪化させるかを評価する。
プライベート訓練と非プライベート訓練の下でユーティリティ（ROC AUC）を最大化しつつ公正性指標を比較する。
複数の保護グループと交差を含むML公正性文学で用いられるデータセットを使用する。

提案手法

DP-SGD を備えた設定可能なフィードフォワードネットワークを構成し、ROC AUC を最大化する総当たりハイパーパラメータ探索を行う。
5-fold クロスバリデーションとホールドアウトテストセット評価を実施する。
Baseline+DP および Best DP Model 設定に対して総プライバシー予算 epsilon ≈ 27 の DP-SGD を適用する。
対立クラスからの3つの公正性概念を評価する：Demographic parity（独立）、Equalized odds（分離）、Predictive parity（十分性）。
保護属性のすべてのサブグループの交差におけるピーク格差を分析する。
設定ごとに平均 ± 標準偏差として10回の訓練実行の結果を報告する。

実験結果

リサーチクエスチョン

RQ1DP-SGD で最適化されたアーキテクチャを選択すると、非プライベートベースラインと比較して公正性の格差が減少するか。
RQ2最適化されたアーキテクチャを持つ DP-SGD モデルは、非プライベートモデルと比べてユーティリティを維持または向上させつつ公正性に対して異なる影響を与えるか。
RQ3複数の実世界データセットにおいて、人口統計的パリティ、等化オッズ、予測的パリティの格差は差分プライバシー下でどう振る舞うか。

主な発見

Dataset	Overall AUC (Baseline)	Overall AUC (Baseline+DP)	Overall AUC (Best DP Model)	AUC difference (Baseline vs Best DP)	Demographic parity difference (Baseline)	Demographic parity difference (Best DP Model)	Equalized odds difference (Baseline)	Equalized odds difference (Best DP Model)	Precision difference (Baseline)	Precision difference (Best DP Model)
ACS Emp.	0.8837 ± 0.0011	0.8110 ± 0.0062	0.8702 ± 0.0013	0.3401 ± 0.0875	0.4383 ± 0.0805	0.3154 ± 0.0359	0.5884 ± 0.1360	0.2874 ± 0.0534	0.5534 ± 0.0998	0.2968 ± 0.0674
ACS Inc.	0.8878 ± 0.0011	0.8155 ± 0.0045	0.8820 ± 0.0008	0.2546 ± 0.0569	0.4223 ± 0.0613	0.2556 ± 0.0490	0.4360 ± 0.0780	0.3756 ± 0.0019	0.3550 ± 0.0518	0.4032 ± 0.0271
LSAC	0.8343 ± 0.0029	0.7755 ± 0.0125	0.7962 ± 0.0077	0.0435 ± 0.0056	0.3064 ± 0.0653	0.1687 ± 0.0151	0.2548 ± 0.0862	0.1975 ± 0.0722	0.1688 ± 0.0485	0.2197 ± 0.0082
Adult	0.9056 ± 0.0011	0.8476 ± 0.0073	0.9005 ± 0.0009	0.1264 ± 0.0249	0.2750 ± 0.0155	0.2375 ± 0.0207	0.7845 ± 0.0492	0.8000 ± 0.0000	0.9400 ± 0.0966	0.8000 ± 0.000
Compas	0.6895 ± 0.0041	0.5349 ± 0.0359	0.6863 ± 0.0030	0.1162 ± 0.0273	0.5101 ± 0.0209	0.3694 ± 0.0230	0.5592 ± 0.0476	0.3726 ± 0.0375	0.3347 ± 0.0749	0.3168 ± 0.0467

Best DP architectures は非プライベートベースラインと比較して、複数の公正性概念の格差を低減することが多い。
DP-SGD は Baseline に比べてユーティリティ（ROC AUC）を維持またはほぼ同等に保ちながら、いくつかのデータセットでグループ間格差を緩和する。
ACS Employment、ACS Income、LSAC、Adult、COMPAS のデータセット全体で、DP 訓練モデルは非プライベートの counterparts より格差の違いが低い、あるいは同等であることが多い。
Baseline+DP（最適化されていない DP）は一般にユーティリティと公正性の両方の指標を低下させる一方で、Best DP Model（アーキテクチャ探索を伴う DP）は公正性の格差を改善する。
格差低減の大幅な改善は ACS Employment および ACS Income データセットで観察され、Adult は DP 下でも格差の低減が見られるが悪化は見られない; COMPAS はパターンが変動して敏感である。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。