Skip to main content
QUICK REVIEW

[논문 리뷰] Asymmetric Shapley values: incorporating causal knowledge into model-agnostic explainability

Christopher Frye, Colin Rowat|arXiv (Cornell University)|2019. 10. 14.
Explainable Artificial Intelligence (XAI)참고 문헌 50인용 수 36
한 줄 요약

비대칭 Shapley values (ASVs)을 도입하여 모델-무관 설명에 인과 지식을 통합하고 대칭성을 완화하여 전체 인과 그래프 없이도 인과 인식, 순서 기반, 공정성 및 특징 선택 분석을 가능하게 한다.

ABSTRACT

Explaining AI systems is fundamental both to the development of high performing models and to the trust placed in them by their users. The Shapley framework for explainability has strength in its general applicability combined with its precise, rigorous foundation: it provides a common, model-agnostic language for AI explainability and uniquely satisfies a set of intuitive mathematical axioms. However, Shapley values are too restrictive in one significant regard: they ignore all causal structure in the data. We introduce a less restrictive framework, Asymmetric Shapley values (ASVs), which are rigorously founded on a set of axioms, applicable to any AI system, and flexible enough to incorporate any causal structure known to be respected by the data. We demonstrate that ASVs can (i) improve model explanations by incorporating causal information, (ii) provide an unambiguous test for unfair discrimination in model predictions, (iii) enable sequentially incremental explanations in time-series models, and (iv) support feature-selection studies without the need for model retraining.

연구 동기 및 목표

  • Motivate the need to incorporate causal structure into model explainability beyond standard Shapley values.
  • Propose a mathematically axiomatic framework (ASVs) that relaxes symmetry to leverage causal information.
  • Demonstrate practical applications: causal-aware explanations, fairness testing via unresolved discrimination, time-series sequential explanations, and feature-selection without retraining.

제안 방법

  • Define Asymmetric Shapley values with respect to a weighting over feature-order permutations w(π).
  • Show that ASVs satisfy efficiency, linearity, and nullity axioms but not symmetry, enabling causal-informed attributions.
  • Use on-manifold value functions to respect data correlations when computing attributions.
  • Present distal and proximate causal-ordering strategies to encode known causal structure into explanations.
  • Demonstrate implementation across four applications with empirical demonstrations on Census data, synthetic admissions data, EEG time-series, and feature-selection scenarios.

실험 결과

연구 질문

  • RQ1How can Shapley-based explanations be extended to incorporate partial or full causal knowledge without requiring a complete causal graph?
  • RQ2Can ASVs detect and quantify causal notions of unfairness (unresolved discrimination) in model predictions?
  • RQ3Do ASVs yield sequence-aware explanations for time-series data and enable sparse, começo-focused attributions?
  • RQ4Can ASVs provide a precise interpretation of feature usefulness for subset-based feature-selection without retraining models?

주요 결과

  • ASVs provide explanations that respect known causal orderings and can attribute model accuracy to causal ancestors before descendants.
  • ASVs can reveal subtle fairness issues by measuring the incremental effect of sensitive attributes after accounting for resolving variables.
  • ASVs produce sparser, time-sequence aware attributions that concentrate importance early in time-series data, unlike standard Shapley values.
  • ASVs can quantify the accuracy achievable using a subset of features, supporting feature-selection without retraining multiple models.
  • Empirical examples show distinct attributions when incorporating distal (root) vs proximate (immediate) causal notions.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.