QUICK REVIEW

[논문 리뷰] Beyond Edge Deletion: A Comprehensive Approach to Counterfactual Explanation in Graph Neural Networks

Matteo De Sanctis, Riccardo De Sanctis|arXiv (Cornell University)|2026. 03. 04.

Advanced Graph Neural Networks인용 수 0

한 줄 요약

XPlore는 간선 삽입/삭제 및 노드 특징 섭동을 허용하여 GNNs에 대한 반사실적 탐색을 넓히고, 기울기와 코사인 유사도에 의해 유효성과 충실도를 향상시킨다.

ABSTRACT

Graph Neural Networks (GNNs) are increasingly adopted across domains such as molecular biology and social network analysis, yet their black-box nature hinders interpretability and trust. This is especially problematic in high-stakes applications, such as predicting molecule toxicity, drug discovery, or guiding financial fraud detections, where transparent explanations are essential. Counterfactual explanations - minimal changes that flip a model's prediction - offer a transparent lens into GNNs' behavior. In this work, we introduce XPlore, a novel technique that significantly broadens the counterfactual search space. It consists of gradient-guided perturbations to adjacency and node feature matrices. Unlike most prior methods, which focus solely on edge deletions, our approach belongs to the growing class of techniques that optimize edge insertions and node-feature perturbations, here jointly performed under a unified gradient-based framework, enabling a richer and more nuanced exploration of counterfactuals. To quantify both structural and semantic fidelity, we introduce a cosine similarity metric for learned graph embeddings that addresses a key limitation of traditional distance-based metrics, and demonstrate that XPlore produces more coherent and minimal counterfactuals. Empirical results on 13 real-world and 5 synthetic benchmarks show up to +56.3% improvement in validity and +52.8% in fidelity over state-of-the-art baselines, while retaining competitive runtime.

연구 동기 및 목표

고위험 도메인에서 GNNs의 설명가능성을 촉진한다.
간선 삭제를 넘어서는 포괄적인 반사실적 설명기를 개발한다.
경사 신호를 활용하여 최소한의 의미적으로 충실한 반사실적을 찾는다.
코사인 유사도 충실도 메트릭을 통해 분포 외 아티팩트를 완화한다.
최첨단 방법들에 비해 경험적으로 더 우수한 유효성 및 충실도를 입증한다.

제안 방법

XPlore를 도입한다, 간선(삽입 및 삭제) 및 노드 특징을 수정할 수 있는 기울기 가이드 섭동 프레임워크.
반사실적 탐색을 부드러운 최적화로 공식화한다. 손실 L(G,G′)=Lpred(G,G′|Φ)+β Ldist(G,G′).
가역 가능한 P 게이트를 통해 인접성(연결 관계)을 섭동하고 자기 루프가 있는 교환된 A/P 방식으로 간선 삽입을 허용한다.
게이팅 또는 연속 변화를 통해 노드 특징 섭동을 가능하게 하여 동일한 기울기 루프에 통합한다.
원래 그래프 임베딩과 반사실적 그래프 임베딩 간의 의미적 충실도를 정량화하기 위해 코사인 유사도 측정치를 사용한다.
수렴 보장과 O(|E| d + n d f) 정도의 복잡성을 가진 효율적이고 확장 가능한 알고리즘을 제공한다.
이진 인접성 섭동을 생성할 때 비분화 가능한 임계화를 역전파하기 위해 straight-through 추정기를 사용한다.

Figure 1 . (left) original graph $G$ predicted mutagenic. (right) Counterfactual $G^{\prime}$ predicted non-mutagenic with highlighted edge additions and removals, as well as potentially node additions to flip the prediction. Note that, although in this example, we illustrate valid chemical properti

실험 결과

연구 질문

RQ1간선 삽입 및 노드 특징 섭동을 허용함으로써 간선 삭제를 넘어 GNNs의 반사실적 설명을 어떻게 생성할 수 있을까?
RQ2구조적 및 특징 공간 전반에 걸친 경사-guided 섭동이 간선 삭제만의 방법에 비해 더 유효하고 충실하며 의미적으로 일관된 반사실적을 만들어내는가?
RQ3코사인 기반 의미 충실도 도입이 그래프 반사실적의 분포 외 아티팩트를 줄이는가?
RQ4다양한 데이터셋에서 XPlore가 최첨단 그래프 반사실적 설명기 기반선과 비교하여 어떤 성능을 보이는가?
RQ5자유 노드 특징 섭동 허용과 게이팅 메커니즘의 차이가 설명 품질에 어떤 영향을 미치는가?

주요 결과

XPlore는 17/18 데이터셋에서 기준선 대비 더 높은 유효성과 충실도를 달성한다.
XPlore는 벤치마크에서 최첨단 기준선 대비 유효성에서 최대 +56.3%, 충실도에서 +52.8% 향상을 보인다.
간선 삽입 및 노드 특징 섭동 허용은 간선 삭제만보다 더 풍부하고 응집력 있는 반사실적을 생성한다.
그래프 임베딩의 코사인 유사도는 의미 충실도와 상관관계가 있어 반사실적의 의미 보존이 더 좋음을 시사한다.
자유 노드 섭동(및 게이팅이 있는 변형)은 유효성과 충실도 측면에서 간선 삭제만 또는 게이팅만 변형보다 우수하다.
XPlore는 실행 시간을 유지하면서 명확히 우수한 설명 가능성 지표를 제공한다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.