QUICK REVIEW

[논문 리뷰] Graph Edit Distance Computation via Graph Neural Networks.

Yunsheng Bai, Hao Ding|arXiv (Cornell University)|2018. 08. 16.

Advanced Graph Neural Networks인용 수 13

한 줄 요약

이 논문은 그래프 에디트 거리(GED)를 효율적이고 정확하게 계산하기 위한 그래프 신경망 기반 접근법인 SimGNN을 제안한다. 학습 가능한 그래프 수준 임베딩과 쌍별 노드 비교 기반 메커니즘을 결합함으로써 SimGNN은 기존 기준 대비 뛰어난 일반화 성능과 상당한 시간 절감을 달성하면서도 실제 데이터셋에서 낮은 오차율을 유지한다.

ABSTRACT

Graph similarity search is among the most important graph-based applications, e.g. finding the chemical compounds that are most similar to a query compound. Graph similarity computation, such as Graph Edit Distance (GED) and Maximum Common Subgraph (MCS), is the core operation of graph similarity search and many other applications, but very costly to compute in practice. Inspired by the recent success of neural network approaches to several graph applications, such as node or graph classification, we propose a novel neural network based approach to address this classic yet challenging graph problem, aiming to alleviate the computational burden while preserving a good performance. The proposed approach, called SimGNN, combines two strategies. First, we design a learnable embedding function that maps every graph into a vector, which provides a global summary of a graph. A novel attention mechanism is proposed to emphasize the important nodes with respect to a specific similarity metric. Second, we design a pairwise node comparison method to supplement the graph-level embeddings with fine-grained node-level information. Our model achieves better generalization on unseen graphs, and in the worst case runs in quadratic time with respect to the number of nodes in two graphs. Taking GED computation as an example, experimental results on three real graph datasets demonstrate the effectiveness and efficiency of our approach. Specifically, our model achieves smaller error rate and great time reduction compared against a series of baselines, including several approximation algorithms on GED computation, and many existing graph neural network based models. To the best of our knowledge, we are among the first to adopt neural networks to explicitly model the similarity between two graphs, and provide a new direction for future research on graph similarity computation and graph similarity search.

연구 동기 및 목표

그래프 유사도 검색 응용 분야에서 정확한 그래프 에디트 거리(GED) 계산의 높은 계산 비용을 해결한다.
기존의 신경망 기반 방법에 비해 새로운 그래프에 대한 일반화 성능을 향상시킨다.
전체 그래프 임베딩과 세밀한 노드 수준의 비교를 결합하여 정확도를 유지하면서 런타임 복잡도를 낮춘다.
기존 GNN 응용 분야의 격차를 메우기 위해 쌍별 그래프 유사도를 명시적으로 모델링하도록 설계된 새로운 신경망 프레임워크를 제공한다.

제안 방법

각 그래프를 전역 구조적 특성을 캡처하는 벡터 표현으로 매핑하는 학습 가능한 임베딩 함수를 설계한다.
유사도 지표에 가장 관련성이 높은 노드를 강조하기 위해 새로운 어텐션 메커니즘을 도입하여 임베딩 품질을 향상시킨다.
그래프 수준의 임베딩에 세밀한 노드 수준의 대응 정보를 보완하기 위해 쌍별 노드 비교 기반 메커니즘을 개발한다.
차별 가능한 손실 함수를 사용하여 GED 값을 예측하기 위해 모델을 엔드 투 엔드로 훈련시켜 효과적인 최적화를 가능하게 한다.
그래프 수준 및 노드 수준의 특징을 조합하여 모델의 표현력과 일반화 능력을 향상시킨다.
노드 수에 대해 이차 시간 복잡도를 달성하여 중간 크기의 그래프에 대해 확장 가능하다.

실험 결과

연구 질문

RQ1높은 정확도와 낮은 계산 비용을 바탕으로 신경망을 효과적으로 훈련시켜 그래프 에디트 거리를 예측할 수 있는가?
RQ2기존의 근사 알고리즘과 GNN 기준 대비 제안된 모델이 새로운 그래프에 대해 얼마나 잘 일반화되는가?
RQ3그래프 수준 및 노드 수준의 정보를 모두 통합할 경우, 얼마나 유사도 예측 성능이 향상되는가?
RQ4정확한 GED 계산에 비해 상당한 속도 향상을 달성하면서도 낮은 오차율을 유지할 수 있는가?

주요 결과

SimGNN은 세 가지 실 그래프 데이터셋에서 여러 근사 알고리즘과 기존의 GNN 기반 모델보다 낮은 오차율을 달성한다.
모델은 새로운 그래프에 대해 강력한 일반화 성능를 보이며, 분포 이탈에 대한 저항력이 있음을 시사한다.
정확한 GED 계산에 비해 런타임이 상당히 감소하였으며, 최악의 경우 노드 수에 대해 이차 복잡도를 가진다.
어텐션 메커니즘이 핵심 노드를 효과적으로 식별하여, 유사도 작업을 위한 그래프 임베딩 품질을 향상시킨다.
그래프 수준 및 쌍별 노드 수준의 특징 조합이 각각의 구성 요소를 별도로 사용할 때보다 더 나은 성능을 낸다.
저자들의 지식에 비추어 볼 때, SimGNN은 신경망을 사용해 쌍별 그래프 유사도를 명시적으로 모델링한 최초의 모델들 중 하나이다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.