QUICK REVIEW

[論文レビュー] Network Analytics for Anti-Money Laundering -- A Systematic Literature Review and Experimental Evaluation

Bruno Deprez, Toon Vanderschueren|arXiv (Cornell University)|May 29, 2024

Crime, Illicit Activities, and Governance被引用数 6

ひとこと要約

97件の論文を横断するAMLのためのネットワーク解析（NA）の体系的文献レビューと、Ellipticデータセット上で著名なNA手法をベンチマークする実験フレームワークを追加し、グラフニューラルネットワークが最良の性能を示す。

ABSTRACT

Money laundering presents a pervasive challenge, burdening society by financing illegal activities. The use of network information is increasingly being explored to effectively combat money laundering, given it involves connected parties. This led to a surge in research on network analytics for anti-money laundering (AML). The literature is, however, fragmented and a comprehensive overview of existing work is missing. This results in limited understanding of the methods to apply and their comparative detection power. This paper presents an extensive and unique literature review, based on 97 papers from Web of Science and Scopus, resulting in a taxonomy following a recently proposed fraud analytics framework. We conclude that most research relies on expert-based rules and manual features, while deep learning methods have been gaining traction. This paper also presents a comprehensive framework to evaluate and compare the performance of prominent methods in a standardized setup. We compare manual feature engineering, random walk-based, and deep learning methods on two publicly available data sets. We conclude that (1) network analytics increases the predictive power, but caution is needed when applying GNNs in the face of class imbalance and network topology, and that (2) care should be taken with synthetic data as this can give overly optimistic results. The open-source implementation facilitates researchers and practitioners to extend this work on proprietary data, promoting a standardised approach for the analysis and evaluation of network analytics for AML.

研究の動機と目的

法定通貨と暗号資産の両方の文脈にわたるAMLのためのネットワーク解析に関する網羅的な体系的レビューを提供する。
AMLタスクを比較するための統一的な実験フレームワークを開発・公開する。
共通の設定の下で従来の特徴量ベース、ランダムウォーク、およびグラフニューラルネットワークアプローチを評価する。
再現性を促進するために再現・拡張のためのオープンソースコードを提供する。

提案手法

Web of ScienceとScopusで97件の論文を特定した体系的文献レビュー
公表データ、学習方法、評価指標、目的、データタイプの5次元を用いた論文の分類
Ellipticデータセットに適用した統一実験フレームワークの構築
手動特徴量エンジニアリング、ランダムウォークベースの手法、深層学習GNNを統一的な設定で実装
再現と拡張を可能にするオープンソースコードリポジトリ

Figure 2: Research Output. The number of papers incorporated in the full query and after filtering per year.

実験結果

リサーチクエスチョン

RQ1文献全体で使用される主要なネットワーク解析手法は何か（監視型、教師なし、視覚化を含む）
RQ2実データ（Ellipticデータセット）上で統一的な実験設定で異なるNA手法はどのように性能を示すか
RQ3NAの比較可能性と再現性に影響を与えるデータと評価実践は何か

主な発見

ネットワーク解析はNAを用いないベースラインと比較してAMLモデルの予測力を向上させる。
Ellipticデータセット上で評価されたNA手法の中でグラフニューラルネットワークが最良の結果を達成。
オープンソースデータ（暗号資産）はデータ間評価を支配しており、暗号資産以外のAML研究では独自データが一般的。
ほとんどの研究は中心性ベースの特徴量に依存しており、AML NA文脈で最新の深層学習を適用する研究は少ない。
評価はデータ入手性、ラベルの不均衡、標準化されたベンチマークの必要性といった課題を指摘している。

(a) The distribution of the learning methods.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。