QUICK REVIEW

[論文レビュー] Early Rug Pull Warning for BSC Meme Tokens via Multi-Granularity Wash-Trading Pattern Profiling

Dingding Cao, Bianbian Jiao|arXiv (Cornell University)|Mar 14, 2026

Financial Markets and Investment Strategies被引用数 0

ひとこと要約

この論文は、マルチグラニュラリティの wash-trading パターンを用いてトークンレベルのリスク特徴を構築し、監督付きモデルを適用して BSC meme トークンの早期 rug-pull 警告を提供するエンドツーエンドのフレームワークを提示する。弱い監督下で Random Forest が Logistic Regression を上回る。

ABSTRACT

The high-frequency issuance and short-cycle speculation of meme tokens in decentralized finance (DeFi) have significantly amplified rug-pull risk. Existing approaches still struggle to provide stable early warning under scarce anomalies, incomplete labels, and limited interpretability. To address this issue, an end-to-end warning framework is proposed for BSC meme tokens, consisting of four stages: dataset construction and labeling, wash-trading pattern feature modeling, risk prediction, and error analysis. Methodologically, 12 token-level behavioral features are constructed based on three wash-trading patterns (Self, Matched, and Circular), unifying transaction-, address-, and flow-level signals into risk vectors. Supervised models are then employed to output warning scores and alert decisions. Under the current setting (7 tokens, 33,242 records), Random Forest outperforms Logistic Regression on core metrics, achieving AUC=0.9098, PR-AUC=0.9185, and F1=0.7429. Ablation results show that trade-level features are the primary performance driver (Delta PR-AUC=-0.1843 when removed), while address-level features provide stable complementary gain (Delta PR-AUC=-0.0573). The model also demonstrates actionable early-warning potential for a subset of samples, with a mean Lead Time (v1) of 3.8133 hours. The error profile (FP=1, FN=8) indicates that the current system is better positioned as a high-precision screener rather than a high-recall automatic alarm engine. The main contributions are threefold: an executable and reproducible rug-pull warning pipeline, empirical validation of multi-granularity wash-trading features under weak supervision, and deployment-oriented evidence through lead-time and error-bound analysis.

研究の動機と目的

高頻度の BSC meme トークンにおける rug-pull リスク検出の課題を弱い監督下で解決する。
wash-trading パターンを実行可能な警告スコアへ結びつける再現可能なパイプラインを開発する。
予測性能を左右する特徴グループ（取引レベル、アドレスレベル、契約レベル）を特定する。
リードタイムや誤差境界といった展開指向の指標を提供し、リスクトリアージを情報提供する。

提案手法

3つの wash-trading パターン（Self、Matched、Circular）から 12 個のトークンレベルの行動特徴を構築する。
取引、アドレス、フローの信号を集約してトークンレベルのリスク特徴ベクトルを作成する。
監督付きモデル（Logistic Regression および Random Forest）を訓練し、警告スコアとリードタイム推定を出力する。
精度、適合率、再現率、F1、AUC、PR-AUC、リードタイムで評価し、PR-AUC を指標の主要なランキング指標として扱う。
取引レベル、アドレスレベル、契約レベルの寄与をアブレーションで評価する。

Figure 1: BSC Meme Token Rug Pull Early-warning Framework. The overall pipeline contains four stages (E1–E4): data construction and labeling, wash-trading pattern profiling, early-warning modeling, and ablation/error analysis.

実験結果

リサーチクエスチョン

RQ1弱い監督下でマルチグラニュラリティの wash-trading 特徴が安定した早期警告信号を提供できるか。
RQ2どの特徴グループが予測性能に最も寄与し、再現率と適合率にどのように影響するか。
RQ3BSC meme トークンの早期警告モデルを展開する際のリードタイムと誤差プロファイルはどうなるか。
RQ4非線形モデルはオンチェーンリスクパターンを捉える点で線形ベースラインとどう比較されるか。

主な発見

Model	Accuracy	Precision	Recall	F1	AUC	PR-AUC	Lead Time (h)
Logistic Regression	0.6500	0.7059	0.5714	0.6316	0.7243	0.7397	3.8133
Random Forest	0.7750	0.9286	0.6190	0.7429	0.9098	0.9185	3.8133

Random Forest は中核指標で Logistic Regression を上回った（AUC 0.9098、PR-AUC 0.9185、F1 0.7429）。
取引レベルの特徴が主要な性能ドライバであり、除去すると PR-AUC が 0.1843 減少。
アドレスレベルの特徴は安定した補完的ゲインを提供し、除去すると PR-AUC が 0.0573 減少。
現在の契約レベル特徴は識別力として限定的であり、除去すると PR-AUC が 0.0077 増加。
Lead Time（v1）はケースの一部で平均 3.8133 時間、中央値は約 1.0331 時間。
誤差プロファイルは FP=1、FN=8 を示し、高精度なスクリーニングを示唆するが高リコール自動化ではない。

Figure 2: Data Collection and Labeling Pipeline. The workflow includes export, token-wise merging, deduplication, normalization, window capping, rule-based labeling, and quality-control checks.

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。