QUICK REVIEW

[論文レビュー] Acyclic Graph Pattern Counting under Local Differential Privacy

Yihua Hu, Kuncan Wang|arXiv (Cornell University)|Mar 20, 2026

Privacy-Preserving Technologies in Data被引用数 0

ひとこと要約

この論文は、エッジローカル差分プライバシーの下で任意の非巡回グラフパターンをカウントする初の一般解を提供する。複数ラウンドのプライバシー保護フレームワークを用い、強い有用性保証を実現。tilde-O(sqrt(N) poly(d(G)))の加法誤差とRRベースのベースラインに比べた通信量の削減を達成。

ABSTRACT

Graph pattern counting serves as a cornerstone of network analysis with extensive real-world applications. Its integration with local differential privacy (LDP) has gained growing attention for protecting sensitive graph information in decentralized settings. However, existing LDP frameworks are largely ad hoc, offering solutions only for specific patterns such as triangles and stars. A general mechanism for counting arbitrary graph patterns, even for the subclass of acyclic patterns, has remained an open problem. To fill this gap, we present the first general solution for counting arbitrary acyclic patterns under LDP. We identify and tackle two fundamental challenges: generalizing pattern construction from distributed data and eliminating node duplication during the construction. To address the first challenge, we propose an LDP-tailored recursive subpattern counting framework that incrementally builds patterns across multiple communication rounds. For the second challenge, we apply a random marking technique that restricts each node to a unique position in the pattern during computation. Our mechanism achieves strong utility guarantees: for any acyclic graph pattern with $k$ edges, we achieve an additive error of $ ilde{O}(\sqrt{N}d(G)^k)$, where $N$ is the number of nodes and $d(G)$ is the maximum degree of the input graph $G$. Experiments on real-world graph datasets across multiple types of acyclic patterns demonstrate that our mechanisms achieve up to $46$-$2600 imes$ improvement in utility and $300$-$650 imes$ reduction in communication cost compared to the baseline methods.

研究の動機と目的

プライバシー保護されたグラフパターンカウントを分散設定で推進するためのエッジ-LDPの動機づけ。
三角形や星を超えた任意の非巡回パターンへと一般化した非巡回パターンカウント。
プライバシー保護技術と結合した再帰的サブパターンカウントフレームワークの開発。
分散LDPカウントにおけるノード重複の排除と通信オーバーヘッドの削減。

提案手法

非巡回パターンカウントを非巡回サブパターンへ還元する再帰的サブパターンカウントフレームワークを導入。
パターンの遠距離インスタンス間でノードの重複を防ぐために色付けに触発されたランダムマーク技術を使用。
公正な推定量とtilde-O(√N d(G)^{k-1})誤差を持つk-ライン走行カウント機構を、k-1ラウンドで実行。
ラン walk-countingソリューションにランダムマークを適用してk-ラインパスカウントへ拡張し、同様の有用性を維持。
kエッジを持つ任意の非巡回パターンへ一般化し、O(M+N)通信とO(N)アナライザコストを維持。
ラウンドをk-1に削減し、最大計算をアナライザへ移すことで効率を最適化し、O(M+N)通信とノード毎の計算O(d(G))を実現。

Figure 1 . Illustration of the 2-line walk counting mechanism $\mathcal{M}_{\mathrm{2\text{-}wk}}$ applied to graph $G$ with privacy budget $\varepsilon=4$ . For simplicity, $\sum$ denotes $\sum_{j\in\mathcal{N}(i)}$ .

実験結果

リサーチクエスチョン

RQ1エッジ-LDP下で、検証可能な有用性保証を伴う任意の非巡回グラフパターンカウントを達成できるか？
RQ2ノードを重複させずに分散・プライバシー保護された方法でパターン構築をどう扱うか？
RQ3エッジ-LDP下で中央DPに近い有用性を達成するための最適なラウンド複雑性と通信コストは？
RQ4ランダムマーク手法は重複を防ぎつつラウンドを跨いで偏りのないカウントを維持できるか？
RQ5実データセットで、異なるグラフ次数を持つkエッジパターンへ提案手法はどの程度スケールするか？

主な発見

k-ライン走行カウント機構は、k-1ラウンドで偏りのない推定量を誤差 Õ(√N d(G)^{k-1})で達成。
走行カウント機構の全体通信コストはO(M+N)で、ノードごとの計算はO(d(G))。
ランダムマークを適用したk-ラインパスカウント拡張は、偏りのない推定量を Õ(√N d(G)^{k})誤差で得、同じO(M+N)通信。
任意のkエッジ非巡回パターンに対しても、機構は同様の Õ(√N d(G)^{k})誤差と同じ通信・計算コストを維持。
経験的結果は、RRベース手法に対して有用性が最大46–2600×、通信コストが300–650×の改善を示す。

Figure 2 . Illustration of the $3$ -line path counting mechanism $\mathcal{M}_{\mathrm{3\text{-}pt}}$ applied to graph $G$ with privacy budget $\varepsilon=1$ . For simplicity, $\sum$ denotes $\sum_{j\in\mathcal{N}(i)}$ . The initialization step occurs before random marking, but is shown afterward f

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。