QUICK REVIEW

[論文レビュー] On the Power of Multiple Anonymous Messages

Badih Ghazi, Noah Golowich|arXiv (Cornell University)|Aug 29, 2019

Privacy-Preserving Technologies in Data参考文献 93被引用数 23

ひとこと要約

本稿は、微分プライバシーのシャッフルモデルにおける頻度推定および関連問題について、根本的な限界と効率的なプロトコルを確立する。複数のメッセージをユーザーが送信できることで、単一メッセージプロトコルと比較して誤差が指数的に小さくなることを証明し、誤差と通信量の両方が多対数的（polylogarithmic）に抑えられ、単一メッセージ設定におけるほぼタイトな下界を提示する。

ABSTRACT

An exciting new development in differential privacy is the shuffled model, in which an anonymous channel enables non-interactive, differentially private protocols with error much smaller than what is possible in the local model, while relying on weaker trust assumptions than in the central model. In this paper, we study basic counting problems in the shuffled model and establish separations between the error that can be achieved in the single-message shuffled model and in the shuffled model with multiple messages per user. For the problem of frequency estimation for $n$ users and a domain of size $B$, we obtain: - A nearly tight lower bound of $ ildeΩ( \min(\sqrt[4]{n}, \sqrt{B}))$ on the error in the single-message shuffled model. This implies that the protocols obtained from the amplification via shuffling work of Erlingsson et al. (SODA 2019) and Balle et al. (Crypto 2019) are essentially optimal for single-message protocols. A key ingredient in the proof is a lower bound on the error of locally-private frequency estimation in the low-privacy (aka high $ε$) regime. - Protocols in the multi-message shuffled model with $poly(\log{B}, \log{n})$ bits of communication per user and $poly\log{B}$ error, which provide an exponential improvement on the error compared to what is possible with single-message algorithms. For the related selection problem on a domain of size $B$, we prove: - A nearly tight lower bound of $Ω(B)$ on the number of users in the single-message shuffled model. This significantly improves on the $Ω(B^{1/17})$ lower bound obtained by Cheu et al. (Eurocrypt 2019), and when combined with their $ ilde{O}(\sqrt{B})$-error multi-message protocol, implies the first separation between single-message and multi-message protocols for this problem.

研究の動機と目的

シャッフルモデルにおける複数の匿名メッセージのパワーを理解すること。
頻度推定および選択問題における単一メッセージプロトコルの誤差に関するタイトな下界を確立すること。
頻度推定、範囲カウント、M-推定といった基本的問題に対して、多対数的誤差と通信量を達成する効率的な多メッセージプロトコルを設計すること。
達成可能な誤差の観点から、単一メッセージプロトコルと多メッセージプロトコルの間に明確な分離を示すこと。
先行研究の制限を克服するために、局所的プライバシーの頻度推定技術を高プライバシー（低ε）の領域へと拡張すること。

提案手法

低プライバシー領域における局所的プライバシー頻度推定のための新規技術を用いて、単一メッセージシャッフルモデルにおける誤差のほぼタイトな下界 $\tilde{\Omega}(\min(\sqrt[4]{n}, \sqrt{B}))$ を導出する。
1ユーザーあたり $\mathrm{poly}(\log B, \log n)$ の通信量と $\mathrm{poly}\log B$ の誤差を達成する、プライベートコイン多メッセージプロトコルを導入する。
効率的なデータ構造とシャッフルを用いて、クエリ時間の小さいパブリックコインプロトコルを構築し、範囲カウントに適応する。
定義域を $B = n$ 個の区間に離散化することで、中央値および分位数のM-推定問題を範囲カウント問題に還元する。
行列機構を用いて、多次元の範囲カウント問題を微分プライバシー保証付きの1次元クエリに還元する。
シャッフルモデルの匿名化機能を活用し、非インタラクティブで微分プライバシーを満たすプロトコルを実現し、ローカルモデルと比較して著しく小さい誤差を達成する。

実験結果

リサーチクエスチョン

RQ1単一メッセージシャッフルモデルにおける頻度推定の誤差の根本的限界は何か？
RQ2シャッフルモデルにおいて1ユーザーあたり複数のメッセージを送信することで、単一メッセージプロトコルと比較して誤差を指数的に小さくできるか？
RQ3単一メッセージシャッフルモデルにおける選択問題の最小ユーザー数は何か？
RQ4中央値および分位数のM-推定問題を、多メッセージシャッフルモデルでどのように効率的に解けるか？
RQ5局所的プライバシー頻度推定のための技術を、高プライバシー領域（高ε）を超えて拡張可能か？

主な発見

単一メッセージシャッフルモデルにおける頻度推定の誤差に対して、ほぼタイトな下界 $\tilde{\Omega}(\min(\sqrt[4]{n}, \sqrt{B}))$ が確立された。
Erlingssonら（SODA 2019）およびBalleら（Crypto 2019）のプロトコルが、単一メッセージ設定において本質的に最適であることが示された。
1ユーザーあたり $\mathrm{poly}(\log B, \log n)$ の通信量と $\mathrm{poly}\log B$ の誤差を達成する多メッセージプロトコルが構築された。
選択問題に関しては、単一メッセージモデルで $\Omega(B)$ 個のユーザーが必要であることを証明し、以前の $\Omega(B^{1/17})$ の下界を著しく改善した。
選択問題において、単一メッセージプロトコルと多メッセージプロトコルの間で、初めて明確な分離が確立された。
離散化による範囲カウントへの還元を経由し、中央値のM-推定に対して、1ユーザーあたり $\mathrm{poly}\log n$ の誤差と通信量を達成する微分プライバシー多メッセージプロトコルが得られた。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。