QUICK REVIEW
[论文解读] Subsampled R\\'enyi Differential Privacy and Analytical Moments Accountant
Yu-Xiang Wang, Borja Balle|arXiv (Cornell University)|Jul 31, 2018
Privacy-Preserving Technologies in Data被引用 95
一句话总结
该论文给出对子采样机制的 Rényi 差分隐私参数的紧上界,使用原始机制的 RDP 和子采样率,并引入一种分析性矩账户。
ABSTRACT
We study the problem of subsampling in differential privacy (DP), a question that is the centerpiece behind many successful differentially private machine learning algorithms. Specifically, we provide a tight upper bound on the R\\'enyi Differential Privacy (RDP) (Mironov, 2017) parameters for algorithms that: (1) subsample the dataset, and then (2) applies a randomized mechanism M to the subsample, in terms of the RDP parameters of M and the subsampling probability parameter. Our results generalize the moments accounting technique, developed by Abadi et al. (2016) for the Gaussian mechanism, to any subsampled RDP mechanism.
研究动机与目标
- Motivate the need to understand privacy amplification under subsampling for Rényi DP (RDP).
- Derive a tight, general bound on the RDP parameters of subsampled mechanisms in terms of the original mechanism and sampling ratio.
- Propose an analytical moments accountant to track privacy parameters across compositions without predefined moment lists.
- Introduce a new ternary Pearson-Vajda divergence concept related to subsampling effects and connect it to RDP.
- Provide practical computational methods to output (epsilon, delta)-DP from RDP bounds and demonstrate improvements via experiments.
提出的方法
- Define subsampling without replacement and quantify privacy amplification for RDP (Theorem 9).
- Prove a tight upper bound on epsilon'(alpha) for M composed with subsample, in terms of epsilon(alpha), gamma, and epsilon(2) (and related terms).
- Show a lower bound (Proposition 11) indicating the bound is tight in general.
- Introduce a data structure for an analytical moments accountant that tracks the CGF K_M(lambda) symbolically and converts to (epsilon, delta)-DP efficiently.
- Discuss special cases including pure DP and Gaussian/Laplace mechanisms and outline asymptotic regimes for alpha and gamma.
实验结果
研究问题
- RQ1How does subsampling affect Rényi DP parameters for a general mechanism M?
- RQ2Can we bound epsilon'(alpha) for M composed with subsample in terms of epsilon(alpha) and the subsampling ratio gamma?
- RQ3Is the derived bound tight, and under what conditions can it be improved or matched by lower bounds?
- RQ4How can we efficiently track privacy parameters across composition using an analytical moments accountant?
- RQ5What are the practical implications and computational considerations for implementing these bounds in privacy-preserving ML pipelines?
主要发现
- A tight bound (Theorem 9) is provided for epsilon'(alpha) of M∘subsample in terms of gamma, alpha, and epsilon(·).
- The bound applies to any RDP mechanism, including Gaussian, Laplace, and exponential-family-based mechanisms.
- There is a phase transition in alpha for the amplification behavior: for small alpha the bound scales as O(alpha gamma^2), while for large alpha it can approach epsilon(alpha) or scale with gamma epsilon(∞).
- A lower bound (Proposition 11) demonstrates the upper bound cannot be improved in general without additional per-instance refinements.
- An analytical moments accountant is proposed, tracking CGFs symbolically to output (epsilon, delta)-DP efficiently without a fixed list of moments.
- The framework supports subsampling before applying M, enabling tighter privacy accounting in private learning settings and other DP applications.
更好的研究,从现在开始
从论文设计到论文写作,大幅缩短您的研究时间。
无需绑定信用卡
本解读由 AI 生成,并经人工编辑审核。