QUICK REVIEW

[论文解读] Remember What You Want to Forget: Algorithms for Machine Unlearning

Ayush Sekhari, Jayadev Acharya|arXiv (Cornell University)|Mar 4, 2021

Privacy-Preserving Technologies in Data参考文献 32被引用 67

一句话总结

本文研究机器学习中的遗忘泛化，提出一种针对凸损失的遗忘算法，最多删除 O(n/d^{1/4}) 个样本，测试性能良好，并显示与差分隐私的分离。

ABSTRACT

We study the problem of unlearning datapoints from a learnt model. The learner first receives a dataset $S$ drawn i.i.d. from an unknown distribution, and outputs a model $\widehat{w}$ that performs well on unseen samples from the same distribution. However, at some point in the future, any training datapoint $z \in S$ can request to be unlearned, thus prompting the learner to modify its output model while still ensuring the same accuracy guarantees. We initiate a rigorous study of generalization in machine unlearning, where the goal is to perform well on previously unseen datapoints. Our focus is on both computational and storage complexity. For the setting of convex losses, we provide an unlearning algorithm that can unlearn up to $O(n/d^{1/4})$ samples, where $d$ is the problem dimension. In comparison, in general, differentially private learning (which implies unlearning) only guarantees deletion of $O(n/d^{1/2})$ samples. This demonstrates a novel separation between differential privacy and machine unlearning.

研究动机与目标

研究在保持对未见数据的泛化能力的前提下，可以从训练数据中遗忘多少样本。
在实际内存约束下，探索存储与计算高效的遗忘方法。
展示机器遗忘与差分隐私在删除数据时的分离。
开发在删除时不需要访问全部训练数据的遗忘算法。

提出的方法

将总体（测试）损失与经验损失进行对比以引出遗忘目标。
引入带有统计量 T(S) 的遗忘框架，使得在不存储所有数据的情况下实现删除。
证明分离结果：基于 DP 的遗忘的删除容量约为 n/d^{1/2}，而我们对凸损失的方法实现约 n/d^{1/4}。
给出一种针对强凸损失的遗忘算法，利用 Hessian 信息和噪声在实现类似 DP 的保证的同时使用更小的噪声。
表明该遗忘算法存储 O(d^2) 的统计量，运行时间为 O(d^{ω})。
通过正则化的归约将从强凸扩展到凸损失。

实验结果

研究问题

RQ1在保持良好测试损失保证的前提下，可以从训练好的模型中删除多少样本？
RQ2存储与计算需求如何影响遗忘算法的设计？
RQ3在凸损失情境下，遗忘是否可以超越基于 DP 的删除容量？
RQ4删除容量、内存使用和计算效率之间的权衡是什么？
RQ5基于样本感知更新的遗忘是否比仅 DP 的方法能获得更好的泛化？

主要发现

对于凸（以及强凸）损失，存在一种遗忘算法，在保持超额风险为 0.01 的同时，最多可删除 m ≥ c · n · sqrt(ε) / (d log(1/δ))^{1/4} 个样本。
所提出的遗忘方法存储基于 Hessian 的统计量，内存为 O(d^2)，并以 O(d^{ω}) 的时间实现遗忘。
用于遗忘的噪声比 DP 噪声小的因子为与 n^2/m^2 成正比，从而提升删除容量。
明确分离：基于 DP 的学习得到的删除容量约为 n/d^{1/2}，而新算法在凸损失下达到约 n/d^{1/4}。
结果适用于强凸和凸损失，删除后学习得到的解保持接近最优的测试损失。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。