QUICK REVIEW

[논문 리뷰] Make up your mind: the price of online queries in differential privacy

Mark Bun, Thomas Steinke|arXiv (Cornell University)|2017. 01. 16.

Privacy-Preserving Technologies in Data인용 수 15

한 줄 요약

이 논문은 온라인, 오프라인, 적응형 세 가지 차별적 프라이버시 모델이 능력 면에서 본질적으로 다름을 입증한다. 오프라인 모델에서는 온라인 모델보다 지수적으로 더 많은 통계적 질의를 정확하게 답변할 수 있으며, 온라인 모델에서는 적응형 모델보다 더 많은 검색 질의를 답변할 수 있음을 보여, 실제로 이러한 모델들이 동일하다는 가정을 도전한다.

ABSTRACT

We consider the problem of answering queries about a sensitive dataset subject to differential privacy. The queries may be chosen adversarially from a larger set Q of allowable queries in one of three ways, which we list in order from easiest to hardest to answer:• Offline: The queries are chosen all at once and the differentially private mechanism answers the queries in a single batch.• Online: The queries are chosen all at once, but the mechanism only receives the queries in a streaming fashion and must answer each query before seeing the next query.• Adaptive: The queries are chosen one at a time and the mechanism must answer each query before the next query is chosen. In particular, each query may depend on the answers given to previous queries.Many differentially private mechanisms are just as efficient in the adaptive model as they are in the offline model. Meanwhile, most lower bounds for differential privacy hold in the offline setting. This suggests that the three models may be equivalent.We prove that these models are all, in fact, distinct. Specifically, we show that there is a family of statistical queries such that exponentially more queries from this family can be answered in the offline model than in the online model. We also exhibit a family of search queries such that exponentially more queries from this family can be answered in the online model than in the adaptive model. We also investigate whether such separations might hold for simple queries like threshold queries over the real line.

연구 동기 및 목표

오프라인, 온라인, 적응형 세 가지 차별적 프라이버시 모델이 정확하게 답변할 수 있는 질의 수 측면에서 상호 동일한지 조사하기.
오프라인 모델에서 효율적인 메커니즘이 적응형 환경에서도 여전히 효율적이라는 일반적인 가정에 도전하기.
질의 수가 지수적으로 다름을 보이는 질의 집합을 구성함으로써 각 모델 간의 공식적 분리를 확립하기.
이러한 분리가 실수선 상의 임계값 질의와 같은 간단한 질의 유형으로까지 확장되는지 탐색하기.

제안 방법

오프라인 모델이 온라인 모델보다 지수적으로 더 많은 질의를 정확하게 답변할 수 있는 통계적 질의 집합을 구성하기.
온라인 모델에서는 적응형 모델보다 훨씬 더 많은 수의 질의를 답변할 수 있는 검색 질의 집합을 설계하기.
정보 이론적 추론을 사용하여 각 모델 하에서 답변 가능한 질의 수의 하한을 설정하기.
적응형 모델에서 각 질의가 이전 답변에 의존할 수 있음을 고려하여 질의의 종속성 구조를 분석함으로써 어려움이 증가함을 보여주기.
질의 복잡도를 지표로 삼아, 세 모델 간의 차별적 프라이버시 메커니즘의 성능을 비교하기.
질의 선택 구조(배치 vs. 스트리밍 vs. 적응형)가 차별적 프라이버시의 유용성-보안 트레이드오프에 본질적으로 영향을 준다는 것을 입증하기.

실험 결과

연구 질문

RQ1오프라인, 온라인, 적응형 차별적 프라이버시 모델은 정확하게 답변할 수 있는 질의 수 측면에서 동일한가?
RQ2오프라인 모델과 온라인 모델 간에 정확하게 답변 가능한 질의 수에 지수적 격차가 존재할 수 있는가?
RQ3일부 질의 집합에 대해 온라인 모델과 적응형 모델 간에 지수적 분리가 존재하는가?
RQ4이러한 분리는 실수선 상의 임계값 질의와 같은 단순한 질의 유형으로도 유지되는가?

주요 결과

오프라인 모델이 온라인 모델보다 지수적으로 더 많은 질의를 정확하게 답변할 수 있는 통계적 질의 집합이 존재한다.
온라인 모델이 적응형 모델보다 지수적으로 더 많은 질의를 정확하게 답변할 수 있는 검색 질의 집합이 존재한다.
오프라인, 온라인, 적응형 세 모델은 차별적 프라이버시 하에서 질의 답변 능력 면에서 본질적으로 다름을 보인다.
결과는 적응형 모델이 온라인 모델보다 엄격히 제약이 있으며, 온라인 모델이 오프라인 모델보다 엄격히 제약가능한 유용성 측면에서 특정 질의 집합에 대해 보여준다.
이러한 분리는 복잡한 질의 집합에 국한되지 않으며, 논문은 임계값 질의와 같은 단순한 질의 유형에 대해 유사한 격차가 존재하는지 조사하지만, 제공된 요약에서는 이러한 질의에 대한 명시적 정량적 결과는 제시하지 않는다.

더 나은 연구,지금 바로 시작하세요

연구 설계부터 논문 작성까지, 연구 시간을 획기적으로 줄여보세요.

카드 등록 없음 · 무료 플랜 제공

이 리뷰는 AI가 만들고, 인간 에디터가 검토했습니다.