QUICK REVIEW

[論文レビュー] You are who you know and how you behave: attribute inference attacks via users' social friends and behaviors

Neil Zhenqiang Gong, Bin Liu|arXiv (Cornell University)|Aug 10, 2016

Privacy-Preserving Technologies in Data参考文献 43被引用数 56

ひとこと要約

本論文は、ユーザーのSNS友人関係と行動記録（例：いいねをしたページやアプリ）を統合して、居住地、職業、関心など、プライベートな属性を推定する新しい属性推定攻撃を提案する。統一されたモデルを用いて社会的要因と行動的要因を統合することで、110万ユーザーの実世界データセットにおいて、ユーザーの都市の推定に57％の成功率を達成し、最も信頼性の高い半数のユーザーに限定して攻撃を行うと、90％を超える精度に上昇する。

ABSTRACT

We propose new privacy attacks to infer attributes (e.g., locations, occupations, and interests) of online social network users. Our attacks leverage seemingly innocent user information that is publicly available in online social networks to infer missing attributes of targeted users. Given the increasing availability of (seemingly innocent) user information online, our results have serious implications for Internet privacy - private attributes can be inferred from users' publicly available data unless we take steps to protect users from such inference attacks. To infer attributes of a targeted user, existing inference attacks leverage either the user's publicly available social friends or the user's behavioral records (e.g., the webpages that the user has liked on Facebook, the apps that the user has reviewed on Google Play), but not both. As we will show, such inference attacks achieve limited success rates. However, the problem becomes qualitatively different if we consider both social friends and behavioral records. To address this challenge, we develop a novel model to integrate social friends and behavioral records and design new attacks based on our model. We theoretically and experimentally demonstrate the effectiveness of our attacks. For instance, we observe that, in a real-world large-scale dataset with 1.1 million users, our attack can correctly infer the cities a user lived in for 57% of the users; via confidence estimation, we are able to increase the attack success rate to over 90% if the attacker selectively attacks a half of the users. Moreover, we show that our attack can correctly infer attributes for significantly more users than previous attacks.

研究の動機と目的

従来の属性推定攻撃が、社会的友人関係や行動記録のいずれかに限定されているという限界を解決すること。
社会的ネットワーク構造とユーザー行動を統合することで、属性推定の正確性が向上するかを調査すること。
社会的要因と行動的要因を効果的に統合できる統一モデルを設計すること。
大規模なSNSデータ上で、このような統合型攻撃の実世界での実現可能性と成功確率を評価すること。

提案手法

ユーザーの社会的友人関係と行動記録（例：いいねをしたページ、アプリ）を統合的に符号化する新しい機械学習モデルを開発し、統一された表現に変換する。
統合された表現を用いて、ターゲットユーザーの居住地、職業、関心といったプライベート属性を予測する。
予測の信頼性が高いユーザーを特定・優先順位付けするための信頼度推定技術を適用し、選択的攻撃を実施する。
110万ユーザーの実世界データセットを活用して攻撃モデルの学習と評価を実施する。
提案手法の統合モデルの性能を、社会的要因または行動的要因のいずれかのみを用いる従来手法と比較する。
正確性やAUCといった標準的な評価指標を用いて、攻撃の成功確率を測定する。

実験結果

リサーチクエスチョン

RQ1社会的友人関係と行動記録を統合することで、単独で使用する場合と比較して、属性推定攻撃の成功率が顕著に向上するか？
RQ2提案されたモデルは、居住都市、職業、関心といったプライベート属性をどの程度効果的に推定できるか？
RQ3信頼度推定を用いることで、予測が比較的容易なユーザーに集中することで、攻撃性能がどの程度向上するか？
RQ4実世界の大規模SNSデータ上で、提案された攻撃は正確性とカバー範囲の両面でどの程度スケーラブルか？

主な発見

提案された攻撃は、110万ユーザーの実世界データセットにおいて、ユーザーの居住都市を正しく推定する成功率が57％に達した。
信頼度推定を活用して、予測の信頼性が高いユーザーに限定して攻撃を行うことで、ユーザーの半数に焦点を当てた場合、成功率が90％を超えるまで上昇した。
従来の社会的友人関係や行動記録のいずれかに依存する手法よりも顕著に優れており、はるかに多くのユーザーに対して属性を推定できた。
社会的要因と行動的要因の統合により、推定性能に質的な改善が見られ、マルチモーダルデータがプライバシー攻撃において価値を持つことが示された。

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。