QUICK REVIEW

[论文解读] Analysis of Location Data Leakage in the Internet Traffic of Android-based Mobile Devices

Nir Sivan, Ron Bitton|arXiv (Cornell University)|Dec 1, 2018

Human Mobility and Location-Based Analysis被引用 1

一句话总结

本研究通过在37天内监控真实用户的网络流量，分析了Android设备中未加密位置数据的泄露情况。提出了一种从明文流量中提取并聚类位置数据的方法，发现超过85%的设备存在位置信息泄露，其中61%的用户兴趣点可通过稀疏的泄露指标推断得出。

ABSTRACT

In recent years we have witnessed a shift towards personalized, context-based applications and services for mobile device users. A key component of many of these services is the ability to infer the current location and predict the future location of users based on location sensors embedded in the devices. Such knowledge enables service providers to present relevant and timely offers to their users and better manage traffic congestion control, thus increasing customer satisfaction and engagement. However, such services suffer from location data leakage which has become one of today's most concerning privacy issues for smartphone users. In this paper we focus specifically on location data that is exposed by Android applications via Internet network traffic in plaintext (i.e., without encryption) without the user's awareness. We present an empirical evaluation, involving the network traffic of real mobile device users, aimed at: (1) measuring the extent of location data leakage in the Internet traffic of Android-based smartphone devices; and (2) understanding the value of this data by inferring users' points of interests (POIs). This was achieved by analyzing the Internet traffic recorded from the smartphones of a group of 71 participants for an average period of 37 days. We also propose a procedure for mining and filtering location data from raw network traffic and utilize geolocation clustering methods to infer users' POIs. The key findings of this research center on the extent of this phenomenon in terms of both ubiquity and severity; we found that over 85\% of devices of users are leaking location data, and the exposure rate of users' POIs, derived from the relatively sparse leakage indicators, is around 61%.

研究动机与目标

测量基于Android的移动设备在未加密互联网流量中位置数据泄露的程度。
通过分析网络流量推断用户兴趣点（POIs），理解此类泄露的隐私影响。
开发一种从原始网络流量中挖掘和过滤位置数据的程序。
评估在真实世界Android使用环境中位置数据泄露的严重性和普遍性。

提出的方法

从71部真实Android智能手机收集平均37天的网络流量，以捕捉真实世界使用模式。
提出一种数据挖掘与过滤程序，从非结构化网络流量中提取与位置相关的信息。
应用地理定位聚类技术，将提取的位置数据分组，推断用户常去的兴趣点（POIs）。
使用明文网络流量识别位置指标，如GPS坐标、IP地址和基于位置的API调用。
通过分析设备间位置数据泄露的频率和一致性，评估兴趣点的暴露率。
训练并应用聚类算法，基于空间接近性将泄露的位置数据点映射为有意义的兴趣点。

实验结果

研究问题

RQ1有多少比例的Android设备通过其互联网流量以明文形式泄露位置数据？
RQ2在稀疏且非结构化的网络流量位置数据中，用户兴趣点（POIs）能在多大程度上被推断？
RQ3在多样化的真实世界Android用户群体中，位置数据泄露的普遍性和严重性如何？
RQ4哪些技术能有效从原始未加密网络流量中提取并聚类位置数据？

主要发现

在所研究的71部Android设备中，超过85%通过其互联网流量以明文形式泄露了位置数据。
基于稀疏泄露指标推断出的用户兴趣点（POIs）暴露率约为61%。
即使用户未意识到此类数据传输，位置数据仍经常以未加密形式传输。
地理定位聚类方法成功地从碎片化且不一致的位置数据点中识别出有意义的兴趣点。
本研究证实，位置数据泄露既广泛存在，又可能具有高度敏感性，即使仅存在少量数据泄露。
即使仅泄露有限的位置元数据，也能准确推断出用户的习惯性位置和行为。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。