Skip to main content
QUICK REVIEW

[论文解读] Inferring Private Personal Attributes of Virtual Reality Users from Head and Hand Motion Data

Vivek Nair, Christian Räck|arXiv (Cornell University)|May 30, 2023
Sexuality, Behavior, and Technology被引用 9
一句话总结

本文显示仅使用基于 Transformer 的模型,通过VR头部与手部运动数据,就能从 Beat Saber 等友好游戏中推断出超过 40 项私人个人属性,揭示多用户 VR 场景中的隐私风险。

ABSTRACT

Motion tracking "telemetry" data lies at the core of nearly all modern virtual reality (VR) and metaverse experiences. While generally presumed innocuous, recent studies have demonstrated that motion data actually has the potential to uniquely identify VR users. In this study, we go a step further, showing that a variety of private user information can be inferred just by analyzing motion data recorded from VR devices. We conducted a large-scale survey of VR users (N=1,006) with dozens of questions ranging from background and demographics to behavioral patterns and health information. We then obtained VR motion samples of each user playing the game "Beat Saber," and attempted to infer their survey responses using just their head and hand motion patterns. Using simple machine learning models, over 40 personal attributes could be accurately and consistently inferred from VR motion data alone. Despite this significant observed leakage, there remains limited awareness of the privacy implications of VR motion data, highlighting the pressing need for privacy-preserving mechanisms in multi-user VR applications.

研究动机与目标

  • 证明在非对抗性游戏环境中,可以从 VR 头部和手部运动数据推断出私人用户属性。
  • 仅通过运动遥测量化哪些属性在统计显著性上可被推断。
  • 提供一个可泛化的机器学习框架,用于从顺序的 VR 运动数据推断二元属性。

提出的方法

  • 在 1,006 名 Beat Saber 玩家中收集运动遥测数据(每帧 21 个特征,来自头部和双手),并附带详细问卷。
  • 将 50 个选定属性转化为二分类任务(例如年龄组、婚姻状况等)。
  • 在 21×1024 帧序列上训练基于 Transformer 的模型,以预测二元属性,使用每类均衡数据集及 Monte Carlo 交叉验证。
  • 通过按序列和按用户的准确率以及二项检验的统计显著性(p<0.01, p<0.05)评估推断成功率。
  • 使用 Adam 优化和 BCE 损失进行 100 轮训练;报告跨属性的聚合结果。

实验结果

研究问题

  • RQ1在非对抗性 VR 游戏中,是否可以仅从头部和手部运动数据推断出 VR 用户的私人属性?
  • RQ2在 Beat Saber 中,哪些属性可以通过运动遥测达到统计显著的推断?
  • RQ3基于 Transformer 的模型在这类顺序运动数据推断任务中的效果如何?

主要发现

属性总数测试数量准确率显著性总数测试数量准确率显著性
StandaloneGrip31,1006,00085.9%<0.0013116091.7%<0.001
Height19,1006,00076.5%<0.0011916086.7%<0.001
Controller33,2006,00081.2%<0.0013326085.0%<0.001
Weight9,8006,00073.6%<0.001986085.0%<0.001
FootSize9,1006,00073.2%<0.001916085.0%<0.001
Country33,3006,00060.3%<0.0013336081.7%<0.001
RhythmGames10,9006,00063.5%<0.0011096080.0%<0.001
Age62,3006,00064.9%<0.0016236078.3%<0.001
TotalPlayTime34,4006,00067.7%<0.0013446078.3%<0.001
Headset65,0006,00066.9%<0.0016506076.7%<0.001
LeftArm10,3006,00065.2%<0.0011036076.7%<0.001
RightArm10,2006,00064.9%<0.0011026075.0%<0.001
Athletics8,7006,00059.1%<0.001876075.0%<0.001
MaritalStatus81,4006,00060.2%<0.0018146073.3%<0.001
EmploymentStatus64,2006,00065.1%<0.0016426071.7%<0.001
AnyRhythmGames83,0006,00054.8%<0.0018306070.0%<0.001
Ethnicity73,9006,00059.7%<0.0017396070.0%<0.001
SteamComputerFormFactor51,3006,00058.5%<0.0015136070.0%<0.001
Footwear36,7006,00060.5%<0.0013676070.0%<0.001
AnyVRRhythmGames83,0008,00056.8%<0.0018308068.8%<0.001
Income76,7008,00055.0%<0.0017678068.8%<0.001
Wingspan16,0008,00059.9%<0.0011608068.8%<0.001
Handedness71,60010,00055.2%<0.00171610066.0%<0.001
HandLength51,0008,00058.5%<0.0015108066.3%<0.001
SubstanceUse69,20010,00055.9%<0.00169210064.0%<0.001
Preparation39,4008,00058.2%<0.0013948065.0%<0.001
LowerBody29,5008,00055.9%<0.0012958065.0%<0.001
Lenses80,9008,00055.3%<0.0018098065.0%<0.001
Languages80,7008,00056.5%<0.0018078065.0%<0.001
SteamOperatingSystemVersion50,8008,00058.4%<0.0015088065.0%<0.001
Music29,6008,00053.6%<0.0012968065.0%<0.001
AnyMentalDisabilities83,00010,00052.6%<0.00183010063.0%<0.001
Sex76,30010,00056.5%<0.00176310063.0%<0.001
AnyPhysicalDisabilities83,00010,00054.5%<0.00183010062.0%<0.001
ReactionTime9,80014,00053.1%<0.0019814060.0%<0.001
AnyMusic83,0008,00055.7%<0.0018308062.5%<0.001
AnyAthletics19,9008,00055.7%<0.0011998061.3%<0.001
EducationalStatus62,2008,00057.1%<0.0016228060.0%<0.001
IPD6,7008,00055.8%<0.001678060.0%<0.001
Dance82,00010,00052.3%<0.00182010059.0%<0.001
PoliticalOrientation33,10010,00053.5%<0.00133110058.0%<0.001
UpperBody47,20010,00052.0%<0.00147210057.0%<0.001
SteamProcessorLogicalCores33,50010,00051.0%<0.00133510056.0%<0.001
HadCOVID83,00010,00054.4%<0.00183010055.0%<0.001
CaffinatedBeverages40,80010,00052.9%<0.00140810055.0%<0.001
RoomArea33,1008,00050.5%p=0.1833318056.3%p=0.157
PhysicalFitness7,80012,00054.2%<0.0017812055.0%p=0.158
SteamProcessorCPUVendor51,60010,00049.2%p=0.95351610053.0%p=0.309
SteamLighthouses5,5008,00048.6%p=0.993558052.5%p=0.369
ColorBlindness79,80010,00050.4%p=0.22779810052.0%p=382
  • 在 50 项属性中,按用户统计显著性高(p<0.01)预测成功的有 33 项,具有中等显著性(p<0.05)的是 8 项。
  • 在按序列的层面上,50 项属性中有 45 项具有高度显著性(p<0.01),1 项具有中度显著性(p<0.05)。
  • 对虚构输入进行的宏观显著性检验在按序列和按用户评估中均得到 p<0.0001,表明总体显著性很强。
  • 结果在真实世界的多样硬件/软件生态中成立,仅使用头部/手部运动数据,且采用最弱的对手模型。

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。