Skip to main content
QUICK REVIEW

[论文解读] CityGuard: Graph-Aware Private Descriptors for Bias-Resilient Identity Search Across Urban Cameras

Rong Fu, Meng, Yibo|arXiv (Cornell University)|Feb 20, 2026
Video Surveillance and Tracking Methods被引用 0
一句话总结

CityGuard 引入具拓扑感知的变换器,结合扩散自适应边距与差分隐私,用于城市规模监控中的隐私保护、偏差鲁棒的跨摄像头身份检索。

ABSTRACT

City-scale person re-identification across distributed cameras must handle severe appearance changes from viewpoint, occlusion, and domain shift while complying with data protection rules that prevent sharing raw imagery. We introduce CityGuard, a topology-aware transformer for privacy-preserving identity retrieval in decentralized surveillance. The framework integrates three components. A dispersion-adaptive metric learner adjusts instance-level margins according to feature spread, increasing intra-class compactness. Spatially conditioned attention injects coarse geometry, such as GPS or deployment floor plans, into graph-based self-attention to enable projectively consistent cross-view alignment using only coarse geometric priors without requiring survey-grade calibration. Differentially private embedding maps are coupled with compact approximate indexes to support secure and cost-efficient deployment. Together these designs produce descriptors robust to viewpoint variation, occlusion, and domain shifts, and they enable a tunable balance between privacy and utility under rigorous differential-privacy accounting. Experiments on Market-1501 and additional public benchmarks, complemented by database-scale retrieval studies, show consistent gains in retrieval precision and query throughput over strong baselines, confirming the practicality of the framework for privacy-critical urban identity matching.

研究动机与目标

  • 在数据保护约束下,驱动隐私保护的城市规模人员再识别(Re-ID)在分布式摄像头间的应用。
  • 解决城市 Re-ID 中的强烈外观变化、视角变动、遮挡和域偏移问题。
  • 将来自摄像头布局的粗略几何先验引入基于图的注意力框架中。
  • 通过嵌入私有化和紧凑索引实现私有、高效的检索,同时控制隐私—效用权衡。

提出的方法

  • 提出在从粗几何先验构建的摄像头图上运行的拓扑感知变换器。
  • 引入扩散感知自适应边距(ACT)损失,根据每个身份特征分布来收紧类内簇。
  • 实现几何条件注意力,将摄像头布局先验注入图自注意力以实现跨视角对齐。
  • 利用时序图网络(TGN)进行时序及跨摄像头运动建模。
  • 应用差分私有嵌入释放,配合标定高斯噪声,并使用紧凑近似索引实现安全检索。
  • 整合传输正则化检索,促进全局、数据库感知的匹配。
Figure 1: Overview of the CityGuard framework for bias-resilient, privacy-preserving identity search. The process begins with Topology-Aware Geometry Encoding , where camera coordinates and rotations are mapped to a spatial adjacency graph. The Geometry-Conditioned Backbone then fuses multi-scale fe
Figure 1: Overview of the CityGuard framework for bias-resilient, privacy-preserving identity search. The process begins with Topology-Aware Geometry Encoding , where camera coordinates and rotations are mapped to a spatial adjacency graph. The Geometry-Conditioned Backbone then fuses multi-scale fe

实验结果

研究问题

  • RQ1在隐私约束下,如何在异构摄像头视角下实现偏见鲁棒的 Re-ID?
  • RQ2几何派生先验能否有效并入基于图的注意力以在不精确标定的情况下提升跨视角对齐?
  • RQ3基于每身份分散度的自适应边距如何影响类内紧凑性与跨摄像头判别性?
  • RQ4差分私有嵌入释放对城市监控环境中的检索准确性和可扩展性有何影响?

主要发现

  • 在结合 ACT 损失、几何条件注意力与 OT 正则化时,CityGuard 在标准基准(如 Market-1501、MARS)上实现了最先进或接近最先进的检索性能。
  • 几何感知注意力提升跨视角对齐和对视点/遮挡的鲁棒性,相较于几何无关基线表现更优。
  • 自适应分散度边距(ACT)在提高类内紧凑性与跨摄像头判别性方面效果显著,尤其与空间先验结合时。
  • 差分私有嵌入释放实现安全、可扩展的索引构建,可控隐私预算下仍维持具有竞争力的检索性能。
  • 对抗性鲁棒性测试显示,在 FGSM 和 PGD-20 攻击下,CityGuard 的 Rank-1/mAP 显著高于若干基线。
  • 消融研究表明 CityGuard 的每个组件(ACT、几何注意力、OT)均贡献性能提升,完整框架提供最强结果。
Figure 2: Camera topology (GPS only): top-down 2D layout of camera nodes with edge thickness encoding the row-stochastic affinity $A_{ij}$ . This panel visualizes affinity derived solely from pairwise GPS distances.
Figure 2: Camera topology (GPS only): top-down 2D layout of camera nodes with edge thickness encoding the row-stochastic affinity $A_{ij}$ . This panel visualizes affinity derived solely from pairwise GPS distances.

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。