QUICK REVIEW

[论文解读] Robust Adversarial Perturbation on Deep Proposal-based Models

Yuezun Li, Daniel Tian|arXiv (Cornell University)|Sep 16, 2018

Adversarial Robustness in Machine Learning参考文献 22被引用 65

一句话总结

本文提出鲁棒对抗扰动（R-AP），用于在深度基于 proposal 的对象检测和实例分割模型中普遍攻击区域提议网络（RPN），在黑盒设置下通过同时干扰标签预测和形状回归来降低性能。

ABSTRACT

Adversarial noises are useful tools to probe the weakness of deep learning based computer vision algorithms. In this paper, we describe a robust adversarial perturbation (R-AP) method to attack deep proposal-based object detectors and instance segmentation algorithms. Our method focuses on attacking the common component in these algorithms, namely Region Proposal Network (RPN), to universally degrade their performance in a black-box fashion. To do so, we design a loss function that combines a label loss and a novel shape loss, and optimize it with respect to image using a gradient based iterative algorithm. Evaluations are performed on the MS COCO 2014 dataset for the adversarial attacking of 6 state-of-the-art object detectors and 2 instance segmentation algorithms. Experimental results demonstrate the efficacy of the proposed method.

研究动机与目标

Motivate study of adversarial vulnerabilities in deep proposal-based models used for object detection and instance segmentation.
Propose a universal attack focusing on Region Proposal Networks (RPN) to degrade downstream predictions without full model access.
Introduce a novel loss combining label disruption and shape regression disturbance to impair RPN performance.
Demonstrate the effectiveness of R-AP against multiple detectors and segmenters on MS COCO 2014.
Highlight potential robustness implications for safety-critical CV applications.

提出的方法

Define a loss L = Llabel + Lshape to generate adversarial perturbations for an input image, while keeping PSNR above a threshold.
Llabel disturbs the probability of positive proposals by reducing their confidence (zj log(sj)).
Lshape disturbs the RPN shape regression by guiding predicted offsets toward large preset targets (τx, τy, τw, τh).
Iteratively update the image by scaled normalized gradient steps pt to minimize L, clipping to valid pixel range and enforcing PSNR ε.
Combine perturbations from multiple RPN architectures to enhance black-box robustness (P = α · sum of p_i).
Experimentally evaluate on MS COCO 2014 across six detectors and two instance segmentation methods to show degradation.

实验结果

研究问题

RQ1Can a universal perturbation targeting RPN degrade a wide range of deep proposal-based detectors and segmenters without model-specific access?
RQ2Does combining label disruption with shape regression disturbance yield stronger degradation than targeting labels alone?
RQ3How does R-AP perform across different RPN backbones and in black-box settings?

主要发现

R-AP significantly degrades several state-of-the-art detectors when perturbations are tailored to their RPN backbones (e.g., Fcns detectors show large drops in mAP at 0.5 and 0.7).
Accumulated multi-RPN perturbations P achieve notable degradation even under black-box conditions (e.g., RFCN and other detectors).
Compared to random Gaussian noise, R-AP produces substantially larger drops in performance as PSNR varies.
Attack effectiveness is demonstrated for instance segmentation with FCIS and Mask R-CNN, with meaningful mAP reductions at 0.5 and 0.7.
The study confirms RPN as a universal vulnerability point in deep proposal-based models, impacting both detection and segmentation pipelines.

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。