Skip to main content
QUICK REVIEW

[论文解读] Category Anchor-Guided Unsupervised Domain Adaptation for Semantic Segmentation

Qiming Zhang, Jing Zhang|arXiv (Cornell University)|Oct 28, 2019
Domain Adaptation and Few-Shot Learning参考文献 46被引用 114
一句话总结

paper introduces a category anchor-guided UDA model (CAG-UDA) for semantic segmentation that uses category-wise anchors to identify active target samples, assign pseudo-labels, and align features across domains, achieving state-of-the-art results on GTA5→Cityscapes and SYNTHIA→Cityscapes.

ABSTRACT

Unsupervised domain adaptation (UDA) aims to enhance the generalization capability of a certain model from a source domain to a target domain. UDA is of particular significance since no extra effort is devoted to annotating target domain samples. However, the different data distributions in the two domains, or \\emph{domain shift/discrepancy}, inevitably compromise the UDA performance. Although there has been a progress in matching the marginal distributions between two domains, the classifier favors the source domain features and makes incorrect predictions on the target domain due to category-agnostic feature alignment. In this paper, we propose a novel category anchor-guided (CAG) UDA model for semantic segmentation, which explicitly enforces category-aware feature alignment to learn shared discriminative features and classifiers simultaneously. First, the category-wise centroids of the source domain features are used as guided anchors to identify the active features in the target domain and also assign them pseudo-labels. Then, we leverage an anchor-based pixel-level distance loss and a discriminative loss to drive the intra-category features closer and the inter-category features further apart, respectively. Finally, we devise a stagewise training mechanism to reduce the error accumulation and adapt the proposed model progressively. Experiments on both the GTA5$\ ightarrow $Cityscapes and SYNTHIA$\ ightarrow $Cityscapes scenarios demonstrate the superiority of our CAG-UDA model over the state-of-the-art methods. The code is available at \\url{https://github.com/RogerZhangzz/CAG_UDA}.

研究动机与目标

  • Address domain shift in unsupervised domain adaptation for semantic segmentation.
  • Propose category anchors to enable explicit category-wise feature alignment and reliable pseudo-labels.
  • Develop a stagewise training mechanism to reduce error accumulation from pseudo-labels.
  • Demonstrate state-of-the-art performance on GTA5→Cityscapes and SYNTHIA→Cityscapes datasets.

提出的方法

  • Compute category-wise centroids (category anchors) from source domain features.
  • Identify active target samples by measuring distance to anchors and assign pseudo-labels.
  • Use an anchor-based pixel-level distance loss to reduce intra-category variance.
  • Use a discriminative loss to increase inter-category variance and refine decision boundaries.
  • Implement a stagewise training procedure to stabilize pseudo-labels and progressive adaptation.

实验结果

研究问题

  • RQ1Can category anchors guide effective unsupervised domain adaptation for semantic segmentation?
  • RQ2Do anchor-based alignment and pseudo-labeling improve target-domain segmentation performance compared to label-free alignment methods?
  • RQ3Does stagewise training mitigate error accumulation from pseudo-labels during UDA?
  • RQ4How does CAG-UDA perform on GTA5→Cityscapes and SYNTHIA→Cityscapes relative to state-of-the-art methods?

主要发现

  • CAG-UDA achieves a mean IoU (mIoU) of 50.2 on GTA5→Cityscapes in Stage 3, surpassing prior SOTA methods.
  • On GTA5→Cityscapes testing, CAG-UDA attains 51.7 mIoU across 19 classes.
  • In SYNTHIA→Cityscapes, CAG-UDA outperforms previous methods for both 13-category (mIoU*) and 16-category (mIoU) benchmarks.
  • Stagewise training with category anchors yields substantial gains over a single-stage approach and warm-up only.
  • Category anchor-based pseudo-labels provide more reliable supervision than probability-based pseudo-labels in ablations.

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。