Skip to main content
QUICK REVIEW

[论文解读] SoftTriple Loss: Deep Metric Learning Without Triplet Sampling

Qi Qian, Lei Shang|arXiv (Cornell University)|Sep 11, 2019
Face recognition and analysis参考文献 28被引用 93
一句话总结

SoftTriple 损失通过为每个类别使用多个中心并采用平滑的相似度来实现无需三元组采样的深度度量学习,从而在细粒度任务中提升嵌入质量。

ABSTRACT

Distance metric learning (DML) is to learn the embeddings where examples from the same class are closer than examples from different classes. It can be cast as an optimization problem with triplet constraints. Due to the vast number of triplet constraints, a sampling strategy is essential for DML. With the tremendous success of deep learning in classifications, it has been applied for DML. When learning embeddings with deep neural networks (DNNs), only a mini-batch of data is available at each iteration. The set of triplet constraints has to be sampled within the mini-batch. Since a mini-batch cannot capture the neighbors in the original set well, it makes the learned embeddings sub-optimal. On the contrary, optimizing SoftMax loss, which is a classification loss, with DNN shows a superior performance in certain DML tasks. It inspires us to investigate the formulation of SoftMax. Our analysis shows that SoftMax loss is equivalent to a smoothed triplet loss where each class has a single center. In real-world data, one class can contain several local clusters rather than a single one, e.g., birds of different poses. Therefore, we propose the SoftTriple loss to extend the SoftMax loss with multiple centers for each class. Compared with conventional deep metric learning algorithms, optimizing SoftTriple loss can learn the embeddings without the sampling phase by mildly increasing the size of the last fully connected layer. Experiments on the benchmark fine-grained data sets demonstrate the effectiveness of the proposed loss function. Code is available at https://github.com/idstcv/SoftTriple

研究动机与目标

  • 鉴于小批量的局限性,推动无需三元组采样的深度度量学习。
  • 将 SoftMax 损失分析为具有单一中心、平滑化的三元组损失。
  • 提出使用多个中心的 SoftTriple 损失以捕捉类内方差。
  • 发展自适应中心策略以保持紧凑的中心集。
  • 展示在细粒度数据集(CUB-2011、Cars196、SOP)上的有效性。

提出的方法

  • 将 SoftMax 损失重新表述为带有类别中心的平滑化三元组损失。
  • 扩展至每个类别的多个中心,并使用带平滑的最大化求和(max-over-centers)定义类别级别的相似性。
  • 引入对中心最大值的熵正则化平滑,以获得类似 SoftHardTriple 的软损失。
  • 给出一种使用 L2,1 正则化来合并相似中心并保持紧凑中心集的自适应方案。
  • 端到端优化 SoftTriple 目标函数,无需进行三元组采样。
  • 在跨数据集的检索和聚类指标上评估嵌入表示。

实验结果

研究问题

  • RQ1Can SoftMax loss be interpreted as a smoothed triplet loss with centers?
  • RQ2Does introducing multiple centers per class better model intra-class variance and improve retrieval/clustering without triplet sampling?
  • RQ3How should the number of centers be chosen and regularized to remain compact while preserving performance?
  • RQ4Is SoftTriple scalable and effective on large fine-grained datasets compared to traditional triplet-based methods?

主要发现

MethodR@1R@2R@4R@8NMI
SemiHard42.655.066.477.255.4
LiftedStruct43.656.668.679.656.5
Clustering48.261.471.881.959.2
Npairs ∗51.063.374.383.260.4
ProxyNCA49.261.967.972.459.5
SoftMax norm57.870.080.187.965.3
SoftTriple60.171.981.288.566.2
  • SoftMax norm already outperforms many metric learning methods, indicating its smoothed triplet interpretation.
  • SoftTriple consistently improves over SoftMax norm by leveraging multiple centers.
  • SoftTriple achieves state-of-the-art or competitive results on CUB-2011, Cars196, and SOP, often surpassing SoftMax norm by several points in R@1.
  • Using a large initial number of centers with L2,1 regularization yields a compact center set and stable performance.
  • SoftTriple with large embeddings generally outperforms comparable methods, e.g., on CUB-2011 and Cars196 tables.
  • The approach eliminates the sampling phase, while maintaining or improving retrieval/clustering performance.

更好的研究,从现在开始

从论文设计到论文写作,大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成,并经人工编辑审核。