QUICK REVIEW

[论文解读] Greedy Policy Search: A Simple Baseline for Learnable Test-Time Augmentation

Dmitry Molchanov, Alexander Lyzhov|arXiv (Cornell University)|Feb 21, 2020

Domain Adaptation and Few-Shot Learning参考文献 32被引用 42

一句话总结

本文介绍了 Greedy Policy Search (GPS)，一种用于学习测试时增强策略的简单方法，可提升预测性能、不确定性估计和鲁棒性，优于传统的 TTA 基线。

ABSTRACT

Test-time data augmentation$-$averaging the predictions of a machine learning model across multiple augmented samples of data$-$is a widely used technique that improves the predictive performance. While many advanced learnable data augmentation techniques have emerged in recent years, they are focused on the training phase. Such techniques are not necessarily optimal for test-time augmentation and can be outperformed by a policy consisting of simple crops and flips. The primary goal of this paper is to demonstrate that test-time augmentation policies can be successfully learned too. We introduce greedy policy search (GPS), a simple but high-performing method for learning a policy of test-time augmentation. We demonstrate that augmentation policies learned with GPS achieve superior predictive performance on image classification problems, provide better in-domain uncertainty estimation, and improve the robustness to domain shift.

研究动机与目标

证明可以学习测试时增强策略以提升性能。
显示（表明）校准的对数似然在 GPS 中是一个关键目标。
在同域精度、不确定性估计和域迁移鲁棒性方面评估 GPS。
研究策略在不同架构和数据集上的可迁移性。

提出的方法

将测试时增强（TTA）策略定义为一组子策略，依次应用图像变换。
使用贪婪策略搜索（GPS）在验证集上迭代地添加使校准对数似然最大化的子策略。
对候选子策略池进行预测的预计算，以实现高效选择。
在温度缩放后使用校准的对数似然（cLL）进行优化，而不是简单准确性或未校准的对数似然。
在多种体系结构下在 CIFAR-10/100 和 ImageNet 上演示 GPS，并评估可迁移性和集成效应。

实验结果

研究问题

RQ1可学习的、对测试时增强进行贪婪搜索是否能带来比传统 TTA 更好的预测性能？
RQ2相较于准确性或未校准的对数似然，校准的对数似然是否是学习 TTA 策略的更优目标？
RQ3GPS 学习得到的策略是否能在不同架构和数据集之间迁移，且是否能在测试时增强集合模型？
RQ4在域移和受污染数据上，GPS 与标准 TTA 方法相比的表现如何？

主要发现

GPS 策略在持续性上超过传统 TTA、重用训练时策略，以及带网格搜索幅度的 RandAugment。
对齐校准对数似然的优化在各模型上为 GPS 带来更高的准确性和更好的校准信号。
在干净数据上训练的策略很好地迁移到不同架构和更大模型，常在域移下超越基线。
GPS 还能进一步提升集成，超越单模型 TTA 的提升。
学习得到的 TTA 策略对污染和域移具有鲁棒性，有时从更极端的增强中受益。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。