QUICK REVIEW

[论文解读] Multi-Source Domain Adaptation for Text Classification via DistanceNet-Bandits

Han Guo, Ramakanth Pasunuru|arXiv (Cornell University)|Jan 13, 2020

Domain Adaptation and Few-Shot Learning被引用 28

一句话总结

本文提出 DistanceNet-Bandits，一种用于文本分类的多源域自适应框架，通过使用基于距离的度量（如 MMD、CORAL、余弦相似度）作为辅助损失函数，对齐源域与目标域的表示。该方法进一步引入多臂赌博机控制器，在训练过程中动态选择最优源域，从而在情感分析基准上实现最先进性能，有效降低域偏移误差，并提升低资源目标域的泛化能力。

ABSTRACT

Domain adaptation performance of a learning algorithm on a target domain is a function of its source domain error and a divergence measure between the data distribution of these two domains. We present a study of various distance-based measures in the context of NLP tasks, that characterize the dissimilarity between domains based on sample estimates. We first conduct analysis experiments to show which of these distance measures can best differentiate samples from same versus different domains, and are correlated with empirical results. Next, we develop a DistanceNet model which uses these distance measures, or a mixture of these distance measures, as an additional loss function to be minimized jointly with the task's loss function, so as to achieve better unsupervised domain adaptation. Finally, we extend this model to a novel DistanceNet-Bandit model, which employs a multi-armed bandit controller to dynamically switch between multiple source domains and allow the model to learn an optimal trajectory and mixture of domains for transfer to the low-resource target domain. We conduct experiments on popular sentiment analysis datasets with several diverse domains and show that our DistanceNet model, as well as its dynamic bandit variant, can outperform competitive baselines in the context of unsupervised domain adaptation.

研究动机与目标

研究并比较各种基于距离的度量在自然语言处理任务中估计域差异性的效果。
开发 DistanceNet 模型，通过联合最小化任务损失和域差异损失，以改善无监督域自适应。
将 DistanceNet 扩展至多源设置，引入多臂赌博机控制器以实现动态源域选择。
在具有多样化领域的现实世界情感分析数据集上，评估距离度量和所提框架的有效性。

提出的方法

该论文评估了五种距离度量：L2、最大均值差异（MMD）、费希尔线性判别（FDA）、余弦相似度和相关性对齐（CORAL）。
DistanceNet 在训练过程中将这些距离度量作为附加损失项，以最小化源域和目标域特征之间的分布差异。
采用距离度量的混合策略，并结合无监督准则，以降低超参数敏感性并提升鲁棒性。
在多源自适应中，多臂赌博机控制器根据性能反馈，在每个训练步骤动态选择最有效的源域。
赌博机策略与分类损失和距离损失端到端联合训练，实现自适应的域调度。
实验在包含五个领域（MR、Books、Apparel、Baby 和 Camera）的多领域情感分析数据集上进行。

实验结果

研究问题

RQ1在自然语言处理任务中，哪些域距离度量最能有效区分来自相同或不同域的样本？
RQ2不同距离度量与实际域自适应性能之间的相关性如何？
RQ3结合多种距离度量是否能超越单一度量基线，进一步提升无监督域自适应性能？
RQ4基于赌博机的动态源域选择控制器是否能优于静态或轮转调度策略，在多源域自适应中表现更优？

主要发现

基于 MMD 的 DistanceNet 模型在跨域设置下的平均性能相比基线提升了一个标准差。
DistanceNet 中距离度量的混合策略实现了最高平均性能，相比基线提升两个标准差。
DistanceNet-Bandit 模型在各个目标域上均表现更优，相比非赌博机基线平均提升 1.1 个百分点，且提升在两个标准差水平上具有显著性。
可视化结果表明，赌博机控制器学习到了上下文相关的域偏好——例如，对 'MR' 目标偏好 'Books'，对 'Baby' 目标偏好 'Apparel'。
即使复用单源实验中的超参数，赌博机策略仍持续优于非赌博机基线，证明了其鲁棒性与适应能力。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。