QUICK REVIEW

[论文解读] Description Based Text Classification with Reinforcement Learning

Duo Chai, Wei Wu|arXiv (Cornell University)|Feb 8, 2020

Topic Modeling参考文献 62被引用 31

一句话总结

本文将文本分类重新表述为一个问答风格的任务：为每个类别附上描述，并通过强化学习自动学习描述，以在单标签、多标签和多方面情感任务上提升性能。

ABSTRACT

The task of text classification is usually divided into two stages: {\it text feature extraction} and {\it classification}. In this standard formalization categories are merely represented as indexes in the label vocabulary, and the model lacks for explicit instructions on what to classify. Inspired by the current trend of formalizing NLP problems as question answering tasks, we propose a new framework for text classification, in which each category label is associated with a category description. Descriptions are generated by hand-crafted templates or using abstractive/extractive models from reinforcement learning. The concatenation of the description and the text is fed to the classifier to decide whether or not the current label should be assigned to the text. The proposed strategy forces the model to attend to the most salient texts with respect to the label, which can be regarded as a hard version of attention, leading to better performances. We observe significant performance boosts over strong baselines on a wide range of text classification tasks including single-label classification, multi-label classification and multi-aspect sentiment analysis.

研究动机与目标

解决传统仅标签的分类在缺乏对要分类内容的明确指引方面的局限。
引入一个问答风格的公式化，其中每个类别配有一个描述。
通过强化学习自动生成类别描述（抽取式或生成式）。
证明基于描述的分类能聚焦于文本中的显著部分并提升在多样任务中的性能。

提出的方法

将分类形式化为将每个类别与自然语言描述 q_y 相关联。
以 BERT 作为骨干网络，并根据设定使用 N-binary 分类器或 N-class 分类器。
提取或生成类别描述；通过强化学习端到端训练描述生成与分类。
对于抽取，使用对令牌 span 的策略来选择文本片段作为描述，并使用分类奖励通过 REINFORCE 进行优化。
对于 abstractive 生成，使用带有 REGS 的 Seq2Seq 生成器为部分生成分配奖励并稳定训练。
三种描述构建策略：template (Tem)，extractive (Ext)，和 abstractive (Abs)。

实验结果

研究问题

RQ1将描述性标签附加到类别上，是否相较于传统的标签嵌入方法能提升文本分类性能？
RQ2通过模板、抽取性文本片段或抽象生成自动生成的类别描述，对单标签、多标签和多方面情感任务的性能有何影响？
RQ3描述质量和生成策略对模型收敛性与数据效率有何影响？

主要发现

模型	AGNews	20news	DBPedia	Yahoo	YelpP	IMDB	备注
描述 (Tem.)	5.2	15.8	0.65	22.1	2.2	5.8	单标签结果，使用模板描述；优于 BERT
描述 (Ext.)	5.0	15.6	0.63	22.0	2.1	5.5	单标签结果，使用抽取式描述；较 Tem 有提升。
描述 (Abs.)	5.1	15.4	0.62	21.8	2.0	5.5	单标签结果，使用 abstractive 描述；在某些指标上表现最佳

基于描述的方法在所测试的任务上始终优于 BERT 基线。
模板描述优于 BERT，抽取式和 abstractive 策略带来进一步提升。
抽取式和 abstractive 策略在多标签和多方面任务上带来更强的提升，在某些数据集（如 BeerAdvocate 和 TripAdvisor）取得显著改善。
较长文本从描述性引导中受益更大，充当一种硬注意力机制。
基于 RL 的描述生成可以用模板初始化并仍能收敛，抽取式方法的收敛速度快于 abstractive。
使用描述相对于基线在 AGNews、20news、DBPedia、Yahoo、YelpP、IMDB、Reuters、AAPD、BeerAdvocate 和 TripAdvisor 数据集上降低了错误率。

更好的研究，从现在开始

从论文设计到论文写作，大幅缩短您的研究时间。

无需绑定信用卡

本解读由 AI 生成，并经人工编辑审核。