QUICK REVIEW

[論文レビュー] Unsupervised Text Style Transfer using Language Models as Discriminators

Zichao Yang, Zhiting Hu|arXiv (Cornell University)|May 30, 2018

Topic Modeling参考文献 44被引用数 47

ひとこと要約

本論文は二値識別器をターゲットドメイン言語モデルに置換し、教師なしのテキストスタイル転送を導く。連続近似を用いたエンドツーエンドの訓練を可能にし、 decipherment、 sentiment modification、および関連翻訳の精度を改善する。

ABSTRACT

Binary classifiers are often employed as discriminators in GAN-based unsupervised style transfer systems to ensure that transferred sentences are similar to sentences in the target domain. One difficulty with this approach is that the error signal provided by the discriminator can be unstable and is sometimes insufficient to train the generator to produce fluent language. In this paper, we propose a new technique that uses a target domain language model as the discriminator, providing richer and more stable token-level feedback during the learning process. We train the generator to minimize the negative log likelihood (NLL) of generated sentences, evaluated by the language model. By using a continuous approximation of discrete sampling under the generator, our model can be trained using back-propagation in an end- to-end fashion. Moreover, our empirical results show that when using a language model as a structured discriminator, it is possible to forgo adversarial steps during training, making the process more stable. We compare our model with previous work using convolutional neural networks (CNNs) as discriminators and show that our approach leads to improved performance on three tasks: word substitution decipherment, sentiment modification, and related language translation.

研究の動機と目的

Motivate unsupervised text style transfer without parallel data.
Disentangle content and style to enable style transfer while preserving content.
Introduce a language-model-based discriminator to provide stable token-level feedback.
Develop a continuous approximation for back-propagation through discrete text generation.
Evaluate on decipherment, sentiment alteration, and related language translation tasks.

提案手法

Model uses an encoder to extract content and a style-conditioned decoder to generate transferred text.
Replace traditional binary discriminators with a target-domain language model that scores sentence likelihood.
Define LM-based loss L_LM with real and transferred sentences to guide generation (Eq. 1–3).
Apply a continuous Gumbel-softmax approximation to enable back-propagation through discrete tokens (Eq. 5).
Two-step training: update language models with real and transferred data, then update the encoder/decoder with reconstruction and LM-based transfer loss.
Explore the role of negative samples and show that omitting adversarial steps improves stability.

実験結果

リサーチクエスチョン

RQ1Can a language model trained on target-domain data provide a more stable and informative signal than a binary classifier for style transfer?
RQ2Does end-to-end training with a continuous token-level feedback mechanism improve fluency and style accuracy across tasks?
RQ3How does LM-based discrimination compare to CNN-based discriminators in decipherment, sentiment modification, and related translation?
RQ4What is the impact of using negative samples in LM-based training on stability and performance?

主な発見

モデル	20%	40%	60%	80%	100%
コピー	64.3	39.1	14.4	2.5	0
Shen et al. 2017 ∗	86.6	77.1	70.1	61.2	50.8
LM	89.0	80.0	74.1	62.9	49.3
LM + adv	89.1	79.6	71.8	63.8	44.2

The LM-based discriminator achieved strong decipherment BLEU scores, e.g., 89.0% at 20% change and 49.3% at 100% change (LM row in Table 1).
LM-based models outperform CNN-discriminator baselines in sentiment modification on accuracy and BLEU with better fluency (e.g., LM: 83.3 Accu, 38.6 BLEU; LM + Classifier: 91.2 Accu, 57.8 BLEU in Table 2).
In sentiment tests, LM-based approaches yielded lower perplexities than several baselines (e.g., PPL_X 30.3 and PPL_Y 42.1 for LM; Table 2).
Related-language translation results show LM-based methods achieving BLEU scores of 81.6 (sr–bs) and 85.5 (tw–cn), outperforming Shen et al. (2017) (Table 4).
Using the LM alone (without adversarial steps) provides more stable training and competitive performance across tasks.
Combining LM with a classifier can further improve sentiment transfer by balancing style modification and fluency (LM + Classifier in Table 2).

より良い研究を、今すぐ始めましょう

論文設計から論文執筆まで、研究時間を劇的に削減しましょう。

クレジットカード登録不要

このレビューはAIが作成し、人間の編集者が確認しました。