QUICK REVIEW

[Paper Review] A Survey of Domain Adaptation for Neural Machine Translation

Chenhui Chu, Rui Wang|arXiv (Cornell University)|Jun 1, 2018

Natural Language Processing Techniques80 references130 citations

TL;DR

A comprehensive survey of domain adaptation techniques for neural machine translation (NMT), organized into data-centric and model-centric approaches, with discussions on real-world applicability and future directions.

ABSTRACT

Neural machine translation (NMT) is a deep learning based approach for machine translation, which yields the state-of-the-art translation performance in scenarios where large-scale parallel corpora are available. Although the high-quality and domain-specific translation is crucial in the real world, domain-specific corpora are usually scarce or nonexistent, and thus vanilla NMT performs poorly in such scenarios. Domain adaptation that leverages both out-of-domain parallel corpora as well as monolingual corpora for in-domain translation, is very important for domain-specific translation. In this paper, we give a comprehensive survey of the state-of-the-art domain adaptation techniques for NMT.

Motivation & Objective

Motivate the need for domain-specific translation in scenarios where domain-specific corpora are scarce.
Survey and categorize domain adaptation techniques for NMT, distinguishing data-centric and model-centric approaches.
Summarize how in-domain monolingual data, synthetic data, and out-of-domain parallel data can be leveraged for NMT domain adaptation.
Discuss practical considerations, real-world deployment challenges, and directions for future research in domain-adaptive NMT.

Proposed method

Categorizes domain adaptation methods for NMT into data-centric and model-centric paradigms.
Within data-centric: reviews use of monolingual data, synthetic parallel data generation, and out-of-domain parallel data with domain tags and oversampling.
Within model-centric: covers training objective adjustments, architecture adaptations (deep fusion, domain discriminators, domain control), and decoding-centric strategies (shallow fusion, ensembling).
Discusses multi-domain training, data selection, and mixed fine-tuning as practical strategies for combining domain data.
Provides discussion of real-world scenarios and guidance on selecting methods based on data availability.

Experimental results

Research questions

RQ1What domain adaptation techniques are effective for neural machine translation in the presence of limited in-domain parallel data?
RQ2How can monolingual in-domain data and synthetic data be leveraged to improve in-domain translation quality in NMT?
RQ3What are the trade-offs between data-centric and model-centric domain adaptation strategies for NMT, and how do they transfer to real-world deployment?
RQ4How should domain information be incorporated (e.g., domain tags, discriminators) to control or improve domain-specific translation in NMT?
RQ5What are promising future directions for applying domain adaptation to state-of-the-art NMT architectures and multilingual settings?

Key findings

The survey identifies two main categories for NMT domain adaptation: data-centric and model-centric approaches.
Data-centric methods include leveraging in-domain monolingual data, generating synthetic parallel data via back-translation, and using out-of-domain data with domain tagging or data selection techniques.
Model-centric methods include training objective adjustments, architecture changes (such as deep fusion and domain discriminators), and decoding strategies (like shallow fusion and lattice-based decoding).
Multi-domain and data selection strategies are discussed, with mixed fine-tuning often providing practical benefits by preserving out-of-domain performance while improving in-domain translation.
The paper highlights the need to adapt domain techniques to state-of-the-art NMT architectures (RNNs, CNNs, Transformer) and discusses future directions like adversarial domain adaptation, domain generation, and multilingual/multi-domain setups.

Better researchstarts right now

From paper design to paper writing, dramatically reduce your research time.

No credit card · Free plan available

This review was created by AI and reviewed by human editors.