A Survey of Domain Adaptation for Machine Translation

DOI Web Site 29 References Open Access
  • Chu Chenhui
    Institute for Datability Science, Osaka University
  • Wang Rui
    National Institute of Information and Communications Technology

Abstract

<p>Neural machine translation (NMT) is a deep learning based approach for machine translation, which outperforms traditional statistical machine translation (SMT) and yields the state-of-the-art translation performance in scenarios where large-scale parallel corpora are available. Although a high-quality and domain-specific translation is crucial in the real world, domain-specific corpora are usually scarce or nonexistent, and thus vanilla NMT performs poorly in such scenarios. Domain adaptation that leverages both out-of-domain parallel corpora as well as monolingual corpora for in-domain translation, is very important for domain-specific translation. In this paper, we give a comprehensive survey of the state-of-the-art domain adaptation techniques for MT. Because of the current dominance of NMT in MT research, we give a brief review of domain adaptation for SMT, but put most of our effort into the survey of domain adaptation for NMT. We hope that this paper will be both a starting point and a source of new ideas for researchers and engineers who are interested in domain adaptation for MT.</p>

Journal

References(29)*help

See more

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top