Length-constrained Neural Machine Translation using Length Prediction and Perturbation into Length-aware Positional Encoding

Oka Yui, Sudoh Katsuhito, Nakamura Satoshi

doi:10.5715/jnlp.28.778

【Created on October 31, 2023】 Integration of CiNii Dissertations and CiNii Books into CiNii Research

Impact of the Release of the New "NDL Search" on CiNii Services

Length-constrained Neural Machine Translation using Length Prediction and Perturbation into Length-aware Positional Encoding

DOI Web Site 11 References Open Access

Oka Yui

Nara Institute of Science and Technology Currently with NTT Communication Science Laboratories
Sudoh Katsuhito

Nara Institute of Science and Technology
Nakamura Satoshi

Nara Institute of Science and Technology

Abstract

<p>Neural machine translation often suffers from an under-translation problem owing to its limited modeling of the output sequence lengths. In this study, we propose a novel approach to training a Transformer model using length constraints based on length-aware positional encoding (PE). Because length constraints with exact target sentence lengths degrade the translation performance, we add a random perturbation with a uniform distribution within a certain range to the length constraints in the PE during the training. In the inference step, we predicted the output lengths from the input sequences using a length prediction model based on a large-scale pre-trained language model. In Japanese-to-English and English-to-Japanese translation, experimental results show that the proposed perturbation injection improves the robustness of the length prediction errors, particularly within a certain range. </p>

Journal

Journal of Natural Language Processing

Journal of Natural Language Processing 28 (3), 778-801, 2021

The Association for Natural Language Processing

References(11)*help

Related Projects

Keywords

Details 詳細情報について

CRID

1390007912125151872
NII Article ID

130008088116
DOI

10.5715/jnlp.28.778
ISSN

21858314

13407619
Web Site

https://www.jstage.jst.go.jp/article/jnlp/28/3/28_778/_pdf
Text Lang

en
Data Source
- JaLC
- Crossref
- CiNii Articles
- KAKEN
Abstract License Flag
Disallowed

Length-constrained Neural Machine Translation using Length Prediction and Perturbation into Length-aware Positional Encoding

Abstract

Journal

References(11)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Length-constrained Neural Machine Translation using Length Prediction and Perturbation into Length-aware Positional Encoding

Abstract

Journal

References(11)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Project list