Neural Incremental Speech Recognition Toward Real-Time Machine Speech Translation

  • NOVITASARI Sashi
    Augmented Human Communication Lab, Nara Institute of Science and Technology
  • SAKTI Sakriani
    Augmented Human Communication Lab, Nara Institute of Science and Technology RIKEN, Center for Advanced Intelligence Project AIP Japan Advanced Institute of Science and Technology
  • NAKAMURA Satoshi
    Augmented Human Communication Lab, Nara Institute of Science and Technology RIKEN, Center for Advanced Intelligence Project AIP

Abstract

<p>Real-time machine speech translation systems mimic human interpreters and translate incoming speech from a source language to the target language in real-time. Such systems can be achieved by performing low-latency processing in ASR (automatic speech recognition) module before passing the output to MT (machine translation) and TTS (text-to-speech synthesis) modules. Although several studies recently proposed sequence mechanisms for neural incremental ASR (ISR), these frameworks have a more complicated training mechanism than the standard attention-based ASR because they have to decide the incremental step and learn the alignment between speech and text. In this paper, we propose attention-transfer ISR (AT-ISR) that learns the knowledge from attention-based non-incremental ASR for a low delay end-to-end speech recognition. ISR comes with a trade-off between delay and performance, so we investigate how to reduce AT-ISR delay without a significant performance drop. Our experiment shows that AT-ISR achieves a comparable performance to the non-incremental ASR when the incremental recognition begins after the speech utterance reaches 25% of the complete utterance length. Additional experiments to investigate the effect of ISR on translation tasks are also performed. The focus is to find the optimum granularity of the output unit. The results reveal that our end-to-end subword-level ISR resulted in the best translation quality with the lowest WER and the lowest uncovered-word rate.</p>

Journal

References(39)*help

See more

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top