Cross-language Voice Conversion Evaluation Using Bilingual Databases

Bibliographic Information

Other Title
  • 音声合成・変換とその応用

Search this article

Abstract

This paper describes experiments that test an extension of techniques for converting the voice of one speaker to sound like that of another speaker to include cross-language utterances such as would be required for spoken language translation or language training applications. In particular it addresses the issue of evaluation of system performance and compares objective tests using a perceptually-motivated acoustic measure with perceptual tests of voice quality and speaker resemblance. The proposed method uses Japanese and English speech databases from 2 female and 2 male bilingual speakers for training in a system based on a Gaussian mixture model (GMM) and a high quality vocoder. Results indicate that training with cross-language models also produces close acoustic matches between source and target speakers' voices. Perceptual tests revealed little significant difference in the performance of mapping functions trained on single-language and cross-language data pairs.

This paper describes experiments that test an extension of techniques for converting the voice of one speaker to sound like that of another speaker, to include cross-language utterances, such as would be required for spoken language translation or language training applications. In particular, it addresses the issue of evaluation of system performance, and compares objective tests using a perceptually-motivated acoustic measure, with perceptual tests of voice quality and speaker resemblance. The proposed method uses Japanese and English speech databases from 2 female and 2 male bilingual speakers for training in a system based on a Gaussian mixture model (GMM) and a high quality vocoder. Results indicate that training with cross-language models also produces close acoustic matches between source and target speakers' voices. Perceptual tests revealed little significant difference in the performance of mapping functions trained on single-language and cross-language data pairs.

Journal

Citations (3)*help

See more

References(9)*help

See more

Details 詳細情報について

Report a problem

Back to top