Cross-language Voice Conversion Evaluation Using Bilingual Databases
Bibliographic Information
- Other Title
-
- 音声合成・変換とその応用
Search this article
Abstract
This paper describes experiments that test an extension of techniques for converting the voice of one speaker to sound like that of another speaker to include cross-language utterances such as would be required for spoken language translation or language training applications. In particular it addresses the issue of evaluation of system performance and compares objective tests using a perceptually-motivated acoustic measure with perceptual tests of voice quality and speaker resemblance. The proposed method uses Japanese and English speech databases from 2 female and 2 male bilingual speakers for training in a system based on a Gaussian mixture model (GMM) and a high quality vocoder. Results indicate that training with cross-language models also produces close acoustic matches between source and target speakers' voices. Perceptual tests revealed little significant difference in the performance of mapping functions trained on single-language and cross-language data pairs.
This paper describes experiments that test an extension of techniques for converting the voice of one speaker to sound like that of another speaker, to include cross-language utterances, such as would be required for spoken language translation or language training applications. In particular, it addresses the issue of evaluation of system performance, and compares objective tests using a perceptually-motivated acoustic measure, with perceptual tests of voice quality and speaker resemblance. The proposed method uses Japanese and English speech databases from 2 female and 2 male bilingual speakers for training in a system based on a Gaussian mixture model (GMM) and a high quality vocoder. Results indicate that training with cross-language models also produces close acoustic matches between source and target speakers' voices. Perceptual tests revealed little significant difference in the performance of mapping functions trained on single-language and cross-language data pairs.
Journal
-
- 情報処理学会論文誌
-
情報処理学会論文誌 43 (7), 2177-2185, 2002-07-15
情報処理学会
- Tweet
Keywords
Details 詳細情報について
-
- CRID
- 1050001337883962624
-
- NII Article ID
- 110002771199
-
- NII Book ID
- AN00116647
-
- ISSN
- 18827764
- 18827837
- 03875806
-
- HANDLE
- 10061/7777
-
- NDL BIB ID
- 6220705
-
- Text Lang
- en
-
- Article Type
- journal article
-
- Data Source
-
- IRDB
- NDL
- CiNii Articles