HMMに基づく音声合成のための話者補間 Speaker interpolation for HMM-based speech synthesis system

抄録

This paper describes an approach to voice characteristics conversion for an HMM-based text-to-speech synthesis system using speaker interpolation. Although most text-to-speech synthesis systems which synthesize speech by concatenating speech units can synthesize speech with acceptable quality, they still cannot synthesize speech with various voice quality such as speaker individualities and emotions ; In order to control speaker individualities and emotions, therefore, they need a large database, which records speech units with various voice characteristics in synthesis phase. On the other hand, our system synthesize speech with untrained speaker's voice quality by interpolating HMM parameters among some representative speakers' HMM sets. Accordingly, our system can synthesize speech with various voice quality without large database in synthesis phase. An HMM interpolation technique is derived from a probabilistic similarity measure for HMMs, and used to synthesize speech with untrained speaker's voice quality by interpolating HMM parameters among some representative speakers' HMM sets. The results of subjective experiments show that we can gradually change the voice quality of synthesized speech from one's to the other's by changing the interpolation ratio.

収録刊行物

Journal of the Acoustical Society of Japan (E)   [巻号一覧]

Journal of the Acoustical Society of Japan (E) 21(4), 199-206, 2000-07  [この号の目次]

社団法人日本音響学会

参考文献:  18件

参考文献を見るにはログインが必要です。ユーザIDをお持ちでない方は新規登録してください。

被引用文献:  14件

被引用文献を見るにはログインが必要です。ユーザIDをお持ちでない方は新規登録してください。

プレビュー

プレビュー

各種コード

  • NII論文ID(NAID) :
    110003106260
  • NII書誌ID(NCID) :
    AA00256597
  • 本文言語コード :
    ENG
  • 資料種別 :
    ART
  • ISSN :
    03882861
  • NDL 記事登録ID :
    5446106
  • NDL 雑誌分類 :
    ZM35(科学技術--物理学)
  • NDL 請求記号 :
    Z53-X48
  • 収録DB :
    CJP書誌  CJP引用  NDL  NII-ELS