Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis

SAKAI Shinsuke, KAWAHARA Tatsuya, KAWAI Hisashi

doi:10.1587/transinf.e94.d.2006

Abstract

The measure of the goodness, or inversely the cost, of concatenating synthesis units plays an important role in concatenative speech synthesis. In this paper, we present a probabilistic approach to concatenation modeling in which the goodness of concatenation is measured by the conditional probability of observing the spectral shape of the current candidate unit given the previous unit and the current phonetic context. This conditional probability is modeled by a conditional Gaussian density whose mean vector has a form of linear transform of the past spectral shape. Decision tree-based parameter tyingis performed to achieve robust trainingthat balances between model complexity and the amount of training data available. The concatenation models are implemented for a corpus-based speech synthesizer, and the effectiveness of the proposed method wasconfirmed by an objective evaluation as well as a subjective listening test. We also demonstrate that the proposed method generalizes some popular conventional methods in that those methods can be derived as the special cases of the proposed method.

Journal

IEICE Transactions on Information and Systems

IEICE Transactions on Information and Systems E94-D (10), 2006-2014, 2011

The Institute of Electronics, Information and Communication Engineers

Keywords

Details 詳細情報について

CRID: 1390282679355544448

NII Article ID: 10030193499

NII Book ID: AA10826272

DOI: 10.1587/transinf.e94.d.2006

ISSN: 17451361; 09168532

Web Site: http://www.jstage.jst.go.jp/article/transinf/E94.D/10/E94.D_10_2006/_pdf

Text Lang: en

Data Source

JaLC
Crossref
CiNii Articles

Abstract License Flag: Disallowed

Export

Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis

Search this article

Abstract

Journal

References(30)*help

Keywords

Details 詳細情報について

Export

Report a problem

Probabilistic Concatenation Modeling for Corpus-Based Speech Synthesis

Search this article

Abstract

Journal

References(30)*help

Keywords

Details 詳細情報について

Export

Report a problem

Project list