識別学習を用いた離散混合分布HMMによる音声認識

小坂, 哲夫, 加藤正治

これまで我々は離散混合分布HMMによる音声認識の検討を行ってきた．離散混合分布HMMでは混合分布やサブベクトル量子化，尤度補償などを行うことにより一般的な離散分布HMMと比較し高い性能を得ることが可能となっている．認識実験の結果雑音音声認識に対して有効であること，および講演音声認識において連続分布HMMを用いたシステムとほぼ同等の性能が得られることを示してきた．本研究では，さらなる性能向上を目指し，離散混合分布の識別学習の有効性を検討する．識別学習としては最大相互情報量（MMI）に基づく手法を用い，日本語話し言葉コーパス（CSJ）で評価を行った．その結果従来法である最尤推定法（ML）や最大事後確率推定法（MAP）と比較し高い性能が得られることが分かった．

Previously, we had investigated speech recognition by using discrete-mixture hidden Markov models (DMHMMs). The DMHMM yields a higher recognition performance than conventional discrete HMM because it uses mixture-density, subvector quantization, likelihood compensation, etc. From the results obtained through speech recognition experiments, the DMHMM-based system showed better performance under noise conditions. Moreover, it showed similar performance to the continuous-mixture HMM-based system in lecture speech recognition. In this paper, we investigate a discriminative learning approach for DMHMMs to further improve their recognition performance. We use the maximum mutual information (MMI) estimation criterion as discriminative learning. The proposed method was evaluated on a large-scale spontaneous speech database, “Corpus of Spontaneous Japanese”. From the results, the MMI estimation showed better performance than conventional approaches such as maximum likelihood estimation or maximum a posteriori estimation.

識別学習を用いた離散混合分布HMMによる音声認識

書誌事項

この論文をさがす

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

識別学習を用いた離散混合分布HMMによる音声認識

書誌事項

この論文をさがす

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について