Spoken Sentence Recognition Based on HMM-LR with Hybrid Language Modeling
-
- KITA K.
- Faculty of Engineering, Tokushima University
-
- Morimoto Tsuyoshi
- ATR Interpreting Telecommunications Research Laboratories
-
- Ohkura Kazumi
- Information & Communication systems Research Center, SANYO Electric Co., Ltd.
-
- Sagayama Shigeki
- NTT Human Interface Laboratories
-
- Yano Yaneo
- Faculty of Engineering, Tokushima University
この論文をさがす
抄録
This paper describes Japanese spoken sentence recognition using hybrid language modeling, which combines the advantages of both syntactic and stochastic language models. As the baseline system, we adopted the HMM-LR speech recognition system, with which we have already achieved good performance for Japanese phrase recognition tasks. Several improvements have been made to this system aimed at handling continuously spoken sentences. The first improvement is HMM training with continuous utterances as well as word utterances. In previous implementations, HMMs were trained with only word utterances. Continuous utterances are included in the HMM training data because coarticulation effects are much stronger in continuous utterances. The second improvement is the development of a sentential grammar for Japanese. The sentential grammar was created by combining inter-and intra-phrase CFG grammars, which were developed separately. The third improvement is the incorporation of stochastic linguistic knowledge, which includes stochastic CFG and a bigram model of production rules. The system was evaluated using continuously spoken sentences from a conference registration task that included approximately 750 words. We attained a sentence accuracy of 83.9% in the speaker-dependent condition.
収録刊行物
-
- IEICE Trans. Inf. & Syst., D
-
IEICE Trans. Inf. & Syst., D 258-265, 1994
一般社団法人電子情報通信学会
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1570854177488677888
-
- NII論文ID
- 110003219671
-
- NII書誌ID
- AA10826272
-
- ISSN
- 09168532
-
- 本文言語コード
- en
-
- データソース種別
-
- CiNii Articles