Speech Recognition Based on Fusion of Visual and Auditory Information Using Full-Frame Color Image

この論文をさがす

抄録

We propose a method to fuse auditory information and visual information for accurate speech recognition. This method fuses two kinds of information by using linear combination after calculating two kinds of probabilities by HMM for each word. In addition, we use full-frame color image as visual information in order to improve the accuracy of the proposed speech recognition system. We have performed experiments comparing the proposed method with the method using either auditory in-formation or visual information, and confirmed the validity of the proposed method.

収録刊行物

参考文献 (3)*注記

もっと見る

詳細情報 詳細情報について

  • CRID
    1570009752557976192
  • NII論文ID
    110003216080
  • NII書誌ID
    AA10826239
  • ISSN
    09168508
  • 本文言語コード
    en
  • データソース種別
    • CiNii Articles

問題の指摘

ページトップへ