Large Vocabulary Continuous Speech Recognition:From Laboratory Systems towards Real-World Applications (音声言語によるコミュニケ-ションシステムの実現に向けて(音声認識,合成,対話処理,システム構築の諸問題)論文特集) Large Vocabulary Continuous Speech Recognition: From Laboratory Systems towards Real-World Applications

Search this Article

Abstract

This paper provided an overview of the state-of-the-art in laboratory speaker-independent, large vocabulary continuous speech recognition (LVCSR) systems with a view towards adapting such technology to the requirements of real-world applications. While in speech recognition the principal concern is to transcribe the speech signal as a sequence of words, the same core technology can be applied to domains other than dictation. The main topics addressed are acoustic-phonetic modeling, lexical representation, language modeling, decoding and model adaptation. After a brief summary of experimental results some directions towards usable systems are given. In moving from laboratory systems towards real-world applications, different constraints arise which influence the system design. The application imposes limitations on computational resources, constraints on signal capture, requirements for noise and channel compensation, and rejection capability. The difficulties and costs of adapting existing technology to new languages and applications need to be assessed. Near term applications for LVCSR technology are likely to grow in somewhat limited domains such as spoken language systems for information retrieval and limited domain dictation. Perspectives on some unresolved problems are given, indicating areas for future research.

Journal

  • The Transactions of the Institute of Electronics,Information and Communication Engineers.

    The Transactions of the Institute of Electronics,Information and Communication Engineers. 00079(00012), 2005-2021, 1996-12-25

    The Institute of Electronics, Information and Communication Engineers

References:  100

Cited by:  5

Codes

  • NII Article ID (NAID)
    110003227668
  • NII NACSIS-CAT ID (NCID)
    AN1007132X
  • Text Lang
    ENG
  • Article Type
    Journal Article
  • ISSN
    09151923
  • NDL Article ID
    4106167
  • NDL Source Classification
    ZN33(科学技術--電気工学・電気機械工業--電子工学・電気通信)
  • NDL Call No.
    Z16-1853
  • Data Source
    CJP  CJPref  NDL  NII-ELS 
Page Top