HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis

PARK Ji Hun, YOON Jae Sam, KIM Hong Kook

doi:10.1093/ietisy/e91-d.9.2360

抄録

In this paper, we propose a new mask estimation method for the computational auditory scene analysis (CASA) of speech using two microphones. The proposed method is based on a hidden Markov model (HMM) in order to incorporate an observation that the mask information should be correlated over contiguous analysis frames. In other words, HMM is used to estimate the mask information represented as the interaural time difference (ITD) and the interaural level difference (ILD) of two channel signals, and the estimated mask information is finally employed in the separation of desired speech from noisy speech. To show the effectiveness of the proposed mask estimation, we then compare the performance of the proposed method with that of a Gaussian kernel-based estimation method in terms of the performance of speech recognition. As a result, the proposed HMM-based mask estimation method provided an average word error rate reduction of 61.4% when compared with the Gaussian kernel-based mask estimation method.

収録刊行物

IEICE Transactions on Information and Systems

IEICE Transactions on Information and Systems E91-D (9), 2360-2364, 2008

一般社団法人電子情報通信学会

キーワード

詳細情報詳細情報について

CRID: 1390001204379348480

NII論文ID: 10026805798

NII書誌ID: AA10826272

DOI: 10.1093/ietisy/e91-d.9.2360

ISSN: 17451361; 09168532

Web Site: https://www.jstage.jst.go.jp/article/transinf/E91.D/9/E91.D_9_2360/_pdf

本文言語コード: en

データソース種別

JaLC
Crossref
CiNii Articles

抄録ライセンスフラグ: 使用不可

HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis

この論文をさがす

抄録

収録刊行物

参考文献 (9)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

HMM-Based Mask Estimation for a Speech Recognition Front-End Using Computational Auditory Scene Analysis

この論文をさがす

抄録

収録刊行物

参考文献 (9)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について