Spoken Term Detection Using Phoneme Transition Network from Multiple Speech Recognizers' Outputs

Natori Satoshi, Furuya Yuto, Nishizaki Hiromitsu, Sekiguchi Yoshihiro

doi:10.2197/ipsjjip.21.176

抄録

Spoken Term Detection (STD) that considers the out-of-vocabulary (OOV) problem has generated significant interest in the field of spoken document processing. This study describes STD with false detection control using phoneme transition networks (PTNs) derived from the outputs of multiple speech recognizers. PTNs are similar to subword-based confusion networks (CNs), which are originally derived from a single speech recognizer. Since PTN-formed index is based on the outputs of multiple speech recognizers, it is robust to recognition errors. Therefore, PTN should also be robust to recognition errors in an STD task, when compared to the CN-formed index from a single speech recognition system. Our PTN-formed index was evaluated on a test collection. The experiment showed that the PTN-based approach effectively detected OOV terms, and improved the F-measure value from 0.370 to 0.639 when compared with a baseline approach. Furthermore, we applied two false detection control parameters, one is based on the majority voting scheme. The other is a measure of the ambiguity of CN, to the calculation of detection score. By introducing these parameters, the performance of STD was found to be better (0.736 for the F-measure value) than that without any parameters (0.639).

収録刊行物

Journal of Information Processing

Journal of Information Processing 21 (2), 176-185, 2013

一般社団法人情報処理学会

キーワード

詳細情報詳細情報について

CRID: 1390001205295997824

NII論文ID: 130003369520; 110009537044

NII書誌ID: AN00116647

ISSN: 18827764; 18826652

DOI: 10.2197/ipsjjip.21.176

Web Site: http://id.nii.ac.jp/1001/00090253/; http://id.nii.ac.jp/1001/00095641/; https://www.jstage.jst.go.jp/article/ipsjjip/21/2/21_176/_pdf

本文言語コード: en

データソース種別

JaLC
IRDB
Crossref
CiNii Articles
KAKEN

抄録ライセンスフラグ: 使用不可

Spoken Term Detection Using Phoneme Transition Network from Multiple Speech Recognizers' Outputs

この論文をさがす

抄録

収録刊行物

被引用文献 (4)*注記

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

Spoken Term Detection Using Phoneme Transition Network from Multiple Speech Recognizers' Outputs

この論文をさがす

抄録

収録刊行物

被引用文献 (4)*注記

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について