音声対話システムにおけるラピッドプロトタイピングを指向したWFSTに基づく言語理解

福林雄一朗, 駒谷, 和範, 中野, 幹生, 船越, 孝太郎, 辻野, 広司, 尾形, 哲也, 奥乃, 博

音声対話システムの開発の初期段階において，言語理解部は，(i) 構築が容易，(ii)様々な表現に対して頑健という2条件を満たす必要がある．本論文では，大量のコーパス収集や人手での詳細な言語理解ルールの記述を行うことなしに，簡単に言語理解部を構築する（ラピッドプロトタイピング）手法について述べる．本手法では，音声認識誤りを含む入力に対して，Weighted Finite State Transducer（WFST）により言語理解結果を出力する．この際の重みは複数種類を定義したうえで，学習データに基づき最適な重みづけを選択する．この重みづけは従来のWFSTを利用した手法に比べて簡単であるため，少ない学習データで動作する．本手法を2つのドメインで評価した結果，本手法では100発話程度の学習で，ベースライン手法より高いコンセプト正解精度が得られた．開発の初期段階にある新たなドメインであっても，この程度の量の発話を集めることは容易であり，本手法は言語理解部のラピッドタイピングに適している．

Language understanding (LU) modules for spoken dialogue systems in an early phase of their development need to be (i) easy to construct, and (ii) robust against various expressions. In this paper, we describe a method for constructing LU modules easily without a large amount of corpus or complicated handcrafted rules. An LU result is selected with Weighted Finite State Transducer from an automatic speech recognition output that may contain speech recognition errors. We designed several weighting schemes. A weighting scheme is determined by using training data. Since these weighting schemes are simpler than conventional methods, our method does not need a large amount of data for determining an optimal scheme. We evaluated our method in two different domains. The results revealed that our method outperformed baseline methods with less than one hundred utterances as training data, which can be reasonably prepared for new domains. This shows that our method is appropriate for a rapid prototyping of LU modules.

音声対話システムにおけるラピッドプロトタイピングを指向したWFSTに基づく言語理解

書誌事項

この論文をさがす

抄録

収録刊行物

被引用文献 (4)*注記

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

音声対話システムにおけるラピッドプロトタイピングを指向したWFSTに基づく言語理解

書誌事項

この論文をさがす

抄録

収録刊行物

被引用文献 (4)*注記

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について