System Paper : Improvements in the Utterance Database for Enhancing System Utterances in Chat-oriented Dialogue Systems

DOI Web Site Web Site 参考文献14件 オープンアクセス

書誌事項

タイトル別名
  • Improvements in the Utterance Database for Enhancing System Utterances in Chat-oriented Dialogue Systems

この論文をさがす

抄録

<p>In our commercial chat-oriented dialogue system, we have been using an utterance database created from a massive amount of predicate-argument structures extracted from the web for generating utterances. However, because the creation of this database involves several automated processes, the database often includes non-sentences (ungrammatical or uninterpretable sentences) and utterances with inappropriate topic information (called off-focus utterances). Additionally, utterances tend to be monotonous and uninformative because they are created from single predicate-argument structures. To resolve these problems, we propose methods for filtering non-sentences by using neural network-based methods and utterances inappropriate for their associated foci by using co-occurrence statistics. To reduce monotony, we also propose a method for concatenating automatically generated utterances so that the utterances can be longer and richer in content. Experimental results indicate that the non-sentence filter can successfully remove non-sentences with an accuracy of 95% and that our focus filter can filter utterances inappropriate for their foci with high recall. We also examine the effectiveness of our filtering methods and concatenation method through an experiment involving human participants. The experimental results indicate that our methods significantly outperform a baseline in terms of understandability and that the concatenation of two utterances leads to higher familiarity and content richness while retaining understandability. </p>

収録刊行物

  • 自然言語処理

    自然言語処理 27 (1), 65-88, 2020-03-15

    一般社団法人 言語処理学会

参考文献 (14)*注記

もっと見る

関連プロジェクト

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ