語の連想関係に基づく文章の典型性の可視化

中林 猛, 湯本 高行, 新居 学, 高橋 豐

近年，Web で情報収集を行うことが多いが，情報の取捨選択に，文章が典型的か否かで判断することが多々ある．そこで我々は，文章の "典型的でない記述の箇所" と "その根拠または補助となる情報" を視覚的に提示する手法を提案する．提案手法は，語の連想関係を Web を用いて抽出し，抽出された連想関係に基づきグラフを構築する手法と，構築されたグラフの形状による分類を行う手法の 2 段階で文の典型性を判定する．また，構築したグラフを典型性の根拠として提示する．実験では，それぞれの手法に用いる数式のしきい値決定実験と，決定されたしきい値による実際の結果の精度算出をそれぞれの手法で独立に行った．Web を用いた語の連想関係抽出実験では，妥当なしきい値は 0.20 と決定し，その時 0.23 の F 値を得た．構築されたグラフの連結成分の大小による分類実験では，妥当なしきい値は 0.40 と決定し，その時偽陽性率が 0.00，偽陰性率が 0.20 の分類精度を得た．Recently, we collect information on the web. Then, we often choose text by typicality of the text. Therefore, we propose a method of visualizing typicality of text. This method finds atypical sentences and the reasons. We extract associative relation between words using web. Then, we make graphs based on the associative relation between words. Finally, we classify the graphs by that their shape, and we judge typicality of sentences. We evaluated our algorithm of extracting associative relation, and the average F-measure was 0.23. We also evaluated our algorithm of classifying graphs of associative relation, and the false positive rate was 0.00 and the false negative rate was 0.20.

語の連想関係に基づく文章の典型性の可視化

書誌事項

この論文をさがす

抄録

収録刊行物

詳細情報詳細情報について

書き出し

問題の指摘

語の連想関係に基づく文章の典型性の可視化

書誌事項

この論文をさがす

抄録

収録刊行物

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について