サポートベクトルマシンを用いた新聞記事からのプロフィール情報抽出

書誌事項

タイトル別名
  • Extraction of Profile Information from Newspaper Articles Using Support Vector Machines
  • サポートベクトルマシン オ モチイタ シンブン キジ カラ ノ プロフィール ジョウホウ チュウシュツ

この論文をさがす

抄録

This paper presents a method for extracting profile information in tabular formats based on existing technologies called named entity extraction and information integration. Named entity extraction enables us to provide elements of tables for profile information. Information integration allows us to unify tables for making the profile information fruitful, though it requires predetermined initial tables. In this paper, we propose a whole system of extracting profile information by bridging the gap between the two technologies. For this purpose we employ a method of grouping named entities for making initial tables. For the extraction and grouping of named entities, we utilize support vector machines. Initial tables are then integrated if these are with the same name. From the experimental results on 7085 newspaper articles, we obtained the results of 53.8% precision with 58.7% recall; Although the proposed method is insufficient as a fully automated information extraction, it provides us a good starting point for extracting profile information.

収録刊行物

被引用文献 (2)*注記

もっと見る

参考文献 (12)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ