5W1H情報抽出・分類によるテキスト要約

書誌事項

タイトル別名
  • Text Summarization based on Information Extraction and Categorization Using 5W1H

抄録

In an office, it is necessary for understanding the temporal transition and the overall situation on an event from various information to extract and abstract a large number of documents. This paper proposes two robust methods for generating an extract and an abstract from documents: an episodic extraction method which generates an extract on the temporal transition of an event and an overall abstraction method which generates an abstract of overall documents for survey. The episodic extraction method retrieves documents including the 5W1H (who, when, where, what, why, how and predicates) information which specifies an event and generates an extract on the temporal transition of the event. The overall abstraction method abstracts documents by replacing 5W1H elements in each document with their upper categories in a thesaurus. These methods proved to be effective for office work from an application to 10000 news articles and 2500 sales reports.

収録刊行物

  • 自然言語処理

    自然言語処理 6 (6), 27-44, 1999

    一般社団法人 言語処理学会

詳細情報 詳細情報について

問題の指摘

ページトップへ