Automatic learning of a concept relation dictionary for a text mining system
-
- Sakurai Shigeaki
- Toshiba Corporation
-
- Ichimura Yumi
- Toshiba Corporation
-
- Suyama Akihiro
- Toshiba Corporation
-
- Orihara Ryohei
- Toshiba Corporation
Bibliographic Information
- Other Title
-
- テキストマイニングシステム向けの構造抽出ルールの自動学習
- テキストマイニング システム ムケ ノ コウゾウ チュウシュツ ルール ノ ジドウ ガクシュウ
Search this article
Abstract
A text mining method using domain-dependent dictionaries can classify text data with various viewpoints. The method uses a key concept dictionary, which stores important words and phrases for domains. Also, the method uses a concept relation dictionary, which is a rule set consisted of their combination. In the method, the knowledge dictionaries are very important and give a strong influence to classification results. However, we have to generate the dictionaries through trial and error. It is difficult to apply the method to many tasks. In this paper, we try to learn a concept relation dictionary automatically. The method extracts key concepts using lexical analysis from text data, generates training examples from the concepts and their classes given by a human expert, and applies the examples to a fuzzy inductive learning algorithm, IDF. Also, the paper shows the method acquires an appropriate rule set by numerical experiments based on 10-fold cross validation and using more than 1, 000 daily business reports.
Journal
-
- IEEJ Transactions on Electronics, Information and Systems
-
IEEJ Transactions on Electronics, Information and Systems 122 (6), 1009-1015, 2002
The Institute of Electrical Engineers of Japan
- Tweet
Details 詳細情報について
-
- CRID
- 1390001204609909760
-
- NII Article ID
- 130006845165
- 10008509127
-
- NII Book ID
- AN10065950
-
- ISSN
- 13488155
- 03854221
-
- NDL BIB ID
- 6174952
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- Abstract License Flag
- Disallowed