Text Classification by Combining Different Distance Functions with Weights
-
- Yamada Takahiro
- Graduate School of Engineering, Aichi Institute of Technology
-
- Ishii Naohiro
- Aichi Institute of Technology
-
- Nakashima Toyoshiro
- Sugiyama jogakuen University
Bibliographic Information
- Other Title
-
- 重みを用いた距離関数の結合によるテキスト分類
- オモミ オ モチイタ キョリ カンスウ ノ ケツゴウ ニ ヨル テキスト ブンルイ
Search this article
Abstract
The text classification is an important subject in the data mining. For the text classification, several methods have been developed up to now, as the nearest neighbor analysis, the latent semantic analysis, etc. The k-nearest neighbor (kNN) classification is a well-known simple and effective method for the classification of data in many domains. In the use of the kNN, the distance function is important to measure the distance and the similarity between data. To improve the performance of the classifier by the kNN, a new approach to combine multiple distance functions is proposed here. The weighting factors of elements in the distance function, are computed by GA for the effectiveness of the measurement. Further, an ensemble processing was developed for the improvement of the classification accuracy. Finally, it is shown by experiments that the methods, developed here, are effective in the text classification.
Journal
-
- IEEJ Transactions on Electronics, Information and Systems
-
IEEJ Transactions on Electronics, Information and Systems 127 (12), 2077-2085, 2007
The Institute of Electrical Engineers of Japan
- Tweet
Details 詳細情報について
-
- CRID
- 1390001204605184768
-
- NII Article ID
- 10019978101
-
- NII Book ID
- AN10065950
-
- ISSN
- 13488155
- 03854221
-
- NDL BIB ID
- 9287888
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
-
- Abstract License Flag
- Disallowed