学習データが少量しかない場合の文書分類に関する一考察

前田 康成, 吉田 秀樹, 鈴木 正清, 松嶋 敏泰

doi:10.1541/ieejeiss.131.1459

学習データが少量しかない場合の文書分類に関する一考察

DOI 機関リポジトリ Web Site Web Site 参考文献22件

前田康成

北見工業大学情報システム工学科
吉田秀樹

北見工業大学情報システム工学科
鈴木正清

北見工業大学情報システム工学科
松嶋敏泰

早稲田大学応用数理学科

書誌事項

タイトル別名

A Note on Document Classification with Small Training Data
ガクシュウデータガショウリョウシカナイバアイノブンショブンルイニカンスルイチコウサツ

この論文をさがす

抄録

Document classification is one of important topics in the field of NLP (Natural Language Processing). In the previous research a document classification method has been proposed which minimizes an error rate with reference to a Bayes criterion. But when the number of documents in training data is small, the accuracy of the previous method is low. So in this research we use estimating data in order to estimate prior distributions. When the training data is small the accuracy using estimating data is higher than the accuracy of the previous method. But when the training data is big the accuracy using estimating data is lower than the accuracy of the previous method. So in this research we also propose another technique whose accuracy is higher than the accuracy of the previous method when the training data is small, and is almost the same as the accuracy of the previous method when the training data is big.

収録刊行物

電気学会論文誌Ｃ（電子・情報・システム部門誌）

電気学会論文誌Ｃ（電子・情報・システム部門誌） 131 (8), 1459-1466, 2011

一般社団法人電気学会

参考文献 (22)*注記

詳細情報詳細情報について

CRID

1390282679584458368
NII論文ID

10030527175
NII書誌ID

AN10065950
DOI

10.1541/ieejeiss.131.1459
ISSN

13488155

03854221
NDL書誌ID

11196040
Web Site

https://kitami-it.repo.nii.ac.jp/records/2000217

https://ndlsearch.ndl.go.jp/books/R000000004-I11196040

https://www.jstage.jst.go.jp/article/ieejeiss/131/8/131_8_1459/_pdf
本文言語コード

ja
データソース種別
- JaLC
- IRDB
- NDL
- Crossref
- CiNii Articles
抄録ライセンスフラグ
使用不可

書き出し

問題の指摘

ページトップへ

学習データが少量しかない場合の文書分類に関する一考察

書誌事項

この論文をさがす

抄録

収録刊行物

参考文献 (22)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について