書誌事項
- タイトル別名
-
- Estimation of User's Interests and WWW Retrieval Based on Topic Extraction from HTML Files
- HTML ファイル カラ ノ トピック チュウシュツ ニ モトヅク キョウミ スイテイ ト WWW ケンサク
この論文をさがす
抄録
In recent years, many methods which assist WWW retrieval based on user's interests have been proposed. However, it is generally difficult to estimate user's interests directly from HTML files, since they often contain multiple topics some of which may not interest a user. In this paper, we propose a method of estimating user's interests and WWW retrieval both of which are based on topics extracted from HTML files. The characteristics of the method are as follows: (1) Topics in a HTML file are extracted by identifying repetitive sequence of its HTML tags, (2) User's interests are estimated by clustering topics extracted from HTML files which contain user's interesting portions. (3) The accuracy of estimation of user's interests as well as WWW retrieval is improved by incorporating retrieved topics into the case-base as positive or negative cases which are specified by a user. Experimental results for 151 HTML files show that the method improves the precision in both of estimating user's interests and WWW retrieval, compared with a method without the extraction of topics.
収録刊行物
-
- 電気学会論文誌C(電子・情報・システム部門誌)
-
電気学会論文誌C(電子・情報・システム部門誌) 119 (11), 1316-1322, 1999
一般社団法人 電気学会
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390001204611414912
-
- NII論文ID
- 130006845942
- 10004438518
-
- NII書誌ID
- AN10065950
-
- ISSN
- 13488155
- 03854221
-
- NDL書誌ID
- 4891557
-
- データソース種別
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- 抄録ライセンスフラグ
- 使用不可