HTMLファイルからのトピック抽出に基づく興味推定とWWW検索

大野 潮満, 前田 英巳子, 黄瀬 浩一, 松本 啓之亮

doi:10.1541/ieejeiss1987.119.11_1316

書誌事項

タイトル別名

Estimation of User's Interests and WWW Retrieval Based on Topic Extraction from HTML Files
HTML ファイルカラノトピックチュウシュツニモトヅクキョウミスイテイト WWW ケンサク

この論文をさがす

抄録

In recent years, many methods which assist WWW retrieval based on user's interests have been proposed. However, it is generally difficult to estimate user's interests directly from HTML files, since they often contain multiple topics some of which may not interest a user. In this paper, we propose a method of estimating user's interests and WWW retrieval both of which are based on topics extracted from HTML files. The characteristics of the method are as follows: (1) Topics in a HTML file are extracted by identifying repetitive sequence of its HTML tags, (2) User's interests are estimated by clustering topics extracted from HTML files which contain user's interesting portions. (3) The accuracy of estimation of user's interests as well as WWW retrieval is improved by incorporating retrieved topics into the case-base as positive or negative cases which are specified by a user. Experimental results for 151 HTML files show that the method improves the precision in both of estimating user's interests and WWW retrieval, compared with a method without the extraction of topics.

収録刊行物

電気学会論文誌Ｃ（電子・情報・システム部門誌）

電気学会論文誌Ｃ（電子・情報・システム部門誌） 119 (11), 1316-1322, 1999

一般社団法人電気学会

キーワード

詳細情報詳細情報について

HTMLファイルからのトピック抽出に基づく興味推定とWWW検索

書誌事項

この論文をさがす

抄録

収録刊行物

被引用文献 (1)*注記

参考文献 (13)*注記

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

HTMLファイルからのトピック抽出に基づく興味推定とWWW検索

書誌事項

この論文をさがす

抄録

収録刊行物

被引用文献 (1)*注記

参考文献 (13)*注記

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について