語の認知度と語間の関係の非典型度に基づくWikipediaからの意外な情報の発見

佃, 洸摂, 大島, 裕明, 山本, 光穂, 岩崎, 弘利, 田中, 克己

本稿ではユーザが与えた1語のクエリに対して，そのクエリに関する意外な情報を発見する手法の提案を行う．提案手法では，クエリに対して意外度の高い関連語を発見し，クエリと意外度の高い関連語を基に意外な情報を発見する．その際，クエリの関連語の中でもクエリとの関係が非典型的であり，かつ認知度が高い関連語ほど意外度が高いという仮説に基づいて関連語の意外度を求める．たとえば提案手法により，“落合博満”というクエリに対して“ガンダム”という関連語の意外度が高いことが分かり，これを基に“落合博満はガンダムマニアである．”という意外な情報を発見できる．提案手法では，Wikipediaから得られる情報のみを用いてクエリに対する意外な情報の発見を行う．実験では75語のクエリを用いて評価を行い，クエリと関連語間の関係の非典型度，および関連語の認知度を考慮することの有効性を示した．

In this paper, we propose methods for discovering unexpected information for a given query. Our method first finds unexpected related terms for the query and then finds unexpected information based on the query and unexpected related terms. We hypothesize that a related term that has both an atypical relationship with a query and a high popularity is unexpected to the query. For example, given the query “Hiromitsu Ochiai,” our method detects that “Gundam” is an unexpected term and retrieves unexpected information: “Hiromitsu Ochiai is a Gundam mania.” Our method finds unexpected information solely based on information in Wikipedia. Experimental results using 75 queries show that these two proposed factors are effective for discovering unexpected information.

語の認知度と語間の関係の非典型度に基づくWikipediaからの意外な情報の発見

書誌事項

この論文をさがす

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

語の認知度と語間の関係の非典型度に基づくWikipediaからの意外な情報の発見

書誌事項

この論文をさがす

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について