情報検索システムの統計的手法による特徴と精度の分析  [in Japanese] Analysis of the Characteristics and the Efficiency of Information Retrieval Systems by Statistical Method  [in Japanese]

Access this Article

Search this Article

Author(s)

Abstract

本論文では, IREXワークショップにおける情報検索課題 (IR) の本試験の結果, および, 参加したすべてのIRシステムについてのアンケートをもとに, 平均適合率, 再現率・適合率曲線を直線回帰させた傾きと切片がシステムに用いられた手法とどのような相関関係をもっているのかを調査し, それぞれの手法がシステムの性能に与える影響の大きさを示した. その結果, 多くの手法について, 再現率0.0での適合率の値と適合率の減少量にトレードオフの関係が存在し, 検索システムに用いる手法の選択の難しさが現れた. また, NARRATIVEタグの使用有無により, 同様に相関関係を調査し, NARRATIVEタグの有効性とシステムの性能に与える影響の大きさを示した. その結果, NARRATIVEタグを利用する場合, それに適した有効な手法を選択することが重要であることが分かった.

In evaluating the effectiveness of information retrieval (IR) and extraction system, the most common method is to compare two retrieval methods and decide if one system measurably achieves better results than the other. However, it is difficult for researchers to compare more than two retrieval methods because there are many participants in IR task in IREX workshop. In this paper, we evaluate the characteristics and the effectiveness of the IR systems using a statistical method based on the results of the IR formal run and questionnaires of systems. Comparisons of systems deal with effects on the performance such as indexing, querying and retrieval model. The results confirm the effectiveness of this evaluation method because phrases relates to the performance better than words. There is a trade-off relation between the precision value at 0.0 and decrease rate in many systems and this result indicates the difficulty of the choice of techniques in system. We also evaluate correlations between the efficiency and the characteristics of the systems with both a short and long versions of the topics. A result of this evaluation shows that it is important to select effective methods for the long version of topics.

Journal

  • Journal of Natural Language Processing

    Journal of Natural Language Processing 8(1), 85-99, 2001-01-10

    The Association for Natural Language Processing

References:  12

Codes

  • NII Article ID (NAID)
    10008830246
  • NII NACSIS-CAT ID (NCID)
    AN10472659
  • Text Lang
    JPN
  • Article Type
    ART
  • ISSN
    13407619
  • NDL Article ID
    5634354
  • NDL Source Classification
    ZU8(書誌・図書館・一般年鑑--図書館・ドキュメンテーション・文書館)
  • NDL Call No.
    Z21-B168
  • Data Source
    CJP  NDL  J-STAGE 
Page Top