ネットオークションの出品情報文書からの属性抽出の精度向上(情報抽出(テーマセッション2))  [in Japanese] Improvement of the performance of attribute-value extraction from description of exhibits in net auction system  [in Japanese]

Abstract

ネットオークションにおける属性検索を目的として,出品情報文書から出品物の属性・属性値対を抽出する手法が提案されている.本論文では,その抽出精度を向上させる一手法について検討をする.具体的には,前処理として,出品情報文書中の各文について,出品物の説明記述を含むか否かの二値分類を行うことにより,送料や関連商品の紹介など出品物と直接関係の無い記述を削除した文書を抽出処理の対象とする.評価実験によれば,本手法を適用することにより,属性抽出における再現率,適合率のいずれもが数ポイント上昇することが確認された.

In order to achieve faceted search in net auction system, several researchers have dealt with the automated extraction of attributes and their values from descriptions of exhibits. In this paper, we studied a method to improve the performance of the extraction. First, the method selects each sentence in a description that is judged to have attributes and/or values. Then, the extraction of attributes and values are performed on the cleaned text that does not contain parts of description irrelevant to exhibits, like descriptions for the postage, other exhibits, and so on. The experimental result shows that the proposed method improves both the precision and the recall in the attribute-value extraction.

Journal

IEICE technical report. Natural language understanding and models of communication   [List of Volumes]

IEICE technical report. Natural language understanding and models of communication 108(141), 67-72, 2008-07-10  [Table of Contents]

The Institute of Electronics, Information and Communication Engineers

References:  6

You must have a user ID to see the references.If you already have a user ID, please click "Login" to access the info.New users can click "Sign Up" to register for an user ID.

Preview

Preview

Codes

  • NII Article ID (NAID) :
    110006967633
  • NII NACSIS-CAT ID (NCID) :
    AN10091225
  • Text Lang :
    JPN
  • Article Type :
    ART
  • ISSN :
    09135685
  • NDL Article ID :
    9603983
  • NDL Source Classification :
    ZN33(科学技術--電気工学・電気機械工業--電子工学・電気通信)
  • NDL Call No. :
    Z16-940
  • Databases :
    CJP  NDL  NII-ELS 

Export