圧縮プログラムを応用した著者推定

書誌事項

タイトル別名
  • アッシュク プログラム オ オウヨウシタ チョシャ スイテイ
  • asshuku puroguramu o oyoshita chosha suitei
  • Authorship attribution by data compression program

この論文をさがす

抄録

type:text

Benedetto et al. recently confirmed the validity of a method for measuring similarity using data compression software. Despite its potential, this method has not yet been applied to the field of information science. The present study proposes the use of CIR, a modified method that uses an improved ratio of compression, and describes two experiments on authorship attribution using data from modern Japanese literature. The first experiment compares the results of applying CIR and Benedetto's method to test collections of modified data (fixed length) using a procedure similar to that described by Matsuura et al. The second experiment is based on original data (variable length). The first experiment showed an average precision rate of 97.7% for CIR, while Benedetto's method gave a rate of 90.5%. The CIR method proves to be an improvement on the best method described by Matsuura et al. The second experiment confirmed the e

原著論文

収録刊行物

被引用文献 (1)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ