圧縮プログラムを応用した著者推定 [in Japanese] asshuku puroguramu o oyoshita chosha suitei [in Japanese]
Access this Article
Search this Article
Author(s)
Abstract
原著論文
Benedetto et al. recently confirmed the validity of a method for measuring similarity using data compression software. Despite its potential, this method has not yet been applied to the field of information science. The present study proposes the use of CIR, a modified method that uses an improved ratio of compression, and describes two experiments on authorship attribution using data from modern Japanese literature. The first experiment compares the results of applying CIR and Benedetto's method to test collections of modified data (fixed length) using aprocedure similar to that described by Matsuura et al. The second experiment is based on original data (variable length).The first experiment showed an average precision rate of 97.7% for CIR, while Benedetto's method gave a rate of 90.5%. The CIR method proves to be an improvement on the best method described by Matsuura et al. The second experiment confirmed the e
Journal
-
- Library and information science
-
Library and information science (54), 1-18, 2005
三田図書館・情報学会