文章の執筆時期の推定 —— 芥川龍之介の作品を例として —— Estimation of When the Works were Written —— With the Works of Ryunosuke Akutagawa as Examples ——
In this research, as a basis of studies regarding when certain works were written, an estimation was attempted using the works of Ryunosuke Akutagawa. In the experiment, two types of data sets were created from the text with part-of-speech tagging, and a comparative analysis was performed using three methods: Linear Regression, Support Vector Regression, and Random Forest Regression. As a result, when the works were written was estimated with rather high accuracy. The average of absolute value of estimation error and standard deviation was approximately 1.4 years. The order of high accuracy of estimation was Random Forest Regression, Support Vector Regression, and Linear Regression.
行動計量学 36(2), 89-103, 2009