日本史史料読解支援のための候補文字検索

山田, 太造, 井上, 聡, 遠藤, 珠紀, 久留島, 典子

史料を読解して記述内容を活字にする翻刻は歴史学や史料学の研究を進める上で重要な作業の1 つであるが，非常に高度な知識が必要とされる．本研究では，史料読解を支援するため，入力してあるテキストに応じて，次に入力される文字の候補を提示するn-gram モデルを用いた候補文字検索手法を提案する．文字推奨機能の有効性を評価するため実験を行った．その結果，検索結果の上位5 件で0.696，上位20 件で0.822 のヒット率であった．

A decoding to reprint is one of important factors to advance studies of history and historical document, however, its work is needed very high skill and knowledge of history. In this paper, we propose a method of a candidate character search for assisting decode. The search method is based on characteristic n-gram model. Using the method, a user can obtain a set of a candidate character which appears immediately after an entered string. For evaluating the effectiveness of the method, we experimented to hit ranking. As experimental results, hit ratio was 0.696 in case of top-rank 5, and hit ratio is 0.822 in case of top-rank 20.

日本史史料読解支援のための候補文字検索

書誌事項

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

日本史史料読解支援のための候補文字検索

書誌事項

抄録

収録刊行物

関連プロジェクト

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について