時系列史料の人機分担構造化：古典籍『武鑑』を参照する江戸情報基盤の構築に向けて

北本, 朝展, 堀井, 洋, 堀井, 美里, 鈴木, 親彦, 山本, 和明

本論文は古典籍「武鑑」を対象として，大規模データを構造化するための全く新しいワークフローを提案する．まず「武鑑」を時間的に連続して変化する「時系列史料」という新しい種類の史料と捉え，そこから生み出される多数のバージョンをソフトウェア工学の観点から解釈し，これを板本書誌学の概念と対応させる．次にバージョン間の差分を検出する方法としてテキストベースと画像ベースのアプローチを比較し，「武鑑」では特に画像ベース差分検出が有効であることを示す．さらに差分検出と差分翻刻を合わせたアプローチを「差読」と呼び，そのためのワークフローを「人機分業」として構築することが「武鑑」の構造化の鍵を握ることを論じる．その最初の成果を「武鑑全集」として2017年11月に公開した．

This paper proposes a new workflow for structuring large-scale data, such as Pre-modern Japanese text “Bukan.” First, we define “Bukan” as a new type of historical sources called “time-series sources” that changes continuously over time, and interpret many versions associated with “Bukan” from the viewpoint of software engineering and make a mapping of those versions to the concepts of bibliography of Japanese old printed books. We then compare text-based and image-based approaches to the detection of difference, and propose a new concept “differential reading” that combines both the detection of difference, and differential transcription, to realize a workflow based on human-machine specialization, which is a key toward structuring “Bukan” The first preliminary result was released as “Bukan Complete Collection” on November 2017.

時系列史料の人機分担構造化：古典籍『武鑑』を参照する江戸情報基盤の構築に向けて

書誌事項

抄録

収録刊行物

キーワード

詳細情報詳細情報について

書き出し

問題の指摘

時系列史料の人機分担構造化：古典籍『武鑑』を参照する江戸情報基盤の構築に向けて

書誌事項

抄録

収録刊行物

キーワード

詳細情報 詳細情報について

書き出し

問題の指摘

参加プロジェクトリスト

詳細情報詳細情報について