日本語地図課題対話コーパスの設計と特徴 [in Japanese] The Design and Statistical Characterization of the Japanese Map Task Dialogue Corpus [in Japanese]
Access this Article
Search this Article
The Japanese Map Task Corpus was created between 1994 and 1998 and contains a collection of 23 hour digital recordings, digitized maps and orthographic transcriptions of 128 dialogues by 64 native Japanese speakers. Map task dialogues are dialogues participated in by two speakers, the instruction giver who has a map with a route and the instruction follower who has a map without a route. The giver verbally instructs the follower to draw a route on his map. The two maps are slightly different so that there may emerge a natural interaction in spite of the fact that the flow of information internal to the task is basically one way. The principle and design of the recordings are described with special reference to the augmentations and improvements to the original HCRC Map Task corpus. Annotations to the orthographic transcriptions are viewed as "tags" that provide the start and end times of utterances, the duration of pauses, non-verbal events and synchronization of overlapping utterances, in a format which provides a view to giving a basis for further tagging in terms of linguistic and discourse phenomena in a interchangeable and sharable manner. Discourse and linguistic phenomena peculiar to spontaneous spoken dialogues, such as overlapping, are analyzed and the method of recording such phenomena in the transcription is discussed and proposed, with an implication for the requirement that one dialogue be represented in one digitized sound file for the preservation and reference of the information on timing. The tags emp1oyed in the corpus also provide an easy way of characterizing it in terms of the number and the duration of utteraI1ces and pauses. The statistical figures thus ob-tained are relatively independent of design factors like kinds of maps, but familiarity does significantly correlate with the duration and number of utterances.