Japanese Universal Dependencies Corpora
-
- Asahara Masayuki
- National Institute for Japanese Language and Linguistics, Japan
-
- Kanayama Hiroshi
- IBM Research - Tokyo, IBM Japan, Ltd.
-
- Miyao Yusuke
- Graduate School of Information Science and Technology, The University of Tokyo
-
- Tanaka Takaaki
- NTT Communication Science Laboratories, Nippon Telegraph and Telephone Corporation
-
- Omura Mai
- National Institute for Japanese Language and Linguistics, Japan
-
- Murawaki Yugo
- Graduate School of Informatics, Kyoto University
-
- Matsumoto Yuji
- Graduate School of Science and Technology, Nara Institute of Science and Technology; Center for Advanced Intelligence Project, RIKEN
Bibliographic Information
- Other Title
-
- Universal Dependencies 日本語コーパス
- Universal Dependencies ニホンゴ コーパス
Search this article
Abstract
<p>Universal Dependencies (UD) is an international project to develop multilingual dependency treebanks in a uniform annotation scheme, aiming at cross lingual learning from multilingual corpora and quantitative comparison of languages. As of mid 2018, more than 100 corpora for about 60 languages have been released. This paper describes the definition of annotations for Japanese. We discuss the localization issues of PoS tags, case marking dependency labels and the difference between phrase and clause in Japanese. We present the issues of coordination structures, which cannot be represented solely by the dependency tree structures. We also report the current status of UD Japanese corpora we have constructed.</p>
Journal
-
- Journal of Natural Language Processing
-
Journal of Natural Language Processing 26 (1), 3-36, 2019-03-15
The Association for Natural Language Processing
- Tweet
Details 詳細情報について
-
- CRID
- 1390001288146699648
-
- NII Article ID
- 130007663692
-
- NII Book ID
- AN10472659
-
- ISSN
- 21858314
- 13407619
-
- NDL BIB ID
- 029580996
-
- Text Lang
- ja
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
-
- Abstract License Flag
- Disallowed