Japanese Universal Dependencies Corpora

  • Asahara Masayuki
    National Institute for Japanese Language and Linguistics, Japan
  • Kanayama Hiroshi
    IBM Research - Tokyo, IBM Japan, Ltd.
  • Miyao Yusuke
    Graduate School of Information Science and Technology, The University of Tokyo
  • Tanaka Takaaki
    NTT Communication Science Laboratories, Nippon Telegraph and Telephone Corporation
  • Omura Mai
    National Institute for Japanese Language and Linguistics, Japan
  • Murawaki Yugo
    Graduate School of Informatics, Kyoto University
  • Matsumoto Yuji
    Graduate School of Science and Technology, Nara Institute of Science and Technology; Center for Advanced Intelligence Project, RIKEN

Bibliographic Information

Other Title
  • Universal Dependencies 日本語コーパス
  • Universal Dependencies ニホンゴ コーパス

Search this article

Abstract

<p>Universal Dependencies (UD) is an international project to develop multilingual dependency treebanks in a uniform annotation scheme, aiming at cross lingual learning from multilingual corpora and quantitative comparison of languages. As of mid 2018, more than 100 corpora for about 60 languages have been released. This paper describes the definition of annotations for Japanese. We discuss the localization issues of PoS tags, case marking dependency labels and the difference between phrase and clause in Japanese. We present the issues of coordination structures, which cannot be represented solely by the dependency tree structures. We also report the current status of UD Japanese corpora we have constructed.</p>

Journal

Citations (3)*help

See more

References(9)*help

See more

Related Projects

See more

Details 詳細情報について

Report a problem

Back to top