Composite Term Extraction from Japanese Texts
-
- KOYAMA Teruo
- National Institute of Informatics
Bibliographic Information
- Other Title
-
- 日本語テキストからの複合語用語抽出
- ニホンゴ テキスト カラ ノ フクゴウゴ ヨウゴ チュウシュツ
Search this article
Abstract
Terms are used to describe important research concepts in academic documents, and are important to utilize the information in various research fields. In this paper, the author discuss about a method for extracting terms from academic texts based on natural language processing technique. Most of Japanese terms take composite word form, yet simple methods to extract composite terms based on current Japanese morpheme classification cannot attain enough precision. Considering internal structure of composite term candidates and the backward/ forward connective relations of the candidates in the texts, most of composite terms can be extracted with high precision. The author also discuss about the systematization of term candidates based on the nesting relations and the relationships of the candidates to various research sub-domains.
Journal
-
- Joho Chishiki Gakkaishi
-
Joho Chishiki Gakkaishi 19 (4), 306-315, 2010
Japan Society of Information and Knowledge
- Tweet
Details 詳細情報について
-
- CRID
- 1390001204423450368
-
- NII Article ID
- 10025992156
-
- NII Book ID
- AN10459774
-
- ISSN
- 18817661
- 09171436
-
- NDL BIB ID
- 10633411
-
- Data Source
-
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
-
- Abstract License Flag
- Disallowed