-
- Sornlertlamvanich Virach
- Tokyo Institute of Technology, Department of Computer Science
-
- Inui Kentaro
- Tokyo Institute of Technology, Department of Computer Science
-
- Tanaka Hozumi
- Tokyo Institute of Technology, Department of Computer Science
-
- Tokunaga Takenobu
- Tokyo Institute of Technology, Department of Computer Science
-
- Takezawa Toshiyuki
- ATR Interpreting Telecommunications Research Laboratories
この論文をさがす
抄録
This paper shows the empirical results of our probabilistic GLR parser based on a new probabilistic GLR language model (PGLR) against existing models based on the same GLR parsing framework, namely the model proposed by Briscoe and Carroll (B & C), and two-level PCFG or pseudo context-sensitive grammar (PCSG) which is claimed to be a context-sensitive version of PCFG. We evaluate each model in character-based parsing (morphological and syntactic analysis) tasks, in which we have to consider the word segmentation and multiple part-of-speech problems. Parsing a sentence from the morphological level makes the task much more complex because of the increase of parse ambiguity stemming from word segmentation ambiguities and multiple corresponding sequences of parts-of-speech. As a result of the well-founded probabilistic nature of PGLR, the model accurately incorporates probabilities for word prediction, by way of encoding pre-terminal n-gram constraints into LR parsing tables. The PGLR model empirically outperforms the other two models in all measures, on experimentation with the ATR Japanese corpus. To examine the appropriateness of PGLR using an LALR table, we test the PGLR model using both an LALR and CLR table. The results show that parsing with the PGLR model using LALR table returns the best performance in parse accuracy, parsing time and memory space consumption.
収録刊行物
-
- 自然言語処理
-
自然言語処理 6 (3), 3-22, 1999
一般社団法人 言語処理学会
- Tweet
詳細情報
-
- CRID
- 1390282679452256128
-
- NII論文ID
- 130004292078
- 10008828891
-
- NII書誌ID
- AN10472659
-
- ISSN
- 21858314
- 13407619
-
- NDL書誌ID
- 4794686
-
- 本文言語コード
- en
-
- データソース種別
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- 抄録ライセンスフラグ
- 使用不可