Reinventing Part-Of-Speech Tagging

Access this Article

Search this Article

Author(s)

Abstract

Part-of-speech tagging methodology has succeeded, but on problems that may lack real-world application. Redirection of the field is indicated, toward potentially more useful, but harder and more sophisticated tagging tasks: (1) using much more detailed tagsets (semantically <I>and</I> syntactically); (2) testing performance on treebanks reflecting the huge gamut of domains, etc., characterizing real-world applications; (3) understanding the magnitude of the unknown-word and unknown-tag problems, then overcoming them. Tagging results are presented on two versions of a new, highly variegated treebank, featuring tagsets of 2720 and 443 tags, respectively, and utilizing a dictionaryless, decision-tree tagger.

Journal

  • Journal of Natural Language Processing

    Journal of Natural Language Processing 5(1), 3-23, 1998-01-10

    The Association for Natural Language Processing

References:  20

Cited by:  1

Codes

  • NII Article ID (NAID)
    10008827415
  • NII NACSIS-CAT ID (NCID)
    AN10472659
  • Text Lang
    ENG
  • Article Type
    Journal Article
  • ISSN
    13407619
  • Data Source
    CJP  CJPref  J-STAGE 
Page Top