CiNii Books - Exploring newspaper language : using the web to create and investigate a large corpus of modern Norwegian

This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system which is flexible enough to handle very large corpora in an efficient way. The individual research contributions based on the corpus explore different aspects of Norwegian, including the occurrence of anglicisms, neologisms and terminology, and the use of metonymy and metaphor in newspaper language. The book also describes an innovative method of applying correspondence analysis and implicational analysis to investigate interdependencies between morphosyntactic variants.

Table of Contents

1. Building a large corpus based on newspapers from the web (by Andersen, Gisle)
2. Part I. Exploiting the web as a corpus - Methods and tools
3. Corpuscle - a new corpus management platform for annotated corpora (by Meurer, Paul)
4. OBT+stat: A combined rule-based and statistical tagger (by Johannessen, Janne Bondi)
5. Exploring corpora through syntactic annotation (by Rosen, Victoria)
6. Collocations and statistical analysis of n-grams: Multiword expressions in newspaper text (by Lyse, Gunn Inger)
7. Automatic topic classification of a large newspaper corpus (by Hagen, Thomas M.)
8. A data-driven approach to anglicism identification in Norwegian (by Losnegaard, Gyri Smordal)
9. Part II. Corpus-based case studies
10. A corpus-based study of the adaptation of English import words in Norwegian (by Andersen, Gisle)
11. Norm clusters in written Norwegian (by Dyvik, Helge)
12. Lexical neography in modern Norwegian (by Fjeld, Ruth Vatvedt)
13. Ash compound frenzy: A case study in the Norwegian Newspaper Corpus (by De Smedt, Koenraad)
14. Financial jargon in a general newspaper corpus (by Kristiansen, Marita)
15. Metonymic extension and vagueness: Schengen and Kyoto in Norwegian newspaper language (by Halverson, Sandra L.)
16. Spatial metaphors in present-day Norwegian newspaper language (by Breivik, Leiv Egil)
17. Doing historical linguistics using contemporary data (by Andersen, Oivin)
19. Subject index

by "Nielsen BookData"

Exploring newspaper language : using the web to create and investigate a large corpus of modern Norwegian

Author(s)

Bibliographic Information

Available at 11 libraries

Search this Book/Journal

Note

Description and Table of Contents

Related Books： 1-1 of 1

Details

Export