Exploring newspaper language : using the web to create and investigate a large corpus of modern Norwegian
Author(s)
Bibliographic Information
Exploring newspaper language : using the web to create and investigate a large corpus of modern Norwegian
(Studies in corpus linguistics, v. 49)
J. Benjamins, c2012
- : hb
Available at 11 libraries
  Aomori
  Iwate
  Miyagi
  Akita
  Yamagata
  Fukushima
  Ibaraki
  Tochigi
  Gunma
  Saitama
  Chiba
  Tokyo
  Kanagawa
  Niigata
  Toyama
  Ishikawa
  Fukui
  Yamanashi
  Nagano
  Gifu
  Shizuoka
  Aichi
  Mie
  Shiga
  Kyoto
  Osaka
  Hyogo
  Nara
  Wakayama
  Tottori
  Shimane
  Okayama
  Hiroshima
  Yamaguchi
  Tokushima
  Kagawa
  Ehime
  Kochi
  Fukuoka
  Saga
  Nagasaki
  Kumamoto
  Oita
  Miyazaki
  Kagoshima
  Okinawa
  Korea
  China
  Thailand
  United Kingdom
  Germany
  Switzerland
  France
  Belgium
  Netherlands
  Sweden
  Norway
  United States of America
Note
Includes bibliographical references and indexes
Description and Table of Contents
Description
This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system which is flexible enough to handle very large corpora in an efficient way. The individual research contributions based on the corpus explore different aspects of Norwegian, including the occurrence of anglicisms, neologisms and terminology, and the use of metonymy and metaphor in newspaper language. The book also describes an innovative method of applying correspondence analysis and implicational analysis to investigate interdependencies between morphosyntactic variants.
Table of Contents
- 1. Building a large corpus based on newspapers from the web (by Andersen, Gisle)
- 2. Part I. Exploiting the web as a corpus - Methods and tools
- 3. Corpuscle - a new corpus management platform for annotated corpora (by Meurer, Paul)
- 4. OBT+stat: A combined rule-based and statistical tagger (by Johannessen, Janne Bondi)
- 5. Exploring corpora through syntactic annotation (by Rosen, Victoria)
- 6. Collocations and statistical analysis of n-grams: Multiword expressions in newspaper text (by Lyse, Gunn Inger)
- 7. Automatic topic classification of a large newspaper corpus (by Hagen, Thomas M.)
- 8. A data-driven approach to anglicism identification in Norwegian (by Losnegaard, Gyri Smordal)
- 9. Part II. Corpus-based case studies
- 10. A corpus-based study of the adaptation of English import words in Norwegian (by Andersen, Gisle)
- 11. Norm clusters in written Norwegian (by Dyvik, Helge)
- 12. Lexical neography in modern Norwegian (by Fjeld, Ruth Vatvedt)
- 13. Ash compound frenzy: A case study in the Norwegian Newspaper Corpus (by De Smedt, Koenraad)
- 14. Financial jargon in a general newspaper corpus (by Kristiansen, Marita)
- 15. Metonymic extension and vagueness: Schengen and Kyoto in Norwegian newspaper language (by Halverson, Sandra L.)
- 16. Spatial metaphors in present-day Norwegian newspaper language (by Breivik, Leiv Egil)
- 17. Doing historical linguistics using contemporary data (by Andersen, Oivin)
- 19. Subject index
by "Nielsen BookData"