Exploring textual data

Author(s)

Bibliographic Information

Exploring textual data

by Ludovic Lebart, André Salem and Lisette Berry

(Text, speech, and language technology, v. 4)

Kluwer Academic, 1998

  • : HB

Available at  / 21 libraries

Search this Book/Journal

Note

Includes bibliographical references (p. [229]-237) and indexes

Description and Table of Contents

Description

Researchers in a number of disciplines deal with large text sets requiring both text management and text analysis. Faced with a large amount of textual data collected in marketing surveys, literary investigations, historical archives and documentary data bases, these researchers require assistance with organizing, describing and comparing texts. Exploring Textual Data demonstrates how exploratory multivariate statistical methods such as correspondence analysis and cluster analysis can be used to help investigate, assimilate and evaluate textual data. The main text does not contain any strictly mathematical demonstrations, making it accessible to a large audience. This book is very user-friendly with proofs abstracted in the appendices. Full definitions of concepts, implementations of procedures and rules for reading and interpreting results are fully explored. A succession of examples is intended to allow the reader to appreciate the variety of actual and potential applications and the complementary processing methods. A glossary of terms is provided.

Table of Contents

Foreword. Introduction. 1. Textual Statistics: Scope and Applications. 2. The Units of Textual Statistics. 3. Correspondence Analysis of Lexical Tables. 4. Cluster Analysis of Words and Texts. 5. Visualization of Textual Data. 6. Characteristic Textual Units, Modal Responses and Modal Texts. 7. Longitudinal Partition, Textual Time Series. 8. Textual Discriminant Analysis. Appendix 1: Singular Value Decomposition and Correspondence Analysis. Appendix 2: Clustering Techniques. Appendix 3: More Details About the Nonparametric Estimation Model. Appendix 4: Search for Repeated Segments in a Corpus. Glossary. References. Author Index. Subject Index. Symbols.

by "Nielsen BookData"

Related Books: 1-1 of 1

Details

Page Top