Exploring data in engineering, the sciences and medicine

書誌事項

Exploring data in engineering, the sciences and medicine

Ronald K. Pearson

Oxford University Press, [2011?]

  • : hardcover

大学図書館所蔵 件 / 2

この図書・雑誌をさがす

注記

Bibliography: p. 745-763

Includes index

内容説明・目次

内容説明

Two recent and ongoing developments have greatly increased both the range of opportunities for exploratory data analysis and the variety of tools to support this type of analysis. First has been the dramatic rise in the number of publicly available datasets available free from the Internet and second has been the similarly dramatic evolution of the Open Source software movement, making powerful analysis packages like R also freely available. The objective of this book is to provide a reasonably thorough introduction to a useful subset of these analysis tools, illustrating what they are, what they do, and when and how they sometimes fail or do something very different than we expect them to. Specific topics covered include descriptive characterizations like summary statistics (mean, median, standard deviation, MAD scale estimate, etc.), graphical techniques like boxplots and nonparametric density estimates, various forms of regression modeling (standard linear regression models, logistic regression, and highly robust techniques like least trimmed squares), and the recognition and treatment of important data anomalies like outliers and missing data. In addition, the book also introduces a variety of dynamic data analysis tools, including autocorrelation analysis, parametric and nonparametric spectrum estimation, and the use of nonlinear data cleaning filters to improve dynamic characterization results. The book assumes familiarity with calculus and linear algebra, but does not assume any prior exposure to probability or statistics. Both simulation-based and real data examples are included and the book is intended either as an introductory textbook for an exploratory data analysis course like ones the author taught at the ETH where some of this material was used, or for self-study. Exercises are included at the end of each chapter and both R code and datasets are available through the associated OUP website.

目次

  • 1. The Art of Analyzing Data
  • 2. Data: Types, Uncertainty and Quality
  • 3. Characterizing Categorical Variables
  • 4. Uncertainty in Real Variables
  • 5. Fitting Straight Lines
  • 6. A Brief Introduction to Estimation Theory
  • 7. Outliers: Distributional Monsters That Lurk in Data
  • 8. Characterizing a Dataset
  • 9. Confidence Intervals and Hypothesis Testing
  • 10. Associations between Variables
  • 11. Regression Models I: Real Data
  • 12. Re-expression: Data Transformations
  • 13. Regression Models II: Mixed Data Types
  • 14. Characterizing Analysis Results
  • 15. Regression Models III: Diagnostics and Refinements
  • 16. Dynamic Data Characterization
  • 17. Linear Data Filters
  • 18. Nonparametric Spectrum Estimation
  • 19. Irregularities in Dynamic Analysis
  • 20. Dealing with Missing Data

「Nielsen BookData」 より

詳細情報

ページトップへ