Machine learning for audio, image and video analysis : theory and applications
Author(s)
Bibliographic Information
Machine learning for audio, image and video analysis : theory and applications
(Advanced information and knowledge processing)
Springer, c2015
2nd ed
- : [hardcover]
Available at 1 libraries
  Aomori
  Iwate
  Miyagi
  Akita
  Yamagata
  Fukushima
  Ibaraki
  Tochigi
  Gunma
  Saitama
  Chiba
  Tokyo
  Kanagawa
  Niigata
  Toyama
  Ishikawa
  Fukui
  Yamanashi
  Nagano
  Gifu
  Shizuoka
  Aichi
  Mie
  Shiga
  Kyoto
  Osaka
  Hyogo
  Nara
  Wakayama
  Tottori
  Shimane
  Okayama
  Hiroshima
  Yamaguchi
  Tokushima
  Kagawa
  Ehime
  Kochi
  Fukuoka
  Saga
  Nagasaki
  Kumamoto
  Oita
  Miyazaki
  Kagoshima
  Okinawa
  Korea
  China
  Thailand
  United Kingdom
  Germany
  Switzerland
  France
  Belgium
  Netherlands
  Sweden
  Norway
  United States of America
Note
Includes bibliographical references and index
Description and Table of Contents
Description
This second edition focuses on audio, image and video data, the three main types of input that machines deal with when interacting with the real world. A set of appendices provides the reader with self-contained introductions to the mathematical background necessary to read the book.
Divided into three main parts, From Perception to Computation introduces methodologies aimed at representing the data in forms suitable for computer processing, especially when it comes to audio and images. Whilst the second part, Machine Learning includes an extensive overview of statistical techniques aimed at addressing three main problems, namely classification (automatically assigning a data sample to one of the classes belonging to a predefined set), clustering (automatically grouping data samples according to the similarity of their properties) and sequence analysis (automatically mapping a sequence of observations into a sequence of human-understandable symbols). The third part Applications shows how the abstract problems defined in the second part underlie technologies capable to perform complex tasks such as the recognition of hand gestures or the transcription of handwritten data.
Machine Learning for Audio, Image and Video Analysis is suitable for students to acquire a solid background in machine learning as well as for practitioners to deepen their knowledge of the state-of-the-art. All application chapters are based on publicly available data and free software packages, thus allowing readers to replicate the experiments.
Table of Contents
Introduction.- Part I: From Perception to Computation.- Audio Acquisition, Representation and Storage.- Image and Video Acquisition, Representation and Storage.- Part II: Machine Learning.- Machine Learning.- Bayesian Theory of Decision.- Clustering Methods.- Foundations of Statistical Learning and Model Selection.- Supervised Neural Networks and Ensemble Methods.- Kernel Methods.- Markovian Models for Sequential Data.- Feature Extraction Methods and Manifold Learning Methods.- Part III: Applications.- Speech and Handwriting Recognition.- Speech and Handwriting Recognition.- Video Segmentation and Keyframe Extraction.- Real-Time Hand Pose Recognition.- Automatic Personality Perception.- Part IV: Appendices.- Appendix A: Statistics.- Appendix B: Signal Processing.- Appendix C: Matrix Algebra.- Appendix D: Mathematical Foundations of Kernel Methods.- Index.
by "Nielsen BookData"