Content-based audio classification and retrieval for audiovisual data parsing

Bibliographic Information

Content-based audio classification and retrieval for audiovisual data parsing

Tong Zhang and C.-C. Jay Kuo

(The Kluwer international series in engineering and computer science, SECS 606)

Kluwer Academic Publishers, c2001

  • : pbk

Available at  / 11 libraries

Search this Book/Journal

Note

Includes bibliographical references (p.[129]-133) and index

Description and Table of Contents

Description

Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing is an up-to-date overview of audio and video content analysis. Included is extensive treatment of audiovisual data segmentation, indexing and retrieval based on multimodal media content analysis, and content-based management of audio data. In addition to the commonly studied audio types such as speech and music, the authors have included hybrid types of sounds that contain more than one kind of audio component such as speech or environmental sound with music in the background. Emphasis is also placed on semantic-level identification and classification of environmental sounds. The authors introduce a new generic audio retrieval system on top of the audio archiving schemes. Both theoretical analysis and implementation issues are presented. The developing MPEG-7 standards are explored. Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing will be especially useful to researchers and graduate level students designing and developing fully functional audiovisual systems for audio/video content parsing of multimedia streams.

Table of Contents

1. Introduction.- 2. Video Content Modeling.- 3. Audio Feature Analysis.- 4. Generic Audio Data Segmentation and Indexing.- 5. Sound Effects Classification and Retrieval.- 6. Image Sequence Analysis.- 7. Experimental Results.- 8. Conclusion and Extensions.- References.

by "Nielsen BookData"

Related Books: 1-1 of 1

Details

Page Top