Content-based audio classification and retrieval for audiovisual data parsing

書誌事項

Content-based audio classification and retrieval for audiovisual data parsing

Tong Zhang and C.-C. Jay Kuo

(The Kluwer international series in engineering and computer science, SECS 606)

Kluwer Academic Publishers, c2001

  • : pbk

大学図書館所蔵 件 / 11

この図書・雑誌をさがす

注記

Includes bibliographical references (p.[129]-133) and index

内容説明・目次

内容説明

Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing is an up-to-date overview of audio and video content analysis. Included is extensive treatment of audiovisual data segmentation, indexing and retrieval based on multimodal media content analysis, and content-based management of audio data. In addition to the commonly studied audio types such as speech and music, the authors have included hybrid types of sounds that contain more than one kind of audio component such as speech or environmental sound with music in the background. Emphasis is also placed on semantic-level identification and classification of environmental sounds. The authors introduce a new generic audio retrieval system on top of the audio archiving schemes. Both theoretical analysis and implementation issues are presented. The developing MPEG-7 standards are explored. Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing will be especially useful to researchers and graduate level students designing and developing fully functional audiovisual systems for audio/video content parsing of multimedia streams.

目次

1. Introduction.- 2. Video Content Modeling.- 3. Audio Feature Analysis.- 4. Generic Audio Data Segmentation and Indexing.- 5. Sound Effects Classification and Retrieval.- 6. Image Sequence Analysis.- 7. Experimental Results.- 8. Conclusion and Extensions.- References.

「Nielsen BookData」 より

関連文献: 1件中  1-1を表示

詳細情報

ページトップへ