-
- ZOTKIN Dmitry
- University of Maryland ATR
-
- TAKAHASHI Kazuhiko
- Yamaguchi University ATR
-
- YOTSUKURA Tatsuo
- Seikei University ATR
-
- MORISHIMA Shigeo
- Seikei University ATR
-
- TETSUTANI Nobuji
- ATR
この論文をさがす
抄録
In this paper, a front end system which uses audio and video information to track the people or other sound sources in the ordinary room has developed. The microphone array is used for determining the spatial location of the sound; the active video camera acquires the image of the area where the sound is detected, detects the people in the image by using skin color and can zoom and track a speaker. Several add-ons to the system include various visualization tools such as on-screen displays of waveforms, correlation plots, spectrum plots, spatial acoustic energy distribution, running time-frequency acoustic energy plots, and the possibility of real-time beamforming with real-time output to the headphones. The system can be used as a front-end for the non-encumbering human-computer interaction by video and audio means.
収録刊行物
-
- 画像電子学会誌
-
画像電子学会誌 30 (4), 452-463, 2001
一般社団法人 画像電子学会
- Tweet
詳細情報 詳細情報について
-
- CRID
- 1390282679587557504
-
- NII論文ID
- 10010070462
-
- NII書誌ID
- AN00041650
-
- ISSN
- 13480316
- 02859831
-
- NDL書誌ID
- 5877832
-
- 本文言語コード
- en
-
- データソース種別
-
- JaLC
- NDL
- CiNii Articles
-
- 抄録ライセンスフラグ
- 使用不可