Audio and Visual Information Integration for Speaker's Localization in Automatic Shooting of Lecture

Nishiguchi Satoshi, Higashi Kazuhide, Kameda Yoshinari, Kakusho Koh, Minoh Michihiko

doi:10.1541/ieejeiss.124.729

【Created on October 31, 2023】 Integration of CiNii Dissertations and CiNii Books into CiNii Research

Impact of the Release of the New "NDL Search" on CiNii Services

Audio and Visual Information Integration for Speaker's Localization in Automatic Shooting of Lecture

DOI Web Site Web Site 13 Citations 23 References

Nishiguchi Satoshi

Graduate School of Law, Kyoto University
Higashi Kazuhide

Graduate School of Informatics, Kyoto University
Kameda Yoshinari

Academic Center for Computing and Media Studies, Kyoto University
Kakusho Koh

Academic Center for Computing and Media Studies, Kyoto University
Minoh Michihiko

Academic Center for Computing and Media Studies, Kyoto University

Bibliographic Information

Other Title

講義自動撮影における話者位置推定のための視聴覚情報の統合
コウギジドウサツエイニオケルワシャイチスイテイノタメノシチョウカクジョウホウノトウゴウ

Search this article

Abstract

It is useful for automatic video shooting in a lecture room to estimate the location of a speaker in the lecture room. The captured videos are used for distance learning and lecture archiving systems. In order to estimate the location of a speaker in a wide lecture room, multiple cameras and multiple microphones are used. However, it is difficult to estimate the precise location of a speaker using only visual or acoustic sensors because of calibration problems, noise, and other interference. Therefore, we propose a method that integrates audio and visual information from a speaker in the lecture room. A lecturer’s cell and a student’s cell ared introduced as a unit of estimation of the location of a speaker. We defined 120 cells in a real lecture room and our multi-modal method were applied to the cells. The estimation accuracy of the location of a speaker is sufficient for automatic video shooting of a speaker in a lecture room by our integrating method.

Journal

IEEJ Transactions on Electronics, Information and Systems

IEEJ Transactions on Electronics, Information and Systems 124 (3), 729-739, 2004

The Institute of Electrical Engineers of Japan

Citations (13)*help

References(23)*help

Related Projects

Keywords

Details 詳細情報について

CRID

1390282679581680512
NII Article ID

10012646473
NII Book ID

AN10065950
DOI

10.1541/ieejeiss.124.729
ISSN

13488155

03854221
NDL BIB ID

6868104
Web Site

https://ndlsearch.ndl.go.jp/books/R000000004-I6868104

http://www.jstage.jst.go.jp/article/ieejeiss/124/3/124_3_729/_pdf
Text Lang

ja
Data Source
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
Abstract License Flag
Disallowed

Audio and Visual Information Integration for Speaker's Localization in Automatic Shooting of Lecture

Bibliographic Information

Search this article

Abstract

Journal

Citations (13)*help

References(23)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Audio and Visual Information Integration for Speaker's Localization in Automatic Shooting of Lecture

Bibliographic Information

Search this article

Abstract

Journal

Citations (13)*help

References(23)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Project list