[Paper] DLF-based Speech Segment Detection and Its Application to Audio Noise Removal for Video Conferences

Sasaki Kazuto, Ogawa Takahiro, Takahashi Sho, Haseyama Miki

doi:10.3169/mta.4.68

Bibliographic Information

Other Title

DLF-based speech segment detection and its application to audio noise removal for video conferences

Abstract

A new decision-level fusion (DLF)-based speech segment detection method and its application to audio noise removal for video conferences are presented in this paper. The proposed method calculates visual and audio features from video sequences and audio signals, respectively, obtained in video conferences. Features extracted from mouth regions of participants and attribution degrees of speech class are used as visual and audio features, respectively, and Support Vector Machine (SVM)-based classification is performed by using each kind of feature. The SVM classifier performs two-class classification of speech and non-speech segments to realize speech segment detection. From the detection results obtained from the visual and audio features, DLF based on Supervised Learning from Multiple Experts is performed to successfully obtain the final detection results with focus on the accuracy of each detection result. Then, from audio signals in the non-speech segments detected by our method, we can extract noise information to realize accurate audio noise removal in the speech segments.

Journal

ITE Transactions on Media Technology and Applications

ITE Transactions on Media Technology and Applications 4 (1), 68-77, 2016

The Institute of Image Information and Television Engineers

Keywords

Details 詳細情報について

CRID: 1390001205423918464

NII Article ID: 130005117878

DOI: 10.3169/mta.4.68

ISSN: 21867364

Web Site: https://www.jstage.jst.go.jp/article/mta/4/1/4_68/_pdf

Text Lang: en

Data Source

JaLC
Crossref
CiNii Articles
KAKEN

Abstract License Flag: Disallowed

Export

[Paper] DLF-based Speech Segment Detection and Its Application to Audio Noise Removal for Video Conferences

Bibliographic Information

Abstract

Journal

Citations (1)*help

References(21)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

[Paper] DLF-based Speech Segment Detection and Its Application to Audio Noise Removal for Video Conferences

Bibliographic Information

Abstract

Journal

Citations (1)*help

References(21)*help

Related Projects

Keywords

Details 詳細情報について

Export

Report a problem

Project list