Speaker verification system robust to speaking style variation using multiple kernel learning based on conditional entropy minimization

小川 哲司, 日野 英逸, 村田 昇, 小林 哲則

話者内変動に頑健な話者照合システムについて検討を行った．発話スタイルや発話時期の違いなどの影響で，同一話者の音声であっても音響的な変動が生じる．このような音響変動は，一般的に話者照合システムの性能を劣化させることが知られている．この問題を解決するため，条件付きエントロピー最小化という，同一クラスのデータを密集させ，かつ異なるクラスのデータを互いに遠ざける性質を持つ最適化基準を用いてマルチカーネル学習を行い，話者照合システムを構築することを試みた．話者照合実験の結果，提案システムは，従来のマージン最大化に基づき構築したシステムと比較して，発話スタイル変動に起因する話者クラス内での音響特徴変動に対して頑健な性能を与えた．We developed a new speaker verification system that is robust to intra-speaker variation. There is a strong likelihood that intra-speaker variations will occur due to changes in speaking styles, the periods when an individual speaks, and so on. It is well known that such variation generally degrades the performance of speaker verification systems. To solve this problem, we applied multiple kernel learning based on conditional entropy minimization, which impose the data to be compactly aggregated for each class and ensure that the different classes were far apart from each other, to speaker verification. Experimental results showed that the proposed speaker verification system achieved a robust performance to intra-speaker variation derived from changes in the speaking styles compared to the conventional maximum margin-based system.

Speaker verification system robust to speaking style variation using multiple kernel learning based on conditional entropy minimization

Bibliographic Information

Search this article

Abstract

Journal

Related Projects

Details 詳細情報について

Export

Report a problem

Speaker verification system robust to speaking style variation using multiple kernel learning based on conditional entropy minimization

Bibliographic Information

Search this article

Abstract

Journal

Related Projects

Details 詳細情報について

Export

Report a problem

Project list