Speaker Recognition Using Sparse Probabilistic Linear Discriminant Analysis

  • YANG Hai
    Key Laboratory of Speech Acoustics and Content Understanding, Chinese Academy of Sciences
  • XU Yunfei
    Key Laboratory of Speech Acoustics and Content Understanding, Chinese Academy of Sciences
  • ZHAO Qinwei
    Key Laboratory of Speech Acoustics and Content Understanding, Chinese Academy of Sciences
  • ZHOU Ruohua
    Key Laboratory of Speech Acoustics and Content Understanding, Chinese Academy of Sciences
  • YAN Yonghong
    Key Laboratory of Speech Acoustics and Content Understanding, Chinese Academy of Sciences

抄録

Sparse representation has been studied within the field of signal processing as a means of providing a compact form of signal representation. This paper introduces a sparse representation based framework named Sparse Probabilistic Linear Discriminant Analysis in speaker recognition. In this latent variable model, probabilistic linear discriminant analysis is modified to obtain an algorithm for learning overcomplete sparse representations by replacing the Gaussian prior on the factors with Laplace prior that encourages sparseness. For a given speaker signal, the dictionary obtained from this model has good representational power while supporting optimal discrimination of the classes. An expectation-maximization algorithm is derived to train the model with a variational approximation to a range of heavy-tailed distributions whose limit is the Laplace. The variational approximation is also used to compute the likelihood ratio score of all trials of speakers. This approach performed well on the core-extended conditions of the NIST 2010 Speaker Recognition Evaluation, and is competitive compared to the Gaussian Probabilistic Linear Discriminant Analysis, in terms of normalized Decision Cost Function and Equal Error Rate.

収録刊行物

参考文献 (11)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ