画像処理を用いた周波数領域での混合母音音声の分離

書誌事項

タイトル別名
  • Sound Source Separation in the Frequency Domain with Image Processing
  • ガゾウ ショリ オ モチイタ シュウハスウ リョウイキ デ ノ コンゴウ ボイン オンセイ ノ ブンリ

この論文をさがす

抄録

We propose a new method for extracting separately each of the sounds from the mixture of two speech sounds, which are uttered concurrently. First the mixture is transformed into a sound spectrogram which is thereafter treated as an image. Exploiting image processing techniques, the onsets and offsets of the fre-quency components of each speech sound are detected. Then the harmonic structure of each speech sound is extracted by tracing each onset through the corresponding offset and relating each of them to one another in the frequency domain. A set of band-pass filters are designed reflecting the extracted harmonic structure. Each speech sound is extracted by applying the set of band-pass filters to the mixture. Experiments were conducted with the mixture of a male speech sound and a female speech sound both consisting of Japanese vowels. The evaluation results demonstrated that the separation was done reasonably well with the proposed method.

収録刊行物

被引用文献 (4)*注記

もっと見る

参考文献 (36)*注記

もっと見る

詳細情報 詳細情報について

問題の指摘

ページトップへ