書誌事項
- タイトル別名
-
- Sound Source Separation in the Frequency Domain with Image Processing
- ガゾウ ショリ オ モチイタ シュウハスウ リョウイキ デ ノ コンゴウ ボイン オンセイ ノ ブンリ
この論文をさがす
抄録
We propose a new method for extracting separately each of the sounds from the mixture of two speech sounds, which are uttered concurrently. First the mixture is transformed into a sound spectrogram which is thereafter treated as an image. Exploiting image processing techniques, the onsets and offsets of the fre-quency components of each speech sound are detected. Then the harmonic structure of each speech sound is extracted by tracing each onset through the corresponding offset and relating each of them to one another in the frequency domain. A set of band-pass filters are designed reflecting the extracted harmonic structure. Each speech sound is extracted by applying the set of band-pass filters to the mixture. Experiments were conducted with the mixture of a male speech sound and a female speech sound both consisting of Japanese vowels. The evaluation results demonstrated that the separation was done reasonably well with the proposed method.
収録刊行物
-
- 電気学会論文誌C(電子・情報・システム部門誌)
-
電気学会論文誌C(電子・情報・システム部門誌) 121 (12), 1866-1874, 2001
一般社団法人 電気学会
- Tweet
キーワード
詳細情報 詳細情報について
-
- CRID
- 1390282679587156736
-
- NII論文ID
- 130006845485
- 10007451148
-
- NII書誌ID
- AN10065950
-
- ISSN
- 13488155
- 03854221
- http://id.crossref.org/issn/03854221
-
- NDL書誌ID
- 5994437
-
- データソース種別
-
- JaLC
- NDL
- Crossref
- CiNii Articles
-
- 抄録ライセンスフラグ
- 使用不可