Contribution of modulation spectral features on the perception of vocal-emotion using noise-vocoded speech
-
- Zhu Zhi
- Japan Advanced Institute of Science and Technology
-
- Miyauchi Ryota
- Japan Advanced Institute of Science and Technology
-
- Araki Yukiko
- Kanazawa University
-
- Unoki Masashi
- Japan Advanced Institute of Science and Technology
この論文をさがす
抄録
Previous studies on noise-vocoded speech showed that the temporal modulation cues provided by the temporal envelope play an important role in the perception of vocal emotion. However, the exact role that the temporal envelope and its modulation components play in the perceptual processing of vocal emotion is still unknown. To clarify the exact features that the temporal envelope contributes to the perception of vocal emotion, a method based on the mechanism of modulation frequency analysis in the auditory system is necessary. In this study, auditory-based modulation spectral features were used to account for the perceptual data collected from vocal-emotion recognition experiments using noise-vocoded speech. An auditory-based modulation filterbank was used to calculate the modulation spectrogram of noise-vocoded speech stimuli, and ten types of modulation spectral features were then extracted from the modulation spectrograms. The results showed that there were high similarities between modulation spectral features and the perceptual data of vocal-emotion recognition experiments. It was shown that the modulation spectral features are useful for accounting for the perceptual processing of vocal emotion with noise-vocoded speech.
収録刊行物
-
- Acoustical Science and Technology
-
Acoustical Science and Technology 39 (6), 379-386, 2018-11-01
一般社団法人 日本音響学会
- Tweet
キーワード
詳細情報 詳細情報について
-
- CRID
- 1390845713015741312
-
- NII論文ID
- 40021703797
-
- NII書誌ID
- AA11501808
-
- ISSN
- 13475177
- 03694232
- 13463969
-
- NDL書誌ID
- 029306604
-
- 本文言語コード
- en
-
- データソース種別
-
- JaLC
- NDL
- Crossref
- CiNii Articles
- KAKEN
-
- 抄録ライセンスフラグ
- 使用不可