期刊文献+

A computational model for assessment of speech intelligibility in informational masking

A computational model for assessment of speech intelligibility in informational masking
原文传递
导出
摘要 The existing auditory computational mod- els for evaluating speech intelligibility can only account for energetic masking, and the effect of informational masking is rarely described in these models. This study was aimed to make a computational model considering the mechanism of informational masking. Several psy- choacoustic experiments were conducted to test the ef- fect of informational masking on speech intelligibility by manipulating the number of masking talker, speech rate, and the similarity of F0 contour between target and masker. The results showed that the speech recep- tion threshold for the target increased as the F0 contours of the masker became more similar to that of the tar- get, suggesting that the difficulty in segregating the tar- get harmonics from the masker harmonics may underlie the informational masking effect. Based on these stud- ies, a new auditory computational model was made by inducing the auditory function of harmonic extraction to the traditional model of speech intelligibility index (SII), named as harmonic extraction (HF) model. The predictions of the HF model are highly consistent with the experimental results. The existing auditory computational mod- els for evaluating speech intelligibility can only account for energetic masking, and the effect of informational masking is rarely described in these models. This study was aimed to make a computational model considering the mechanism of informational masking. Several psy- choacoustic experiments were conducted to test the ef- fect of informational masking on speech intelligibility by manipulating the number of masking talker, speech rate, and the similarity of F0 contour between target and masker. The results showed that the speech recep- tion threshold for the target increased as the F0 contours of the masker became more similar to that of the tar- get, suggesting that the difficulty in segregating the tar- get harmonics from the masker harmonics may underlie the informational masking effect. Based on these stud- ies, a new auditory computational model was made by inducing the auditory function of harmonic extraction to the traditional model of speech intelligibility index (SII), named as harmonic extraction (HF) model. The predictions of the HF model are highly consistent with the experimental results.
出处 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2012年第1期107-115,共9页 中国电气与电子工程前沿(英文版)
关键词 auditory computational model speech intelligibility informational masking F0 contour harmonic extraction auditory computational model, speech intelligibility, informational masking, F0 contour, harmonic extraction
  • 相关文献

参考文献44

  • 1Geneva International Organization for Standardization. Ergonomics —— Assessment of speech communication[S].2003.
  • 2Watson C S. Uncertainty,informational masking,and the capacity of immediate auditory memory[A].NJ:Lawrence Erlbaum Associates,1987.267-277.
  • 3Freyman R L,Balakrishnan U,Helfer K S. Spatial release from informational masking in speech recognition[J].Journal of The Acoustical Society of America,2001,(05):2112-2122.
  • 4Brungart D S,Simpson B D,Ericson M A,Scott K R. Informational and energetic masking effects in the perception of multiple simultaneous talkers[J].Journal of The Acoustical Society of America,2001,(05):2527-2538.
  • 5Durlach N I,Mason C R,Kidd G Jr,Arbogast T L Colburn H S Shinn-Cunningham B G. Note on informational masking[J].Journal of The Acoustical Society of America,2003,(06):2984-2987.
  • 6Wu X H,Wang C,Chen J,Qu H W Li W R Wu Y H Schneider B A Li L. The effect of perceived spatial separatiou on informational masking of Chinese speech[J].Hearing Research,2005,(1-2):1-10.
  • 7Mattys S L,Brooks J,Cooke M. Recognizing speech under a processing load:Dissociatting energetic from informational factors[J].Cognitive Psychology,2009,(03):203-243.
  • 8Freyman R L,Balakrishnan U,Helfer K S. Effect of number of masking talkers and auditory priming on informational masking in speech recognition[J].Journal of The Acoustical Society of America,2004,(05):2246-2256.doi:10.1121/1.1689343.
  • 9Simpson S A,Cooke M. Consonant identification in N-talker babble is a nonmonotonic function of N[J].Journal of The Acoustical Society of America,2005,(05):2775-2778.doi:10.1121/1.2062650.
  • 10Rhebergen K S,Versfeld N J,Dreschler W A. Release from informational masking by time reversal of native and nonnative interfering speech[J].Journal of The Acoustical Society of America,2005,(03):1274-1277.

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部