期刊文献+

汉语音段反转言语的可懂度研究 被引量:3

Speech intelligibility of Chinese time-reversed speech
原文传递
导出
摘要 实验研究了帧长对汉语音段反转言语可懂度的影响。实验结果表明,帧长在64 ms以下,汉语音段反转言语具有较高的可懂度;帧长在64~203 ms之间,可懂度随帧长的增加逐渐降低;帧长在203 ms以上,可懂度为0。在帧长8 ms时,汉语的声调失真导致可懂度下降。原始语音信号和音段反转言语的调制谱的分析表明,调制谱失真大小和可懂度密切相关。因此,用原始语音信号和音段反转言语的窄带包络间的归一化相关值可以衡量调制谱失真大小,基于语音的语言传输指数法计算的客观值和实验结果显著相关(r=0.876,p<0.01)。研究表明,语言可懂度与窄带包络有关,音段反转言语的可懂度和保留原始语音信号的窄带包络密切相关。 This study investigated speech intelligibility of Chinese time-reversed speech in a psychoacoustic experiment with different frame lengths of time reversal window. The test of speech intelligibility showed that the intelligibility was high when the frame length was below 64 ms, the intelligibility reduced gradually when the frame length was from 64 to 203 ms, and the intelligibility nearly got to zero when the frame length was above 203 ms. The intelligibility with the frame length 8 ms reduced due to the tonal distortion. The modulation spectra of the original speech and the corresponding time-reversed speech were analyzed and it showed that the intelligibility was correlated with modulation spectra distortion. Therefore, the modulation spectra distortion was conducted by normalizing correlation between the narrow-band envelopes of the original speech and the corresponding time-reversed speech. The objective values were calculated by the speech-based speech transmission index method and it showed that the objective values were highly correlated with the test of speech intelligibility (r = 0.876, p 〈 0.01). The study demonstrates that speech intelligibility is related to narrow-band envelopes and the preservation of narrow-band envelopes is correlated with the intelligibility of time-reversed speech.
出处 《声学学报》 EI CSCD 北大核心 2012年第6期659-666,共8页 Acta Acustica
基金 国家自然科学基金资助项目(11004217 11074279 11174317)
  • 相关文献

参考文献18

  • 1Yon S, Tanter M, Fink M. Sound focusing in rooms: The time-reversal approach. J. Acoust. Soc. Am., 2003; 113(3): 1533--1543.
  • 2Ma D, Yang J. Optimal time-reversal focusing by an iter- ative least-squares method. Jpn. J. Appl. Phys., 2009; 48(7): 07GD08.
  • 3Catheline S, Fink M, Quieffin N, Ing R K. Acoustic source localization model using in-skull reverberation and time re- versal. Appl. Phys. Left., 2007; 90(6): 063902.
  • 4Rhebergen K S, Versfeld N J, Dreschler W A. Release from informational masking by time reversal of native and non- native interfering speech. J. Acoust. Soc. Am., 2005; 118(3): 1274--1277.
  • 5Stickney G S, Zeng F G, Litovsky R, Assmann P. Cochlear implant speech recognition with speech maskers. J. Acoust.Soc. Am., 2004; 116(2): 1081-1091.
  • 6Saberi K, Perrott D R. Cognitive restoration of reversed speech. Nature, 1999; 398(6730): 760--760.
  • 7Nguyen D Q, Gan W S, Khong A W H. Time-reversal ap- proach to the stereophonic acoustic echo cancellation prob- lem. IEEE Trans. Audio Speech Lang. Process., 2011; 19(2): 385--395.
  • 8Jiang B, Liebl A, Leistner P, Yang J. Sound masking per- formance of time-reversed masker processed from the target speech. Acta Acust. united Ac., 2012; 98(1): 135--141.
  • 9Kang J. Comparison of speech intelligibility between En- glish and Chinese. J. Acoust. Soc. Am., 1998; 103(2): 1213--1216.
  • 10杨琳,张建平,颜永红.单通道语音增强算法对汉语语音可懂度影响的研究[J].声学学报,2010,35(2):248-253. 被引量:17

二级参考文献10

  • 1王晶,傅丰林,张运伟.语音增强算法综述[J].声学与电子工程,2005(1):22-26. 被引量:22
  • 2张家禄 齐士钤 宋美珍 等.汉语声调在言语可懂度中的重要作用.声学学报,1981,7:237-237.
  • 3Song Myung-Suk, Lee Chang-Heon, Kang Hong-Goo. Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition. Inter- speech2006, 1451-1454, Pittsburgh, Pennsylvania.
  • 4Hu Guoning, Wang DeLiang. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks, 2004; 15(5): 1135-1150.
  • 5Hu Yi, Loizou P C. A comparative intelligibility study of single-microphone noise reduction algorithms. J. Acoust. Soc. Am., 2007; 122(3): 1777-1786.
  • 6Hu Yi, Loizou P C. Subjective evaluation and comparison of speech enhancement algorithms. Speech Communication, 2007; 49:588-601.
  • 7Kang Jian. Comparison of speech intelligibility between English and Chinese. J. Acoust. Soc. Am., 1998; 103(2): 1213-1216.
  • 8Loizou P C. Speech enhancement: Theory and practice. CRC Press, 2007.
  • 9Kong Y Y, Zeng F G. Temporal and spectral cues in Mandarin tone recogntion. J. Acoust. Soc. Am., 2006; 120(5): 2830-2840.
  • 10郑成诗,李晓东,陈佳路,田静.自适应平滑周期图语音增强研究[J].声学学报,2007,32(5):461-467. 被引量:4

共引文献27

同被引文献17

  • 1王晶,傅丰林,张运伟.语音增强算法综述[J].声学与电子工程,2005(1):22-26. 被引量:22
  • 2赵毅,尹雪飞,陈克安.一种新的基于倒谱的共振峰频率检测算法[J].应用声学,2010,29(6):416-424. 被引量:9
  • 3易克初.语音信号处理[M].北京:国防工业出版社,2002..
  • 4KELLERMANN W.A self-steering digital microphone array[C]//IEEE International Conference on Acoustics,Speech and Signal Processing.1991:3581-3584.
  • 5GRIFFITHS L J,JIM C W.An alternative approach to linearly constrained adaptive beamforming[J].IEEE Transactions on Antennas and Propagation,1981,30(1):27-34.
  • 6ZELINSKI R.A microphone array with adaptive post-filtering for noise reduction in reverberant rooms[C]//IEEE International Conference on Acoustics,Speech and Signal Processing.1988,5:2578-2581.
  • 7OSAMU Hoshuyama,AKIHIKO Sugiyama,AKIHIRO Hirano.A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters[J].IEEE Transactions on Signal Processing,1999,47(10):2677-2684.
  • 8CHONG K S,GWEE B H,CHANG J S.A 16-channel low-power non-uniform spaced filter bank core for digital hearing aids[J].IEEE Transactions on Circuits and Systems:Express Briefs,2006,53(10):853-858.
  • 9邹采荣,陈国明,赵力.Speech enhancement based on leakage constraints DF-GSC[J].Journal of Southeast University(English Edition),2007,23(4):507-511. 被引量:1
  • 10王青云,赵力,乔杰,邹采荣.符合人耳听觉特征的数字助听器子带响度补偿[J].应用科学学报,2008,26(6):580-585. 被引量:3

引证文献3

二级引证文献4

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部