汉语音段反转言语的可懂度研究被引量：3

Speech intelligibility of Chinese time-reversed speech

导出

摘要实验研究了帧长对汉语音段反转言语可懂度的影响。实验结果表明,帧长在64 ms以下,汉语音段反转言语具有较高的可懂度;帧长在64～203 ms之间,可懂度随帧长的增加逐渐降低;帧长在203 ms以上,可懂度为0。在帧长8 ms时,汉语的声调失真导致可懂度下降。原始语音信号和音段反转言语的调制谱的分析表明,调制谱失真大小和可懂度密切相关。因此,用原始语音信号和音段反转言语的窄带包络间的归一化相关值可以衡量调制谱失真大小,基于语音的语言传输指数法计算的客观值和实验结果显著相关(r=0.876,p<0.01)。研究表明,语言可懂度与窄带包络有关,音段反转言语的可懂度和保留原始语音信号的窄带包络密切相关。 This study investigated speech intelligibility of Chinese time-reversed speech in a psychoacoustic experiment with different frame lengths of time reversal window. The test of speech intelligibility showed that the intelligibility was high when the frame length was below 64 ms, the intelligibility reduced gradually when the frame length was from 64 to 203 ms, and the intelligibility nearly got to zero when the frame length was above 203 ms. The intelligibility with the frame length 8 ms reduced due to the tonal distortion. The modulation spectra of the original speech and the corresponding time-reversed speech were analyzed and it showed that the intelligibility was correlated with modulation spectra distortion. Therefore, the modulation spectra distortion was conducted by normalizing correlation between the narrow-band envelopes of the original speech and the corresponding time-reversed speech. The objective values were calculated by the speech-based speech transmission index method and it showed that the objective values were highly correlated with the test of speech intelligibility （r = 0.876, p 〈 0.01）. The study demonstrates that speech intelligibility is related to narrow-band envelopes and the preservation of narrow-band envelopes is correlated with the intelligibility of time-reversed speech.

作者蒋斌匡正吴鸣杨军

机构地区中国科学院噪声与振动重点实验室(声学研究所)

出处《声学学报》 EI CSCD 北大核心 2012年第6期659-666,共8页 Acta Acustica

基金国家自然科学基金资助项目(11004217 11074279 11174317)

关键词语言可懂度语音信号反转调制谱谱失真语言传输帧长实验 Acoustics Modulation

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献18

1Yon S, Tanter M, Fink M. Sound focusing in rooms: The time-reversal approach. J. Acoust. Soc. Am., 2003; 113(3): 1533--1543.
2Ma D, Yang J. Optimal time-reversal focusing by an iter- ative least-squares method. Jpn. J. Appl. Phys., 2009; 48(7): 07GD08.
3Catheline S, Fink M, Quieffin N, Ing R K. Acoustic source localization model using in-skull reverberation and time re- versal. Appl. Phys. Left., 2007; 90(6): 063902.
4Rhebergen K S, Versfeld N J, Dreschler W A. Release from informational masking by time reversal of native and non- native interfering speech. J. Acoust. Soc. Am., 2005; 118(3): 1274--1277.
5Stickney G S, Zeng F G, Litovsky R, Assmann P. Cochlear implant speech recognition with speech maskers. J. Acoust.Soc. Am., 2004; 116(2): 1081-1091.
6Saberi K, Perrott D R. Cognitive restoration of reversed speech. Nature, 1999; 398(6730): 760--760.
7Nguyen D Q, Gan W S, Khong A W H. Time-reversal ap- proach to the stereophonic acoustic echo cancellation prob- lem. IEEE Trans. Audio Speech Lang. Process., 2011; 19(2): 385--395.
8Jiang B, Liebl A, Leistner P, Yang J. Sound masking per- formance of time-reversed masker processed from the target speech. Acta Acust. united Ac., 2012; 98(1): 135--141.
9Kang J. Comparison of speech intelligibility between En- glish and Chinese. J. Acoust. Soc. Am., 1998; 103(2): 1213--1216.
10杨琳,张建平,颜永红.单通道语音增强算法对汉语语音可懂度影响的研究[J].声学学报,2010,35(2):248-253. 被引量：17

二级参考文献10

1王晶,傅丰林,张运伟.语音增强算法综述[J].声学与电子工程,2005(1):22-26. 被引量：22
2张家禄齐士钤宋美珍等.汉语声调在言语可懂度中的重要作用.声学学报,1981,7:237-237.
3Song Myung-Suk, Lee Chang-Heon, Kang Hong-Goo. Performance analysis of various single channel speech enhancement algorithms for automatic speech recognition. Inter- speech2006, 1451-1454, Pittsburgh, Pennsylvania.
4Hu Guoning, Wang DeLiang. Monaural speech segregation based on pitch tracking and amplitude modulation. IEEE Trans. Neural Networks, 2004; 15(5): 1135-1150.
5Hu Yi, Loizou P C. A comparative intelligibility study of single-microphone noise reduction algorithms. J. Acoust. Soc. Am., 2007; 122(3): 1777-1786.
6Hu Yi, Loizou P C. Subjective evaluation and comparison of speech enhancement algorithms. Speech Communication, 2007; 49:588-601.
7Kang Jian. Comparison of speech intelligibility between English and Chinese. J. Acoust. Soc. Am., 1998; 103(2): 1213-1216.
8Loizou P C. Speech enhancement: Theory and practice. CRC Press, 2007.
9Kong Y Y, Zeng F G. Temporal and spectral cues in Mandarin tone recogntion. J. Acoust. Soc. Am., 2006; 120(5): 2830-2840.
10郑成诗,李晓东,陈佳路,田静.自适应平滑周期图语音增强研究[J].声学学报,2007,32(5):461-467. 被引量：4

共引文献27

1张家騄.汉语普通话区别特征系统[J].声学学报,2005,30(6):506-514. 被引量：27
2郑燕萍.汉族姓名语音修辞考察[J].修辞学习,2007(1):47-50. 被引量：3
3王欢良,钱瑶,F.K.Soong,韩纪庆.基于声调建模的带噪汉语数字串语音识别[J].声学学报,2007,32(5):454-460. 被引量：2
4PENG JianXin.Relationship between Chinese speech intelligibility and speech transmission index in rooms using dichotic listening[J].Chinese Science Bulletin,2008,53(18):2748-2752. 被引量：3
5郭莹,陈雪清,郭连生,杨宜林,于红玉,周婉荣,吴燕君,武文芳,张华,刘学宗.滤除低频音对听力正常人声调识别的影响[J].听力学及言语疾病杂志,2008,16(6):477-480. 被引量：1
6许伟,龚昌超,曾新吾.带通滤波后语音可懂度的实验研究[J].声学技术,2008,27(5):700-703. 被引量：7
7陈雪清,刘海红.语前聋患者人工耳蜗植入后声调识别能力研究[J].听力学及言语疾病杂志,2010,18(1):55-56. 被引量：14
8杨琳,张建平,颜永红.单通道语音增强算法对汉语语音可懂度影响的研究[J].声学学报,2010,35(2):248-253. 被引量：17
9亓贝尔,刘博.影响人工耳蜗植入者声调识别的因素[J].听力学及言语疾病杂志,2010,18(5):512-514. 被引量：4
10梁瑞宇,邹采荣,赵力,王青云,奚吉.汉语数字助听器高频听损增强方法的实验研究[J].声学学报,2012,37(5):527-533. 被引量：1

同被引文献17

1王晶,傅丰林,张运伟.语音增强算法综述[J].声学与电子工程,2005(1):22-26. 被引量：22
2赵毅,尹雪飞,陈克安.一种新的基于倒谱的共振峰频率检测算法[J].应用声学,2010,29(6):416-424. 被引量：9
3易克初.语音信号处理[M].北京:国防工业出版社,2002..
4KELLERMANN W.A self-steering digital microphone array[C]//IEEE International Conference on Acoustics,Speech and Signal Processing.1991:3581-3584.
5GRIFFITHS L J,JIM C W.An alternative approach to linearly constrained adaptive beamforming[J].IEEE Transactions on Antennas and Propagation,1981,30(1):27-34.
6ZELINSKI R.A microphone array with adaptive post-filtering for noise reduction in reverberant rooms[C]//IEEE International Conference on Acoustics,Speech and Signal Processing.1988,5:2578-2581.
7OSAMU Hoshuyama,AKIHIKO Sugiyama,AKIHIRO Hirano.A robust adaptive beamformer for microphone arrays with a blocking matrix using constrained adaptive filters[J].IEEE Transactions on Signal Processing,1999,47(10):2677-2684.
8CHONG K S,GWEE B H,CHANG J S.A 16-channel low-power non-uniform spaced filter bank core for digital hearing aids[J].IEEE Transactions on Circuits and Systems:Express Briefs,2006,53(10):853-858.
9邹采荣,陈国明,赵力.Speech enhancement based on leakage constraints DF-GSC[J].Journal of Southeast University(English Edition),2007,23(4):507-511. 被引量：1
10王青云,赵力,乔杰,邹采荣.符合人耳听觉特征的数字助听器子带响度补偿[J].应用科学学报,2008,26(6):580-585. 被引量：3

引证文献3

1高婉贞,张玲华,曹旭来,李云天.基于GSC结构的多通道语音补偿算法研究[J].南京邮电大学学报（自然科学版）,2014,34(4):51-56. 被引量：1
2陈亚繁,许勇,杨军.单通道语音增强对低信噪比汉语可懂度的影响[J].电声技术,2016,40(10):49-52. 被引量：3
3张万松,林春丹,杜爱红,王晶晶,吴冲,张鹏.影响汉语音段反转言语掩蔽效率的参数研究[J].内蒙古师范大学学报（自然科学汉文版）,2017,46(1):23-26.

二级引证文献4

1向前,唐勇.基于生成对抗网络的汉语语音增强技术研究[J].计算机应用研究,2020,37(S02):150-151. 被引量：3
2文小方,张玲华,高婉贞.基于小波去噪的自适应波束形成算法研究[J].计算机技术与发展,2017,27(6):169-172. 被引量：24
3李声飞.一种适用于直升机强旋翼噪声环境的话音处理系统研究与实现[J].现代电子技术,2019,42(10):124-127.
4张鹏,赵晖,张建强,王思叶,汪付强,朱光慧,吴晓明.声掩蔽技术浅析[J].保密科学技术,2023(3):18-25.

1杨定军.广播系统的语言可懂度的讨论[J].电声技术,2000,24(5):27-28. 被引量：1
2朱利,于虹.分布反馈半导体激光器的调制谱[J].电子器件,1997,20(4):6-10.
3黄?,赵岳松.基于电子白板的语音传输[J].计算机时代,2003(2):24-24.
4代云飞.强声号筒阵列研究[J].科学与财富,2015,7(22):431-431.
5杨日杰,叶灵伟,徐进军.一种重构辐射噪声调制谱的方法[J].应用声学,2001,20(6):19-22.
6胡修林,陈富贵.TCP/IP网际语音/数据综合传输方法分析[J].系统工程与电子技术,1998,20(8):28-31.
7张善文,甄蜀春,赵兴录.基于时频分析的雷达目标识别研究[J].宇航计测技术,2001,21(1):47-50. 被引量：2
8金楠.如何找回丢失的手机?[J].大众科学,2014(9):56-58.
9刘兴权,陆卫,穆耀明,乔怡敏,陈效双,万明芳,查访星,严立平,沈学础.超薄层AlGaAs样品的光调制反射谱研究[J].物理学报,1997,46(8):1613-1617.
10杨定军.扩声系统设计中关于语言可懂度的讨论[J].电声技术,2012,36(2):1-4.

声学学报

2012年第6期

浏览历史

内容加载中请稍等...

汉语音段反转言语的可懂度研究被引量：3

参考文献18

二级参考文献10

共引文献27

同被引文献17

引证文献3

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

汉语音段反转言语的可懂度研究 被引量：3

参考文献18

二级参考文献10

共引文献27

同被引文献17

引证文献3

二级引证文献4

相关作者

相关机构

相关主题

浏览历史

汉语音段反转言语的可懂度研究被引量：3