期刊文献+

语音中相位的听觉感知实验研究 被引量:7

Experimental study on phase perception in speech
原文传递
导出
摘要 人的听觉对语音信号中相位的感知比较迟钝,因而对语音信号进行处理和编码时常常不关心相位失真。实际上,相位失真到一定程度时会明显导致语音质量的下降。为了取得高质量的声码器,语音谱分量的相位信息是不能不考虑的。本文通过主观听觉测试实验研究了语音信号的短时Fourier变换相位谱对人的听觉感知的影响。测试结果表明: (1)如果完全舍弃原相位信息,则得到的重建语音含有很强的噪声且自然度很差; (2)不论舍弃高频段还是低频段的相位信息,均能导致听觉感知差异; (3)当相位的量阶小于π/7时,人的听觉系统将分辨不出重建语音和原始语音之间存在的差异. As the human ear is dull to the phase in speech, little attention has been paid to phase information in speech coding. In fact, the speech perceptual quality may be degrated if the phase distortion is very large. The perceptual efiect of the STFT phase spectrum is studied by auditory subjective tests in this paper. Three main conclusions are: (1) If the phase information is neglected completely, the naturalness of the reconstructed speech is very poor; (2) Whether the neglected phase is in low frequency band or high frequency band, the difference from the original speech can be perceived by ear; (3) The human ear can not perceive the difference of speech quality between original speech and reconstructed speech while the phase quantization step size is little than π/7.
出处 《声学学报》 EI CSCD 北大核心 2003年第1期7-11,共5页 Acta Acustica
关键词 相位 听觉感知 实验研究 语音信号处理 编码 语音质量 声码器 语音谱 Acoustic distortion Signal reconstruction Speech coding
  • 相关文献

参考文献6

  • 1Plomp R, Steeneken H J. Effect of phase on the timber of complex tones. J. Acoust. Soc. America, 1969; 46(2):409-421.
  • 2Levitt H. Transformed up-down methods in psychoacoustics. J. Acoust. Soc. America, 1971;49(2):467-477.
  • 3Patterson R D. A pulse ribbon model of monaural phase perception. J. Acoust. Soc. America, 1987;82(5):1560-1586.
  • 4Pobloth H, Kleijn W B. On phase perception in speech. In: Proc. of ICASSP, Phoenix, 1999;1:29-32.
  • 5Kim D. Perceptual phase redundancy in speech. In: Proc. of ICASSP, Istanbul, 2000;2:1383-1386.
  • 6Painter T, Spanias A. Perceptual coding of digital audio. In: Proc. IEEE, 2000;88(4):451-513.

同被引文献62

  • 1饶丹,谢菠荪,谢志文.双通路立体声条件下的双耳掩蔽[J].电声技术,2005,29(2):53-56. 被引量:8
  • 2赵铭,崔慧娟,唐昆,杜文.谱包络参数的平滑算法[J].清华大学学报(自然科学版),2005,45(4):448-451. 被引量:5
  • 3岳德刚,谢志文.一种新的非线性失真评价方法[J].声学技术,2007,26(1):84-89. 被引量:1
  • 4Moore Brian C J.Derivation of auditory filter shapes from notched-noise date[J].Hearing Research,1990,47:103-108.
  • 5Zwicker E,Fastl H.Psycho-acoustics:Facts and models[M].2nd ed.Berlin/Heidelberg:Springer-Verlag,1999:159.
  • 6Schorer E.Comparison of just-noticeable differences and variations in frequency and amplitude of sounds[J].Acustica,1989,68:183-199.
  • 7Czerwinski Eugene.Multitone testing of sound system components-some results and conclusions,Part 1:History and theory[J].J Audio Eng Soc,2001,49(11):1011-1048.
  • 8Tan Chin-Tuan,Moore Brian C J.The effect of nonlinear distortion on the perceived quality of music and speech signals[J].J Audio Eng Soc,2003,51(11):1012-1031.
  • 9James M Harte,Stephen J Elliott,Sarosh Kapadia,et al.Dynamic nonlinear cochlear model predictions of click-evoked otoacoustic emission suppression[J].Hearing Research,2005,207:99-109.
  • 10Monica L Hawley,Ruth Y Litovsky.The benefit of binaural hearing in a cocktail party:Effect of location and type of interferer[J].J Acoust Soc Am,2004,115 (2):833-843.

引证文献7

二级引证文献86

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部