期刊文献+

人脸语音动画中语音特征参数提取算法研究 被引量:1

Algorithm of Speech Feature Extraction in Facial Speech Animation
在线阅读 下载PDF
导出
摘要 人脸语音动画是虚拟现实领域的热点,语音特征参数提取是实现语音同步动画的前提和关键所在。为了能够提取鲁棒性更好的语音特征参数,在小波变换的理论基础上,借鉴MFCC特征参数的提取方法,运用表征语音动态特征的特征差分算法,提出了一种基于离散小波变换的语音特征参数(DWTMFCC)提取方法,并与反映语音情感特征的韵律参数相结合。通过基于LGB算法的VQ模型进行说话人语音识别,可以得到组合特征参数的识别率较高。 Facial speech animation is a popular issue in the area of virtual reality. The speech feature extraction is the premise and key to realize the speech synchronous animation. In order to extract the pronunciation feature parameters whose robustness is better, on the basis of wavelet transform theory and the extraction method of MFCC reference of characteristic parameters, a phonetic feature parameter extraction method based on discrete wavelet transform is proposed by the aid of a feature difference algorithm which characters the speech dynamic features. It combines the prosodic parameter which reflects the characteristics of speech emotion and acquires high recognition rate of mixed characteristic parameter. The speaker's voice recognition is implemented with the VQ model based on LGB algorithm.
作者 林睿 樊养余
出处 《现代电子技术》 2011年第6期74-77,共4页 Modern Electronics Technique
基金 国家"863"高技术研究发展计划(2007AA01Z324)
关键词 人脸语音动画 语音特征提取 小波变换 动态特征 组合特征参数 facial speech animation extraction of speech feature wavelet transform dynamic feature mixed characteristic parameter
  • 相关文献

参考文献6

二级参考文献49

  • 1叶静,董兰芳,王洵.用于语音动画合成的语音特征提取和聚类技术[J].微型机与应用,2004,23(8):47-49. 被引量:4
  • 2李战明,王贞.基于小波包分析特征参数的说话人识别系统[J].电声技术,2005,29(6):46-49. 被引量:5
  • 3贾熹滨,尹宝才,李敬华.语音同步的可视语音合成技术研究[J].北京工业大学学报,2005,31(6):656-661. 被引量:6
  • 4尹宝才,李敬华,贾熹滨,孙艳丰.基于两层隐马尔可夫模型的可视语音合成[J].北京工业大学学报,2006,32(5):416-418. 被引量:4
  • 5[1]Beskow J. Rule-Based visual speech synthesis. In: Proceedings of the 4th European Conference on Speech Communication and Technology. 1995. 299~302. http://www.speech.kth.se/~beskow/papers/es95rul.pdf.
  • 6[2]Waters K, Levergood, TM. DECface : an automatic lip-synchronization algorithm for synthetic face. Technical Report, CRL 93-4, Digital Equipment Corporation, Cambridge Research Laboratory, 1993. ftp://crl.dec.com/pub/DEC/CRL/tech-reports/93.4.ps.Z.
  • 7[3]Hong PY, Wen Z, Huang TS. IFACE: a 3D synthetic talking face. International Journal of Image and Graphics, 2001,1(1):1~8.
  • 8[4]Ezzat T, Poggio, T. Visual speech synthesis by morphing visemes. International Journal of Computer Vision, 2000,38(1):45~57.
  • 9[5]Yehia H, Kuratate T, Vatikiotis-Bateson E. Using speech acoustics to drive facial motion. In: Proceedings of the 14th international congress of phonetic sciences (ICPhS'99). 1999. 631~634. http://trill.berkeley.edu/ICPhS/frameless/acceptance.html.
  • 10[6]Massaro DW, Beskow J, Cohen MM. Picture my voice: audio to visual speech synthesis using artificial neural networks. In: Proceedings of the 4th Annual Auditory-Visual Speech Processing Conference (AVSP'99). 1999. 105~111. http://mambo.ucsc.edu/ pdf/avsp9922.pdf.

共引文献42

同被引文献10

引证文献1

二级引证文献3

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部