期刊文献+

基于Volterra级数预测的音频频带扩展 被引量:2

Audio Bandwidth Extension Based on Volterra Series
在线阅读 下载PDF
导出
摘要 本文采用非线性分析方法,基于Volterra级数提出了一种宽带音频信号的频带扩展方法,并利用高斯混合模型(Gaussian Mixture Model,GMM)和码本映射技术对扩展后的音频信号进行了谱包络和能量增益调整.实验表明,所提算法的性能要好于已有的非线性频带扩展算法,当用本文的方法替代ITU-T G.722.1C编码器中的噪声填充技术时,在24kbps得到了提升的超宽带音频质量. In this paper,a bandwidth extension algorithm of wideband audio signal is proposed based on Volterra series with the nonlinear analysis method. The Gaussian mixture model and codebook mapping algorithms are used to adjust the specmtm enve- lope and energy gain of the extended audio signal separately.Test results indicate that the proposed method outperforms the existing nonlinear algorithms. When the noise-filling method used in ITU-T G. 722.1C super-wideband audio codec is replaced by the pro- posed algorithm,the super-wideband audio quality is improved at 24 kbps.
出处 《电子学报》 EI CAS CSCD 北大核心 2012年第12期2501-2506,共6页 Acta Electronica Sinica
基金 国家自然科学基金(No.61072089 No.60872027) 北京市自然科学基金(No.4082006) 北京市属高等学校人才强教计划资助 北京工业大学第十届研究生科技基金(No.yk-j 2012-7001)
关键词 频带扩展 VOLTERRA级数 高斯混合模型 码本映射 bandwidth extension Volterra series Gaussian mixture model codebook mapping
  • 相关文献

参考文献11

  • 1王海燕;卢山.非线性时间序列分析及其应用[M]北京:科学出版社,200610-12.
  • 2岳毅宏,韩文秀,张伟波.基于关联度的混沌序列局域加权线性回归预测法[J].中国电机工程学报,2004,24(11):17-20. 被引量:26
  • 3V J Mathews. Adaptive polynomial filters[J].IEEE Signal Processing Magazine,1991,(03):10-26.doi:10.1109/79.127998.
  • 4Xiao-ke Xu,Xiao-ming Liu,Xiao-nan Chen. The Cao method for determining the minimum embedding dimension of sea clutter[A].Proceedings of 2006 CIE International Conference on Radar[A].Shanghai:IEEE Press,2006.77-80.
  • 5沙永涛;鲍长春;贾懋坤.一种基于重构八度音的音频信号高频重建方法[J]信号处理,2009(8A):139-142.
  • 6张勇,胡瑞敏.基于高斯混合模型的语音带宽扩展算法的研究[J].声学学报,2009,34(5):471-480. 被引量:7
  • 7韩敏.混沌时间序列预测理论和方法[M]北京:中国水利水电出版社,2007155-160.
  • 8鲍长春.数字语音编码原理[M]西安:西安电子科技大学出版社,2007.
  • 9Holger Kantz,Thomas Schreiber. Nonlinear Time Series Analysis[M].Britain:Cambridge University Press,2004.42-51.
  • 10Yong-tao Sha,Chang-chun Bao,Mao-shen Jia,Xin Liu. High frequoncy reconstruction of audio signal based on chaotic prediction theory[A].Dallas,Texas,USA:IEEE Press,2010.381-384.

二级参考文献26

  • 1俞一彪,王朔中.基于互信息匹配模型的说话人识别[J].声学学报,2004,29(5):462-466. 被引量:8
  • 2郎玥,赵胜辉,匡镜明.基于矢量量化的语音信号频带扩展[J].北京理工大学学报,2005,25(3):260-264. 被引量:4
  • 3党辰,戴葵,王苏峰,刘芸,王志英.高频重建技术SBR的研究与实现[J].电子学报,2004,32(F12):189-191. 被引量:2
  • 4俞一彪,王朔中.文本无关说话人识别的全特征矢量集模型及互信息评估方法[J].声学学报,2005,30(6):536-541. 被引量:7
  • 5Jax P, Vary P. Bandwidth extension of speech signals: a catalyst for the introduction of wideband speech coding. IEEE Communications Magazines, 2006; 44(5): 106--111.
  • 6Geiser B, Jax P. Bandwidth extension for hierarchical speech and audio coding in ITU-T rec. G.729.1. IEEE Transactions on Audio, Speech and Language Processing, 2007; 15(8): 2496--2509.
  • 7Dar Ghulam Raza, Cheung-Fat Chan. Enhancing quality of celp coded speech via wideband extension by using voic- ing GMM interpolation and HNM re-synthesis. Proceeding of IEEE International Conference on Acoustics, Speech~ Signal Processing. 2002; 4:1241--1244.
  • 8Nakatoh Y, Tuushima M, Norimatsu T. Generation of broadband speech from narrowband speech using piecewise linear mapping. In Proceeding of EUROSPEECH, 1997; 9: 1643--1646.
  • 9Enbom N, Klenijn W B. Bandwidth expansion of speech based on vector quantization of the reel frequency cepstral coefficients. IEEE Workshop on Speech Coding Proceedings, 1999; 2:171--173.
  • 10Park K Y, Kim H S. Narrowband to wideband conversion of speech using GMM based transformation. Proceeding of IEEE International Conference on Acoustics, Speech, Signal Processing, 2000; 4:1843--1846.

共引文献31

同被引文献7

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部