期刊文献+

基于压缩感知的线谱对参数降维量化算法 被引量:1

Dimension Reduction Quantization of LSP Parameters Based on Compressed Sensing
在线阅读 下载PDF
导出
摘要 为实现高质量的极低速语音编码,提出一种基于压缩感知理论的线谱对(LSP)参数降维量化算法。编码端利用压缩感知理论对超帧LSP高维矢量进行降维处理,将原始LSP参数投影到低维空间,得到低维测量值,然后采用分裂矢量量化算法对测量值进行量化;解码端以量化后的测量值为已知条件,利用正交匹配追踪算法重构出原始LSP高维矢量。实验结果表明,本算法相对低速语音编码中的矩阵量化方案,平均谱失真降低了0.23dB,相对基于DCT变换的降维量化方案,平均谱失真降低了0.13dB。这种先降维再量化的思想可以大幅减少编码所需的比特数及码本存储复杂度,有效降低语音编码速率,并且合成语音可懂度、自然度较高,音质虽有所失真,但基本上感觉不到明显的听觉质量下降。 To achieve good reconstruction speech quality in very low bit rate speech codecs,an efficient dimension reduction quantization scheme for linear spectrum pair(LSP) parameters was proposed based on compressed sensing.In the encoder,the LSP parameters extracted from consecutive speech frames are shaped into a high dimensional vector,and then the dimension of the vector is reduced by CS to produce a measurement vector,the measurements are quantized using the split vector quantizer.In the decoder,according to the quantized measurements,the original LSP vector is reconstructed by the orthogonal matching pursuit method.Experimental results show that the scheme is more efficient than that of conventional matrix quantization scheme and DCT based dimension reduction quantization scheme,the average spectral distortion reduction of up to 0.23dB and 0.13dB is achieved respectively.Informal subjective listening test shows that the reconstructed speech has moderate intelligibility and naturalness,it is observed that the degradation in speech quality is tolerable and with low codebook storage requirements.
出处 《信号处理》 CSCD 北大核心 2011年第4期563-568,共6页 Journal of Signal Processing
基金 国家自然科学基金资助(No.61072042 No.60572095)
关键词 低速语音编码 线谱对 压缩感知 矢量量化 Low bit rate speech coding Line spectrum pair Compressed sensing Vector quantization
  • 相关文献

参考文献17

  • 1Gwenael G, Francois C, Bertrand R, et al. New NATO STANAG narrow band voice coder at 600bits/s [ C ]. In Proc. IEEE Int. Conf. Acousic Speech Signal Processing,Toulouse,France,2006:689-692.
  • 2Paliwal K K, Atal B S. Efficient vector quantization of LPC parameters at 24bits/frame [ J ]. IEEE Trans on speeeh and audio processing, 1993,1 (1) :3-14.
  • 3Leblanc W P, Bhattacharya B, Mahmoud S A, et al. Efficient search and design procedures for robust multi-stage VQ of LPC parameters for 4 Kb/s speech coding [ J ].IEEE Trans on speech and audio processing, 1993,1 (4) : 373 -385.
  • 4ZOU Xia, ZHANG Xiong-wei. Efficient coding of LSF parameters using multi-mode predictive multistage matrix quantization[ C ]. IEEE lnteruational Conference on Signal Processing, Beijing, China,2008:542-545.
  • 5Subasingha S, Murthi M N, Andersen S V. On gram kalman predictive coding of lsfs for packet loss [ C ]. In Proc. IEEE Int. Conf. Acousic Speech Signal Processing, Taibei, China, 2009 : 4105 - 4108.
  • 6ozaydin S, Baykal B. Matrix quantization and mixed excitation based linear predictive speech coding at very low bit rates[ J ]. Speech Communication ,2003,41:381-392.
  • 7Xydeas C S, Papanastasiou C. Split matrix quantization of LPC parameters [ J ]. IEEE Trans on speech and audio processing, 1999,7 ( 2 ) : 113-125.
  • 8ZOU Xia, ZHANG Xiong-wei, ZHANG Ya-fei. A 300bps speech coding algorithm based on multi-mode matrix quantization[ CI. IEEE International Conference on Wireless Communictions and Signal Processing, Nanjing, China ,2009 : 1-4.
  • 9Donoho D L. Compressed sensing[J]. IEEE Trans on information theory, 2006,52(4) : 1289-1306.
  • 10Sreenivas T V, Kleijn W B. Compressive sensing for sparsely excited speech signals [ C ]. In Proc. IEEE Int. Conf. Acousic Speech Signal Processing, Taibei, China, 2009:4125-4128.

二级参考文献124

  • 1赵铭,崔慧娟,唐昆,杜文.谱包络参数的平滑算法[J].清华大学学报(自然科学版),2005,45(4):448-451. 被引量:5
  • 2张春梅,尹忠科,肖明霞.基于冗余字典的信号超完备表示与稀疏分解[J].科学通报,2006,51(6):628-633. 被引量:70
  • 3丛键,张知易.一种600bps极低速率语音编码算法[J].电子与信息学报,2007,29(2):429-433. 被引量:7
  • 4Paliwal K K, Kleijn K B. Quantization of LPC Parameters[J]. IEEE Trans. Speech Audio Processing, 1995:433-466.
  • 5赵铭.超低速率语音编码技术与算法研究[D].北京:清华大学,2003.
  • 6Ozaydin S, Baykal B. Multi stage matrix quantization for very low bit rate speech coding[J]. IEEE Workshop on Signal Processing, 2001:372-375.
  • 7Suhramaniam h D, Rao B D. PDF optimized parametric vector quantization of speech line spectral frequencies [J]. IEEE Trans Speech Audio Processing, 2003, 11(2):130-142.
  • 8R Baraniuk.A lecture on compressive sensing[J].IEEE Signal Processing Magazine,2007,24(4):118-121.
  • 9Guangming Shi,Jie Lin,Xuyang Chen,Fei Qi,Danhua Liu and Li Zhang.UWB echo signal detection with ultra low rate sampling based on compressed sensing[J].IEEE Trans.On Circuits and Systems-Ⅱ:Express Briefs,2008,55(4):379-383.
  • 10Cand,S E J.Ridgelets:theory and applications[I)].Stanford.Stanford University.1998.

共引文献775

同被引文献13

  • 1Tokuda K,Masuko T,Hiroi J,et al.A very low bit rate speech coder using HMM-based speech recognitiorn/synthesis techniques[C]//Acoustics,Speech and Signal Processing,1998.Proceedings of the 1998 IEEE International Conference on.IEEE,1998,2:609-612.
  • 2Crosmer J,Bamwell Ⅲ T.A low bit rate segment vocoder based on line spectrum pairs[C]//Acoustics,Speech,and Signal Processing,IEEE International Conference on ICASSP'85.IEEE,1985,10:240-243.
  • 3Wang T,Koishida K,Cuperman V,et al.A 1200 bps speech coder based on MELP[C]//Acoustics,Speech,and Signal Processing,2000.ICASSP'00.Proceedings.2000 IEEE International Conference on.IEEE,2000,3:1375-1378.
  • 4Guilmin G,Capman F,Ravera B,et al.New NATO STANAG narrow band voice coder at 600 bits/s[C]//Acoustics,Speech and Signal Processing,2006.ICASSP 2006Proceedings.2006 IEEE International Conference on.IEEE,2006.
  • 5Zou X,Wen C,Zhang X,et al.An improved 600bps speech coding based on joint quantization of pitch and gain shape[C]//Communication Technology (ICCT),2010 12th IEEE International Conference on.IEEE,2010:1303-1306.
  • 6Wu C,Jiang H,Li B.An improved MELP speech coder[C]//Information Technology and Computer Science,2009.ITCS 2009.International Conference on.IEEE,2009,2:130-133.
  • 7Unver E,Villette S,Kondoz A.Joint quantisation strategies for low bit-rate sinusoidal coding[J].Signal Processing,IET,2010,4(5):548-559.
  • 8Boucheron L E,Leon P L D,Sandoval S.Hybrid scalar/ vector quantization of mel-frequency cepstral coefficients for low bit-rate coding of speech[C]//Data Compression Conference (DCC),2011.IEEE,2011:103-112.
  • 9ITU-T.Federal information processing standards publication (MELP),specifications for the analog to digital conversion of voice by 2400 bit/second mixed excitation linear prediction,Draft June 1997.
  • 10Yanxia L,Jiawei Y,Ye L.One effective method to design LBG initial codebook[C]//Intelligent Computation Technology and Automation (ICICTA),2011 International Conference on.IEEE,2011,2:628-631.

引证文献1

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部