基于压缩感知的线谱对参数降维量化算法被引量：1

Dimension Reduction Quantization of LSP Parameters Based on Compressed Sensing

下载PDF

导出

摘要为实现高质量的极低速语音编码,提出一种基于压缩感知理论的线谱对(LSP)参数降维量化算法。编码端利用压缩感知理论对超帧LSP高维矢量进行降维处理,将原始LSP参数投影到低维空间,得到低维测量值,然后采用分裂矢量量化算法对测量值进行量化;解码端以量化后的测量值为已知条件,利用正交匹配追踪算法重构出原始LSP高维矢量。实验结果表明,本算法相对低速语音编码中的矩阵量化方案,平均谱失真降低了0.23dB,相对基于DCT变换的降维量化方案,平均谱失真降低了0.13dB。这种先降维再量化的思想可以大幅减少编码所需的比特数及码本存储复杂度,有效降低语音编码速率,并且合成语音可懂度、自然度较高,音质虽有所失真,但基本上感觉不到明显的听觉质量下降。 To achieve good reconstruction speech quality in very low bit rate speech codecs,an efficient dimension reduction quantization scheme for linear spectrum pair（LSP） parameters was proposed based on compressed sensing.In the encoder,the LSP parameters extracted from consecutive speech frames are shaped into a high dimensional vector,and then the dimension of the vector is reduced by CS to produce a measurement vector,the measurements are quantized using the split vector quantizer.In the decoder,according to the quantized measurements,the original LSP vector is reconstructed by the orthogonal matching pursuit method.Experimental results show that the scheme is more efficient than that of conventional matrix quantization scheme and DCT based dimension reduction quantization scheme,the average spectral distortion reduction of up to 0.23dB and 0.13dB is achieved respectively.Informal subjective listening test shows that the reconstructed speech has moderate intelligibility and naturalness,it is observed that the degradation in speech quality is tolerable and with low codebook storage requirements.

作者肖强陈亮朱涛黄建军

机构地区解放军理工大学通信工程学院解放军理工大学指挥自动化学院

出处《信号处理》 CSCD 北大核心 2011年第4期563-568,共6页 Journal of Signal Processing

基金国家自然科学基金资助(No.61072042 No.60572095)

关键词低速语音编码线谱对压缩感知矢量量化 Low bit rate speech coding Line spectrum pair Compressed sensing Vector quantization

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献17

1Gwenael G, Francois C, Bertrand R, et al. New NATO STANAG narrow band voice coder at 600bits/s [ C ]. In Proc. IEEE Int. Conf. Acousic Speech Signal Processing,Toulouse,France,2006:689-692.
2Paliwal K K, Atal B S. Efficient vector quantization of LPC parameters at 24bits/frame [ J ]. IEEE Trans on speeeh and audio processing, 1993,1 (1) :3-14.
3Leblanc W P, Bhattacharya B, Mahmoud S A, et al. Efficient search and design procedures for robust multi-stage VQ of LPC parameters for 4 Kb/s speech coding [ J ].IEEE Trans on speech and audio processing, 1993,1 (4) : 373 -385.
4ZOU Xia, ZHANG Xiong-wei. Efficient coding of LSF parameters using multi-mode predictive multistage matrix quantization[ C ]. IEEE lnteruational Conference on Signal Processing, Beijing, China,2008:542-545.
5Subasingha S, Murthi M N, Andersen S V. On gram kalman predictive coding of lsfs for packet loss [ C ]. In Proc. IEEE Int. Conf. Acousic Speech Signal Processing, Taibei, China, 2009 : 4105 - 4108.
6ozaydin S, Baykal B. Matrix quantization and mixed excitation based linear predictive speech coding at very low bit rates[ J ]. Speech Communication ,2003,41:381-392.
7Xydeas C S, Papanastasiou C. Split matrix quantization of LPC parameters [ J ]. IEEE Trans on speech and audio processing, 1999,7 ( 2 ) : 113-125.
8ZOU Xia, ZHANG Xiong-wei, ZHANG Ya-fei. A 300bps speech coding algorithm based on multi-mode matrix quantization[ CI. IEEE International Conference on Wireless Communictions and Signal Processing, Nanjing, China ,2009 : 1-4.
9Donoho D L. Compressed sensing[J]. IEEE Trans on information theory, 2006,52(4) : 1289-1306.
10Sreenivas T V, Kleijn W B. Compressive sensing for sparsely excited speech signals [ C ]. In Proc. IEEE Int. Conf. Acousic Speech Signal Processing, Taibei, China, 2009:4125-4128.

二级参考文献124

1赵铭,崔慧娟,唐昆,杜文.谱包络参数的平滑算法[J].清华大学学报（自然科学版）,2005,45(4):448-451. 被引量：5
2张春梅,尹忠科,肖明霞.基于冗余字典的信号超完备表示与稀疏分解[J].科学通报,2006,51(6):628-633. 被引量：70
3丛键,张知易.一种600bps极低速率语音编码算法[J].电子与信息学报,2007,29(2):429-433. 被引量：7
4Paliwal K K, Kleijn K B. Quantization of LPC Parameters[J]. IEEE Trans. Speech Audio Processing, 1995:433-466.
5赵铭.超低速率语音编码技术与算法研究[D].北京:清华大学,2003.
6Ozaydin S, Baykal B. Multi stage matrix quantization for very low bit rate speech coding[J]. IEEE Workshop on Signal Processing, 2001:372-375.
7Suhramaniam h D, Rao B D. PDF optimized parametric vector quantization of speech line spectral frequencies [J]. IEEE Trans Speech Audio Processing, 2003, 11(2):130-142.
8R Baraniuk.A lecture on compressive sensing[J].IEEE Signal Processing Magazine,2007,24(4):118-121.
9Guangming Shi,Jie Lin,Xuyang Chen,Fei Qi,Danhua Liu and Li Zhang.UWB echo signal detection with ultra low rate sampling based on compressed sensing[J].IEEE Trans.On Circuits and Systems-Ⅱ:Express Briefs,2008,55(4):379-383.
10Cand,S E J.Ridgelets:theory and applications[I)].Stanford.Stanford University.1998.

共引文献775

1颜上取,汤昊,刘备,张含,钱盛友.基于压缩感知的HIFU回波信号降噪研究[J].电子测量与仪器学报,2020,32(11):19-25. 被引量：12
2涂云轩,冯玉田,应凯杰,高萌.基于高倍特征残差网络的压缩感知图像重构[J].电子测量技术,2020(7):113-118.
3刘树鹏,张君宇,许熙巍,董菁,李晓东.京津冀地区历史文化空间格局及熵变分析[J].干旱区资源与环境,2023,37(7):84-93. 被引量：3
4计振兴,孔繁锵.基于谱间线性滤波的高光谱图像压缩感知[J].光子学报,2012,41(1):82-86. 被引量：12
5樊甫华,尹学忠.IR-UWB信号随机性压缩采样和重建方法[J].数据采集与处理,2012,27(S2):291-297.
6陈思思,李跃华,陈昆.基于压缩感知的毫米波一维距离像算法研究[J].微波学报,2012,28(S1):257-259. 被引量：1
7孙子璇,易荣华.基于小波变换的正交匹配追踪算法及其应用[J].计算机科学,2012,39(S3):273-275. 被引量：5
8孔勐,陈明生,张忠祥,张量,吴先良.基于压缩感知结合HFSS软件求解目标单站RCS问题[J].微波学报,2015,31(3):7-10. 被引量：5
9岑翼刚,陈晓方,岑丽辉,陈世明.基于单层小波变换的压缩感知图像处理[J].通信学报,2010,31(S1):52-55. 被引量：51
10范晓维,刘哲,刘灿.分块可压缩传感的图像重构模型[J].计算机工程与应用,2009,45(29):153-155. 被引量：7

同被引文献13

1Tokuda K,Masuko T,Hiroi J,et al.A very low bit rate speech coder using HMM-based speech recognitiorn/synthesis techniques[C]//Acoustics,Speech and Signal Processing,1998.Proceedings of the 1998 IEEE International Conference on.IEEE,1998,2:609-612.
2Crosmer J,Bamwell Ⅲ T.A low bit rate segment vocoder based on line spectrum pairs[C]//Acoustics,Speech,and Signal Processing,IEEE International Conference on ICASSP'85.IEEE,1985,10:240-243.
3Wang T,Koishida K,Cuperman V,et al.A 1200 bps speech coder based on MELP[C]//Acoustics,Speech,and Signal Processing,2000.ICASSP'00.Proceedings.2000 IEEE International Conference on.IEEE,2000,3:1375-1378.
4Guilmin G,Capman F,Ravera B,et al.New NATO STANAG narrow band voice coder at 600 bits/s[C]//Acoustics,Speech and Signal Processing,2006.ICASSP 2006Proceedings.2006 IEEE International Conference on.IEEE,2006.
5Zou X,Wen C,Zhang X,et al.An improved 600bps speech coding based on joint quantization of pitch and gain shape[C]//Communication Technology (ICCT),2010 12th IEEE International Conference on.IEEE,2010:1303-1306.
6Wu C,Jiang H,Li B.An improved MELP speech coder[C]//Information Technology and Computer Science,2009.ITCS 2009.International Conference on.IEEE,2009,2:130-133.
7Unver E,Villette S,Kondoz A.Joint quantisation strategies for low bit-rate sinusoidal coding[J].Signal Processing,IET,2010,4(5):548-559.
8Boucheron L E,Leon P L D,Sandoval S.Hybrid scalar/ vector quantization of mel-frequency cepstral coefficients for low bit-rate coding of speech[C]//Data Compression Conference (DCC),2011.IEEE,2011:103-112.
9ITU-T.Federal information processing standards publication (MELP),specifications for the analog to digital conversion of voice by 2400 bit/second mixed excitation linear prediction,Draft June 1997.
10Yanxia L,Jiawei Y,Ye L.One effective method to design LBG initial codebook[C]//Intelligent Computation Technology and Automation (ICICTA),2011 International Conference on.IEEE,2011,2:628-631.

引证文献1

1刘斌,陶建华,莫福源.面向窄带通信的极低速率语音编码算法研究[J].信号处理,2013,29(9):1134-1141. 被引量：2

二级引证文献2

1杨超,贺一君,任建存,宋家康,刘云飞.码本均衡矢量编码算法[J].现代电子技术,2016,39(13):38-40. 被引量：8
2李凌云,陈奕钊,王国法,蒋剑伟,周品臣,谢臣.基于混合语音压缩编码技术的综合通信业务系统设计[J].广东通信技术,2024,44(8):63-69.

1肖强,陈亮,朱涛,黄建军.基于准KLT域的线谱对参数压缩感知量化研究[J].电子与信息学报,2011,33(9):2062-2067. 被引量：2
2施宏,陆洲,吴海燕.一种用于UHF频段卫星通信的多业务、多信令接口单元的实现[J].无线电通信技术,1999,25(2):54-56.
3王俊改.语音的多带激励编码算法简介[J].无线电通信技术,1995,21(3):65-72. 被引量：1
4王卫锋,张秀彬,王世新,刘旭涛,汤亮.基于线谱区域量化技术的低速语音编码[J].微型电脑应用,2001,17(4):50-51.
5于增贵.低速语音编码的最新成果[J].通信技术,1996,29(4):55-64. 被引量：1
6李蓉.低速语音编码的通信终端实现以及传输方案[J].移动通信,2004,0(S2):129-131.
7蒋海霞,成立新,陈显治.语音增强技术在低速语音编码中的应用[J].解放军理工大学学报（自然科学版）,2000,1(2):33-37. 被引量：1
8肖强,陈亮.改进的能量参数预测多级矢量量化算法[J].军事通信技术,2010,31(2):11-14.
9陈亮,张雄伟.一种600b/s甚低速率声码器的研究[J].信号处理,2002,18(5):403-409. 被引量：2
10刘永庆,沈兰荪.码激励线性预测及其在中低速语音编码中的应用[J].北京工业大学学报,1996,22(4):103-109.

信号处理

2011年第4期

浏览历史

内容加载中请稍等...

基于压缩感知的线谱对参数降维量化算法被引量：1

参考文献17

二级参考文献124

共引文献775

同被引文献13

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于压缩感知的线谱对参数降维量化算法 被引量：1

参考文献17

二级参考文献124

共引文献775

同被引文献13

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于压缩感知的线谱对参数降维量化算法被引量：1