基于Gaussian混合模型的LSF参数量化方法被引量：2

Quantization of LSF parameters using a Gaussian mixture model

导出

摘要为了高效率量化线谱频率(linear spectrumfrequency,LSF)参数,提出了基于G auss ian混合模型(G auss ian m ix ture m ode l,GMM)的LSF量化算法。假设LSF矢量属于GMM中的某一个G auss ian分布,用G auss ian分布随机矢量的量化方法对LSF矢量进行了量化。利用准确的G auss ian分布变量量化误差,得到了G auss ian分布矢量的比特分配方法。应用G auss ian分布随机变量的非均匀量化方法量化每一维LSF参数。最后给出了分裂矢量量化、基于概率密度函数(probab ility dens ityfunction,PDF)量化方法和该算法的性能对比。该无记忆LSF量化算法在21 b/帧可以达到透明量化,比传统Sp litVQ节省3 b。 An efficient linear spectrum frequency （LSF） parameter quantization scheme was developed based on the Gaussian mixture model （GMM）. The algorithm assumes that the LSF parameter has a GMM Gaussian distribution so that the LSF vector can be quantized using a random Gaussian distribution vector quantization. The bits of the LSF parameters are allocated according to the precise quantization error of the Gaussian distribution. Each dimension of the LSF parameter is quantized using a non-uniform scalar quantization of the Gaussian distribution variable. Comparison of the method with the Split-VQ and PDF VQ methods shows that the LSF parameters could be transparent quantized at 21 b/frame by the memoryless quantizer, which is 3b less than the conventional Split-VQ method.

作者赵永刚唐昆崔慧娟

机构地区清华大学电子工程系微波与数字通信技术国家重点实验室

出处《清华大学学报（自然科学版）》 EI CAS CSCD 北大核心 2006年第10期1727-1730,共4页 Journal of Tsinghua University(Science and Technology)

基金国家自然科学基金资助项目(60272020)

关键词语音编码矢量量化 Gaussian混合模型线谱频率 speech coding vector quantization Gaussian mixture model （GMM） linear spectrum frequency （LSF）

分类号 TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献9

1Gray R M,Neuhoff D L.Quantization[J].IEEE Trans Inform Theory,1998,44:2325-2383.
2Gersho A,Gray R M.Vector Quantization and Signal Compression[M].New York:Wiley,1994.
3Subramaniam A D,Rao B D.PDF optimized parametric vector quantization of speech line spectral frequencies[J].IEEE Trans Speech Audio Processing,2003,11(2):130-142.
4Reynolds D A,Rose R C.Robust text-independent speaker identification using Gaussian mixture speaker models[J].IEEE Trans Speech Audio Processing,1995,3(1):72-83.
5Hedelin P,Skoglund J.Vector quantization based on Gaussian mixture models[J].IEEE Transactions on Speech and Audio Processing,2000,8:385-401.
6Dempster A,Laird N,Rubin D.Maximum likelihood from incomplete data via the EM algorithm[J].J R Statist Soc,1977,39:1-38.
7Huang J,Schultheiss P M.Block quantization of correlated Gaussian random variables[J].IEEE Trans Commun,1963,11(3):289296.
8Max J.Quantizing for minimum distortion[J].IRE Trans On Information Theory,1960,6:7-12.
9Paliwal K K,Atal B S.Efficient vector quantization of LPC parameters at 24 bits/frame[J].IEEE Transactions on Speech and Audio Processing,1993,1(1):3-14.

同被引文献20

1ETSI EN 301 704 V7.2.1 Adaptive Multi-Rate(AMR)Speech Transcoding[S]. 2000.
2ITU-T G.729:Coding of Speech at 8kbit/s Using Conjugate Structure Algebraic Code Excited Linear Prediction(CS-ACELP)[S]. 1996.
3ITU-T G.729A: Educed Complexity 8kbit/s CS-ACELP Speech Codec[S]. 1996.
4OTA Y, SUZUKI M, TSUCHINAGA Y, et al. Speech coding translation for IP and 3G mobile integrated network[A]. IEEE International Conference on Communications[C]. New York: IEEE Press, 2002. 114-118.
5GHENANIA M, LAMBL1N C. Low-cost smart transcoding algorithm between ITU-T G.729 (8kbit/s) and 3GPPNB-AMR (12.2kbit/s)[A]. European Signal Processing Conference[C]. Vienna: EUSIPCO Press, 2004, (3): 1681-1684.
6吴金池.语音辩识系统之研究[D].台湾国立中央大学,2003.9-17.
7KAIN A B. High Resolution Voice Transformation[D]. Oregon Health and Science University, 2001.36-54.
8康永国,双志伟,陶建华等.高斯混合模型和码本映射相结合的语音转换算法[A].第八届全国人机语音通讯学术会议[c].2005.293-297.
9ITU-T P.800.1 :Mean Opinion Score(MOS) Terminology[S]. 2003.
10ITU-T P.862.1: Mapping Function for Transforming P.862 Raw Result Scores to MOS-LQO[S]. 2003.

引证文献2

1赵永刚,唐昆,崔慧娟.预测自适应Gauss混合模型线谱频率的量化[J].清华大学学报（自然科学版）,2007,47(4):530-533.
2刘张宇,鲍长春,邱建伟,徐昊.基于GMM的AMR-NB与G.729A之间的LSP参数转码方法[J].通信学报,2010,31(2):44-50. 被引量：1

二级引证文献1

1李凌云,李肖克,陈奕钊,王国法,王辉.基于IP包拆分重组技术的混合语音压缩编码算法研究[J].电子技术应用,2025,51(2):70-74.

1李凤莲,张雪英,王子中,李红春.码书分类重排矢量量化方法及其应用[J].清华大学学报（自然科学版）,2013,53(6):893-897. 被引量：3
2鲍长春,卓力,王永会.LSF参数的模拟退火法连接分裂矢量量化[J].电子学报,2001,29(1):127-129. 被引量：1
3张晓洲,黄德智,蔡莲红.考虑帧间动态特征的音色变换算法[J].清华大学学报（自然科学版）,2006,46(10):1767-1770. 被引量：1
4李燕诚,崔慧娟,唐昆.基于似然比测试的语音激活检测算法[J].计算机工程,2009,35(10):214-216. 被引量：5
5舒若,李世宝,潘辛.SVAC音频编码的特征参数量化器改进[J].信息技术,2014,38(6):50-54.
6王军,张连海,屈丹.一种针对ISF参数的量化算法[J].通信技术,2009,42(10):204-206.
7杨弢,罗春,杨军.基于矢量随机生成和断言的LCD控制器的验证[J].电子工程师,2006,32(3):4-6. 被引量：1
8李海婷,鲍长春.宽带ISF参数的非等系数帧间预测分裂矢量量化方法[J].电子学报,2008,36(6):1214-1217. 被引量：1
9刘付娥,薄拾,葛宁,周祖成.低阶链路容量调整机制协议仿真研究[J].光通信研究,2006(5):13-15.
10崔慧娟,郑海生,江灏,王田,唐昆.实时低速率语音压缩编码的算法研究[J].清华大学学报（自然科学版）,1997,37(10):21-24.

清华大学学报（自然科学版）

2006年第10期

浏览历史

内容加载中请稍等...

基于Gaussian混合模型的LSF参数量化方法被引量：2

参考文献9

同被引文献20

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于Gaussian混合模型的LSF参数量化方法 被引量：2

参考文献9

同被引文献20

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于Gaussian混合模型的LSF参数量化方法被引量：2