期刊文献+

改进加权线性预测倒谱的复合参数说话人识别

SPEAKER RECOGNITION USING COMPOSITE PARAMETERS WITH WEIGHTING FUNCTION IMPROVED LINEAR PREDICTION CEPSTRUM
在线阅读 下载PDF
导出
摘要 说话人识别和确认是信号处理中研究的热点之一,但有关文献表明识别效率并不是很高,而且训练和识别的语音要求都比较长,距离实际应用还有一定差距。分析了说话人识别中有关参数的选取对识别结果的影响,采用线性预测倒谱和基音参数共同作为识别参数,并采用矢量量化,改进了线性预测倒谱距离的加权函数,提供了与文本无关的说话人识别系统。最后给出了实验结果和有关分析,在低噪声时识别正确率可达99%以上,在高噪声时也能达到98%以上的正确率。 Speaker recognition and identification is one of the research hot topics in signal processing.But the related documents indicate that its recognising efficiency has limitations,and long speech is required for training and recognition,there is still certain distance apart from the practical application.In this article we analyse the influence of selecting relevant parameters in speaker recognition on the outcome of recognition,and provide a speaker recognition system independent to the text which uses linear prediction(LP) cepstrum and pitch parameter as the joint recognition parameters,and quantises vectors by the vector quantization(VQ),improves the weighting function of LP cepstrum distance.The experimental results and relevant analysis are given in the last part of the paper.In low noise environment the recognition correct rate approaches 99% or higher,and that is also higher than 98% in condition of high-noise.
出处 《计算机应用与软件》 CSCD 2011年第2期242-245,共4页 Computer Applications and Software
基金 大学生创新实验计划项目(091048936)
关键词 说话人识别 线性预测倒谱 基音 矢量量化 Speaker recognition LP cepstrum Pitch Vector quantization
  • 相关文献

参考文献9

  • 1Phu Chien Nguyen,Masato Akagi,Tu Bao Ho.A Promising Approach to VQ_Based Spesker Recognition[C]//2003 IEEE International Conference on Acoustics,Speech,and Signal Processing,Procedings Volume Ⅰ of Ⅵ Speech Processing Ⅰ.2003:184-187.
  • 2M.A.EL-Gamal,M.F.ABU El-Yazeed,EL M M H.Ayadi.Enhancing the Performance of Ganssian Mixture Model-Based Text Independent Speaker Recognition[J].International Journal of Speech Technology,2005,8:93-103.
  • 3Limin Xu,Zhenmin Tang.Speaker Identification Using Multi-Step Clustering Algorithm with Transformation-Based GMM[J].Automatic Control and Computer Science,2007,41:224-231.
  • 4Marcos Faundez-Zamuy.A Combination Between VQ Covariance Matrices for Speaker Recognition[C]//The 2001 IEEE International Conference on Acoustics,Speech,and Signal Processing(ICASSP2001),vol.I:Speech Processing 1,Utah,USA,2001:453-456.
  • 5Andrens Stolcke,Sachin S Kajarekar,Luciana Ferrer.Speaker Recognition with Session Variability Normalization Based on MLLR Adaptation Transforms[J].IEEE Transaction on Audio,Speech and Language Processing,2007,15(7):1987-1998.
  • 6Robert M Nickel.Saehin P Oswal,Ananth N Iyer.Robust Speaker Verification with Principal Pitch Components[J].International Journal of Speech Technology,2005,8(4):323-339.
  • 7亢明,汪成亮,陈娟娟.基于动态阈值失量量化的说话人识别[J].计算机应用,2009,29(1):146-148. 被引量:4
  • 8陈明义,周昆湘,余伶俐.一种基于VQ的说话人确认的阈值设计方法[J].计算机工程与应用,2007,43(13):117-119. 被引量:1
  • 9俞一彪,袁冬梅,薛峰.一种适于说话人识别的非线性频率尺度变换[J].声学学报,2008,33(5):450-455. 被引量:14

二级参考文献18

共引文献16

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部