基于概率加权平均的Mel子带特征重建算法被引量：1

Probability-Weighted Average Algorithm for Mel-Frequency Filter-Bank Vector Reconstruction

下载PDF

导出

摘要本文提出基于概率加权平均的Mel子带特征数据重建算法 .该算法选择K个最优重建结果的概率加权平均作为被加性噪声掩蔽的语音特征分量的估计 .实验结果表明 ,基于概率加权平均的语音特征数据重建算法降低了重建误差 ,减少了帧间突变现象 ,增强了Mel子带特征的帧间连续性 ,从而显著提高了语音识别系统对加性噪声的鲁棒性能 . Probability weighted average (PWA) algorithm is proposed to reconstruct Mel-frequency filter-bank vectors. The probability-weighted average of K-best reconstructed missing components of Mel-frequency filter-bank vectors is taken as the estimation of components masked by additive noise. Experimental results show that PWA algorithm can reduce reconstruction error, increase the continuity between neighbor Mel-filterbank vectors and greatly improve automatic speech recognition (ASH) system's robustness against additive noise.

作者罗宇杜利民

机构地区中科院声学所语音交互技术研究中心

出处《电子学报》 EI CAS CSCD 北大核心 2004年第10期1738-1741,共4页 Acta Electronica Sinica

基金国家 973重点基础研究发展项目 (No .G1 9980 30 50 5)

关键词子带加性噪声语音特征数据重建语音识别系统加权平均掩蔽概率重建算法分量缺失特征方法 Algorithms Data processing Estimation Probability Spurious signal noise Vectors

分类号 TN912.34 [电子电信—通信与信息系统] TN919.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献11

1A Vizinho,P Green,M Cooke,L Josifovski.Missing data theory,spectral subtraction and signal-to-noise estimation for robust ASR:An integrated study[A].Eurospeech'99[C].Budapest,1999.
2Martin Cooke,Phil Green,Ljubomir Josifovski and Ascension Vizinho.Robust ASR with unreliable data and minimal assumptions[A].Robust 99[C].Tamper,Finland.
3Morris A C,Cooke M,Green P.Some solutions to the missing feature problem in data classification,with application to noise robust ASR[A].Proc.ICASSP'98[C].1998.737-740.
4Ljubomir Josifovski,Martin Cooke,Phil Green,Ascension Vizinho.State based imputation of missing data for robust speech recognition and speech enhancement[A].in Eurospeech[C].1999.6.2833-2836.
5Jon Barker,Ljubomir Josifovski,Martin Cooke and Phil Green.Soft decisions in missing data techniques for robust automatic speech recognition[A].ICSLP-2000,Beijing[C].2000.373-376.
6B Raj,M L Seltzer,R M Stern.Reconstruction of damaged spectrographic features for robust speech recognition[A].Proceedings of the International Conference on Spoken Language Processing[C].October,2000,Beijing,China.
7M L Seltzer,B Raj,R M Stern.Classifier-based mask estimation for missing feature methods of robust speech recognition[A].Proceedings of the International Conference on Spoken Language Processing[C].October,2000,Beijing,China.
8Philippe Renevey,Rolf Vetter,Jens Kraus.Robust speech recognition using missing feature theory and vector quantization[A].Eurospeech 2001[C].Scandinavia,2001.1107.
9Lippmann R P,Carlson B A.Using missing feature theory to actively select features for robust speech recognition with interruptions,filtering and noise[A].Proc Eurospeech'97 [C].Rhodes,Greece,September 1997.KN37-40.
10Steve Young,Dan Kershaw,Julian Odell,Dave Ollason,Valtcho Valtchev,Phil Woodland,The HTK Book ( for HTK Version 3.1) [M].

同被引文献18

1李凌,曾以成,雷雄国.EMD在说话人辨认中的应用[J].湘潭大学自然科学学报,2006,28(3):108-111. 被引量：6
2QIAN S, MORRIS J M. Wigner distribution decomposition and cross-terms delete representation[J]. Signal Processing, 1992,27 : 125 - 144.
3WONG K M, JIN Q. Estimation of the time-varying frequency of a signal: The cramer-ral bound and the ap- plication of Wigner distribution[J]. IEEE Trans Signal Processing, 1990,1 770:358-375.
4HUANG N E, SHEN Z, LONG S R, et al. The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-station time series analysisI-J3. The Royal Society, 1998, A454 : 903- 995.
5HUANG N E,WU M L, QU W, et al. Applications of Hilbert-Huang transform to nonstationary financial time series analysis[J]. Applied Stochastic Models in Business and Industry,2003,19:245-268.
6YANG Z H, QI D X,YANG L H. A novel approach for detecting pitch based on Hilbert-Huang Transform [R.. Sun Yat-sen University, 2004.
7YAN R Q, GAO R X. A tour of the Hilbert-Huang transform: an empirical tool for signal analysis[J3. IEEE Instrumentation . Measurement Magazine, 2007,10 (5) : 40 - 45.
8陈迪,龚卫国,李波.噪声鲁棒性说话人识别语音高频加权MFCC提取[J].仪器仪表学报,2008,29(3):668-672. 被引量：15
9吴小羊,刘天佑.基于时频重排的地震信号Wigner-Ville分布时频分析[J].石油地球物理勘探,2009,44(2):201-205. 被引量：21
10刘亚丽,杨鸿武,黄德智.基于加权Mel倒谱系数的说话人识别[J].计算机应用与软件,2009,26(9):24-27. 被引量：3

引证文献1

1曾以成,陈雨莺,毛燕湖,谢小娟.基于经验模态分解结合傅氏变换与Wigner分布的Mel频率倒谱系数提取[J].湘潭大学自然科学学报,2015,37(2):20-26. 被引量：2

二级引证文献2

1魏艳鸣,海本斋.基于EMD和改进多重信号分类的感应电机故障检测方法[J].微特电机,2017,45(7):37-40. 被引量：4
2Fuzhao Chen,Zhilei Chen,Qian Chen,Tianyang Gao,Mingyan Dai,Xiang Zhang,Lin Sun.Research on motor rotation anomaly detection based on improved VMD algorithm[J].Railway Sciences,2024,3(1):18-31. 被引量：1

1陈建新.突变时电路初始值的确定[J].沈阳师范学院学报（自然科学版）,2002,20(4):271-274. 被引量：1
2杨惠娟,张建秋.基于小波去噪和数据融合的多传感器数据重建算法[J].复旦学报（自然科学版）,2005,44(1):161-165. 被引量：3
3罗宇,杜利民.基于单高斯模型集的汉语美子带特征重建算法[J].电子学报,2004,32(10):1654-1657. 被引量：2
4刘青松,戈迪,钱苏翔.小波去噪和数据融合及在线监测系统中的应用[J].微计算机信息,2006,22(12S):117-119. 被引量：4
5杜利民,罗宇.语音识别的隐马尔可夫模型边缘化解码数据重建方法[J].科技开发动态,2004(5):43-43.
6杨宜康,贾敬肖,杨传胜.动态心电数据的小波网络压缩算法[J].西北大学学报（自然科学版）,2001,31(6):469-472. 被引量：1
7蒋文建,韦岗.多数据流子带噪声语音识别方法[J].计算机工程与应用,2001,37(19):52-54. 被引量：1
8ZHOU Huibin,ZHANG Dafang,XIE Kun,WANG Xiaoyang.Data Reconstruction in Internet Traffic Matrix[J].China Communications,2014,11(7):1-12. 被引量：1
9何伟俊,贺前华,吴俊峰,杨继臣.基于子带双特征的自适应保留似然比鲁棒语音检测算法[J].电子与信息学报,2016,38(11):2879-2886. 被引量：1
10邓善熙,张亚东,张亚斋.动态心电数据压缩的新算法[J].中国生物医学工程学报,1998,17(3):251-256. 被引量：4

电子学报

2004年第10期

浏览历史

内容加载中请稍等...

基于概率加权平均的Mel子带特征重建算法被引量：1

参考文献11

同被引文献18

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于概率加权平均的Mel子带特征重建算法 被引量：1

参考文献11

同被引文献18

引证文献1

二级引证文献2

相关作者

相关机构

相关主题

浏览历史

基于概率加权平均的Mel子带特征重建算法被引量：1