期刊文献+

结合主分量分析与DOA估计的语音盲分离

Blind speech separation combining with principal component analysis and DOA estimation
在线阅读 下载PDF
导出
摘要 在欠定语音盲分离中,W-分离正交性假设通常使问题简化,但这种简化是以降低分离性能为代价。在语音信号满足近似W-分离正交性的假设下,提出利用主分量分析(PCA)检测只有一个源信号存在的时频点,检测出的时频点均满足W-分离正交性,因此提高了混合矩阵的估计精度。通过从混合矩阵中估计源信号的波达方向,可以较好地解决置换模糊问题。仿真结果表明,提出的方法与经典的DUET方法相比具有更优的性能,平均信干比提高了2.77dB。 The assumption of W-disjoint orthogonality (W-DO) can simplify the problem of blind separation for under-determined mixed speech signals at the cost of decreasing the separation performance. A method based on principal component analysis (PCA) is proposed to detect the timefrequency cells where only one source exists, under the assumption of approximate W-DO of speech signal. All the detected time frequency cells satisfy the W-DO, so that the estimation precision of the mixing matrix is improved. The direction of arrival (DOA) of sources is estimated from the mixing matrix and is exploited to solve the permutation ambiguity problem. Simulation results demonstrate that the proposed method outperforms the typical DUET method, the average signal to interference (SIR) is improved by 2.77dB.
出处 《声学技术》 CSCD 2009年第5期624-628,共5页 Technical Acoustics
基金 国家自然科学基金(60672157 60672158) 重庆市自然科学基金(CSTC2005BB4219)
关键词 语音盲分离 主分量分析 波达方向 混合矩阵估计 blind speech separation PCA DOA mixing matrix estimation
  • 相关文献

参考文献9

  • 1Hyvarinen A, Karhunen J, Oja E. Independent component analy sis[M]. New York ,John Wiley & Sons, 2001.
  • 2Makino S, Lee T W, Sawada H. Blind speech separation[M]. Springer Netherlands, 2007: 47-78.
  • 3苗浩,李晓东,田静.一种用于语音增强的频域盲信号分离算法[J].声学技术,2007,26(3):431-434. 被引量:4
  • 4Yilmaz O, Rickard S. Blind separation of speech mixtures via time-frequency masking[J]. Signal Processing, IEEE Transactions on, 2004, 52(7): 1830-1847.
  • 5Rickard S, Yilmaz Z. On the approximate W-disjoint orthogonality of speech[A]. ICASSP '02[C]. 2002, 1: 529-532.
  • 6孙即祥.现代模式识别[M].长沙:国防科技大学出版社,2001.
  • 7徐舜,陈绍荣,刘郁林.基于非线性时频掩蔽的语音盲分离方法[J].声学学报,2007,32(4):375-381. 被引量:9
  • 8Bofill P, Zibulevsky M. Underdetermined blind source separation using sparse representations[J]. Signal Processing, 2001, 81(11): 2353-2362.
  • 9Theis F J, Langa E W, Puntonet C G. A geometric algorithm for overcomplete linear ICA[J]. Neurocomputing, 2004, 56: 381-398.

二级参考文献19

  • 1谢志文,尹俊勋,饶丹.空间掩蔽效应的实验研究[J].声学学报,2006,31(4):363-369. 被引量:10
  • 2Siow Yong Low,and Sven Nordholm.Convolutive Blind Source Separation With Post-Processing[J].IEEE Trans.Speech,Audio Processing,2004,12(5):539-547.
  • 3S.Ikeda,N.Murata.A method of ICA in time-frequency domain[c],in Int.Conf.ICA and Signal Separation,Aussois,France.1999,1:365-371.
  • 4F.Asano,S.Ikeda,M.Ogawa,H.Asoh,N.Kitawaki.Combined Approach of Array Processing and Independence Component Analysis for Blind Separation of Acoustic Signals[J].IEEE trans.Speech,Audio Processing,2003,11(5):204-215.
  • 5A.J.Bell,T.J.Sejnowski.An information-maximization approach to blind separation and blind deconvolution[J].Neural Computation,1995,7(6):1129-1159.
  • 6Hiroshi Sawada,Ryo Mukai.A Robust Approach To The Permutation Problem Of Frequency-Domain Blind Source Separation[J].IEEE trans.Speech,Audio Processing,2004,12(9):530-538.
  • 7Kostas Kokkinakis,Asoke K.Nandi.Multichannel Blind Deconvolution for Source Separation in Convolutive Mixtures of Speec[J].IEEE trans.Speech,Audio Processing,2006,14(1):200-212.
  • 8Sawada H,Mukai R et al.Real-time blind extraction of dominant target sources from many background interference sources.In:Proc.IWAENC2005,2005:73-76.
  • 9Araki S,Mukai R et al.The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech.IEEE Trans.Speech Audio Processing,2003; 11(2):109-116.
  • 10Haykin S.Unsupervised adaptive filtering,volume 1:blind source separation.John Wiley & Sons Canada,Ltd.2000.

共引文献35

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部