期刊文献+

Hilbert边际能量谱在语音情感识别中的应用 被引量:5

Application of Hilbert marginal energy spectrum in speech emotion recognition
在线阅读 下载PDF
导出
摘要 情感特征的提取是语音情感识别的重要方面。由于传统信号处理方法的局限,使得提取的传统声学特征特别是频域特征并不准确,不能很好地表征语音的情感特性,因而对情感识别率不高。利用希尔伯特黄变换(HHT)对情感语音进行处理,得到情感语音的希尔伯特边际能量谱;通过对不同情感语音的边际能量谱基于Mel尺度的比较分析,提出了一组新的情感特征:Mel频率边际能量系数(MFEC)、Mel频率子带频谱质心(MSSC)、Mel频率子带频谱平坦度(MSSF);利用支持向量机(SVM)对5种情感语音即悲伤、高兴、厌倦、愤怒和平静进行了识别。实验结果表明,通过该方法提取的新的情感特征具有较好的识别效果。 Emotional feature extraction plays an important role in speech emotion recognition. Due to the limitations of traditional signal processing methods, traditional phonetic features, especially in terms of frequency domain features, are unable to reflect precisely phonetic emotional characteristic, which leads to a low emotion recognition rate. This paper proposes a new method. Firstly, Hilbert-Huang Transform(HHT)is used in order to process speech signal, thus to obtain Hilbert marginal energy spectrum. Then, a comparison and relative analysis based on Mel-scale is carried out, afterwards a new array of emotional features are obtained, which consists of Mel-Frequency Marginal Energy Coefficient(MFEC), Mel-frequency Sub-band Spectral Centroid(MSSC)and Mel-frequency Sub-band Spectral Flatness(MSSF). Finally, the five kinds of speech emotion namely sadness, happiness, boredom, anger and neutral are recognized by using the Support Vector Machine(SVM). The experimental results show that the new emotional features extracted by this method have better recognition performance.
出处 《计算机工程与应用》 CSCD 2014年第7期203-207,共5页 Computer Engineering and Applications
基金 国家自然科学基金(No.61170199)
关键词 Mel尺度 Hilbert边际能量谱 边际能量谱特征 情感识别 Mel-scale Hilbert marginal energy spectrum marginal energy spectrum feature emotion recognition
  • 相关文献

参考文献14

  • 1Murray I R, Arnott J L.Toward the simulation of emotion in synthetic speech:a review of the literature on human vocal emotion[J].Journal of the Acoustical Society of America, 1993,93(2) : 1097-1108.
  • 2Scherer K R.Adding the affective dimension: a new look in speech analysis and synthesis[C]//Proceedings of Inter-national Conference on Spoken Language Processing.Phila- delphia:IEEE, 1996: 1808-1811.
  • 3Ververidis D, Kotropoulos C. Automatic speech classifi- cation to five emotional stales based on gender informa- tion[C]//Proceeding of EUSIPCO 2004 Conference,2003: 341-344.
  • 4Iliou T, Anagnostopoulos C N.Statistical evaluation of speech features for emotion recognition[C]//Proceedings of the 4th International Conference on Digital Telecommu- nications, Colmar, France, July 2009: 121-126.
  • 5Ling H, Margaret L.Time-frequency feature extraction frdm spectrograms and wavelet packets with application to auto- matic stress and emotion classification in speech[C]//Pro- ceedings of the IEEE ICICS.[S.1.]:IEEE,2009.
  • 6Huang N E.The empirical mode decomposition and the Hilbert spectrum for nonlinear and nonstationary time series analysis[J]. Proceedings of the Royal Society A, 1998,454 : 903-995.
  • 7蔡剑华,胡惟文,王先春.基于边际谱的功率谱估计方法[J].核电子学与探测技术,2011,31(9):1062-1066. 被引量:7
  • 8Gao H, Chen S G, Su G C.Emotion classification of man- darin speech based on TEO nonlinear features[C]//Pro- ceedings of Eighth ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, Qingdao, China, July, 2007 : 394-398.
  • 9Berlin data base of emotional speech[EB/OL].[2012-05-01]. http://pascal.kgw.tu-berlin.de/emodb/index- 1280.html.
  • 10Huang R, Ma C.Towards a speaker-independent real-time affect detection system[C]//Proc of the 18th Int Conf on Pattern Recognition(ICPR' 06),2006 : 1204-1207.

二级参考文献18

共引文献11

同被引文献62

引证文献5

二级引证文献29

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部