基于UCR训练集重构的真实语音情感识别

Real emotion recognition for training data restructuring based on utterance concatenation and resampling

下载PDF

导出

摘要真实语音情感识别是使人机交互更加友好的重要手段,但是训练数据稀缺为这一领域带来很多挑战。为了减小这一阻碍,提出了语句串接与重采样(UCR)方法,以便高效利用存在的训练数据。UCR方法是将原始音频样本按照情感类型进行串接,形成一个长的音频流,以一个固定粒度对其随机乱序,然后将其切割,并通过多次重采样操作来增加支持向量机(SVM)的训练样本数。实验基于一个从访谈节目中录制的真实语音情感库。实验结果表明,在统一背景模型-高斯混合模型-支持向量机(UBM—GMM—SVM)识别框架中这种训练集重构的方法错误率降低近33.10%。 Real emotion recognition can be an important means to make human-computer interaction more friendly,yet insufficient training data pose many challenges for this speech-related field.In this paper,a method to help reduce this barrier is proposed by effectively utilizing existing training data—namely,utterance concatenation and resampling（UCR）.It involves concatenation of audio files of the same emotion into a long stream,and then segmenting the stream;randomly permuting chunks of that stream;and even increasing the number of all supervectors for SVM by resampling several times.Experiments are made based on the interview speech emotion database,recorded from actual television interviews.Evaluation results show that the error rate reduction can reach 33.10% by restructuring the training data of UBM-GMM-SVM systems.

作者戴明洋杨大利徐明星

机构地区北京信息科技大学计算机学院普适计算教育部重点实验室清华信息科学与技术国家实验室(筹) 清华大学计算机科学与技术系

出处《北京信息科技大学学报（自然科学版）》 2012年第2期63-67,共5页 Journal of Beijing Information Science and Technology University

基金北京市属市管高等学校人才强教计划资助项目(PHR201007131)

关键词语音情感识别高斯混合模型超向量 UBM-GMM-SVM speech emotion recognition GMM supervector UBM-GMM-SVM

分类号 TP391 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献6

1Hausmann R,Chi M.Can a computer interfacesupport self-explaining?[J].The InternationalJournal of Cognitive Technology,2002,7(1):4-14.
2Dai Mingyang,Yang Dali,Xu Mingxing,et al.Research on the training speech selection for realemotion recognition[C] ∥National Conference onMan-Machine Speech Communication.西安:NCMMSC常设机构、西北工业大学,2011:61.
3Man-Wai Mak,Wei Rao.Utterance partitioningwith acoustic vector resampling for GMM-SVMspeaker verification[J].Speech Communication,2011,53(1):119-130.
4Liu Minghui,Dai Beiqian,Xie Yanlu,et al.Improved GMM-UBM/SVM for speakerverification[C] ∥ICASSP.Toulouse[s.n.] ,2006:I-925-I-928.
5Hu H,Xu M X,Wu W.GMM supervector basedSVM with spectral features for speech emotionrecognition[C] ∥Proc IEEE ICASSP.Honolulu:Hawai’i Conversition Center,2007:413-416.
6Reynolds,D A,Quatieri,et al.Speakerverification using adapted Gaussian mixturemodels[J].Digital Signal Process,2000:19-41.

1行业动态[J].中国教育技术装备,2011(11):133-135.
2张瑞华,贾智平,程合友.基于非均匀分簇和最小能耗的无线传感网络路由算法[J].上海交通大学学报,2012,46(11):1774-1778. 被引量：12
3邢玉娟,谭萍.基于稀疏表示分类的说话人识别算法及其在智能考勤系统中的应用[J].工业仪表与自动化装置,2016(2):84-87. 被引量：1
4申雪琴.内部排序算法的性能分析与探讨[J].河西学院学报,2011,27(5):50-54.
5潘美玲,胡昌海,张明明,朱斌.音乐情感自动分类器研究[J].浙江树人大学学报（自然科学版）,2011,11(4):6-10.
6邢玉娟,李明.NAP序列核函数在话者识别中的应用[J].计算机工程,2010,36(8):194-196. 被引量：2
7樊爱京.UCRP—一种能量有效的WSN非均匀分簇路由协议[J].微电子学与计算机,2011,28(2):65-68.
8曹清华,王亮.一种改进的基于小波变换的基音周期提取算法[J].科技资讯,2011,9(9):243-244.
9胡海波,傅鹂,向宏,周元,刘晓艳.基于贝叶斯算法与高斯混和模型的语者确认研究[J].计算机工程与应用,2007,43(29):225-227.
10姜丽红,徐博艺,应骊珠.cnXML电子商务中注册服务机制研究[J].合肥工业大学学报（自然科学版）,2004,27(2):195-198.

北京信息科技大学学报（自然科学版）

2012年第2期

浏览历史

内容加载中请稍等...

基于UCR训练集重构的真实语音情感识别

参考文献6

相关作者

相关机构

相关主题

浏览历史