期刊文献+

基于模糊多类支持向量机的语音质量客观评价 被引量:3

Objective Speech Quality Evaluation Based on Fuzzy Multi-Class Support Vector Machine
在线阅读 下载PDF
导出
摘要 提出了采用模糊有向图支持向量机(FDGSVM)对基于输出的多语言语音样本进行语音质量评价的一种新方法.将多个可进行两类分类的模糊支持向量机组织成具有惟一根节点的有向图结构,得到多类分类器FDGSVM;提取待测语音信号的Mel倒谱系数并将其作为特征向量,再通过FDGSVM将特征向量映射到非线性划分的主观平均意见评分(MOS)区间,映射值即为输出的语音质量的客观评价结果.实验结果表明,所提算法获得的评测结果与主观MOS评价之间的相关度,在闭集测试时可达0.91,在开集测试时可达0.88. A novel approach to output-based speech quality evaluation using fuzzy directed graph support vector machine (FDGSVM) is proposed. Binary classifiers of fuzzy support vector machine are organized into a structure of directed graph with a unique root to form the multi-classifier FDGSVM. Mel cepstrum coefficients are extracted from multi-lingual speech samples and regarded as eigenvectors, and mapped to different non-linear partition of subjective mean opinion score (MOS) through FDGSVM. The mapped scores are the out-based speech quality evaluation results. Experimental results show that the correlation between the result of the proposed approach and MOS is up to 0. 91 in close-set test and 0.88 in open set test.
出处 《西安交通大学学报》 EI CAS CSCD 北大核心 2006年第2期199-202,共4页 Journal of Xi'an Jiaotong University
基金 国家高技术研究发展计划资助项目(2003AA148010)
关键词 模糊有向图 支持向量机 MEL倒谱系数 语音质量 客观评价 fuzzy directed graph support vector machine Mel-cepstrum coefficient speech quality objective evaluation
  • 相关文献

参考文献9

  • 1ITU-T Rec.G.107-1998,The E-model:a computational model for use in transmission planning [S].
  • 2Takahashi A.Opinion model for estimating conversational quality of VOIP [A].International Conference on Acoustics,Speech,and Signal Processing [C].Montreal,Canada:IEEE,2004.1072-1075.
  • 3Kim D S,Tarraf A.Perceptual model for non-intrusive speech quality assessment [A].International Conference on Acoustics,Speech,and Signal Processing [C].Montreal,Canada:IEEE,2004.1060-1063.
  • 4Davies S B,Mermelstein P.Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences [J].IEEE Trans Acoustics,Speech,and Signal Processing,1980,28 (4):357-366.
  • 5Vapnik V N.Statistical learning theory [M].New York:Wiley,1998.
  • 6Lin Chunfu,Wang Shengde.Fuzzy support vector machines [J].IEEE Trans Neural Networks,2002,13(2):464-471.
  • 7Hsu Chihwei,Lin Chihjen.A comparison of methods for multiclass support vector machines [J].IEEE Trans on Neural Networks,2002,13(2):415-425.
  • 8Mozer M,Jordan M,Petsche T.Advances in neural information processing systems [M].Cambridge,USA:MIT Press,2000.547-553.
  • 9ITU-T P 23-1998,ITU-T coded-speech database [S].

同被引文献18

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部