期刊文献+

语谱图二次傅里叶变换特定人二字汉语词汇识别 被引量:2

Recognition of specific two-word Chinese vocabulary by applying Fourier transform twice to the spectrogram
在线阅读 下载PDF
导出
摘要 以语音信号的语谱图作为处理对象,提出了基于语谱图二次傅里叶变换对特定人二字词汇识别的方法.首先对语谱图二次傅里叶变换频域图的图像意义以及相应的语音特性表征进行了详细剖析;然后对语谱图频域图像进行二进宽度行投影,将投影值作为语音识别特征值,以支持向量机为分类器,进行特定人二字词汇语音整体识别.采用1 000个语音样本进行了仿真实验.结果表明,该方法正确识别率可达到92.4%,为汉语词汇整体识别提供了新的思路. This paper illustrates a method to recognize specific two-word Chinese vocabulary by analyzing speech signals using a spectrogram after Fourier transform is applied to it twice. First, we analyze the spectrogram in the frequency domain and its corresponding voice characteristics in detail after applying Fourier transform twice. Then, binary width zoning projection is carried out in the frequency domain. The projection value is treated as the characteristic value of semantic recognition feature and the support vector machine (SVM)is considered as the classifier for recognizing the semantics of specific two-word Chinese vocabulary. A total of 1000 voice samples were used in the simulation. The results using this method show a remarkable recognition rate of 92.4 %. The proposed method provides a new way for vocabulary recognition.
出处 《东北师大学报(自然科学版)》 CAS CSCD 北大核心 2017年第2期95-100,共6页 Journal of Northeast Normal University(Natural Science Edition)
基金 国家自然科学基金资助项目(61471111)
关键词 语谱图 二次傅里叶变换 支持向量机 二进宽度行投影 spectrogram fourier transform twice support vector machine (SVM) binary widthzoning projection
  • 相关文献

参考文献5

二级参考文献37

  • 1陈振标,徐波.基于子带能量特征的最优化语音端点检测算法研究[J].声学学报,2005,30(2):171-176. 被引量:22
  • 2Foote J. An Overview of Audio Information Retrieval [D]. Singapore: National University of Singapore, 1997.
  • 3LIANG Wei, ZHANG Shuwu, XU Bo. A histogram algorithm for fast audio retrieval [C]// Proceedings of the 6t International Conference on Music Information Retrieval. London, UK, 2005:586-589.
  • 4LU Lie, ZHANG Hongjiang, JIANG Hao. Content analysis for audio classification and segmentation[J]. IEEE Transaction on Speech and Auido Processing, 2002, 10(7) : 504 - 516.
  • 5Kashino K, Kurozumi T, Murase H. A quick search method for audio and video signals based on histogram pruning [J]. IEEE Transaction on Multimedia, 2003, 5(3) : 348 - 357.
  • 6Viola P, Jones M. Rapid object detection using a boosted cascade of simple features [C]// Proceedings of Computer Vision and Pattern Recognition. Hawaii, USA, 2001:511 -518.
  • 7Haitsma J, Kalker T. A highly robust audio fingerprinting system [C]// Proceedings of International Symposium on Music Information Retrieval. Paris, France, 2002 : 107-115.
  • 8Freund Y, Schapire R. Experiments with a new boosting algorithm [C]// Proceedings of International Conference on Machine Learning. Bari, Italy, 1996:148-156.
  • 9Gionis A, Indyk P, Motwani R. Similarity search in high dimensions via hashing [C]// Proceedings of International Conference on Very Large Databases. Edinburgh, Scotland, 1999:518-529.
  • 10Fischler M, Bolles R. Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography[J]. Communications of the ACM, 1981, 24(6) : 381 - 395.

共引文献41

同被引文献14

引证文献2

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部