期刊文献+

有序聚类方法及其在神经网络语音识别中的应用 被引量:2

Sequential Cluster Method and Its Application on Neural Network Based Speech Recognition
在线阅读 下载PDF
导出
摘要 本文提出了一种新的网络结构,我们称之为有序聚类网络。这种网络能够对语音信号进行特征提取,很好地解决神经网络语音识别中的时间规整问题。有序聚类网络从输入语音信号的特征矢量序列中提取出一组固定数目的特征矢量,然后将这组特征矢量馈入神经网络分类器进行识别。和其他的神经网络语音识别方法相比较,用这种网络进行前端处理,可以缩短后端神经网络分类器的训练和识别时间,简化分类器的网络结构并保持较高的识别率。根据该方法我们建立了一个语音识别系统,并对两组英语单词进行了识别测试。实验结果表明,该方法优于传统的隐马尔可夫模型方法以及其它一些神经网络方法。 This paper proposes a novel method named sequential cluster network to solve the time alignment problems in artificial neural network (ANN) based speech recognition. Using this network, a fixed number of feature vectors is extracted from the input speech signal, and then processed by a ANN classifier for recognition. Compared with other ANN based methods, the proposed method has many advantages such as less time taken for training and recognition with simpler ANN structure and higher accuracy. A word recognition system is established based on this method and then tested with two sets of English words. The experiment results demonstrate that the proposed method outperforms the conventional HMM and some other ANN methods.
出处 《电路与系统学报》 CSCD 2000年第2期99-103,共5页 Journal of Circuits and Systems
关键词 神经网络 语音识别 有序聚类 Time Wrapping Algorithm , Neural Network , Speech RecognitionH
  • 相关文献

参考文献6

  • 1Chen Sin-Horng and Chen Wen-Yuan. Generalized Minimal Distortion Segmentation for ANN-based Speech Recognition. IEEE trans. on Speech andAudio Processing, 1995.3 (.2):.141-145.
  • 2Svendsen T and Soong F k. On the Automatic Segmentation of Speech Signals. In: Proc. ICASSP, 1987.77-80.
  • 3Zhu San, Chen Dao-wen and Huang Tai-yi. Feature Parameter Curve Method for High Performance NN-Based Speech Recognition.In: Proc.ICASSP, 1996. 1-4.
  • 4Sakoe H, Isotani R, Yoshida K, et al. Speaker Independent Word Recognition Using Dynamic Programming Neural Networks. In: Proc. ICASSP,1989..29-32.
  • 5Kammerer B R, Kupper W A. Experiments for Isolated Word Recognition with Single and Two Layer Perceptrons. Neural Networks, 1990 ( 3):693-706.
  • 6Shiraki Y and Honda M. LPC Speech Coding Based on Variable Length Segment Quantization. IEEE Trans. on ASSP, 1988, 36: 1437-1444.

同被引文献24

引证文献2

二级引证文献58

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部