基于长时性特征的音位属性检测方法

Phonological Attribute Detection Method Based on Long-term Features

下载PDF

导出

摘要提出一种基于长时性信息的音位属性检测方法,该方法通过高、低两层时间延迟神经网络(TDNN)进行实现,低层TDNN在短时特征上进行音位属性的检测,高层TDNN在低层检测结果的基础上,对更长时段上的信息进行融合。实验结果表明,引入长时性特征使得音位属性检测率提升约3%,将音位属性后验概率作为音素识别系统的观测特征,使用长时性特征的识别结果提升约1.7%。 A novel phonological attribute detection method based on long-term information is presented.This method is comprised of high-level and low-level Time-delayed Neural Networks（TDNN）.The low-level TDNN carries out phonological attribute detection on the basis of short-term features,and the high-level TDNN is based on the low-level output and considering the long-term information,and fully taps the relation between speech signals in time.Experimental results show that,compared by the detection using short-term features,the introduction of phonological attribute based on long-term features improves detection rate with 3%.In addition,this paper puts the phonological attribute in phoneme recognition experiments,the results improveing 1.7% in Hidden Markov Model（HMM）-based speech recognition system.

作者许友亮张连海屈丹牛铜

机构地区解放军信息工程大学信息工程学院

出处《计算机工程》 CAS CSCD 2012年第11期160-162,166,共4页 Computer Engineering

基金国家自然科学基金资助项目(61175017)

关键词音位属性长时特征层级结构人工神经网络隐马尔可夫模型音素识别 phonological attribute long-term features hierarchical structure Artificial Neural Network（ANN） Hidden Markov Model（HMM） phoneme classification

分类号 TP391.4 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献10

1Dusan S, Rabiner L R. On Integrating Insights from Human Speech Perception into Automatic Speech Recognition[C]//Proc. of Conference on International Speech Communication Association. Lisbon, Portugal: [s. n.], 2005: 1233-1236.
2Chen I F, Wang Hsin-Min. An Investigation of Phonological Feature Systems Used in Detection-based ASR[C]//Proc. of Conference on Chinese Spoken Language Processing. Kunming, China: [s. n.], 2008: 1-4.
3King S, Taylor P. Detection of Phonological Features in Con- tinuous Speech Recognition Using Neural Networks[J]. Computer Speech and Language, 2000, 14(4): 333-353.
4Rajamanohar M, Fosler L E. An Evaluation of Hierarchical Articulatory Feature Detectors[C]//Proc. of IEEE Workshop on Automatic Speech Recognition and Understanding. San Juan, Puerto Rico: IEEE Press, 2005: 349-354.
5Chen I F, Wang Hsin-Min. Articulatory Feature Asynchrony Analysis and Compensation in Detection-based ASR[C]//Proc. of Conference on International Speech Communication Association. Lisbon, Portugal: [s. n.], 2009: 3059-3062.
6Chen B, Zhu Qifeng, Morgan N. Learning Long-term Temporal Features in LVCSR Using Neural Networks[C]//Proc. of Conference on Spoken Language Processing. Jeju, Korea: [s. n.], 2005: 1233-1236.
7李晨冲,董滨,潘复平,曾兴雯,颜永红.汉语普通话易混淆音素的识别[J].计算机工程,2009,35(23):201-203. 被引量：4
8Ganapathy S, Thomas S, Hermansky H. Comparison of Modul- ation Features for Phoneme Recognition[C]//Proc. of IEEE International Conference on Acoustics Speech and Signal Processing. Dallas, USA: IEEE Press, 2010: 5038-504,1.
9Ketabdar H, Bourlard H. Hierarchical Integration of Phonetic and Lexical Knowledge in Phone Posterior Estimation[C]//Proc. of IEEE International Conference on Acoustics Speech and Signal Processing. Las Vegas, USA: IEEE Press, 2008: 4065-4068.
10Le V B, Lamel L, Gauvain J L. Multi-style MLP Features for BN Transcription[C]//Proc. of IEEE International Conference on Acoustics Speech and Signal Processing. Dallas, USA: IEEE Press 2010: 4866-4869.

二级参考文献1

1刘鸣,戴蓓倩,李辉,李霄寒,陆伟.基于离散小波变换和感知频域滤波的语音特征参数[J].电路与系统学报,2000,5(1):21-25. 被引量：16

共引文献3

1琚四化.盲生与明眼生语音辨别的比较研究[J].南京特教学院学报,2010,0(3):21-23. 被引量：3
2梁青青,杨鸿武,郭威彤,裴东.基于语音识别和语速修改的语音复读系统[J].计算机工程,2011,37(5):288-290. 被引量：2
3薛化建,董兴华,周喜,吐尔洪.吾司曼,李晓.基于子字单元的维吾尔语语音识别研究[J].计算机工程,2011,37(20):208-210. 被引量：5

1许友亮,张连海,牛铜.基于音位属性和边界信息的音素识别[J].数据采集与处理,2013,28(2):178-183. 被引量：6
2许友亮,张连海,张文林,李永彬.基于语速调整和音位属性后验概率的音素识别[J].信号处理,2012,28(2):295-300. 被引量：5
3陆明明,张连海,牛铜.基于音位属性检测的PSPL改进方法[J].信息工程大学学报,2012,13(4):426-431.
4陆明明,张连海,屈丹,牛铜.一种融合音位属性的语音文档索引方法[J].计算机工程,2012,38(19):159-162.
5李立永,张连海.基于区分性特征的音素识别[J].信息工程大学学报,2013,14(6):692-699.
6王家定,胡访宇.MCG网特征库的构造及其动态调整技术[J].电子技术（上海）,2013(4):71-73.
7冉维丽,乔俊飞.基于PCA时间延迟神经网络的BOD在线预测软测量方法[J].电工技术学报,2004,19(12):78-82. 被引量：12
8李颖,张有为.一种新型极低比特率声码器在音素HMM语音识别中的应用[J].五邑大学学报（自然科学版）,1999,13(4):37-41.
9罗万伯,罗霄岚,陈炜,彭舰,吴端培.K子空间和时延自相关器的英汉音素识别[J].电子科技大学学报,2006,35(1):66-69.
10徐红.基于遗传算法的人工神经网络优化设计[J].燕山大学学报,2004,28(4):337-340. 被引量：9

计算机工程

2012年第11期

浏览历史

内容加载中请稍等...

基于长时性特征的音位属性检测方法

参考文献10

二级参考文献1

共引文献3

相关作者

相关机构

相关主题

浏览历史