期刊文献+

一种改进的基于时域参数的语音切分算法 被引量:3

An Improved Speech Detection Algorithm Based on Time-domain Parameter
在线阅读 下载PDF
导出
摘要 本文探讨了基于时域的语音切分算法,在前人研究的基础上,提出一种改进算法——自适应、前后搜索和检测短时脉冲噪音算法。该算法主要利用语音信号的短时参数,采用统计的方法定出切分所需要的阈值;根据背景音和静音过零率的不同,进一步搜索符合要求的静音帧;同时滤去短时脉冲噪音。实验证明,该算法准确卑很高,有很好的鲁棒性,允许误差在60 ms 的范围内,对于原始语音切分错误率为5.04%;在信噪比(SNR)大于等于2 dB 的情况下,对带噪语音的切分错误率为10%~20%。 This paper researches on speech detection algodthrn based on time domain, and describes an adaptive, both forwards and backwards search, detecting short-term pulse noise algorithm. This algorithm uses a variety of features including the frame amplitude and zero crossing rate to calculate threshold using statistical method. And it searches much further for the unvoiced frame according to the ZCR( zero crossing rate ), which is differ from unvoiced frame to background frame. This algorithm also detects impulse noise that last little. Experimental results show that this im- provement has good performance, even in noisy condition. Testing the original speech, the error rate is 5.04%, and in noisy environment with a SNR of beyond 2 dB, the rate is around 80-90%.
作者 林帆 徐明星
出处 《计算机科学》 CSCD 北大核心 2006年第4期164-167,共4页 Computer Science
基金 国家自然重点基金资助 基金号:60433030
关键词 语音切分 短时参数 自适应 前后搜索 检测短时脉冲噪首 Speech detection, Short-term variety, Adaptive, Search forwards and backwards, Detecting short-term pulse noise
  • 相关文献

参考文献10

  • 1Rabiner L R, Sambur M R. An Algorithm for Determining the Endpoint of Isolated utterances. Bell Syst. Tech. J., 1975, 54(2),297~315
  • 2张继勇,sp.cs.tsinghua.edu.cn,郑方,sp.cs.tsinghua.edu.cn,杜术,sp.cs.tsinghua.edu.cn,宋战江,sp.cs.tsinghua.edu.cn,徐明星,sp.cs.tsinghua.edu.cn.连续汉语语音识别中基于归并的音节切分自动机[J].软件学报,1999,10(11):1212-1215. 被引量:10
  • 3Burileanu D, Pascalin L, Burileanu C, et al. An Adaptive and Fast Speech Detection Algorithm. In: Sojka P, Kopeck I, Pala K, et al eds. Text, Speech and Dialogue: Third International Workshop, TSD 2000. Brno, Czech Republic: Springer-Verlag GmbH, 2000. 177~182
  • 4Beritelli F, Casale S, Cavallaro A. A robust voice activity detector for wireless communications using soft computing. IEEE J Sel Areas Commun, 1998,16(9) : 1818~ 1829
  • 5Beritelli F, Casale S, et al. Performance evaluation and comparison of G. 729/AMR/fuzzy voice activity detectors. IEEE Signal Process Lett, 2002,9(3) :85~88
  • 6张文军,谢剑英,李聪.基于贝叶斯方法的鲁棒语音切分[J].数据采集与处理,2002,17(3):260-264. 被引量:2
  • 7Wu Bing-Fei, Wang Kun-Ching. Robust Endpoint Detection Algorithm Based on the Adaptive Band Partitioning Spectral Entropy in Adverse Environments. IEEE Transactions on Speech and Audio Processing, 2005,13 (5)
  • 8Li K, Swamy M N S, Ahmad M O, An Improved Voice Activity Detection Using Higher Order Statistics, IEEE Transactions on Speech and Audio Processing, 2005,13(5)
  • 9贾卓燕,申瑞民.一种利用声音特性快速切分英文单词音节的算法[J].计算机仿真,2005,22(2):86-88. 被引量:1
  • 10何致远,胡起秀,徐光.说话人识别中语音切分算法的研究[J].计算机工程与应用,2003,39(6):55-58. 被引量:4

二级参考文献20

  • 1郑方 吴文虎 等.CDCPM及其在语音识别中的应用[J].软件学报,1996,7(10):69-75.
  • 2郑方 王承发 等.一个语文转换文本编辑器的实现.第5届全国人机语音通讯学术会议(NCMMSC'98)会议论文集[M].哈尔滨:哈尔滨工业大学出版社,1998.280-285.
  • 3[1]Sevendsen T, Soong F K. On the automatic segmentation of speech signals[C]. In: Proc ICASSP′87 (Vol.1, Dallas, Texas), 1987.77~80
  • 4[2]Mak M W, Allen W G. Spectral transitivity functions for speech segmentation in noise[J]. Acoustic Letters, 1993,16(10):228~234
  • 5[3]Rabiner L R. A tutorial on hidden Markov models and selected applications in speech recognition[J]. IEEE Transactions on Acoustics, Speech, and Signal Processing, 1989,77(2):267~295
  • 6[4]Leung H, Zue V. A procedure for automatic alignment of phonetic transcriptions with continuous speech[C]. In: Proc ICASSP′84 (San Diego, California), 1984.271~274
  • 7[5]van Hernert J. Automatic segmentation of speech[J]. IEEE Trans on Signal Processing, 1991,39(4):1008~1012
  • 8[6]Brugnara F,Falavigna D,Omologo M.Automatic segmentation and labeling of speech based on hidden Markov models[J]. Speech Communication, 1993,12(4):357~370
  • 9[7]Young S J. HTK: hidden Markov model toolkit V1.4[M]. Engineering Department of Cambridge University Cambridge, England,1992
  • 10[8]Varga A, Steeneken H. Assessment for automaticspeech recognition: II.NOISEX-92: A database and an experiment to study the effect of additive noise on speech recognition system[J]. Speech Communication, 1993, 12:247~251

共引文献12

同被引文献13

引证文献3

二级引证文献6

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部