期刊文献+

基于动态规划的低延时基音提取算法 被引量:6

Low-delay pitch tracking algorithm based on dynamic programming
原文传递
导出
摘要 基音周期是语音信号的一个重要参数。为了在低延时条件下准确地提取语音基音周期参数,提出了一种基于动态规划的单路径搜索算法。该算法引入了清浊音信息和帧间相对能量对相邻帧代价函数进行加权;使用了一种更加有伸缩性的方式对基音变化幅度进行控制,以代替基音变化的硬性限制;专门针对基音突变和自身错误的情况,对过去帧影响进行限制,在保持基音跟踪性能和允许基音突变之间寻找一个平衡点。最后,使用Keele数据库进行的测试表明,该算法在只有一帧延时的情况下,严重错误率比传统算法下降2.32%。 Pitch is one of the most important parameters for describing speech characteristics. A one-way pitch tracking algorithm based on dynamic programming was developed to extract the correct pitch within a limited delay. The Unvolced/Voiced (U/V) and power variety information was utilized to weight the penalty of consecutive speech segments. A flexible method was used instead of hard constraints, to keep the pitch contour continuity. The pitch variation penalty between the previous frame and the current one was restricted to attenuate the influence of the previous frame, since abrupt change in the pitch contour occasionally occur and the algorithm sometimes makes mistakes. Simulations based on the Keele database show that the gross error reduced by 2.32% compared to the traditional algorithm with only a one frame delay.
出处 《清华大学学报(自然科学版)》 EI CAS CSCD 北大核心 2008年第10期1586-1588,共3页 Journal of Tsinghua University(Science and Technology)
基金 国家自然科学基金资助项目(60572081)
关键词 语音编码 低延时 基音周期提取 动态规划 speech coding low delay pitch tracking dynamic programming
  • 相关文献

参考文献6

  • 1Rabiner L, Cheng M. A comparative performance study of several pitch detection algorithms[J].IEEE Tram On Acoustics, Speech, and Signal Processing, 1976, 24(5): 399 - 418.
  • 2Secrest B, Doddington G. Postprocessing techniques for voice pitch trackers[C]//International Conf On Acoustics, Speech, and Signal Processing. Paris: IEEE, 1982: 172- 175.
  • 3Ney H. A dynamic programming technique for nonlinear smoothing[C]// International Conf On Acoustics, Speech, and Signal Processing. Atlanta: IEEE, 1981: 62-65.
  • 4Plante F, Meyer G F. A pitch extraction reference database [C]// European Conf on Speech Communication and Technology. Madrid, 1995:837 - 840.
  • 5Kondoz A M. Digital Speech[M]. England: John Wiley& Sons Ltd, 2004.
  • 6刘建,郑方,吴文虎.基于幅度差平方和函数的基音周期提取算法[J].清华大学学报(自然科学版),2006,46(1):74-77. 被引量:22

二级参考文献7

  • 1Ross M,Shaffer H,Cohen A,et al.Average magnitude difference function pitch extractor[J].IEEE Trans on Acoustics,Speech,and Signal Processing,1974,22(5):353-362.
  • 2Cheveign A D,Kawahara H.YIN:A fundamental frequency estimator for speech and music[J].J Acoust Soc Am,2002,111(4):1917-1930.
  • 3Paul B.Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound[A].Proc Institute of Phonetic Sciences 17[C].Amsterdam:UVA,1993.97-110.
  • 4Secrest B,Doddington G.An integrated pitch tracking algorithm for speech systems[A].Proc ICASSP[C].Boston,MA:IEEE,1983.1352-1355.
  • 5杨行俊 迟惠生.语音信号数字处理[M].北京:电子工业出版社,1995..
  • 6顾良,刘润生.高性能汉语语音基音周期估计[J].电子学报,1999,27(1):8-11. 被引量:19
  • 7张文耀,许刚,王裕国.循环AMDF及其语音基音周期估计算法[J].电子学报,2003,31(6):886-890. 被引量:40

共引文献21

同被引文献41

  • 1刘建,郑方,吴文虎.基于幅度差平方和函数的基音周期提取算法[J].清华大学学报(自然科学版),2006,46(1):74-77. 被引量:22
  • 2张康杰,赵欢,饶居华.基于LV-AMDF的自适应基音检测算法[J].计算机应用,2007,27(7):1674-1676. 被引量:7
  • 3Ney H.A dynamic programming technique for nonlinear smoothing[C]//International Conf on Acoustics,Speech,and Signal Processing.Atlanta,USA:IEEE,1981:62-65.
  • 4Kumar K,Jain J.Speech pitch shifting using complex continuous wavelet transform[C]//Annual IEEE India Conference.New Delhi,India,2006:1-4.
  • 5Shelby G A,Cooper C M,Adhami R R A.Wavelet-based speech pitch detector for tone languages[C]//IEEE-SP International Symposium on Time-Frequency and Time-Scale Analysis.Beijing,China,1994:596-599.
  • 6GAO Yanhua,ZHENG Guoqiang.Speech pitch period detection algorithm based on wavelet transform and spacial correlation function[C]//Electrical and International Conference on Control Engineering.Jinan,China,2010:5613-5616.
  • 7Hasan K,Shahnaz C,Fatath S A.Determination of pitch of noisy speech using dominant harmonic frequency[C]//IEEE-SP International Symposium on Circuits and Systems(ISCAS03).Bangkok,Thailand,2003,2:556-559.
  • 8Gu Y H.HMM-based noisy speech pitch contour estimation[C]//IEEE International Conference on Acoustics,Speech,and Signal Processing.New York,USA,1992,2:21-24.
  • 9HUANG Dongyan,LIN Weisi,Rahardja S.Speech pitch detection in noisy environment using multi-rate adaptive lossless FIR filters[C]//International Symposium on Circuits and Systems.Vancouver,Canada,2004,3:429-432.
  • 10Plante F,Meyer G F.A pitch extraction reference database[C]//European Conference on Speech Communication and Technology.Madrid,Spain,1995:837-840.

引证文献6

二级引证文献12

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部