期刊文献+

基于FFT-ACF和候选值估计的基音周期提取方法 被引量:2

A novel pitch tracking method based on FFT-ACF and estimation of pitch candidates
在线阅读 下载PDF
导出
摘要 利用FFT-ACF算法进行基音周期候选值估计,减少在语音基音周期提取中常见的倍频和半频错误,提出针对候选值的多重后处理算法.后处理过程:首先运用峰值筛选法进行初选,接着利用一次均值法将语音分为不同的音高段,再使用二次均值法为每个音高段确定合适的频率范围,最后精确提取出基音周期.实验结果表明,基音周期后处理算法有效,在音乐哼唱识别应用中收到良好效果. To reduce the halving and doubling errors in pitch tracking, the FFT-ACF algorithm was used to estimate the candidates of pitch, and a new multi-post-processing algorithm to process the candidates of pitch was proposed. Firstly, the peek-selecting method was used to detect the right candidates of pitch. Secondly, the first-mean method was used to divide the singing speech into different pitch segment. Thirdly, the second-mean method was used to obtain the optimal frequency range for every pitch segment. As a result, the precise pitch is determined from speech signal. Experiments show that the proposed multi-post-processing algorithm outperforms other algorithms, demonstrating desirable performance in query by singing system.
出处 《深圳大学学报(理工版)》 EI CAS 北大核心 2007年第4期388-392,共5页 Journal of Shenzhen University(Science and Engineering)
基金 深圳市科技计划资助项目(QK200601)
关键词 语音信号处理 基音周期提取 FFT-ACF 后处理算法 候选值估计 Speech processing pitch tracking FFT-ACF post-processing algorithm estimation of pitch candidates
  • 相关文献

参考文献10

二级参考文献52

  • 1杨行峻 迟惠生.语音信号数学处理[M].北京:电子工业出版社,1995.8-21.
  • 2A.V奥本海姆 黄建国等(译).离散时间信号处理[M].北京:科学出版社,1998..
  • 3杨行逡 迟惠生 等.语音信号数字处理[M].北京:电子工业出版社,1995..
  • 4Wolfgang Hess. Pitch Determination of Speech Signals [ M ]. New York: Springer-Verlag, 1983.
  • 5Ross M J, et al. Average magnitude difference function pitch extractor[J]. IEEE Trans on Acoustics, Speech, and Signal Processing, 1974,22(5) :353 - 362.
  • 6Thomas W Parsons. Voice and Speech Processing [ M]. New York:Mc-Graw-Hill, 1986.
  • 7Ross M,Shaffer H,Cohen A,et al.Average magnitude difference function pitch extractor[J].IEEE Trans on Acoustics,Speech,and Signal Processing,1974,22(5):353-362.
  • 8Cheveign A D,Kawahara H.YIN:A fundamental frequency estimator for speech and music[J].J Acoust Soc Am,2002,111(4):1917-1930.
  • 9Paul B.Accurate short-term analysis of the fundamental frequency and the harmonics-to-noise ratio of a sampled sound[A].Proc Institute of Phonetic Sciences 17[C].Amsterdam:UVA,1993.97-110.
  • 10Secrest B,Doddington G.An integrated pitch tracking algorithm for speech systems[A].Proc ICASSP[C].Boston,MA:IEEE,1983.1352-1355.

共引文献124

同被引文献8

  • 1张静,朱悦心.采用人声输入的网络音乐检索系统[J].微电子学与计算机,2006,23(5):173-178. 被引量:4
  • 2李明,颜永红.一种基于哼唱的音乐检索方法[A]第八届全国人机语音通讯学术会议论文集,2005.
  • 3Rabiner L,Juang B H.Fundamentals of Speech Recognition. . 1993
  • 4Viterbi AJ.Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory . 1967
  • 5Alpha architecture reference manual . 1992
  • 6李明,颜永红.一种基于哼唱的音乐检索方法[A].第八届全国人机语音通讯学术会议论文集.2005
  • 7L.R. Rabiner,B.H. Juang.Fundamentals of Speech Recognition[]..1993
  • 8刘志强.基于连续隐马尔可夫模型的旋律检索算法研究[J].中小企业管理与科技,2011(34):324-325. 被引量:2

引证文献2

二级引证文献2

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部