期刊文献+

基于改进对数梅尔谱特征的街道环境声事件检测方法 被引量:2

A street environmental sound event detection method based on improved log-Mel feature
在线阅读 下载PDF
导出
摘要 针对现有的环境声事件识别方法对噪声的鲁棒性差以及街道环境声数据少的问题,提出一种改进的对数梅尔谱特征结合卷积神经网络的环境声事件识别方法。该方法将对数梅尔谱及其一阶差分系数和二阶差分系数构建为三维特征,使用卷积神经网络进行分类,提高了街道环境声事件识别的抗噪性能。采用自主设计的环境声采集设备收集了大量的街道环境声数据。实验结果表明,在不同环境声数据集下,该方法比常规的识别方法性能更优。此外,基于该方法提出了一套街道环境声检测系统,经实景测试,该系统的查全率、查准率和置信度分别为94%、87%和90.5%,相比于基于常规识别方法的检测系统具有更好的表现,进而验证了该检测系统的可行性。 Aiming at poor noise robustness of existing environmental sound event recognition methods and the lack of street environmental sound data.This paper proposes a method for recognizing environmental sound events based on improved log-Mel features combined with convolutional neural networks,firstly,it constructs the log-Mel and its first-and second-order into three-dimensional features,and then uses a convolutional neural network to classify,which effectively improves its anti-noise performance.Experimental results show that the proposed recognition method performs better than the conventional recognition methods under different environmental sound data sets.In addition,based on this method,a set of street environmental sound detection systems is proposed.After real-world testing,the Recall,Precision,and F1-measure of the proposed detection system are 94%,87%,and 90.5%,respectively.Compared with detection systems based on conventional methods,it has better performance,and which verify the feasibility of the proposed system.
作者 张留军 王玫 罗丽燕 ZHANG Liujun;WANG Mei;LUO Liyan(Provincial Ministry of Education Key Laboratory of Cognitive Radio and Signal Processing,Guilin University of Electronic Technology,Guilin 541004,China;College of Information Science and Engineering,Guilin University of Technology,Guilin 541006,China)
出处 《桂林电子科技大学学报》 2020年第5期411-417,共7页 Journal of Guilin University of Electronic Technology
基金 国家自然科学基金(61771151) 广西重点研发计划(2017AB08072) 广西自然科学基金(2016GXNSFBA38014) 中国博士后科学基金(2016M602921XB) 广西研究生教育创新计划(YCSW2019139) 桂林电子科技大学研究生教育创新计划(2019YCXS038)。
关键词 环境声事件识别 卷积神经网络 检测系统 environmental sound event recognition convolutional neural network detection system
  • 相关文献

参考文献1

二级参考文献11

  • 1商琳,杨育彬,王亮,陈兆乾.基于颜色共生矩阵的纹理检索算法MCM[J].南京大学学报(自然科学版),2004,40(5):540-547. 被引量:5
  • 2贾富仓,李华.基于随机森林的多谱磁共振图像分割[J].计算机工程,2005,31(10):159-161. 被引量:15
  • 3Briggs F,Raich R,Fern X Z. Audio Classification of Bird Species:A Statistical Manifold Approach[A].Miami,Florida,USA,2009.
  • 4Vilches E,Escobar I A,Vallejo E E. Data mining applied to a-coustic bird species recognition[A].Hong Kong,China:IEEE Computer So-ciety,2006.400-403.
  • 5Behnaz Ghoraani J,Sridhar Krishnan. Time-Frequency Matrix Feature Extraction and Classification of Environmental Audio Signals[J].IEEE Transactions on Audio Speech and Language Processing,2011,(07):2197-2209.
  • 6Lawrence Neal,Forrest Briggs,Raviv Raich. Time Frequency Segmentation of Bird Song in Noisy Acousti Environments[A].2011.2012-2015.
  • 7Sundarrajan Rangachari,Philipos C Loizou. A noise-estimation algorithm for highly non-stationary environments[J].Speech Commu-nication,2006,(48):220-231.
  • 8Universitat Pompeu Fabra. Repository of sound under the creative com-mons license[DB/OL].http://www.freesound.org/,2012.
  • 9毕福昆,边明明.多分类器融合在语音情感识别中的应用[J].计算机工程与应用,2010,46(28):205-207. 被引量:3
  • 10王爱平,万国伟,程志全,李思昆.支持在线学习的增量式极端随机森林分类器[J].软件学报,2011,22(9):2059-2074. 被引量:59

共引文献13

同被引文献11

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部