期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Improving sound event detection through enhanced feature extraction and attention mechanisms
1
作者 Dongping ZHANG Siyi WU +3 位作者 Zhanhong LU Zhehao ZHANG Haimiao HU Jiabin YU 《Frontiers of Computer Science》 2025年第10期143-145,共3页
1 Introduction Sound event detection(SED)aims to identify and locate specific sound event categories and their corresponding timestamps within continuous audio streams.To overcome the limitations posed by the scarcity... 1 Introduction Sound event detection(SED)aims to identify and locate specific sound event categories and their corresponding timestamps within continuous audio streams.To overcome the limitations posed by the scarcity of strongly labeled training data,researchers have increasingly turned to semi-supervised learning(SSL)[1],which leverages unlabeled data to augment training and improve detection performance.Among many SSL methods[2-4]. 展开更多
关键词 sound event detection semi supervised learning feature extraction sound event detection sed aims identify locate specific sound event categories augment training unlabeled data attention mechanisms
原文传递
Dynamic prompting class distribution optimization for semi-supervised sound event detection
2
作者 Lijian GAO Qing ZHU +2 位作者 Yaxin SHEN Qirong MAO Yongzhao ZHAN 《Frontiers of Information Technology & Electronic Engineering》 2025年第4期556-567,共12页
Semi-supervised sound event detection(SSED)tasks typically leverage a large amount of unlabeled and synthetic data to facilitate model generalization during training,reducing overfitting on a limited set of labeled da... Semi-supervised sound event detection(SSED)tasks typically leverage a large amount of unlabeled and synthetic data to facilitate model generalization during training,reducing overfitting on a limited set of labeled data.However,the generalization training process often encounters challenges from noisy interference introduced by pseudo-labels or domain knowledge gaps.To alleviate noisy interference in class distribution learning,we propose an efficient semi-supervised class distribution learning method through dynamic prompt tuning,named prompting class distribution optimization(PADO).Specifically,when modeling real labeled data,PADO dynamically incorporates independent learnable prompt tokens to explore prior knowledge about the true distribution.Then,the prior knowledge serves as prompt information,dynamically interacting with the posterior noisy-class distribution information.In this case,PADO achieves class distribution optimization while maintaining model generalization,leading to a significant improvement in the efficiency of class distribution learning.Compared with state-of-the-art methods on the SSED datasets from DCASE 2019,2020,and 2021 challenges,PADO achieves significant performance improvements.Furthermore,it is readily extendable to other benchmark models. 展开更多
关键词 Prompt tuning Class distribution learning Semi-supervised learning sound event detection
原文传递
Sound event localization and detection based on deep learning
3
作者 ZHAO Dada DING Kai +2 位作者 QI Xiaogang CHEN Yu FENG Hailin 《Journal of Systems Engineering and Electronics》 SCIE CSCD 2024年第2期294-301,共8页
Acoustic source localization(ASL)and sound event detection(SED)are two widely pursued independent research fields.In recent years,in order to achieve a more complete spatial and temporal representation of sound field,... Acoustic source localization(ASL)and sound event detection(SED)are two widely pursued independent research fields.In recent years,in order to achieve a more complete spatial and temporal representation of sound field,sound event localization and detection(SELD)has become a very active research topic.This paper presents a deep learning-based multioverlapping sound event localization and detection algorithm in three-dimensional space.Log-Mel spectrum and generalized cross-correlation spectrum are joined together in channel dimension as input features.These features are classified and regressed in parallel after training by a neural network to obtain sound recognition and localization results respectively.The channel attention mechanism is also introduced in the network to selectively enhance the features containing essential information and suppress the useless features.Finally,a thourough comparison confirms the efficiency and effectiveness of the proposed SELD algorithm.Field experiments show that the proposed algorithm is robust to reverberation and environment and can achieve higher recognition and localization accuracy compared with the baseline method. 展开更多
关键词 sound event localization and detection(SELD) deep learning convolutional recursive neural network(CRNN) channel attention mechanism
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部