In low-altitude air traffic management, non-cooperation targets are the greatest threat to security of low-flying aircraft. Among various aviation fatalities, flying bird is the main factor with the highest risk and d...In low-altitude air traffic management, non-cooperation targets are the greatest threat to security of low-flying aircraft. Among various aviation fatalities, flying bird is the main factor with the highest risk and directs economic losses amounted to nearly 10 billion US dollars each year.Therefore, Flying Bird Detection(FBD) has attracted considerable attention in low-altitude air traffic management. In this paper, we propose a skeleton based FBD method via describing bird motion information with a set of key poses. To overcome the variability of birds, the skeleton feature is selected as a relatively fixed and common characteristic for the pose appearance of flying bird. Based on the geometric topology among some key parts of bird body, a set of key poses can be described by some extracted skeleton features, which are used to represent the bird motion information. Aimed at robustly handling with the pose variations, multiple pose-specific classifiers are individually trained to learn the representative poses of the flying bird. At the detection stage,the flying bird skeleton features are combined with extracted key-pose sets to perform the flying bird classification task from each image. Afterwards, the key-frame pose-change set and the consistency of the classification results from sequent images are employed to validate the final detection results.Experiments on flying bird datasets demonstrate the effectiveness and efficiency of the proposed method.展开更多
Passive acoustic monitoring(PAM)technology is increasingly becoming one of the mainstream methods for bird monitoring.However,detecting bird audio within complex natural acoustic environments using PAM devices remains...Passive acoustic monitoring(PAM)technology is increasingly becoming one of the mainstream methods for bird monitoring.However,detecting bird audio within complex natural acoustic environments using PAM devices remains a significant challenge.To enhance the accuracy(ACC)of bird audio detection(BAD)and reduce both false negatives and false positives,this study proposes a BAD method based on a Dual-Feature Enhancement Fusion Model(DFEFM).This method incorporates per-channel energy normalization(PCEN)to suppress noise in the input audio and utilizes mel-frequency cepstral coefficients(MFCC)and frequency correlation matrices(FCM)as input features.It achieves deep feature-level fusion of MFCC and FCM on the channel dimension through two independent multi-layer convolutional network branches,and further integrates Spatial and Channel Synergistic Attention(SCSA)and Multi-Head Attention(MHA)modules to enhance the fusion effect of the aforementioned two deep features.Experimental results on the DCASE2018 BAD dataset show that our proposed method achieved an ACC of 91.4%and an AUC value of 0.963,with false negative and false positive rates of 11.36%and 7.40%,respectively,surpassing existing methods.The method also demonstrated detection ACC above 92%and AUC values above 0.987 on datasets from three sites of different natural scenes in Beijing.Testing on the NVIDIA Jetson Nano indicated that the method achieved an ACC of 89.48%when processing an average of 10 s of audio,with a response time of only 0.557 s,showing excellent processing efficiency.This study provides an effective method for filtering non-bird vocalization audio in bird vocalization monitoring devices,which helps to save edge storage and information transmission costs,and has significant application value for wild bird monitoring and ecological research.展开更多
基金co-supported by the National Key Research and Development Program of China (No. 2016YFB1200100)National Natural Science Foundation of China (Nos. 61521091, 91538204 and 61425014)
文摘In low-altitude air traffic management, non-cooperation targets are the greatest threat to security of low-flying aircraft. Among various aviation fatalities, flying bird is the main factor with the highest risk and directs economic losses amounted to nearly 10 billion US dollars each year.Therefore, Flying Bird Detection(FBD) has attracted considerable attention in low-altitude air traffic management. In this paper, we propose a skeleton based FBD method via describing bird motion information with a set of key poses. To overcome the variability of birds, the skeleton feature is selected as a relatively fixed and common characteristic for the pose appearance of flying bird. Based on the geometric topology among some key parts of bird body, a set of key poses can be described by some extracted skeleton features, which are used to represent the bird motion information. Aimed at robustly handling with the pose variations, multiple pose-specific classifiers are individually trained to learn the representative poses of the flying bird. At the detection stage,the flying bird skeleton features are combined with extracted key-pose sets to perform the flying bird classification task from each image. Afterwards, the key-frame pose-change set and the consistency of the classification results from sequent images are employed to validate the final detection results.Experiments on flying bird datasets demonstrate the effectiveness and efficiency of the proposed method.
基金supported by the Beijing Natural Science Foundation(5252014)the National Natural Science Foundation of China(62303063)。
文摘Passive acoustic monitoring(PAM)technology is increasingly becoming one of the mainstream methods for bird monitoring.However,detecting bird audio within complex natural acoustic environments using PAM devices remains a significant challenge.To enhance the accuracy(ACC)of bird audio detection(BAD)and reduce both false negatives and false positives,this study proposes a BAD method based on a Dual-Feature Enhancement Fusion Model(DFEFM).This method incorporates per-channel energy normalization(PCEN)to suppress noise in the input audio and utilizes mel-frequency cepstral coefficients(MFCC)and frequency correlation matrices(FCM)as input features.It achieves deep feature-level fusion of MFCC and FCM on the channel dimension through two independent multi-layer convolutional network branches,and further integrates Spatial and Channel Synergistic Attention(SCSA)and Multi-Head Attention(MHA)modules to enhance the fusion effect of the aforementioned two deep features.Experimental results on the DCASE2018 BAD dataset show that our proposed method achieved an ACC of 91.4%and an AUC value of 0.963,with false negative and false positive rates of 11.36%and 7.40%,respectively,surpassing existing methods.The method also demonstrated detection ACC above 92%and AUC values above 0.987 on datasets from three sites of different natural scenes in Beijing.Testing on the NVIDIA Jetson Nano indicated that the method achieved an ACC of 89.48%when processing an average of 10 s of audio,with a response time of only 0.557 s,showing excellent processing efficiency.This study provides an effective method for filtering non-bird vocalization audio in bird vocalization monitoring devices,which helps to save edge storage and information transmission costs,and has significant application value for wild bird monitoring and ecological research.