期刊文献+
共找到6篇文章
< 1 >
每页显示 20 50 100
Hybrid receptive field network for small object detection on drone view
1
作者 Zhaodong CHEN Hongbing JI +2 位作者 Yongquan ZHANG Wenke LIU Zhigang ZHU 《Chinese Journal of Aeronautics》 2025年第2期322-338,共17页
Drone-based small object detection is of great significance in practical applications such as military actions, disaster rescue, transportation, etc. However, the severe scale differences in objects captured by drones... Drone-based small object detection is of great significance in practical applications such as military actions, disaster rescue, transportation, etc. However, the severe scale differences in objects captured by drones and lack of detail information for small-scale objects make drone-based small object detection a formidable challenge. To address these issues, we first develop a mathematical model to explore how changing receptive fields impacts the polynomial fitting results. Subsequently, based on the obtained conclusions, we propose a simple but effective Hybrid Receptive Field Network (HRFNet), whose modules include Hybrid Feature Augmentation (HFA), Hybrid Feature Pyramid (HFP) and Dual Scale Head (DSH). Specifically, HFA employs parallel dilated convolution kernels of different sizes to extend shallow features with different receptive fields, committed to improving the multi-scale adaptability of the network;HFP enhances the perception of small objects by capturing contextual information across layers, while DSH reconstructs the original prediction head utilizing a set of high-resolution features and ultrahigh-resolution features. In addition, in order to train HRFNet, the corresponding dual-scale loss function is designed. Finally, comprehensive evaluation results on public benchmarks such as VisDrone-DET and TinyPerson demonstrate the robustness of the proposed method. Most impressively, the proposed HRFNet achieves a mAP of 51.0 on VisDrone-DET with 29.3 M parameters, which outperforms the extant state-of-the-art detectors. HRFNet also performs excellently in complex scenarios captured by drones, achieving the best performance on the CS-Drone dataset we built. 展开更多
关键词 Drone remote sensing Object detection on drone view Small object detector Hybrid receptive field feature pyramid network feature augmentation Multi-scale object detection
原文传递
Attentive Neighborhood Feature Augmentation for Semi-supervised Learning
2
作者 Qi Liu Jing Li +1 位作者 Xianmin Wang Wenpeng Zhao 《Intelligent Automation & Soft Computing》 SCIE 2023年第8期1753-1771,共19页
Recent state-of-the-art semi-supervised learning(SSL)methods usually use data augmentations as core components.Such methods,however,are limited to simple transformations such as the augmentations under the instance’s... Recent state-of-the-art semi-supervised learning(SSL)methods usually use data augmentations as core components.Such methods,however,are limited to simple transformations such as the augmentations under the instance’s naive representations or the augmentations under the instance’s semantic representations.To tackle this problem,we offer a unique insight into data augmentations and propose a novel data-augmentation-based semi-supervised learning method,called Attentive Neighborhood Feature Aug-mentation(ANFA).The motivation of our method lies in the observation that the relationship between the given feature and its neighborhood may contribute to constructing more reliable transformations for the data,and further facilitating the classifier to distinguish the ambiguous features from the low-dense regions.Specially,we first project the labeled and unlabeled data points into an embedding space and then construct a neighbor graph that serves as a similarity measure based on the similar representations in the embedding space.Then,we employ an attention mechanism to transform the target features into augmented ones based on the neighbor graph.Finally,we formulate a novel semi-supervised loss by encouraging the predictions of the interpolations of augmented features to be consistent with the corresponding interpolations of the predictions of the target features.We carried out exper-iments on SVHN and CIFAR-10 benchmark datasets and the experimental results demonstrate that our method outperforms the state-of-the-art methods when the number of labeled examples is limited. 展开更多
关键词 Semi-supervised learning attention mechanism feature augmentation consistency regularization
在线阅读 下载PDF
Pavement Cracks Coupled With Shadows:A New Shadow-Crack Dataset and A Shadow-Removal-Oriented Crack Detection Approach 被引量:3
3
作者 Lili Fan Shen Li +3 位作者 Ying Li Bai Li Dongpu Cao Fei-Yue Wang 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI CSCD 2023年第7期1593-1607,共15页
Automatic pavement crack detection is a critical task for maintaining the pavement stability and driving safety.The task is challenging because the shadows on the pavement may have similar intensity with the crack,whi... Automatic pavement crack detection is a critical task for maintaining the pavement stability and driving safety.The task is challenging because the shadows on the pavement may have similar intensity with the crack,which interfere with the crack detection performance.Till to the present,there still lacks efficient algorithm models and training datasets to deal with the interference brought by the shadows.To fill in the gap,we made several contributions as follows.First,we proposed a new pavement shadow and crack dataset,which contains a variety of shadow and pavement pixel size combinations.It also covers all common cracks(linear cracks and network cracks),placing higher demands on crack detection methods.Second,we designed a two-step shadow-removal-oriented crack detection approach:SROCD,which improves the performance of the algorithm by first removing the shadow and then detecting it.In addition to shadows,the method can cope with other noise disturbances.Third,we explored the mechanism of how shadows affect crack detection.Based on this mechanism,we propose a data augmentation method based on the difference in brightness values,which can adapt to brightness changes caused by seasonal and weather changes.Finally,we introduced a residual feature augmentation algorithm to detect small cracks that can predict sudden disasters,and the algorithm improves the performance of the model overall.We compare our method with the state-of-the-art methods on existing pavement crack datasets and the shadow-crack dataset,and the experimental results demonstrate the superiority of our method. 展开更多
关键词 Automatic pavement crack detection data augmentation compensation deep learning residual feature augmentation shadow removal shadow-crack dataset
在线阅读 下载PDF
Enhancing lithofacies machine learning predictions with gamma-ray attributes for boreholes with limited diversity of recorded well logs 被引量:2
4
作者 David A.Wood 《Artificial Intelligence in Geosciences》 2021年第1期148-164,共17页
Derivative and volatility attributes can be usefully calculated from recorded gamma ray(GR)data to enhance lithofacies classification in wellbores penetrating multiple lithologies.Such attributes extract information a... Derivative and volatility attributes can be usefully calculated from recorded gamma ray(GR)data to enhance lithofacies classification in wellbores penetrating multiple lithologies.Such attributes extract information about the log curve shape that cannot be readily discerned from the recorded well log data.A logged wellbore section for which 8911 data records are available for the three recorded logs(GR,sonic(DT)and bulk density(PB))is evaluated.That section demonstrates the value of the GR attributes for machine learning(ML)lithofacies predictions.Five feature selection configurations are considered.The 9-var configuration including GR,DT,PB and six GR attributes,and the 7-var configuration of GR and the six GR attributes,provide the most accurate and reproducible lithofacies predictions.The other three feature configurations evaluated do not include the GR attributes but just one to three of the recorded log features.The results of seven ML models and two regression models reveal that K-nearest neighbor(KNN),random forest(RF)and extreme gradient boosting(XGB)are the best performing models.They generate between 14 and 23 misclassification from 8911 data records for the 9-var model.Multi-layer perceptron(MLP)and support vector classification(SVC)do not perform well with the 7-var model which lacks the PB feature displaying the highest correlation with facies class.Annotated confusion matrices reveal that KNN,RF and XGB models can effectively distinguish all facies classes for the 9-var and 7-var configurations(that includes the GR attributes),whereas none of the models can achieve that outcome for the 3-var configuration(that excludes the GR attributes).Accurately distinguishing lithofacies using well-log data in sedimentary sections is an important objective in applied geoscience.The straightforward,GR-attribute method proposed works to improve confidence in ML-lithofacies classifications based on limited recorded well-log data. 展开更多
关键词 Rolling average derivatives Log-curve volatility Lithofacies log characteristics Confusion analysis Gamma-ray attributes Well-log feature augmentation
在线阅读 下载PDF
Multi-dimensional Classification via Selective Feature Augmentation 被引量:6
5
作者 Bin-Bin Jia Min-Ling Zhang 《Machine Intelligence Research》 EI CSCD 2022年第1期38-51,共14页
In multi-dimensional classification(MDC), the semantics of objects are characterized by multiple class spaces from different dimensions. Most MDC approaches try to explicitly model the dependencies among class spaces ... In multi-dimensional classification(MDC), the semantics of objects are characterized by multiple class spaces from different dimensions. Most MDC approaches try to explicitly model the dependencies among class spaces in output space. In contrast, the recently proposed feature augmentation strategy, which aims at manipulating feature space, has also been shown to be an effective solution for MDC. However, existing feature augmentation approaches only focus on designing holistic augmented features to be appended with the original features, while better generalization performance could be achieved by exploiting multiple kinds of augmented features.In this paper, we propose the selective feature augmentation strategy that focuses on synergizing multiple kinds of augmented features.Specifically, by assuming that only part of the augmented features is pertinent and useful for each dimension′s model induction, we derive a classification model which can fully utilize the original features while conduct feature selection for the augmented features. To validate the effectiveness of the proposed strategy, we generate three kinds of simple augmented features based on standard k NN, weighted k NN, and maximum margin techniques, respectively. Comparative studies show that the proposed strategy achieves superior performance against both state-of-the-art MDC approaches and its degenerated versions with either kind of augmented features. 展开更多
关键词 Machine learning multi-dimensional classification feature augmentation feature selection class dependencies
原文传递
STATNet:One-stage coal-gangue detector based on deep learning algorithm for real industrial application
6
作者 Kefei Zhang Teng Wang +4 位作者 Xiaolin Yang Liang Xu Jesse Thé Zhongchao Tan Hesheng Yu 《Energy and AI》 EI 2024年第3期113-125,共13页
Coal-gangue object detection has attracted substantial attention because it is the core of realizing vision-based intelligent and green coal separation. However, most existing studies have been focused on laboratory d... Coal-gangue object detection has attracted substantial attention because it is the core of realizing vision-based intelligent and green coal separation. However, most existing studies have been focused on laboratory datasets and prioritized model lightweight. This makes the coal-gangue object detection challenging to adapt to the complex and harsh scenes of real production environments. Therefore, our project collected and labeled image datasets of coal and gangue under real production conditions from a coal preparation plant. We then designed a one-stage object model, named STATNet, following the “backbone-neck-head” architecture with the aim of enhancing the detection accuracy under industrial coal preparation scenarios. The proposed model utilizes Swin Transformer as backbone module to extract multi-scale features, improved path augmentation feature pyramid network (iPAFPN) as neck module to enrich feature fusion, and task-aligned head (TAH) as head module to mitigate conflicts and misalignments between classification and localization tasks. Experimental results on a real-world industrial dataset demonstrate that the proposed STATNet model achieves an impressive AP50 of 89.27 %, significantly surpassing several state-of-the-art baseline models by 2.02 % to 5.58 %. Additionally, it exhibits stronger robustness in resisting image corruption and perturbation. These findings demonstrate its promising prospects in practical coal and gangue separation applications. 展开更多
关键词 Coal-gangue detection Swin transformer Impr oved path augmentation feature pyranid network Task-aligned head Object detection
在线阅读 下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部