期刊文献+
共找到943篇文章
< 1 2 48 >
每页显示 20 50 100
Rectification of magnetic force tracker using neural network inaugmented reality system 被引量:2
1
作者 丰艳 陈一民 《Journal of Shanghai University(English Edition)》 CAS 2006年第5期431-435,共5页
Nonlinear errors always exist in data obtained from tracker in augmented reality (AR), which badly influence the effect of AR. This paper proposes to rectify the errors using BP neural network. As BP neural network ... Nonlinear errors always exist in data obtained from tracker in augmented reality (AR), which badly influence the effect of AR. This paper proposes to rectify the errors using BP neural network. As BP neural network is prone to getting into local extrema and convergence is slow, genetic algorithm is employed to optimize the initial weights and threshold of neural network. This paper discusses how to set the crucial parameters in the algorithm. Experimental results show that the method ensures that the neural network achieves global convergence quickly and correctly. Tracking precision of AR system is improved after the tracker is rectified, and the third dimension of AR system is enhanced. 展开更多
关键词 BP neural network genetic algorithm augmented reality(AR) magnetic force tracker rectification.
在线阅读 下载PDF
A solution framework for the experimental data shortage problem of lithium-ion batteries:Generative adversarial network-based data augmentation for battery state estimation 被引量:1
2
作者 Jinghua Sun Ankun Gu Josef Kainz 《Journal of Energy Chemistry》 2025年第4期476-497,共22页
In order to address the widespread data shortage problem in battery research,this paper proposes a generative adversarial network model that combines it with deep convolutional networks,the Wasserstein distance,and th... In order to address the widespread data shortage problem in battery research,this paper proposes a generative adversarial network model that combines it with deep convolutional networks,the Wasserstein distance,and the gradient penalty to achieve data augmentation.To lower the threshold for implementing the proposed method,transfer learning is further introduced.The W-DC-GAN-GP-TL framework is thereby formed.This framework is evaluated on 3 different publicly available datasets to judge the quality of generated data.Through visual comparisons and the examination of two visualization methods(probability density function(PDF)and principal component analysis(PCA)),it is demonstrated that the generated data is hard to distinguish from the real data.The application of generated data for training a battery state model using transfer learning is further evaluated.Specifically,Bi-GRU-based and Transformer-based methods are implemented on 2 separate datasets for estimating state of health(SOH)and state of charge(SOC),respectively.The results indicate that the proposed framework demonstrates satisfactory performance in different scenarios:for the data replacement scenario,where real data are removed and replaced with generated data,the state estimator accuracy decreases only slightly;for the data enhancement scenario,the estimator accuracy is further improved.The estimation accuracy of SOH and SOC is as low as 0.69%and 0.58%root mean square error(RMSE)after applying the proposed framework.This framework provides a reliable method for enriching battery measurement data.It is a generalized framework capable of generating a variety of time series data. 展开更多
关键词 Lithium-ion battery Generative adversarial network Data augmentation State of health State of charge Data shortage
在线阅读 下载PDF
Experiments on image data augmentation techniques for geological rock type classification with convolutional neural networks 被引量:2
3
作者 Afshin Tatar Manouchehr Haghighi Abbas Zeinijahromi 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第1期106-125,共20页
The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and hist... The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and historical context,DL offers a powerful complement by enhancing the speed,objectivity,and precision of the classification process.This research explores the significance of image data augmentation techniques in optimizing the performance of convolutional neural networks(CNNs)for geological image analysis,particularly in the classification of igneous,metamorphic,and sedimentary rock types from rock thin section(RTS)images.This study primarily focuses on classic image augmentation techniques and evaluates their impact on model accuracy and precision.Results demonstrate that augmentation techniques like Equalize significantly enhance the model's classification capabilities,achieving an F1-Score of 0.9869 for igneous rocks,0.9884 for metamorphic rocks,and 0.9929 for sedimentary rocks,representing improvements compared to the baseline original results.Moreover,the weighted average F1-Score across all classes and techniques is 0.9886,indicating an enhancement.Conversely,methods like Distort lead to decreased accuracy and F1-Score,with an F1-Score of 0.949 for igneous rocks,0.954 for metamorphic rocks,and 0.9416 for sedimentary rocks,exacerbating the performance compared to the baseline.The study underscores the practicality of image data augmentation in geological image classification and advocates for the adoption of DL methods in this domain for automation and improved results.The findings of this study can benefit various fields,including remote sensing,mineral exploration,and environmental monitoring,by enhancing the accuracy of geological image analysis both for scientific research and industrial applications. 展开更多
关键词 Deep learning(DL) Image analysis Image data augmentation Convolutional neural networks(CNNs) Geological image analysis Rock classification Rock thin section(RTS)images
在线阅读 下载PDF
Predicting Concrete Strength Using Data Augmentation Coupled with Multiple Optimizers in Feedforward Neural Networks
4
作者 Sandeerah Choudhary Qaisar Abbas +3 位作者 Tallha Akram Irshad Qureshi Mutlaq B.Aldajani Hammad Salahuddin 《Computer Modeling in Engineering & Sciences》 2025年第11期1755-1787,共33页
The increasing demand for sustainable construction practices has led to growing interest in recycled aggregate concrete(RAC)as an eco-friendly alternative to conventional concrete.However,predicting its compressive st... The increasing demand for sustainable construction practices has led to growing interest in recycled aggregate concrete(RAC)as an eco-friendly alternative to conventional concrete.However,predicting its compressive strength remains a challenge due to the variability in recycled materials and mix design parameters.This study presents a robust machine learning framework for predicting the compressive strength of recycled aggregate concrete using feedforward neural networks(FFNN),Random Forest(RF),and XGBoost.A literature-derived dataset of 502 samples was enriched via interpolation-based data augmentation and modeled using five distinct optimization techniques within MATLAB’s Neural Net Fitting module:Bayesian Regularization,Levenberg-Marquardt,and three conjugate gradient variants—Powell/Beale Restarts,Fletcher-Powell,and Polak-Ribiere.Hyperparameter tuning,dropout regularization,and early stopping were employed to enhance generalization.Comparative analysis revealed that FFNN outperformed RF and XGBoost,achieving an R2 of 0.9669.To ensure interpretability,accumulated local effects(ALE)along with partial dependence plots(PDP)were utilized.This revealed trends consistent with the pre-existent domain knowledge.This allows estimation of strength from the properties of the mix without extensive lab testing,permitting designers to track the performance and sustainability trends in concrete mix designs while promoting responsible construction and demolition waste utilization. 展开更多
关键词 Feedforward neural networks recycled aggregates compressive strength prediction optimization techniques data augmentation grid search
在线阅读 下载PDF
Visualization and Management Platform with Augmented Reality for Wireless Sensor Networks
5
作者 Kenya Sato Naoya Sakamoto Hideki Shimada 《Wireless Sensor Network》 2015年第1期1-11,共11页
Recently a ubiquitous sensor network which collects our environmental information gets increasingly popular, a visualization application is necessary for users to manage complicated wireless networks, however, these a... Recently a ubiquitous sensor network which collects our environmental information gets increasingly popular, a visualization application is necessary for users to manage complicated wireless networks, however, these applications are developed individually for wireless communication standard or a type of wireless device. Therefore, users are forced to adopt and use the application individually according to the target of the wireless network. In this paper, we propose a visualization platform for wireless network environments using augmented reality technology, and evaluate the effectiveness of the platform. From the result of the evaluation, we have confirmed the proposed platform has availability for visualization and management of wireless networks. 展开更多
关键词 network MANAGEMENT VISUALIZATION augmented REALITY
暂未订购
An Augmentation Method for Small-Sample Imbalanced Industrial IoT Detection Data
6
作者 SU Zhilong SHEN Zhidong SUN Hui 《Wuhan University Journal of Natural Sciences》 2026年第1期25-34,共10页
IoT devices are highly vulnerable to cyberattacks due to their widespread,distributed nature and limited security features.Intrusion detection can counter these threats,but class imbalance between normal and abnormal ... IoT devices are highly vulnerable to cyberattacks due to their widespread,distributed nature and limited security features.Intrusion detection can counter these threats,but class imbalance between normal and abnormal traffic often degrades model performance.We propose a novel multi-generator adversarial data augmentation method that blends the strengths of TMG-GAN(Tabular Multi-Generator Generative Adversarial Network)and R3GAN(Re-GAN).Our approach uses multiple class-specific generators to create diverse,high-quality synthetic samples,improving training stability and minority-class detection.A dual-branch discriminator-classifier enhances authenticity and class prediction,while feature similarity and decoupling techniques ensure clear class separation.Experiments on TON-IoT and Edge-IIoTset datasets show our method outperforms existing techniques like hybrid sampling,SNGAN(Spectral Normalization GAN),and TMG-GAN,achieving higher detection accuracy and better minority-class recall for imbalanced IoT intrusion detection. 展开更多
关键词 Internet of Things(IoT) intrusion detection system generative adversarial networks class imbalance data augmentation
原文传递
Enhanced sparse RCNN for transmission line bolt defect detection via text-to-image data augmentation and quality filtering
7
作者 Chen Zhenyu Yan Huaguang +2 位作者 Du Jianguang Xue Meng Zhao Shuai 《High Technology Letters》 2026年第1期11-20,共10页
To address the issue of inconsistent image quality and data scarcity in bolt defect detection for transmission lines,this paper proposes an improved sparse region-based convolutional neural network(RCNN) based detecti... To address the issue of inconsistent image quality and data scarcity in bolt defect detection for transmission lines,this paper proposes an improved sparse region-based convolutional neural network(RCNN) based detection framework integrating image quality evaluation and text-to-image data augmentation.First,a HyperNetwork-based image quality assessment module is introduced to filter low-quality inspection images in terms of clarity and structural integrity,resulting in a high-quality training dataset.Second,a text-to-image diffusion model is utilized for sample augmentation.By designing text prompts that describe various bolt defect types under diverse lighting and viewing conditions,the model automatically generates realistic synthetic samples.The generated images are further filtered using a combination of quality and perceptual similarity metrics to ensure consistency with the real data distribution.Building upon the sparse RCNN baseline,a dynamic label assignment mechanism and a random decision path detection head are incorporated to enhance bounding box matching and prediction accuracy.Experimental results demonstrate that the proposed method significantly improves detection accuracy(mAP@0.5) over the original sparse RCNN while maintaining low computational cost,enabling more efficient and intelligent inspection of transmission line components. 展开更多
关键词 sparse region-based convolutional neural network Hypernetwork image quality assessment text-to-image generation data augmentation bolt defect detection transmission line inspection
在线阅读 下载PDF
Image Augmentation-Based Food Recognition with Convolutional Neural Networks 被引量:6
8
作者 Lili Pan Jiaohua Qin +3 位作者 Hao Chen Xuyu Xiang Cong Li Ran Chen 《Computers, Materials & Continua》 SCIE EI 2019年第4期297-313,共17页
Image retrieval for food ingredients is important work,tremendously tiring,uninteresting,and expensive.Computer vision systems have extraordinary advancements in image retrieval with CNNs skills.But it is not feasible... Image retrieval for food ingredients is important work,tremendously tiring,uninteresting,and expensive.Computer vision systems have extraordinary advancements in image retrieval with CNNs skills.But it is not feasible for small-size food datasets using convolutional neural networks directly.In this study,a novel image retrieval approach is presented for small and medium-scale food datasets,which both augments images utilizing image transformation techniques to enlarge the size of datasets,and promotes the average accuracy of food recognition with state-of-the-art deep learning technologies.First,typical image transformation techniques are used to augment food images.Then transfer learning technology based on deep learning is applied to extract image features.Finally,a food recognition algorithm is leveraged on extracted deepfeature vectors.The presented image-retrieval architecture is analyzed based on a smallscale food dataset which is composed of forty-one categories of food ingredients and one hundred pictures for each category.Extensive experimental results demonstrate the advantages of image-augmentation architecture for small and medium datasets using deep learning.The novel approach combines image augmentation,ResNet feature vectors,and SMO classification,and shows its superiority for food detection of small/medium-scale datasets with comprehensive experiments. 展开更多
关键词 Image augmentation small-scale dataset deep feature deep learning convolutional neural network
在线阅读 下载PDF
Effect of Data Augmentation of Renal Lesion Image by Nine-layer Convolutional Neural Network in Kidney CT 被引量:1
9
作者 Liying Wang Zhiqiang Xu Shuihua Wang 《Computer Modeling in Engineering & Sciences》 SCIE EI 2020年第9期1001-1015,共15页
Artificial Intelligence(AI)becomes one hotspot in the field of the medical images analysis and provides rather promising solution.Although some research has been explored in smart diagnosis for the common diseases of ... Artificial Intelligence(AI)becomes one hotspot in the field of the medical images analysis and provides rather promising solution.Although some research has been explored in smart diagnosis for the common diseases of urinary system,some problems remain unsolved completely A nine-layer Convolutional Neural Network(CNN)is proposed in this paper to classify the renal Computed Tomography(CT)images.Four group of comparative experiments prove the structure of this CNN is optimal and can achieve good performance with average accuracy about 92.07±1.67%.Although our renal CT data is not very large,we do augment the training data by affine,translating,rotating and scaling geometric transformation and gamma,noise transformation in color space.Experimental results validate the Data Augmentation(DA)on training data can improve the performance of our proposed CNN compared to without DA with the average accuracy about 0.85%.This proposed algorithm gives a promising solution to help clinical doctors automatically recognize the abnormal images faster than manual judgment and more accurately than previous methods. 展开更多
关键词 Artificial intelligence convolutional neural network data augmentation renal lesion computed tomography image
在线阅读 下载PDF
Terrain Simplification Research in Augmented Scene Modeling
10
作者 Ming Delie You Kefei Tian Jinwen Liu Jian State Key Laboratory for Image Processing and Intelligent Control, Institute for Pattern Recognition and Artificial Intelligence , Huazhong University of Science and Technology , Wuhan 430074 Wang Guangju 《Journal of China University of Geosciences》 SCIE CSCD 2001年第4期351-355,共5页
Augmented reality is the merging of synthetic sensory information into a user's perception of a real environment. As one of the most important tasks in augmented scene modeling, terrain simplification research has... Augmented reality is the merging of synthetic sensory information into a user's perception of a real environment. As one of the most important tasks in augmented scene modeling, terrain simplification research has gained more and more attention. In this paper, we mainly focus on point selection problem in terrain simplification using triangulated irregular network. Based on the analysis and comparison of traditional importance measures for each input point, we put forward a new importance measure based on local entropy. The results demonstrate that the local entropy criterion has a better performance than any traditional methods. In addition, it can effectively conquer the 'short-sight' problem associated with the traditional methods. 展开更多
关键词 augmented reality scene modeling terrain simplification triangulated irregular network importance measures local entropy.
在线阅读 下载PDF
Image segmentation of exfoliated two-dimensional materials by generative adversarial network-based data augmentation
11
作者 程晓昱 解晨雪 +6 位作者 刘宇伦 白瑞雪 肖南海 任琰博 张喜林 马惠 蒋崇云 《Chinese Physics B》 SCIE EI CAS CSCD 2024年第3期112-117,共6页
Mechanically cleaved two-dimensional materials are random in size and thickness.Recognizing atomically thin flakes by human experts is inefficient and unsuitable for scalable production.Deep learning algorithms have b... Mechanically cleaved two-dimensional materials are random in size and thickness.Recognizing atomically thin flakes by human experts is inefficient and unsuitable for scalable production.Deep learning algorithms have been adopted as an alternative,nevertheless a major challenge is a lack of sufficient actual training images.Here we report the generation of synthetic two-dimensional materials images using StyleGAN3 to complement the dataset.DeepLabv3Plus network is trained with the synthetic images which reduces overfitting and improves recognition accuracy to over 90%.A semi-supervisory technique for labeling images is introduced to reduce manual efforts.The sharper edges recognized by this method facilitate material stacking with precise edge alignment,which benefits exploring novel properties of layered-material devices that crucially depend on the interlayer twist-angle.This feasible and efficient method allows for the rapid and high-quality manufacturing of atomically thin materials and devices. 展开更多
关键词 two-dimensional materials deep learning data augmentation generating adversarial networks
原文传递
Importance of Adaptive Photometric Augmentation for Different Convolutional Neural Network
12
作者 Saraswathi Sivamani Sun Il Chon +2 位作者 Do Yeon Choi Dong Hoon Lee Ji Hwan Park 《Computers, Materials & Continua》 SCIE EI 2022年第9期4433-4452,共20页
Existing segmentation and augmentation techniques on convolutional neural network(CNN)has produced remarkable progress in object detection.However,the nominal accuracy and performance might be downturned with the phot... Existing segmentation and augmentation techniques on convolutional neural network(CNN)has produced remarkable progress in object detection.However,the nominal accuracy and performance might be downturned with the photometric variation of images that are directly ignored in the training process,along with the context of the individual CNN algorithm.In this paper,we investigate the effect of a photometric variation like brightness and sharpness on different CNN.We observe that random augmentation of images weakens the performance unless the augmentation combines the weak limits of photometric variation.Our approach has been justified by the experimental result obtained from the PASCAL VOC 2007 dataset,with object detection CNN algorithms such as YOLOv3(You Only Look Once),Faster R-CNN(Region-based CNN),and SSD(Single Shot Multibox Detector).Each CNN model shows performance loss for varying sharpness and brightness,ranging between−80%to 80%.It was further shown that compared to random augmentation,the augmented dataset with weak photometric changes delivered high performance,but the photometric augmentation range differs for each model.Concurrently,we discuss some research questions that benefit the direction of the study.The results prove the importance of adaptive augmentation for individual CNN model,subjecting towards the robustness of object detection. 展开更多
关键词 Object detection photometric variation adaptive augmentation convolutional neural network
在线阅读 下载PDF
Data Augmentation and Deep Neuro-fuzzy Network for Student Performance Prediction with MapReduce Framework
13
作者 Amlan Jyoti Baruah Siddhartha Baruah 《International Journal of Automation and computing》 EI CSCD 2021年第6期981-992,共12页
The main aim of an educational institute is to offer high-quality education to students. The system to achieve better quality in the educational system is to find the knowledge from educational data and to discover th... The main aim of an educational institute is to offer high-quality education to students. The system to achieve better quality in the educational system is to find the knowledge from educational data and to discover the attributes that manipulate the performance of students. Student performance prediction is a major issue in education and training, specifically in the educational data mining system. This research presents the student performance prediction approach with the MapReduce framework based on the proposed fractional competitive multi-verse optimization-based deep neuro-fuzzy network. The proposed fractional competitive multi-verse optimization-based deep neuro-fuzzy network is derived by integrating fractional calculus with competitive multi-verse optimization. The MapReduce framework is designed with the mapper and the reducer phase to perform the student performance prediction mechanism with the deep learning classifier. The input data is partitioned at the mapper phase to perform the data transformation process, and thereby the features are selected using the distance measure. The selected unique features are employed for the data segmentation process, and thereafter the prediction strategy is accomplished at the reducer phase by the deep neuro-fuzzy network classifier. The proposed method obtained the performance in terms of mean square error, root mean square error and mean absolute error with the values of 0.338 3, 0.581 7, and 0.391 5, respectively. 展开更多
关键词 Educational data mining(EDA) MapReduce framework deep neuro-fuzzy network student performance data augmentation
原文传递
Data Augmentation Using Contour Image for Convolutional Neural Network
14
作者 Seung-Yeon Hwang Jeong-Joon Kim 《Computers, Materials & Continua》 SCIE EI 2023年第6期4669-4680,共12页
With the development of artificial intelligence-related technologies such as deep learning,various organizations,including the government,are making various efforts to generate and manage big data for use in artificia... With the development of artificial intelligence-related technologies such as deep learning,various organizations,including the government,are making various efforts to generate and manage big data for use in artificial intelligence.However,it is difficult to acquire big data due to various social problems and restrictions such as personal information leakage.There are many problems in introducing technology in fields that do not have enough training data necessary to apply deep learning technology.Therefore,this study proposes a mixed contour data augmentation technique,which is a data augmentation technique using contour images,to solve a problem caused by a lack of data.ResNet,a famous convolutional neural network(CNN)architecture,and CIFAR-10,a benchmark data set,are used for experimental performance evaluation to prove the superiority of the proposed method.And to prove that high performance improvement can be achieved even with a small training dataset,the ratio of the training dataset was divided into 70%,50%,and 30%for comparative analysis.As a result of applying the mixed contour data augmentation technique,it was possible to achieve a classification accuracy improvement of up to 4.64%and high accuracy even with a small amount of data set.In addition,it is expected that the mixed contour data augmentation technique can be applied in various fields by proving the excellence of the proposed data augmentation technique using benchmark datasets. 展开更多
关键词 Data augmentation image classification deep learning convolutional neural network mixed contour image benchmark dataset
在线阅读 下载PDF
Feature-Based Augmentation in Sarcasm Detection Using Reverse Generative Adversarial Network
15
作者 Derwin Suhartono Alif Tri Handoyo Franz Adeta Junior 《Computers, Materials & Continua》 SCIE EI 2023年第12期3637-3657,共21页
Sarcasm detection in text data is an increasingly vital area of research due to the prevalence of sarcastic content in online communication.This study addresses challenges associated with small datasets and class imba... Sarcasm detection in text data is an increasingly vital area of research due to the prevalence of sarcastic content in online communication.This study addresses challenges associated with small datasets and class imbalances in sarcasm detection by employing comprehensive data pre-processing and Generative Adversial Network(GAN)based augmentation on diverse datasets,including iSarcasm,SemEval-18,and Ghosh.This research offers a novel pipeline for augmenting sarcasm data with Reverse Generative Adversarial Network(RGAN).The proposed RGAN method works by inverting labels between original and synthetic data during the training process.This inversion of labels provides feedback to the generator for generating high-quality data closely resembling the original distribution.Notably,the proposed RGAN model exhibits performance on par with standard GAN,showcasing its robust efficacy in augmenting text data.The exploration of various datasets highlights the nuanced impact of augmentation on model performance,with cautionary insights into maintaining a delicate balance between synthetic and original data.The methodological framework encompasses comprehensive data pre-processing and GAN-based augmentation,with a meticulous comparison against Natural Language Processing Augmentation(NLPAug)as an alternative augmentation technique.Overall,the F1-score of our proposed technique outperforms that of the synonym replacement augmentation technique using NLPAug.The increase in F1-score in experiments using RGAN ranged from 0.066%to 1.054%,and the use of standard GAN resulted in a 2.88%increase in F1-score.The proposed RGAN model outperformed the NLPAug method and demonstrated comparable performance to standard GAN,emphasizing its efficacy in text data augmentation. 展开更多
关键词 Data augmentation Generative Adversarial network(GAN) Reverse GAN(RGAN) sarcasm detection
在线阅读 下载PDF
Synergy Model of Artificial Intelligence and Augmented Reality in the Processes of Exploitation of Energy Systems
16
作者 Tomasz Wojcicki 《Journal of Energy and Power Engineering》 2016年第2期102-108,共7页
The article presents a fragment of research and development, which objective was to develop technical tools and methodology to improve exploitation processes of energy systems. The author's model includes synergy of ... The article presents a fragment of research and development, which objective was to develop technical tools and methodology to improve exploitation processes of energy systems. The author's model includes synergy of artificial intelligence and augmented reality. This solution, which combines modem technologies in order to improve the activities related to the continuity of energy supply, and reduce costs associated with the time needed to carry out exploitation activities and employment of qualified staff, is presented. This paper presents both theoretical foundations as well as the development of technical systems. The characteristics of exploitation processes of energy systems and possible technical conditions, as well as factors characterizing them, are discussed. The physical and software structures of the system and individual modules, as well as dependencies connecting them are demonstrated. The dependencies between physical and logical elements during the exploitation processes of energy systems, that determine decisions related to the evaluation of technical states and related activities are described. The advantages and limitations of the developed model which connects methods of data processing and analysis, interactive visualization processes and possible areas of application are as well discussed in detailed. 展开更多
关键词 Energy system EXPLOITATION artificial intelligence augmented reality artificial neural networks.
在线阅读 下载PDF
Hybrid receptive field network for small object detection on drone view 被引量:1
17
作者 Zhaodong CHEN Hongbing JI +2 位作者 Yongquan ZHANG Wenke LIU Zhigang ZHU 《Chinese Journal of Aeronautics》 2025年第2期322-338,共17页
Drone-based small object detection is of great significance in practical applications such as military actions, disaster rescue, transportation, etc. However, the severe scale differences in objects captured by drones... Drone-based small object detection is of great significance in practical applications such as military actions, disaster rescue, transportation, etc. However, the severe scale differences in objects captured by drones and lack of detail information for small-scale objects make drone-based small object detection a formidable challenge. To address these issues, we first develop a mathematical model to explore how changing receptive fields impacts the polynomial fitting results. Subsequently, based on the obtained conclusions, we propose a simple but effective Hybrid Receptive Field Network (HRFNet), whose modules include Hybrid Feature Augmentation (HFA), Hybrid Feature Pyramid (HFP) and Dual Scale Head (DSH). Specifically, HFA employs parallel dilated convolution kernels of different sizes to extend shallow features with different receptive fields, committed to improving the multi-scale adaptability of the network;HFP enhances the perception of small objects by capturing contextual information across layers, while DSH reconstructs the original prediction head utilizing a set of high-resolution features and ultrahigh-resolution features. In addition, in order to train HRFNet, the corresponding dual-scale loss function is designed. Finally, comprehensive evaluation results on public benchmarks such as VisDrone-DET and TinyPerson demonstrate the robustness of the proposed method. Most impressively, the proposed HRFNet achieves a mAP of 51.0 on VisDrone-DET with 29.3 M parameters, which outperforms the extant state-of-the-art detectors. HRFNet also performs excellently in complex scenarios captured by drones, achieving the best performance on the CS-Drone dataset we built. 展开更多
关键词 Drone remote sensing Object detection on drone view Small object detector Hybrid receptive field Feature pyramid network Feature augmentation Multi-scale object detection
原文传递
Advancing predictive accuracy of shallow landslide using strategic data augmentation
18
作者 Hongzhi Qiu Xiaoqing Chen +4 位作者 Peng Feng Renchao Wang Wang Hu Liping Zhang Alessandro Pasuto 《Journal of Rock Mechanics and Geotechnical Engineering》 2025年第7期4273-4287,共15页
Rainfall-induced shallow landslides pose one of significant geological hazards,necessitating precise monitoring and prediction for effective disaster mitigation.Most studies on landslide prediction have focused on opt... Rainfall-induced shallow landslides pose one of significant geological hazards,necessitating precise monitoring and prediction for effective disaster mitigation.Most studies on landslide prediction have focused on optimizing machine learning(ML)algorithms,very limited attention has been paid to enhancing data quality for improved predictive performance.This study employs strategic data augmentation(DA)techniques to enhance the accuracy of shallow landslide prediction.Using five DA methods including singular spectrum analysis(SSA),moving averages(MA),wavelet denoising(WD),variational mode decomposition(VMD),and linear interpolation(LI),we utilize strategies such as smoothing,denoising,trend decomposition,and synthetic data generation to improve the training dataset.Four machine learning algorithms,i.e.artificial neural network(ANN),recurrent neural network(RNN),one-dimensional convolutional neural network(CNN1D),and long short-term memory(LSTM),are used to forecast landslide displacement.The case study of a landslide in southwest China shows the effectiveness of our approach in predicting landslide displacements,despite the inherent limitations of the monitoring dataset.VMD proves the most effective for smoothing and denoising,improving R^(2),RMSE,and MAPE by 172.16%,71.82%,and 98.9%,respectively.SSA addresses missing data,while LI is effective with limited data samples,improving metrics by 21.6%,52.59%,and 47.87%,respectively.This study demonstrates the potential of DA techniques to mitigate the impact of data defects on landslide prediction accuracy,with implications for similar cases. 展开更多
关键词 Shallow landslide Data augmentation Machine learning Neural network Deformation prediction
在线阅读 下载PDF
GAN-based data augmentation of time series for fault diagnosis in railway track
19
作者 Héctor A.Fernández-Bobadilla Yahya Bouchikhi Ullrich Martin 《Railway Engineering Science》 2025年第4期642-683,共42页
Supervised learning classification has arisen as a powerful tool to perform data-driven fault diagnosis in dynamical systems,achieving astonishing results.This approach assumes the availability of extensive,diverse an... Supervised learning classification has arisen as a powerful tool to perform data-driven fault diagnosis in dynamical systems,achieving astonishing results.This approach assumes the availability of extensive,diverse and labeled data corpora for train-ing.However,in some applications it may be difficult or not feasible to obtain a large and balanced dataset including enough representative instances of the fault behaviors of interest.This fact leads to the issues of data scarcity and class imbalance,greatly affecting the performance of supervised learning classifiers.Datasets from railway systems are usually both,scarce and imbalanced,turning supervised learning-based fault diagnosis into a highly challenging task.This article addresses time-series data augmentation for fault diagnosis purposes and presents two application cases in the context of railway track.The case studies employ generative adversarial networks(GAN)schemes to produce realistic synthetic samples of geometrical and structural track defects.The goal is to generate samples that enhance fault diagnosis performance;therefore,major attention was paid not only in the generation process,but also in the synthesis quality assessment,to guarantee the suitability of the samples for training of supervised learning classification models.In the first application,a convolutional classifier achieved a test accuracy of 87.5%for the train on synthetic,test on real(TSTR)scenario,while,in the second application,a fully-connected classifier achieved 96.18%in test accuracy for TSTR.The results indicate that the proposed augmentation approach produces samples having equivalent statistical characteristics and leading to a similar classification behavior as real data. 展开更多
关键词 Data augmentation Time series Generative adversarial networks Fault diagnosis Predictive maintenance Railway systems
在线阅读 下载PDF
Optimization of convolutional neural networks for predicting water pollutants using spectral data in the middle and lower reaches of the Yangtze River Basin,China
20
作者 ZHANG Guohao LI Song +3 位作者 WANG Cailing WANG Hongwei YU Tao DAI Xiaoxu 《Journal of Mountain Science》 2025年第8期2851-2869,共19页
Developing an accurate and efficient comprehensive water quality prediction model and its assessment method is crucial for the prevention and control of water pollution.Deep learning(DL),as one of the most promising t... Developing an accurate and efficient comprehensive water quality prediction model and its assessment method is crucial for the prevention and control of water pollution.Deep learning(DL),as one of the most promising technologies today,plays a crucial role in the effective assessment of water body health,which is essential for water resource management.This study models using both the original dataset and a dataset augmented with Generative Adversarial Networks(GAN).It integrates optimization algorithms(OA)with Convolutional Neural Networks(CNN)to propose a comprehensive water quality model evaluation method aiming at identifying the optimal models for different pollutants.Specifically,after preprocessing the spectral dataset,data augmentation was conducted to obtain two datasets.Then,six new models were developed on these datasets using particle swarm optimization(PSO),genetic algorithm(GA),and simulated annealing(SA)combined with CNN to simulate and forecast the concentrations of three water pollutants:Chemical Oxygen Demand(COD),Total Nitrogen(TN),and Total Phosphorus(TP).Finally,seven model evaluation methods,including uncertainty analysis,were used to evaluate the constructed models and select the optimal models for the three pollutants.The evaluation results indicate that the GPSCNN model performed best in predicting COD and TP concentrations,while the GGACNN model excelled in TN concentration prediction.Compared to existing technologies,the proposed models and evaluation methods provide a more comprehensive and rapid approach to water body prediction and assessment,offering new insights and methods for water pollution prevention and control. 展开更多
关键词 Water pollutants Convolutional neural networks Data augmentation Optimization algorithms Model evaluation methods Deep Learning
原文传递
上一页 1 2 48 下一页 到第
使用帮助 返回顶部