To address the issues of insufficient and imbalanced data samples in proton exchange membrane fuel cell(PEMFC)performance degradation prediction,this study proposes a data augmentation-based model to predict PEMFC per...To address the issues of insufficient and imbalanced data samples in proton exchange membrane fuel cell(PEMFC)performance degradation prediction,this study proposes a data augmentation-based model to predict PEMFC performance degradation.Firstly,an improved generative adversarial network(IGAN)with adaptive gradient penalty coefficient is proposed to address the problems of excessively fast gradient descent and insufficient diversity of generated samples.Then,the IGANis used to generate datawith a distribution analogous to real data,therebymitigating the insufficiency and imbalance of original PEMFC samples and providing the predictionmodel with training data rich in feature information.Finally,a convolutional neural network-bidirectional long short-termmemory(CNN-BiLSTM)model is adopted to predict PEMFC performance degradation.Experimental results show that the data generated by the proposed IGAN exhibits higher quality than that generated by the original GAN,and can fully characterize and enrich the original data’s features.Using the augmented data,the prediction accuracy of the CNN-BiLSTM model is significantly improved,rendering it applicable to tasks of predicting PEMFC performance degradation.展开更多
Legal case classification involves the categorization of legal documents into predefined categories,which facilitates legal information retrieval and case management.However,real-world legal datasets often suffer from...Legal case classification involves the categorization of legal documents into predefined categories,which facilitates legal information retrieval and case management.However,real-world legal datasets often suffer from class imbalances due to the uneven distribution of case types across legal domains.This leads to biased model performance,in the form of high accuracy for overrepresented categories and underperformance for minority classes.To address this issue,in this study,we propose a data augmentation method that masks unimportant terms within a document selectively while preserving key terms fromthe perspective of the legal domain.This approach enhances data diversity and improves the generalization capability of conventional models.Our experiments demonstrate consistent improvements achieved by the proposed augmentation strategy in terms of accuracy and F1 score across all models,validating the effectiveness of the proposed method in legal case classification.展开更多
The Global Precipitation Measurement(GPM)dual-frequency precipitation radar(DPR)products(Version 07A)are employed for a rigorous comparative analysis with ground-based operational weather radar(GR)networks.The reflect...The Global Precipitation Measurement(GPM)dual-frequency precipitation radar(DPR)products(Version 07A)are employed for a rigorous comparative analysis with ground-based operational weather radar(GR)networks.The reflectivity observed by GPM Ku PR is compared quantitatively against GR networks from CINRAD of China and NEXRAD of the United States,and the volume matching method is used for spatial matching.Additionally,a novel frequency correction method for all phases as well as precipitation types is used to correct the GPM Ku PR radar frequency to the GR frequency.A total of 20 GRs(including 10 from CINRAD and 10 from NEXRAD)are included in this comparative analysis.The results indicate that,compared with CINRAD matched data,NEXRAD exhibits larger biases in reflectivity when compared with the frequency-corrected Ku PR.The root-mean-square difference for CINRAD is calculated at 2.38 d B,whereas for NEXRAD it is 3.23 d B.The mean bias of CINRAD matched data is-0.16 d B,while the mean bias of NEXRAD is-2.10 d B.The mean standard deviation of bias for CINRAD is 2.15 d B,while for NEXRAD it is 2.29 d B.This study effectively assesses weather radar data in both the United States and China,which is crucial for improving the overall consistency of global precipitation estimates.展开更多
Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in ord...Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation.展开更多
Large-scale point cloud datasets form the basis for training various deep learning networks and achieving high-quality network processing tasks.Due to the diversity and robustness constraints of the data,data augmenta...Large-scale point cloud datasets form the basis for training various deep learning networks and achieving high-quality network processing tasks.Due to the diversity and robustness constraints of the data,data augmentation(DA)methods are utilised to expand dataset diversity and scale.However,due to the complex and distinct characteristics of LiDAR point cloud data from different platforms(such as missile-borne and vehicular LiDAR data),directly applying traditional 2D visual domain DA methods to 3D data can lead to networks trained using this approach not robustly achieving the corresponding tasks.To address this issue,the present study explores DA for missile-borne LiDAR point cloud using a Monte Carlo(MC)simulation method that closely resembles practical application.Firstly,the model of multi-sensor imaging system is established,taking into account the joint errors arising from the platform itself and the relative motion during the imaging process.A distortion simulation method based on MC simulation for augmenting missile-borne LiDAR point cloud data is proposed,underpinned by an analysis of combined errors between different modal sensors,achieving high-quality augmentation of point cloud data.The effectiveness of the proposed method in addressing imaging system errors and distortion simulation is validated using the imaging scene dataset constructed in this paper.Comparative experiments between the proposed point cloud DA algorithm and the current state-of-the-art algorithms in point cloud detection and single object tracking tasks demonstrate that the proposed method can improve the network performance obtained from unaugmented datasets by over 17.3%and 17.9%,surpassing SOTA performance of current point cloud DA algorithms.展开更多
In order to address the widespread data shortage problem in battery research,this paper proposes a generative adversarial network model that combines it with deep convolutional networks,the Wasserstein distance,and th...In order to address the widespread data shortage problem in battery research,this paper proposes a generative adversarial network model that combines it with deep convolutional networks,the Wasserstein distance,and the gradient penalty to achieve data augmentation.To lower the threshold for implementing the proposed method,transfer learning is further introduced.The W-DC-GAN-GP-TL framework is thereby formed.This framework is evaluated on 3 different publicly available datasets to judge the quality of generated data.Through visual comparisons and the examination of two visualization methods(probability density function(PDF)and principal component analysis(PCA)),it is demonstrated that the generated data is hard to distinguish from the real data.The application of generated data for training a battery state model using transfer learning is further evaluated.Specifically,Bi-GRU-based and Transformer-based methods are implemented on 2 separate datasets for estimating state of health(SOH)and state of charge(SOC),respectively.The results indicate that the proposed framework demonstrates satisfactory performance in different scenarios:for the data replacement scenario,where real data are removed and replaced with generated data,the state estimator accuracy decreases only slightly;for the data enhancement scenario,the estimator accuracy is further improved.The estimation accuracy of SOH and SOC is as low as 0.69%and 0.58%root mean square error(RMSE)after applying the proposed framework.This framework provides a reliable method for enriching battery measurement data.It is a generalized framework capable of generating a variety of time series data.展开更多
Peri-implant keratinized mucosa(PIKM)augmentation refers to surgical procedures aimed at increasing the width of PIKM.Consensus reports emphasize the necessity of maintaining a minimum width of PIKM to ensure long-ter...Peri-implant keratinized mucosa(PIKM)augmentation refers to surgical procedures aimed at increasing the width of PIKM.Consensus reports emphasize the necessity of maintaining a minimum width of PIKM to ensure long-term peri-implant health.Currently,several surgical techniques have been validated for their effectiveness in increasing PIKM.However,the selection and application of PIKM augmentation methods may present challenges for dental practitioners due to heterogeneity in surgical techniques,variations in clinical scenarios,and anatomical differences.Therefore,clear guidelines and considerations for PIKM augmentation are needed.This expert consensus focuses on the commonly employed surgical techniques for PIKM augmentation and the factors influencing their selection at second-stage surgery.It aims to establish a standardized framework for assessing,planning,and executing PIKM augmentation procedures,with the goal of offering evidence-based guidance to enhance the predictability and success of PIKM augmentation.展开更多
The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and hist...The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and historical context,DL offers a powerful complement by enhancing the speed,objectivity,and precision of the classification process.This research explores the significance of image data augmentation techniques in optimizing the performance of convolutional neural networks(CNNs)for geological image analysis,particularly in the classification of igneous,metamorphic,and sedimentary rock types from rock thin section(RTS)images.This study primarily focuses on classic image augmentation techniques and evaluates their impact on model accuracy and precision.Results demonstrate that augmentation techniques like Equalize significantly enhance the model's classification capabilities,achieving an F1-Score of 0.9869 for igneous rocks,0.9884 for metamorphic rocks,and 0.9929 for sedimentary rocks,representing improvements compared to the baseline original results.Moreover,the weighted average F1-Score across all classes and techniques is 0.9886,indicating an enhancement.Conversely,methods like Distort lead to decreased accuracy and F1-Score,with an F1-Score of 0.949 for igneous rocks,0.954 for metamorphic rocks,and 0.9416 for sedimentary rocks,exacerbating the performance compared to the baseline.The study underscores the practicality of image data augmentation in geological image classification and advocates for the adoption of DL methods in this domain for automation and improved results.The findings of this study can benefit various fields,including remote sensing,mineral exploration,and environmental monitoring,by enhancing the accuracy of geological image analysis both for scientific research and industrial applications.展开更多
Cassava is the most widely distributed food crop in Central Africa. Chikwangue, also known as kwanga in the Republic of Congo, is a starchy fermented cassava product that is a staple food in the country. This work aim...Cassava is the most widely distributed food crop in Central Africa. Chikwangue, also known as kwanga in the Republic of Congo, is a starchy fermented cassava product that is a staple food in the country. This work aims to determine the composition of bioactive compounds in chikwangue, including biosurfactant-like molecules and proteins content. Antibacterial activities were investigated through the preliminary emulsification index of chikwangue and fermented paste. Antibacterial assay, 16S rRNA, cytK, hblD, nheB and entFM PCR amplifications, DNA sequence analysis, NCBI homology analysis, and phylogenic tree were performed using NGPhylogeny. fr and iTOL (interactive of live). Fermented cassava paste and chikwangue contain biosurfactants with an emulsification index of 50%. The total protein concentration in fermented cassava paste was 4 g/ml and the chikwangue was 2.5 g/mL Further sequence analysis showed that isolates shared a homology of up to 99.9% with Bacillus cereus PQ432941.1, B. licheniformis PQ432758.1, B. altitudinis PQ432754.1, B. subtilis PQ432759.1, B. mojavensis PQ432755.1, B. tequilensis MT994788.1, B. subtilis MT994789.1, Paenibacillus polymyxa PQ452544.1, B. velezensis PQ452545.1, B. thuringiensis PQ432763.1, B. pumilus PQ432762.1, B. subtilis MT994787.1, B. mycoides PQ432890.1, B. thuringiensis PQ432766.1, B. subtilis PQ432757.1 and B. amyloliquefaciens PQ432756.1. Importantly, the emulsification index (E24) ranged from 60 to 100% and the crude biosurfactant for the Bacillus strains mentioned above could easily inhibit the growth for pathogen Gram-negative bacteria (S. enterica, S. flexneri, E. coli, Klebsiella sp. and P. aeruginosa) with diameters ranging from 2.3 ± 0.1 cm to 5.5 ± 0.4 cm. On the other hand, the diameters of Gram-positive pathogenic bacteria (B. cereus and S. aureus) varied between 1.5 ± 0.5 cm and 4.0 ± 0.2 cm. These findings involve the promise purpose of Bacillus isolated from retted cassava, and this study systematically uncovered the biodiversity and distribution characteristics of retted paste cassava and chikwangue.展开更多
Accurate cloud classification plays a crucial role in aviation safety,climate monitoring,and localized weather forecasting.Current research has been focusing on machine learning techniques,particularly deep learning b...Accurate cloud classification plays a crucial role in aviation safety,climate monitoring,and localized weather forecasting.Current research has been focusing on machine learning techniques,particularly deep learning based model,for the types identification.However,traditional approaches such as convolutional neural networks(CNNs)encounter difficulties in capturing global contextual information.In addition,they are computationally expensive,which restricts their usability in resource-limited environments.To tackle these issues,we present the Cloud Vision Transformer(CloudViT),a lightweight model that integrates CNNs with Transformers.The integration enables an effective balance between local and global feature extraction.To be specific,CloudViT comprises two innovative modules:Feature Extraction(E_Module)and Downsampling(D_Module).These modules are able to significantly reduce the number of model parameters and computational complexity while maintaining translation invariance and enhancing contextual comprehension.Overall,the CloudViT includes 0.93×10^(6)parameters,which decreases more than ten times compared to the SOTA(State-of-the-Art)model CloudNet.Comprehensive evaluations conducted on the HBMCD and SWIMCAT datasets showcase the outstanding performance of CloudViT.It achieves classification accuracies of 98.45%and 100%,respectively.Moreover,the efficiency and scalability of CloudViT make it an ideal candidate for deployment inmobile cloud observation systems,enabling real-time cloud image classification.The proposed hybrid architecture of CloudViT offers a promising approach for advancing ground-based cloud image classification.It holds significant potential for both optimizing performance and facilitating practical deployment scenarios.展开更多
An improved cycle-consistent generative adversarial network(CycleGAN) method for defect data augmentation based on feature fusion and self attention residual module is proposed to address the insufficiency of defect s...An improved cycle-consistent generative adversarial network(CycleGAN) method for defect data augmentation based on feature fusion and self attention residual module is proposed to address the insufficiency of defect sample data for light guide plate(LGP) in production,as well as the problem of minor defects.Two optimizations are made to the generator of CycleGAN:fusion of low resolution features obtained from partial up-sampling and down-sampling with high-resolution features,combination of self attention mechanism with residual network structure to replace the original residual module.Qualitative and quantitative experiments were conducted to compare different data augmentation methods,and the results show that the defect images of the LGP generated by the improved network were more realistic,and the accuracy of the you only look once version 5(YOLOv5) detection network for the LGP was improved by 5.6%,proving the effectiveness and accuracy of the proposed method.展开更多
High-quality data is essential for the success of data-driven learning tasks.The characteristics,precision,and completeness of the datasets critically determine the reliability,interpretability,and effectiveness of su...High-quality data is essential for the success of data-driven learning tasks.The characteristics,precision,and completeness of the datasets critically determine the reliability,interpretability,and effectiveness of subsequent analyzes and applications,such as fault detection,predictive maintenance,and process optimization.However,for many industrial processes,obtaining sufficient high-quality data remains a significant challenge due to high costs,safety concerns,and practical constraints.To overcome these challenges,data augmentation has emerged as a rapidly growing research area,attracting considerable attention across both academia and industry.By expanding datasets,data augmentation techniques improve greater generalization and more robust performance in actual applications.This paper provides a comprehensive,multi-perspective review of data augmentation methods for industrial processes.For clarity and organization,existing studies are systematically grouped into four categories:small sample with low dimension,small sample with high dimension,large sample with low dimension,and large sample with high dimension.Within this framework,the review examines current research from both methodological and application-oriented perspectives,highlighting main methods,advantages,and limitations.By synthesizing these findings,this review offers a structured overview for scholars and practitioners,serving as a valuable reference for newcomers and experienced researchers seeking to explore and advance data augmentation techniques in industrial processes.展开更多
Space target imaging simulation technology is an important tool for space target detection and identification,with advantages that include high flexibility and low cost.However,existing space target imaging simulation...Space target imaging simulation technology is an important tool for space target detection and identification,with advantages that include high flexibility and low cost.However,existing space target imaging simulation technologies are mostly based on target magnitudes for simulations,making it difficult to meet image simulation requirements for different signal-to-noise ratio(SNR)needs.Therefore,design of a simulation method that generates target image sequences with various SNRs based on the optical detection system parameters will be important for faint space target detection research.Addressing the SNR calculation issue in optical observation systems,this paper proposes a ground-based detection image SNR calculation method using the optical system parameters.This method calculates the SNR of an observed image precisely using radiative transfer theory,the optical system parameters,and the observation environment parameters.An SNR-based target sequence image simulation method for ground-based detection scenarios is proposed.This method calculates the imaging SNR using the optical system parameters and establishes a model for conversion between the target’s apparent magnitude and image grayscale values,thereby enabling generation of target sequence simulation images with corresponding SNRs for different system parameters.Experiments show that the SNR obtained using this calculation method has an average calculation error of<1 dB when compared with the theoretical SNR of the actual optical system.Additionally,the simulation images generated by the imaging simulation method show high consistency with real images,which meets the requirements of faint space target detection algorithm research and provides reliable data support for development of related technologies.展开更多
Breast augmentation with implants is a popular cosmetic surgery that enhances breast volume and contour through various placement planes.In this review,we examine the impact of subglandular,subpectoral,and subfascial ...Breast augmentation with implants is a popular cosmetic surgery that enhances breast volume and contour through various placement planes.In this review,we examine the impact of subglandular,subpectoral,and subfascial implant planes on postoperative outcomes and complication rates.Subglandular placement offers simplicity but is associated with higher risks of capsular contracture,hematoma,and rippling in patients with low tissue coverage.The subpectoral plane,widely adopted for its natural appearance and reduced capsular contracture risk,may cause dynamic deformity due to muscle contraction.Although technically challenging,the subfascial plane combines the benefits of soft tissue support and reduced implant displacement.We highlight the importance of choosing an optimal implant plane tailored to each patient’s anatomical and aesthetic needs to enhance surgical outcomes and minimize complications.Further research is needed to validate long-term efficacy,particularly for subfascial placement.展开更多
Objective:Although bariatric surgeries are widely performed around the world,patients frequently experience the recurrence of pre-existing gastroesophageal reflux disease(GERD)symptoms or develop new symptoms,some of ...Objective:Although bariatric surgeries are widely performed around the world,patients frequently experience the recurrence of pre-existing gastroesophageal reflux disease(GERD)symptoms or develop new symptoms,some of which are resistant to medical treatment.This study investigates the effect and outcome of magnetic sphincter augmentation(MSA),a minimally invasive treatment for GERD,in this population.Methods:A thorough search of the PubMed,Cochrane,Scopus,Web of Science,and Google Scholar databases from inception until June 6,2024 was performed to retrieve relevant studies that evaluated the effects of MSA on the GERD health-related quality of life(GERD-HRQL)score and the reduction in proton pump inhibitor(PPI)use in patients who underwent bariatric surgery.The“meta”package in RStudio version 2023.12.0 t 369 was used.Results:A total of eight studies were included in the systematic review and seven studies were included in the meta-analysis.MSA significantly reduced the GERD-HRQL score(MD?27.55[95%CI:30.99 to24.11],p<0.01)and PPI use(RR?0.23[95%CI:0.16 to 0.33],p<0.01).Conclusion:MSA is a viable treatment option for patients with GERD symptoms who undergo bariatric surgery.This approach showed promising results in terms of reducing the GERD-HRQL score and reducing the use of PPI.展开更多
Bird species classification is not only a challenging topic in artificial intelligence but also a domain closely related to environmental protection and ecological research.Additionally,performing edge computing on lo...Bird species classification is not only a challenging topic in artificial intelligence but also a domain closely related to environmental protection and ecological research.Additionally,performing edge computing on low-level devices using small neural networks can be an important research direction.In this paper,we use the EfficientNetV2B0 model for bird species classification,applying transfer learning on a dataset of 525 bird species.We also employ the BiRefNet model to remove backgrounds from images in the training set.The generated background-removed images are mixed with the original training set as a form of data augmentation.We aim for these background-removed images to help the model focus on key features,and by combining data augmentation with transfer learning,we trained a highly accurate and efficient bird species classification model.The training process is divided into a transfer learning stage and a fine-tuning stage.In the transfer learning stage,only the newly added custom layers are trained;while in the fine-tuning stage,all pre-trained layers except for the batch normalization layers are fine-tuned.According to the experimental results,the proposed model not only has an advantage in size compared to other models but also outperforms them in various metrics.The training results show that the proposed model achieved an accuracy of 99.54%and a precision of 99.62%,demonstrating that it achieves both lightweight design and high accuracy.To confirm the credibility of the results,we use heatmaps to interpret the model.The heatmaps show that our model can clearly highlight the image feature area.In addition,we also perform the 10-fold cross-validation on the model to verify its credibility.Finally,this paper proposes a model with low training cost and high accuracy,making it suitable for deployment on edge computing devices to provide lighter and more convenient services.展开更多
Imbalanced data distribution stands as the primary cause of performance deterioration in the majority of supervised classification algorithms.The current publicly available weld defect datasets are very limited,and th...Imbalanced data distribution stands as the primary cause of performance deterioration in the majority of supervised classification algorithms.The current publicly available weld defect datasets are very limited,and the samples of various defects are seriously imbalanced.The paper proposes an improved deep convolution generative adversarial network(DCGAN)to balance the weld defect dataset.To solve the problem of poor diversity in the samples generated by the traditional DCGAN,a C-Res unit is constructed,which integrates the convolutional block attention module(CBAM)into the residual block.The transposed convolution in the DCGAN's generator is replaced with the constructed C-Res unit to enhance the attention to image details and improve the stability and learning efficiency of the model.The Pixelshuffle module is added into the generator as the upsampling module to solve the problem that the C-Res unit can't up-sample like the transposed convolution.CBAM is added into the DCGAN's discriminator to further enhance the discriminator's ability to judge the quality of the generated sample.To validate the effectiveness of the improved DCGAN,comparison experiments are carried out.The weld defect dataset is balanced by DCGAN and improved DCGAN,respectively,and then YOLOv8s-cls is used to classify the weld defect sample based on the original dataset,the dataset balanced by the DCGAN,and the dataset balanced by the improved DCGAN,respectively.Among the nine F1 scores of the nine types of samples,seven of them are higher than those of YOLOv8s-cls trained with the original dataset,and six of them are higher than those of YOLOv8s-cls trained with the dataset balanced by the traditional DCGAN.The experiments reveal that the weld defect dataset balanced with improved DCGAN can enhance the performance of the supervised classification model,and is helpful to realize automation of weld defect detection.展开更多
Rainfall-induced shallow landslides pose one of significant geological hazards,necessitating precise monitoring and prediction for effective disaster mitigation.Most studies on landslide prediction have focused on opt...Rainfall-induced shallow landslides pose one of significant geological hazards,necessitating precise monitoring and prediction for effective disaster mitigation.Most studies on landslide prediction have focused on optimizing machine learning(ML)algorithms,very limited attention has been paid to enhancing data quality for improved predictive performance.This study employs strategic data augmentation(DA)techniques to enhance the accuracy of shallow landslide prediction.Using five DA methods including singular spectrum analysis(SSA),moving averages(MA),wavelet denoising(WD),variational mode decomposition(VMD),and linear interpolation(LI),we utilize strategies such as smoothing,denoising,trend decomposition,and synthetic data generation to improve the training dataset.Four machine learning algorithms,i.e.artificial neural network(ANN),recurrent neural network(RNN),one-dimensional convolutional neural network(CNN1D),and long short-term memory(LSTM),are used to forecast landslide displacement.The case study of a landslide in southwest China shows the effectiveness of our approach in predicting landslide displacements,despite the inherent limitations of the monitoring dataset.VMD proves the most effective for smoothing and denoising,improving R^(2),RMSE,and MAPE by 172.16%,71.82%,and 98.9%,respectively.SSA addresses missing data,while LI is effective with limited data samples,improving metrics by 21.6%,52.59%,and 47.87%,respectively.This study demonstrates the potential of DA techniques to mitigate the impact of data defects on landslide prediction accuracy,with implications for similar cases.展开更多
Hematoxylin and Eosin(H&E)images,popularly used in the field of digital pathology,often pose challenges due to their limited color richness,hindering the differentiation of subtle cell features crucial for accurat...Hematoxylin and Eosin(H&E)images,popularly used in the field of digital pathology,often pose challenges due to their limited color richness,hindering the differentiation of subtle cell features crucial for accurate classification.Enhancing the visibility of these elusive cell features helps train robust deep-learning models.However,the selection and application of image processing techniques for such enhancement have not been systematically explored in the research community.To address this challenge,we introduce Salient Features Guided Augmentation(SFGA),an approach that strategically integrates machine learning and image processing.SFGA utilizes machine learning algorithms to identify crucial features within cell images,subsequently mapping these features to appropriate image processing techniques to enhance training images.By emphasizing salient features and aligning them with corresponding image processing methods,SFGA is designed to enhance the discriminating power of deep learning models in cell classification tasks.Our research undertakes a series of experiments,each exploring the performance of different datasets and data enhancement techniques in classifying cell types,highlighting the significance of data quality and enhancement in mitigating overfitting and distinguishing cell characteristics.Specifically,SFGA focuses on identifying tumor cells from tissue for extranodal extension detection,with the SFGA-enhanced dataset showing notable advantages in accuracy.We conducted a preliminary study of five experiments,among which the accuracy of the pleomorphism experiment improved significantly from 50.81%to 95.15%.The accuracy of the other four experiments also increased,with improvements ranging from 3 to 43 percentage points.Our preliminary study shows the possibilities to enhance the diagnostic accuracy of deep learning models and proposes a systematic approach that could enhance cancer diagnosis,contributing as a first step in using SFGA in medical image enhancement.展开更多
基金supported by the Jiangsu Engineering Research Center of the Key Technology for Intelligent Manufacturing Equipment and the Suqian Key Laboratory of Intelligent Manufacturing(Grant No.M202108).
文摘To address the issues of insufficient and imbalanced data samples in proton exchange membrane fuel cell(PEMFC)performance degradation prediction,this study proposes a data augmentation-based model to predict PEMFC performance degradation.Firstly,an improved generative adversarial network(IGAN)with adaptive gradient penalty coefficient is proposed to address the problems of excessively fast gradient descent and insufficient diversity of generated samples.Then,the IGANis used to generate datawith a distribution analogous to real data,therebymitigating the insufficiency and imbalance of original PEMFC samples and providing the predictionmodel with training data rich in feature information.Finally,a convolutional neural network-bidirectional long short-termmemory(CNN-BiLSTM)model is adopted to predict PEMFC performance degradation.Experimental results show that the data generated by the proposed IGAN exhibits higher quality than that generated by the original GAN,and can fully characterize and enrich the original data’s features.Using the augmented data,the prediction accuracy of the CNN-BiLSTM model is significantly improved,rendering it applicable to tasks of predicting PEMFC performance degradation.
基金supported by the Institute of Information&Communications Technology Planning&Evaluation(IITP)grant funded by the Korea government(MSIT)[RS-2021-II211341,Artificial Intelligence Graduate School Program(Chung-Ang University)],and by the Chung-Ang University Graduate Research Scholarship in 2024.
文摘Legal case classification involves the categorization of legal documents into predefined categories,which facilitates legal information retrieval and case management.However,real-world legal datasets often suffer from class imbalances due to the uneven distribution of case types across legal domains.This leads to biased model performance,in the form of high accuracy for overrepresented categories and underperformance for minority classes.To address this issue,in this study,we propose a data augmentation method that masks unimportant terms within a document selectively while preserving key terms fromthe perspective of the legal domain.This approach enhances data diversity and improves the generalization capability of conventional models.Our experiments demonstrate consistent improvements achieved by the proposed augmentation strategy in terms of accuracy and F1 score across all models,validating the effectiveness of the proposed method in legal case classification.
基金funded by the National Key Research and Development Program of China(Grant No.2023YFB3907500)the National Natural Science Foundation(Grant No.42330602)the“Fengyun Satellite Remote Sensing Product Validation and Verification”Youth Innovation Team of the China Meteorological Administration(Grant No.CMA2023QN12)。
文摘The Global Precipitation Measurement(GPM)dual-frequency precipitation radar(DPR)products(Version 07A)are employed for a rigorous comparative analysis with ground-based operational weather radar(GR)networks.The reflectivity observed by GPM Ku PR is compared quantitatively against GR networks from CINRAD of China and NEXRAD of the United States,and the volume matching method is used for spatial matching.Additionally,a novel frequency correction method for all phases as well as precipitation types is used to correct the GPM Ku PR radar frequency to the GR frequency.A total of 20 GRs(including 10 from CINRAD and 10 from NEXRAD)are included in this comparative analysis.The results indicate that,compared with CINRAD matched data,NEXRAD exhibits larger biases in reflectivity when compared with the frequency-corrected Ku PR.The root-mean-square difference for CINRAD is calculated at 2.38 d B,whereas for NEXRAD it is 3.23 d B.The mean bias of CINRAD matched data is-0.16 d B,while the mean bias of NEXRAD is-2.10 d B.The mean standard deviation of bias for CINRAD is 2.15 d B,while for NEXRAD it is 2.29 d B.This study effectively assesses weather radar data in both the United States and China,which is crucial for improving the overall consistency of global precipitation estimates.
基金Natural Science Foundation of Zhejiang Province,Grant/Award Number:LY23F020025Science and Technology Commissioner Program of Huzhou,Grant/Award Number:2023GZ42Sichuan Provincial Science and Technology Support Program,Grant/Award Numbers:2023ZHCG0005,2023ZHCG0008。
文摘Data augmentation plays an important role in training deep neural model by expanding the size and diversity of the dataset.Initially,data augmentation mainly involved some simple transformations of images.Later,in order to increase the diversity and complexity of data,more advanced methods appeared and evolved to sophisticated generative models.However,these methods required a mass of computation of training or searching.In this paper,a novel training-free method that utilises the Pre-Trained Segment Anything Model(SAM)model as a data augmentation tool(PTSAM-DA)is proposed to generate the augmented annotations for images.Without the need for training,it obtains prompt boxes from the original annotations and then feeds the boxes to the pre-trained SAM to generate diverse and improved annotations.In this way,annotations are augmented more ingenious than simple manipulations without incurring huge computation for training a data augmentation model.Multiple comparative experiments on three datasets are conducted,including an in-house dataset,ADE20K and COCO2017.On this in-house dataset,namely Agricultural Plot Segmentation Dataset,maximum improvements of 3.77%and 8.92%are gained in two mainstream metrics,mIoU and mAcc,respectively.Consequently,large vision models like SAM are proven to be promising not only in image segmentation but also in data augmentation.
基金Postgraduate Innovation Top notch Talent Training Project of Hunan Province,Grant/Award Number:CX20220045Scientific Research Project of National University of Defense Technology,Grant/Award Number:22-ZZCX-07+2 种基金New Era Education Quality Project of Anhui Province,Grant/Award Number:2023cxcysj194National Natural Science Foundation of China,Grant/Award Numbers:62201597,62205372,1210456foundation of Hefei Comprehensive National Science Center,Grant/Award Number:KY23C502。
文摘Large-scale point cloud datasets form the basis for training various deep learning networks and achieving high-quality network processing tasks.Due to the diversity and robustness constraints of the data,data augmentation(DA)methods are utilised to expand dataset diversity and scale.However,due to the complex and distinct characteristics of LiDAR point cloud data from different platforms(such as missile-borne and vehicular LiDAR data),directly applying traditional 2D visual domain DA methods to 3D data can lead to networks trained using this approach not robustly achieving the corresponding tasks.To address this issue,the present study explores DA for missile-borne LiDAR point cloud using a Monte Carlo(MC)simulation method that closely resembles practical application.Firstly,the model of multi-sensor imaging system is established,taking into account the joint errors arising from the platform itself and the relative motion during the imaging process.A distortion simulation method based on MC simulation for augmenting missile-borne LiDAR point cloud data is proposed,underpinned by an analysis of combined errors between different modal sensors,achieving high-quality augmentation of point cloud data.The effectiveness of the proposed method in addressing imaging system errors and distortion simulation is validated using the imaging scene dataset constructed in this paper.Comparative experiments between the proposed point cloud DA algorithm and the current state-of-the-art algorithms in point cloud detection and single object tracking tasks demonstrate that the proposed method can improve the network performance obtained from unaugmented datasets by over 17.3%and 17.9%,surpassing SOTA performance of current point cloud DA algorithms.
基金funded by the Bavarian State Ministry of Science,Research and Art(Grant number:H.2-F1116.WE/52/2)。
文摘In order to address the widespread data shortage problem in battery research,this paper proposes a generative adversarial network model that combines it with deep convolutional networks,the Wasserstein distance,and the gradient penalty to achieve data augmentation.To lower the threshold for implementing the proposed method,transfer learning is further introduced.The W-DC-GAN-GP-TL framework is thereby formed.This framework is evaluated on 3 different publicly available datasets to judge the quality of generated data.Through visual comparisons and the examination of two visualization methods(probability density function(PDF)and principal component analysis(PCA)),it is demonstrated that the generated data is hard to distinguish from the real data.The application of generated data for training a battery state model using transfer learning is further evaluated.Specifically,Bi-GRU-based and Transformer-based methods are implemented on 2 separate datasets for estimating state of health(SOH)and state of charge(SOC),respectively.The results indicate that the proposed framework demonstrates satisfactory performance in different scenarios:for the data replacement scenario,where real data are removed and replaced with generated data,the state estimator accuracy decreases only slightly;for the data enhancement scenario,the estimator accuracy is further improved.The estimation accuracy of SOH and SOC is as low as 0.69%and 0.58%root mean square error(RMSE)after applying the proposed framework.This framework provides a reliable method for enriching battery measurement data.It is a generalized framework capable of generating a variety of time series data.
基金supported by the Natural Science Foundation of Sichuan Province(grant number:25NSFSC0265).
文摘Peri-implant keratinized mucosa(PIKM)augmentation refers to surgical procedures aimed at increasing the width of PIKM.Consensus reports emphasize the necessity of maintaining a minimum width of PIKM to ensure long-term peri-implant health.Currently,several surgical techniques have been validated for their effectiveness in increasing PIKM.However,the selection and application of PIKM augmentation methods may present challenges for dental practitioners due to heterogeneity in surgical techniques,variations in clinical scenarios,and anatomical differences.Therefore,clear guidelines and considerations for PIKM augmentation are needed.This expert consensus focuses on the commonly employed surgical techniques for PIKM augmentation and the factors influencing their selection at second-stage surgery.It aims to establish a standardized framework for assessing,planning,and executing PIKM augmentation procedures,with the goal of offering evidence-based guidance to enhance the predictability and success of PIKM augmentation.
文摘The integration of image analysis through deep learning(DL)into rock classification represents a significant leap forward in geological research.While traditional methods remain invaluable for their expertise and historical context,DL offers a powerful complement by enhancing the speed,objectivity,and precision of the classification process.This research explores the significance of image data augmentation techniques in optimizing the performance of convolutional neural networks(CNNs)for geological image analysis,particularly in the classification of igneous,metamorphic,and sedimentary rock types from rock thin section(RTS)images.This study primarily focuses on classic image augmentation techniques and evaluates their impact on model accuracy and precision.Results demonstrate that augmentation techniques like Equalize significantly enhance the model's classification capabilities,achieving an F1-Score of 0.9869 for igneous rocks,0.9884 for metamorphic rocks,and 0.9929 for sedimentary rocks,representing improvements compared to the baseline original results.Moreover,the weighted average F1-Score across all classes and techniques is 0.9886,indicating an enhancement.Conversely,methods like Distort lead to decreased accuracy and F1-Score,with an F1-Score of 0.949 for igneous rocks,0.954 for metamorphic rocks,and 0.9416 for sedimentary rocks,exacerbating the performance compared to the baseline.The study underscores the practicality of image data augmentation in geological image classification and advocates for the adoption of DL methods in this domain for automation and improved results.The findings of this study can benefit various fields,including remote sensing,mineral exploration,and environmental monitoring,by enhancing the accuracy of geological image analysis both for scientific research and industrial applications.
文摘Cassava is the most widely distributed food crop in Central Africa. Chikwangue, also known as kwanga in the Republic of Congo, is a starchy fermented cassava product that is a staple food in the country. This work aims to determine the composition of bioactive compounds in chikwangue, including biosurfactant-like molecules and proteins content. Antibacterial activities were investigated through the preliminary emulsification index of chikwangue and fermented paste. Antibacterial assay, 16S rRNA, cytK, hblD, nheB and entFM PCR amplifications, DNA sequence analysis, NCBI homology analysis, and phylogenic tree were performed using NGPhylogeny. fr and iTOL (interactive of live). Fermented cassava paste and chikwangue contain biosurfactants with an emulsification index of 50%. The total protein concentration in fermented cassava paste was 4 g/ml and the chikwangue was 2.5 g/mL Further sequence analysis showed that isolates shared a homology of up to 99.9% with Bacillus cereus PQ432941.1, B. licheniformis PQ432758.1, B. altitudinis PQ432754.1, B. subtilis PQ432759.1, B. mojavensis PQ432755.1, B. tequilensis MT994788.1, B. subtilis MT994789.1, Paenibacillus polymyxa PQ452544.1, B. velezensis PQ452545.1, B. thuringiensis PQ432763.1, B. pumilus PQ432762.1, B. subtilis MT994787.1, B. mycoides PQ432890.1, B. thuringiensis PQ432766.1, B. subtilis PQ432757.1 and B. amyloliquefaciens PQ432756.1. Importantly, the emulsification index (E24) ranged from 60 to 100% and the crude biosurfactant for the Bacillus strains mentioned above could easily inhibit the growth for pathogen Gram-negative bacteria (S. enterica, S. flexneri, E. coli, Klebsiella sp. and P. aeruginosa) with diameters ranging from 2.3 ± 0.1 cm to 5.5 ± 0.4 cm. On the other hand, the diameters of Gram-positive pathogenic bacteria (B. cereus and S. aureus) varied between 1.5 ± 0.5 cm and 4.0 ± 0.2 cm. These findings involve the promise purpose of Bacillus isolated from retted cassava, and this study systematically uncovered the biodiversity and distribution characteristics of retted paste cassava and chikwangue.
基金funded by Innovation and Development Special Project of China Meteorological Administration(CXFZ2022J038,CXFZ2024J035)Sichuan Science and Technology Program(No.2023YFQ0072)+1 种基金Key Laboratory of Smart Earth(No.KF2023YB03-07)Automatic Software Generation and Intelligent Service Key Laboratory of Sichuan Province(CUIT-SAG202210).
文摘Accurate cloud classification plays a crucial role in aviation safety,climate monitoring,and localized weather forecasting.Current research has been focusing on machine learning techniques,particularly deep learning based model,for the types identification.However,traditional approaches such as convolutional neural networks(CNNs)encounter difficulties in capturing global contextual information.In addition,they are computationally expensive,which restricts their usability in resource-limited environments.To tackle these issues,we present the Cloud Vision Transformer(CloudViT),a lightweight model that integrates CNNs with Transformers.The integration enables an effective balance between local and global feature extraction.To be specific,CloudViT comprises two innovative modules:Feature Extraction(E_Module)and Downsampling(D_Module).These modules are able to significantly reduce the number of model parameters and computational complexity while maintaining translation invariance and enhancing contextual comprehension.Overall,the CloudViT includes 0.93×10^(6)parameters,which decreases more than ten times compared to the SOTA(State-of-the-Art)model CloudNet.Comprehensive evaluations conducted on the HBMCD and SWIMCAT datasets showcase the outstanding performance of CloudViT.It achieves classification accuracies of 98.45%and 100%,respectively.Moreover,the efficiency and scalability of CloudViT make it an ideal candidate for deployment inmobile cloud observation systems,enabling real-time cloud image classification.The proposed hybrid architecture of CloudViT offers a promising approach for advancing ground-based cloud image classification.It holds significant potential for both optimizing performance and facilitating practical deployment scenarios.
基金supported by the Jiangsu Province IUR Cooperation Project (No.BY2021258)the Wuxi Science and Technology Development Fund Project (No.G20212028)。
文摘An improved cycle-consistent generative adversarial network(CycleGAN) method for defect data augmentation based on feature fusion and self attention residual module is proposed to address the insufficiency of defect sample data for light guide plate(LGP) in production,as well as the problem of minor defects.Two optimizations are made to the generator of CycleGAN:fusion of low resolution features obtained from partial up-sampling and down-sampling with high-resolution features,combination of self attention mechanism with residual network structure to replace the original residual module.Qualitative and quantitative experiments were conducted to compare different data augmentation methods,and the results show that the defect images of the LGP generated by the improved network were more realistic,and the accuracy of the you only look once version 5(YOLOv5) detection network for the LGP was improved by 5.6%,proving the effectiveness and accuracy of the proposed method.
基金supported by the Postdoctoral Fellowship Program(Grade B)of China(GZB20250435)the National Natural Science Foundation of China(62403270).
文摘High-quality data is essential for the success of data-driven learning tasks.The characteristics,precision,and completeness of the datasets critically determine the reliability,interpretability,and effectiveness of subsequent analyzes and applications,such as fault detection,predictive maintenance,and process optimization.However,for many industrial processes,obtaining sufficient high-quality data remains a significant challenge due to high costs,safety concerns,and practical constraints.To overcome these challenges,data augmentation has emerged as a rapidly growing research area,attracting considerable attention across both academia and industry.By expanding datasets,data augmentation techniques improve greater generalization and more robust performance in actual applications.This paper provides a comprehensive,multi-perspective review of data augmentation methods for industrial processes.For clarity and organization,existing studies are systematically grouped into four categories:small sample with low dimension,small sample with high dimension,large sample with low dimension,and large sample with high dimension.Within this framework,the review examines current research from both methodological and application-oriented perspectives,highlighting main methods,advantages,and limitations.By synthesizing these findings,this review offers a structured overview for scholars and practitioners,serving as a valuable reference for newcomers and experienced researchers seeking to explore and advance data augmentation techniques in industrial processes.
基金supported by Open Fund of National Key Laboratory of Deep Space Exploration(NKDSEL2024014)by Civil Aerospace Pre-research Project of State Administration of Science,Technology and Industry for National Defence,PRC(D040103).
文摘Space target imaging simulation technology is an important tool for space target detection and identification,with advantages that include high flexibility and low cost.However,existing space target imaging simulation technologies are mostly based on target magnitudes for simulations,making it difficult to meet image simulation requirements for different signal-to-noise ratio(SNR)needs.Therefore,design of a simulation method that generates target image sequences with various SNRs based on the optical detection system parameters will be important for faint space target detection research.Addressing the SNR calculation issue in optical observation systems,this paper proposes a ground-based detection image SNR calculation method using the optical system parameters.This method calculates the SNR of an observed image precisely using radiative transfer theory,the optical system parameters,and the observation environment parameters.An SNR-based target sequence image simulation method for ground-based detection scenarios is proposed.This method calculates the imaging SNR using the optical system parameters and establishes a model for conversion between the target’s apparent magnitude and image grayscale values,thereby enabling generation of target sequence simulation images with corresponding SNRs for different system parameters.Experiments show that the SNR obtained using this calculation method has an average calculation error of<1 dB when compared with the theoretical SNR of the actual optical system.Additionally,the simulation images generated by the imaging simulation method show high consistency with real images,which meets the requirements of faint space target detection algorithm research and provides reliable data support for development of related technologies.
基金supported by the Zhejiang Provincial Natural Science Foundation of China(grant no.LQ22H150005).
文摘Breast augmentation with implants is a popular cosmetic surgery that enhances breast volume and contour through various placement planes.In this review,we examine the impact of subglandular,subpectoral,and subfascial implant planes on postoperative outcomes and complication rates.Subglandular placement offers simplicity but is associated with higher risks of capsular contracture,hematoma,and rippling in patients with low tissue coverage.The subpectoral plane,widely adopted for its natural appearance and reduced capsular contracture risk,may cause dynamic deformity due to muscle contraction.Although technically challenging,the subfascial plane combines the benefits of soft tissue support and reduced implant displacement.We highlight the importance of choosing an optimal implant plane tailored to each patient’s anatomical and aesthetic needs to enhance surgical outcomes and minimize complications.Further research is needed to validate long-term efficacy,particularly for subfascial placement.
文摘Objective:Although bariatric surgeries are widely performed around the world,patients frequently experience the recurrence of pre-existing gastroesophageal reflux disease(GERD)symptoms or develop new symptoms,some of which are resistant to medical treatment.This study investigates the effect and outcome of magnetic sphincter augmentation(MSA),a minimally invasive treatment for GERD,in this population.Methods:A thorough search of the PubMed,Cochrane,Scopus,Web of Science,and Google Scholar databases from inception until June 6,2024 was performed to retrieve relevant studies that evaluated the effects of MSA on the GERD health-related quality of life(GERD-HRQL)score and the reduction in proton pump inhibitor(PPI)use in patients who underwent bariatric surgery.The“meta”package in RStudio version 2023.12.0 t 369 was used.Results:A total of eight studies were included in the systematic review and seven studies were included in the meta-analysis.MSA significantly reduced the GERD-HRQL score(MD?27.55[95%CI:30.99 to24.11],p<0.01)and PPI use(RR?0.23[95%CI:0.16 to 0.33],p<0.01).Conclusion:MSA is a viable treatment option for patients with GERD symptoms who undergo bariatric surgery.This approach showed promising results in terms of reducing the GERD-HRQL score and reducing the use of PPI.
文摘Bird species classification is not only a challenging topic in artificial intelligence but also a domain closely related to environmental protection and ecological research.Additionally,performing edge computing on low-level devices using small neural networks can be an important research direction.In this paper,we use the EfficientNetV2B0 model for bird species classification,applying transfer learning on a dataset of 525 bird species.We also employ the BiRefNet model to remove backgrounds from images in the training set.The generated background-removed images are mixed with the original training set as a form of data augmentation.We aim for these background-removed images to help the model focus on key features,and by combining data augmentation with transfer learning,we trained a highly accurate and efficient bird species classification model.The training process is divided into a transfer learning stage and a fine-tuning stage.In the transfer learning stage,only the newly added custom layers are trained;while in the fine-tuning stage,all pre-trained layers except for the batch normalization layers are fine-tuned.According to the experimental results,the proposed model not only has an advantage in size compared to other models but also outperforms them in various metrics.The training results show that the proposed model achieved an accuracy of 99.54%and a precision of 99.62%,demonstrating that it achieves both lightweight design and high accuracy.To confirm the credibility of the results,we use heatmaps to interpret the model.The heatmaps show that our model can clearly highlight the image feature area.In addition,we also perform the 10-fold cross-validation on the model to verify its credibility.Finally,this paper proposes a model with low training cost and high accuracy,making it suitable for deployment on edge computing devices to provide lighter and more convenient services.
基金supported by Postgraduate Research&Practice Innovation Program of Jiangsu Province(Grant No.KYCX24_4084)external funding。
文摘Imbalanced data distribution stands as the primary cause of performance deterioration in the majority of supervised classification algorithms.The current publicly available weld defect datasets are very limited,and the samples of various defects are seriously imbalanced.The paper proposes an improved deep convolution generative adversarial network(DCGAN)to balance the weld defect dataset.To solve the problem of poor diversity in the samples generated by the traditional DCGAN,a C-Res unit is constructed,which integrates the convolutional block attention module(CBAM)into the residual block.The transposed convolution in the DCGAN's generator is replaced with the constructed C-Res unit to enhance the attention to image details and improve the stability and learning efficiency of the model.The Pixelshuffle module is added into the generator as the upsampling module to solve the problem that the C-Res unit can't up-sample like the transposed convolution.CBAM is added into the DCGAN's discriminator to further enhance the discriminator's ability to judge the quality of the generated sample.To validate the effectiveness of the improved DCGAN,comparison experiments are carried out.The weld defect dataset is balanced by DCGAN and improved DCGAN,respectively,and then YOLOv8s-cls is used to classify the weld defect sample based on the original dataset,the dataset balanced by the DCGAN,and the dataset balanced by the improved DCGAN,respectively.Among the nine F1 scores of the nine types of samples,seven of them are higher than those of YOLOv8s-cls trained with the original dataset,and six of them are higher than those of YOLOv8s-cls trained with the dataset balanced by the traditional DCGAN.The experiments reveal that the weld defect dataset balanced with improved DCGAN can enhance the performance of the supervised classification model,and is helpful to realize automation of weld defect detection.
基金supported by the National Natural Science Foundation of China(Grant No.42101089)Sichuan Science and Technology Program(2022YFS0586)the Open Fund of Key Laboratory of Mountain Hazards and Earth Surface Processes Chinese Academy of Sciences.
文摘Rainfall-induced shallow landslides pose one of significant geological hazards,necessitating precise monitoring and prediction for effective disaster mitigation.Most studies on landslide prediction have focused on optimizing machine learning(ML)algorithms,very limited attention has been paid to enhancing data quality for improved predictive performance.This study employs strategic data augmentation(DA)techniques to enhance the accuracy of shallow landslide prediction.Using five DA methods including singular spectrum analysis(SSA),moving averages(MA),wavelet denoising(WD),variational mode decomposition(VMD),and linear interpolation(LI),we utilize strategies such as smoothing,denoising,trend decomposition,and synthetic data generation to improve the training dataset.Four machine learning algorithms,i.e.artificial neural network(ANN),recurrent neural network(RNN),one-dimensional convolutional neural network(CNN1D),and long short-term memory(LSTM),are used to forecast landslide displacement.The case study of a landslide in southwest China shows the effectiveness of our approach in predicting landslide displacements,despite the inherent limitations of the monitoring dataset.VMD proves the most effective for smoothing and denoising,improving R^(2),RMSE,and MAPE by 172.16%,71.82%,and 98.9%,respectively.SSA addresses missing data,while LI is effective with limited data samples,improving metrics by 21.6%,52.59%,and 47.87%,respectively.This study demonstrates the potential of DA techniques to mitigate the impact of data defects on landslide prediction accuracy,with implications for similar cases.
基金supported by grants fromthe North China University of Technology Research Start-Up Fund(11005136024XN147-14)and(110051360024XN151-97)Guangzhou Development Zone Science and Technology Project(2023GH02)+4 种基金the National Key R&D Program of China(2021YFE0201100 and 2022YFA1103401 to Juntao Gao)National Natural Science Foundation of China(981890991 to Juntao Gao)Beijing Municipal Natural Science Foundation(Z200021 to Juntao Gao)CAS Interdisciplinary Innovation Team(JCTD-2020-04 to Juntao Gao)0032/2022/A,by Macao FDCT,and MYRG2022-00271-FST.
文摘Hematoxylin and Eosin(H&E)images,popularly used in the field of digital pathology,often pose challenges due to their limited color richness,hindering the differentiation of subtle cell features crucial for accurate classification.Enhancing the visibility of these elusive cell features helps train robust deep-learning models.However,the selection and application of image processing techniques for such enhancement have not been systematically explored in the research community.To address this challenge,we introduce Salient Features Guided Augmentation(SFGA),an approach that strategically integrates machine learning and image processing.SFGA utilizes machine learning algorithms to identify crucial features within cell images,subsequently mapping these features to appropriate image processing techniques to enhance training images.By emphasizing salient features and aligning them with corresponding image processing methods,SFGA is designed to enhance the discriminating power of deep learning models in cell classification tasks.Our research undertakes a series of experiments,each exploring the performance of different datasets and data enhancement techniques in classifying cell types,highlighting the significance of data quality and enhancement in mitigating overfitting and distinguishing cell characteristics.Specifically,SFGA focuses on identifying tumor cells from tissue for extranodal extension detection,with the SFGA-enhanced dataset showing notable advantages in accuracy.We conducted a preliminary study of five experiments,among which the accuracy of the pleomorphism experiment improved significantly from 50.81%to 95.15%.The accuracy of the other four experiments also increased,with improvements ranging from 3 to 43 percentage points.Our preliminary study shows the possibilities to enhance the diagnostic accuracy of deep learning models and proposes a systematic approach that could enhance cancer diagnosis,contributing as a first step in using SFGA in medical image enhancement.