Differentiating between regular and abnormal noises in machine-generated sounds is a crucial but difficult problem.For accurate audio signal classification,suitable and efficient techniques are needed,particularly mac...Differentiating between regular and abnormal noises in machine-generated sounds is a crucial but difficult problem.For accurate audio signal classification,suitable and efficient techniques are needed,particularly machine learning approaches for automated classification.Due to the dynamic and diverse representative characteristics of audio data,the probability of achieving high classification accuracy is relatively low and requires further research efforts.This study proposes an ensemble model based on the LeNet and hierarchical attention mechanism(HAM)models with MFCC features to enhance the models’capacity to handle bias.Additionally,CNNs,bidirectional LSTM(BiLSTM),CRNN,LSTM,capsule network model(CNM),attention mechanism(AM),gated recurrent unit(GRU),ResNet,EfficientNet,and HAM models are implemented for performance comparison.Experiments involving the DCASE2020 dataset reveal that the proposed approach works better than the others,achieving an impressive 99.13%accuracy and 99.56%k-fold cross-validation accuracy.Comparison with state-of-the-art studies further validates this performance.The study’s findings highlight the potential of the proposed approach for accurate fault detection in vehicles,particularly involving the use of acoustic data.展开更多
Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To ...Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To address these issues,this paper proposes SIM-Net,an enhanced detection framework derived from YOLOv11.The model integrates SPDConv to preserve fine-grained features for small object detection,introduces a novel convolutional partial attention module(C2PAM)to suppress redundant background information and highlight salient regions,and employs a multi-scale fusion network(MFN)with a multi-grain contextual module(MGCT)to strengthen contextual representation and accelerate inference.Experimental evaluations demonstrate that SIM-Net achieves 92.4%mAP,92%accuracy,and 89.4%recall with an inference speed of 75.1 FPS,outperforming existing state-of-the-art methods.These results confirm the robustness and real-time applicability of SIM-Net for PCB defect inspection.展开更多
To address the challenges of high-precision optical surface defect detection,we propose a novel design for a wide-field and broadband light field camera in this work.The proposed system can achieve a 50°field of ...To address the challenges of high-precision optical surface defect detection,we propose a novel design for a wide-field and broadband light field camera in this work.The proposed system can achieve a 50°field of view and operates at both visible and near-infrared wavelengths.Using the principles of light field imaging,the proposed design enables 3D reconstruction of optical surfaces,thus enabling vertical surface height measurements with enhanced accuracy.Using Zemax-based simulations,we evaluate the system’s modulation transfer function,its optical aberrations,and its tolerance to shape variations through Zernike coefficient adjustments.The results demonstrate that this camera can achieve the required spatial resolution while also maintaining high imaging quality and thus offers a promising solution for advanced optical surface defect inspection.展开更多
In printed circuit board(PCB)manufacturing,surface defects can significantly affect product quality.To address the performance degradation,high false detection rates,and missed detections caused by complex backgrounds...In printed circuit board(PCB)manufacturing,surface defects can significantly affect product quality.To address the performance degradation,high false detection rates,and missed detections caused by complex backgrounds in current intelligent inspection algorithms,this paper proposes CG-YOLOv8,a lightweight and improved model based on YOLOv8n for PCB surface defect detection.The proposed method optimizes the network architecture and compresses parameters to reduce model complexity while maintaining high detection accuracy,thereby enhancing the capability of identifying diverse defects under complex conditions.Specifically,a cascaded multi-receptive field(CMRF)module is adopted to replace the SPPF module in the backbone to improve feature perception,and an inverted residual mobile block(IRMB)is integrated into the C2f module to further enhance performance.Additionally,conventional convolution layers are replaced with GSConv to reduce computational cost,and a lightweight Convolutional Block Attention Module based Convolution(CBAMConv)module is introduced after Grouped Spatial Convolution(GSConv)to preserve accuracy through attention mechanisms.The detection head is also optimized by removing medium and large-scale detection layers,thereby enhancing the model’s ability to detect small-scale defects and further reducing complexity.Experimental results show that,compared to the original YOLOv8n,the proposed CG-YOLOv8 reduces parameter count by 53.9%,improves mAP@0.5 by 2.2%,and increases precision and recall by 2.0%and 1.8%,respectively.These improvements demonstrate that CG-YOLOv8 offers an efficient and lightweight solution for PCB surface defect detection.展开更多
High-performance lattice structures produced through powder bed fusion-laser beam exhibit high specific strength and energy absorption capabilities.However,a significant deviation exists between the mechanical propert...High-performance lattice structures produced through powder bed fusion-laser beam exhibit high specific strength and energy absorption capabilities.However,a significant deviation exists between the mechanical properties,service life of lattice structures,and design expectations.This deviation arises from the intense interaction between the laser and powder,which leads to the formation of numerous defects within the lattice structure.To address these issues,this paper proposes a high-performance defect detection model for metal lattice structures based on YOLOv4,called YOLO-Lattice(YOLO-L).The main objectives of this paper are as follows:(1)utilize computed tomography to construct datasets of the diamond lattice and body-centered cubic lattice structures;(2)in the backbone network of YOLOv4,employ deformable convolution to enhance the feature extraction capability of the model for small-scale defects;(3)adopt a dual-attention mechanism to suppress invalid feature information and amplify the distinction between defect and background regions;and(4)implement a channel pruning strategy to eliminate channels carrying less feature information,thereby improving the inference speed of the model.The experimental results on the diamond lattice structure dataset demonstrate that the mean average precision of the YOLO-L model increased from 96.98% to 98.8%(with an intersection over union of 0.5),and the inference speed decreased from 51.3 ms to 32.5 ms when compared to YOLOv4.Thus,the YOLO-L model can be effectively used to detect defects in metal lattice structures.展开更多
An experimental platform accompanying with the improved Roberts algorithm has been developed to achieve accurate and real-time edge detection of surface defects on heavy rails.Detection results of scratching defects s...An experimental platform accompanying with the improved Roberts algorithm has been developed to achieve accurate and real-time edge detection of surface defects on heavy rails.Detection results of scratching defects show that the improved Roberts operator can attain accurate positioning to defect contour and get complete edge information.Meanwhile,a decreasing amount of interference noises as well as more precise characteristic parameters of the extracted defects can also be confirmed for the improved algorithm.Furthermore,the BP neural network adopted for defects classification with the improved Roberts operator can obtain the target training precision with 98 iterative steps and time of 2s while that of traditional Roberts operator is 118 steps and 4s.Finally,an enhanced defects identification rate of 13.33%has also been confirmed after the Roberts operator is improved.The proposed detecting platform will be positive in producing high-quality heavy rails and guaranteeing the national transportation safety.展开更多
With the advent of Industry 4.0,marked by a surge in intelligent manufacturing,advanced sensors embedded in smart factories now enable extensive data collection on equipment operation.The analysis of such data is pivo...With the advent of Industry 4.0,marked by a surge in intelligent manufacturing,advanced sensors embedded in smart factories now enable extensive data collection on equipment operation.The analysis of such data is pivotal for ensuring production safety,a critical factor in monitoring the health status of manufacturing apparatus.Conventional defect detection techniques,typically limited to specific scenarios,often require manual feature extraction,leading to inefficiencies and limited versatility in the overall process.Our research presents an intelligent defect detection methodology that leverages deep learning techniques to automate feature extraction and defect localization processes.Our proposed approach encompasses a suite of components:the high-level feature learning block(HLFLB),the multi-scale feature learning block(MSFLB),and a dynamic adaptive fusion block(DAFB),working in tandem to extract meticulously and synergistically aggregate defect-related characteristics across various scales and hierarchical levels.We have conducted validation of the proposed method using datasets derived from gearbox and bearing assessments.The empirical outcomes underscore the superior defect detection capability of our approach.It demonstrates consistently high performance across diverse datasets and possesses the accuracy required to categorize defects,taking into account their specific locations and the extent of damage,proving the method’s effectiveness and reliability in identifying defects in industrial components.展开更多
Railway turnouts often develop defects such as chipping,cracks,and wear during use.If not detected and addressed promptly,these defects can pose significant risks to train operation safety and passenger security.Despi...Railway turnouts often develop defects such as chipping,cracks,and wear during use.If not detected and addressed promptly,these defects can pose significant risks to train operation safety and passenger security.Despite advances in defect detection technologies,research specifically targeting railway turnout defects remains limited.To address this gap,we collected images from railway inspectors and constructed a dataset of railway turnout defects in complex environments.To enhance detection accuracy,we propose an improved YOLOv8 model named YOLO-VSS-SOUP-Inner-CIoU(YOLO-VSI).The model employs a state-space model(SSM)to enhance the C2f module in the YOLOv8 backbone,proposed the C2f-VSS module to better capture long-range dependencies and contextual features,thus improving feature extraction in complex environments.In the network’s neck layer,we integrate SPDConv and Omni-Kernel Network(OKM)modules to improve the original PAFPN(Path Aggregation Feature Pyramid Network)structure,and proposed the Small Object Upgrade Pyramid(SOUP)structure to enhance small object detection capabilities.Additionally,the Inner-CIoU loss function with a scale factor is applied to further enhance the model’s detection capabilities.Compared to the baseline model,YOLO-VSI demonstrates a 3.5%improvement in average precision on our railway turnout dataset,showcasing increased accuracy and robustness.Experiments on the public NEU-DET dataset reveal a 2.3%increase in average precision over the baseline,indicating that YOLO-VSI has good generalization capabilities.展开更多
Current you only look once(YOLO)-based algorithm model is facing the challenge of overwhelming parameters and calculation complexity under the printed circuit board(PCB)defect detection application scenario.In order t...Current you only look once(YOLO)-based algorithm model is facing the challenge of overwhelming parameters and calculation complexity under the printed circuit board(PCB)defect detection application scenario.In order to solve this problem,we propose a new method,which combined the lightweight network mobile vision transformer(Mobile Vi T)with the convolutional block attention module(CBAM)mechanism and the new regression loss function.This method needed less computation resources,making it more suitable for embedded edge detection devices.Meanwhile,the new loss function improved the positioning accuracy of the bounding box and enhanced the robustness of the model.In addition,experiments on public datasets demonstrate that the improved model achieves an average accuracy of 87.9%across six typical defect detection tasks,while reducing computational costs by nearly 90%.It significantly reduces the model's computational requirements while maintaining accuracy,ensuring reliable performance for edge deployment.展开更多
To solve the problem of low detection accuracy for complex weld defects,the paper proposes a weld defects detection method based on improved YOLOv5s.To enhance the ability to focus on key information in feature maps,t...To solve the problem of low detection accuracy for complex weld defects,the paper proposes a weld defects detection method based on improved YOLOv5s.To enhance the ability to focus on key information in feature maps,the scSE attention mechanism is intro-duced into the backbone network of YOLOv5s.A Fusion-Block module and additional layers are added to the neck network of YOLOv5s to improve the effect of feature fusion,which is to meet the needs of complex object detection.To reduce the computation-al complexity of the model,the C3Ghost module is used to replace the CSP2_1 module in the neck network of YOLOv5s.The scSE-ASFF module is constructed and inserted between the neck network and the prediction end,which is to realize the fusion of features between the different layers.To address the issue of imbalanced sample quality in the dataset and improve the regression speed and accuracy of the loss function,the CIoU loss function in the YOLOv5s model is replaced with the Focal-EIoU loss function.Finally,ex-periments are conducted based on the collected weld defect dataset to verify the feasibility of the improved YOLOv5s for weld defects detection.The experimental results show that the precision and mAP of the improved YOLOv5s in detecting complex weld defects are as high as 83.4%and 76.1%,respectively,which are 2.5%and 7.6%higher than the traditional YOLOv5s model.The proposed weld defects detection method based on the improved YOLOv5s in this paper can effectively solve the problem of low weld defects detection accuracy.展开更多
This study investigates the application of Learnable Memory Vision Transformers(LMViT)for detecting metal surface flaws,comparing their performance with traditional CNNs,specifically ResNet18 and ResNet50,as well as o...This study investigates the application of Learnable Memory Vision Transformers(LMViT)for detecting metal surface flaws,comparing their performance with traditional CNNs,specifically ResNet18 and ResNet50,as well as other transformer-based models including Token to Token ViT,ViT withoutmemory,and Parallel ViT.Leveraging awidely-used steel surface defect dataset,the research applies data augmentation and t-distributed stochastic neighbor embedding(t-SNE)to enhance feature extraction and understanding.These techniques mitigated overfitting,stabilized training,and improved generalization capabilities.The LMViT model achieved a test accuracy of 97.22%,significantly outperforming ResNet18(88.89%)and ResNet50(88.90%),aswell as the Token to TokenViT(88.46%),ViT without memory(87.18),and Parallel ViT(91.03%).Furthermore,LMViT exhibited superior training and validation performance,attaining a validation accuracy of 98.2%compared to 91.0%for ResNet 18,96.0%for ResNet50,and 89.12%,87.51%,and 91.21%for Token to Token ViT,ViT without memory,and Parallel ViT,respectively.The findings highlight the LMViT’s ability to capture long-range dependencies in images,an areawhere CNNs struggle due to their reliance on local receptive fields and hierarchical feature extraction.The additional transformer-based models also demonstrate improved performance in capturing complex features over CNNs,with LMViT excelling particularly at detecting subtle and complex defects,which is critical for maintaining product quality and operational efficiency in industrial applications.For instance,the LMViT model successfully identified fine scratches and minor surface irregularities that CNNs often misclassify.This study not only demonstrates LMViT’s potential for real-world defect detection but also underscores the promise of other transformer-based architectures like Token to Token ViT,ViT without memory,and Parallel ViT in industrial scenarios where complex spatial relationships are key.Future research may focus on enhancing LMViT’s computational efficiency for deployment in real-time quality control systems.展开更多
For the characteristics of small,dense distribution,high diversity of defects and high precision and fast detection in the process of PCB(Printed Circuit Board)defect detection,a defect detection algorithm based on YO...For the characteristics of small,dense distribution,high diversity of defects and high precision and fast detection in the process of PCB(Printed Circuit Board)defect detection,a defect detection algorithm based on YOLOv8n was proposed in this study.Firstly,the original C2f module of YOLOv8n was improved into a C2FFaster-EMA module to reduce the number of parameters and floating-point operations(FLOPs).Additionally,the WIoUv3 loss function was introduced to mitigate the negative impact of low-quality defect images on model training.Consequently,a reduction in model size and an enhancement in detection precision were achieved.Finally,the ablation and comparative experiments were conducted on an augmented Deep PCB dataset,and the generalization experiments were performed on the PCB Defect-Augmented dataset.The results indicated that the proposed model reduces the number of parameters by 23.3%and FLOPs by 20%,P by 0.7%,mAP@0.5 by 0.3%,and mAP@0.5:0.95 by 3.9%,respectively,compared to the original YOLOv8n model.Furthermore,the comparative experiments demonstrated that the proposed model achieves higher accuracy and mAP compared to YOLOv5n and YOLOv5s.It was concluded that the proposed method satisfies the requirements for both accuracy and speed in PCB defect detection.展开更多
To address the problems of large number of parameters and high complexity of calculation in the current steel surface defect detection model,a steel surface defect lightweight algorithm CGV-YOLO based on the improveme...To address the problems of large number of parameters and high complexity of calculation in the current steel surface defect detection model,a steel surface defect lightweight algorithm CGV-YOLO based on the improvement of YOLOv8 was proposed in this study.Firstly,in the process of optimizing the network architecture,the algorithm designed the FRC module and embeds it in the backbone network.Then,the GSConv convolution was employed to construct the Slim-neck network architecture,which further reduces computational load while maintaining model accuracy.Finally,the optimized CBAM replaced the C2f module in the YOLOv8 backbone network,reducing both model parameters and computational load.Based on the database of NEU-DET and BSD,the F1-Score of the CGV-YOLO algorithm is improved by 1.3%and 1.1%respectively compared with the baseline model.Based on the database of NEU-DET,the Params and computational complexity of the model are reduced by 30.6%and 35.3%respectively against the baseline.The results demonstrated that the proposed algorithm drastically reduces the number of parameters and computational cost with the maintenance of the accuracy of the model and realizes the lightweight effect.展开更多
Accurate and timely insulator defect detection is crucial for maintaining the reliability and safety of the power supply.However,the development of deep-learning-based insulator defect detection is hindered by the sca...Accurate and timely insulator defect detection is crucial for maintaining the reliability and safety of the power supply.However,the development of deep-learning-based insulator defect detection is hindered by the scarcity of comprehensive,high-quality datasets for insulator defects.To address this gap,the synthetic insulator defect imaging and annotation(SYNTHIDIA)system was proposed.SYNTHIDIA generates synthetic defect images in a 3D virtual environment using domain randomisation,offering a cost-effective and versatile solution for creating diverse and annotated data.Our dataset includes 22,000 images with accurate pixel-level and instance-level annotations,covering broken defect and drop defect types.Through rigorous experiments,SYNTHIDIA demonstrates strong generalisation capabilities to real-world data and provides valuable insights into the impact of various domain factors on model performance.The inclusion of 3D models further supports broader research initiatives.SYNTHIDIA addresses data insufficiency in insulator defect detection and enhances model performance in data-limited scenarios,contributing significantly to the advancement of power inspection.展开更多
Aiming at the problems of low detection efficiency and difficult positioning of traditional steel surface defect detection methods,a lightweight steel surface defect detection model based on you only look once version...Aiming at the problems of low detection efficiency and difficult positioning of traditional steel surface defect detection methods,a lightweight steel surface defect detection model based on you only look once version 7(YOLOv7)is proposed.First,a cascading style sheets(CSS)block module is proposed,which uses more lightweight operations to obtain redundant information in the feature map,reduces the amount of computation,and effectively improves the detection speed.Secondly,the improved spatial pyramid pooling with cross stage partial convolutions(SPPCSPC)structure is adopted to ensure that the model can also pay attention to the defect location information while predicting the defect category information,obtain richer defect features.In addition,the convolution operation in the original model is simplified,which significantly reduces the size of the model and helps to improve the detection speed.Finally,using efficient intersection over union(EIOU)loss to focus on high-quality anchors,speed up convergence and improve positioning accuracy.Experiments were carried out on the Northeastern University-defect(NEU-DET)steel surface defect dataset.Compared with the original YOLOv7 model,the number of parameters of this model was reduced by 40%,the frames per second(FPS)reached 112,and the average accuracy reached 79.1%,the detection accuracy and speed have been improved,which can meet the needs of steel surface defect detection.展开更多
The detection of surface defects in concrete bridges using deep learning is of significant importance for reducing operational risks,saving maintenance costs,and driving the intelligent transformation of bridge defect...The detection of surface defects in concrete bridges using deep learning is of significant importance for reducing operational risks,saving maintenance costs,and driving the intelligent transformation of bridge defect detection.In contrast to the subjective and inefficient manual visual inspection,deep learning-based algorithms for concrete defect detection exhibit remarkable advantages,emerging as a focal point in recent research.This paper comprehensively analyzes the research progress of deep learning algorithms in the field of surface defect detection in concrete bridges in recent years.It introduces the early detection methods for surface defects in concrete bridges and the development of deep learning.Subsequently,it provides an overview of deep learning-based concrete bridge surface defect detection research from three aspects:image classification,object detection,and semantic segmentation.The paper summarizes the strengths and weaknesses of existing methods and the challenges they face.Additionally,it analyzes and prospects the development trends of surface defect detection in concrete bridges.展开更多
With the growing demand for higher product quality in manufacturing,X-ray non-destructive testing has found widespread application not only in industrial quality control but also in a wide range of industrial applicat...With the growing demand for higher product quality in manufacturing,X-ray non-destructive testing has found widespread application not only in industrial quality control but also in a wide range of industrial applications,owing to its unique capability to penetrate materials and reveal both internal and surface defects.This paper presents a systematic review of recent advances and current applications of X-ray-based defect detection in industrial components.It begins with an overview of the fundamental principles of X-ray imaging and typical inspection workflows,followed by a review of classical image processing methods for defect detection,segmentation,and classification,with particular emphasis on their limitations in feature extraction and robustness.The focus then shifts to recent developments in deep learning techniques—particularly convolutional neural networks,object detection,and segmentation algorithms—and their innovative applications in X-ray defect analysis,which demonstrate substantial advantages in terms of automation and accuracy.In addition,the paper summarizes newly released public datasets and performance evaluation metrics reported in recent years.Finally,it discusses the current challenges and potential solutions in X-ray-based defect detection for industrial components,outlines key directions for future research,and highlights the practical relevance of these advances to real-world industrial applications.展开更多
To address the high cost of online detection equipment and the low adaptability and accuracy of online detection models that are caused by uneven lighting,high noise,low contrast and so on,a block-based template match...To address the high cost of online detection equipment and the low adaptability and accuracy of online detection models that are caused by uneven lighting,high noise,low contrast and so on,a block-based template matching method incorporating fabric texture characteristics is proposed.Firstly,the template image set is evenly divided into N groups of sub-templates at the same positions to mitigate the effects of image illumination,reduce the model computation,and enhance the detection speed,with all image blocks being preprocessed.Then,the feature value information is extracted from the processed set of subtemplates at the same position,extracting two gray-level cooccurrence matrix(GLCM)feature values for each image block.These two feature values are then fused to construct a matching template.The mean feature value of all image blocks at the same position is calculated and used as the threshold for template detection,enabling automatic selection of template thresholds for different positions.Finally,the feature values of the image blocks in the experimental set are traversed and matched with subtemplates at the same positions to obtain fabric defect detection results.The detection experiments are conducted on a platform that simulates a fabric weaving environment,using defective gray fabrics from a weaving factory as the detected objects.The outcomes demonstrate the efficacy of the proposed method in detecting defects in gray fabrics,the mitigation of the impact of uneven external lighting on detection outcomes,and the enhancement of detection accuracy and adaptability.展开更多
The service life of internal combustion engines is significantly influenced by surface defects in cylinder liners.To address the limitations of traditional detection methods,we propose an enhanced YOLOv8 model with Sw...The service life of internal combustion engines is significantly influenced by surface defects in cylinder liners.To address the limitations of traditional detection methods,we propose an enhanced YOLOv8 model with Swin Transformer as the backbone network.This approach leverages Swin Transformer's multi-head self-attention mechanism for improved feature extraction of defects spanning various scales.Integrated with the YOLOv8 detection head,our model achieves a mean average precision of 85.1%on our dataset,outperforming baseline methods by 1.4%.The model's effectiveness is further demonstrated on a steel-surface defect dataset,indicating its broad applicability in industrial surface defect detection.Our work highlights the potential of combining Swin Transformer and YOLOv8 for accurate and efficient defect detection.展开更多
Rail defects can pose significant safety risks in railway operations, raising the need for effective detection methods. Acoustic Emission (AE) technology has shown promise for identifying and monitoring these defects,...Rail defects can pose significant safety risks in railway operations, raising the need for effective detection methods. Acoustic Emission (AE) technology has shown promise for identifying and monitoring these defects, and this study evaluates an advanced on-vehicle AE detection approach using bone-conduct sensors—a solution to improve upon previous AE methods of using on-rail sensor installations, which required extensive, costly on-rail sensor networks with limited effectiveness. In response to these challenges, the study specifically explored bone-conduct sensors mounted directly on the vehicle rather than rails by evaluating AE signals generated by the interaction between rails and the train’s wheels while in motion. In this research, a prototype detection system was developed and tested through initial trials at the Nevada Railroad Museum using a track with pre-damaged welding defects. Further testing was conducted at the Transportation Technology Center Inc. (rebranded as MxV Rail) in Colorado, where the system’s performance was evaluated across various defect types and train speeds. The results indicated that bone-conduct sensors were insufficient for detecting AE signals when mounted on moving vehicles. These findings highlight the limitations of contact-based methods in real-world applications and indicate the need for exploring improved, non-contact approaches.展开更多
基金funded by Princess Nourah bint Abdulrahman University Researchers Supporting Project number(PNURSP2025R746),Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia。
文摘Differentiating between regular and abnormal noises in machine-generated sounds is a crucial but difficult problem.For accurate audio signal classification,suitable and efficient techniques are needed,particularly machine learning approaches for automated classification.Due to the dynamic and diverse representative characteristics of audio data,the probability of achieving high classification accuracy is relatively low and requires further research efforts.This study proposes an ensemble model based on the LeNet and hierarchical attention mechanism(HAM)models with MFCC features to enhance the models’capacity to handle bias.Additionally,CNNs,bidirectional LSTM(BiLSTM),CRNN,LSTM,capsule network model(CNM),attention mechanism(AM),gated recurrent unit(GRU),ResNet,EfficientNet,and HAM models are implemented for performance comparison.Experiments involving the DCASE2020 dataset reveal that the proposed approach works better than the others,achieving an impressive 99.13%accuracy and 99.56%k-fold cross-validation accuracy.Comparison with state-of-the-art studies further validates this performance.The study’s findings highlight the potential of the proposed approach for accurate fault detection in vehicles,particularly involving the use of acoustic data.
文摘Defect detection in printed circuit boards(PCB)remains challenging due to the difficulty of identifying small-scale defects,the inefficiency of conventional approaches,and the interference from complex backgrounds.To address these issues,this paper proposes SIM-Net,an enhanced detection framework derived from YOLOv11.The model integrates SPDConv to preserve fine-grained features for small object detection,introduces a novel convolutional partial attention module(C2PAM)to suppress redundant background information and highlight salient regions,and employs a multi-scale fusion network(MFN)with a multi-grain contextual module(MGCT)to strengthen contextual representation and accelerate inference.Experimental evaluations demonstrate that SIM-Net achieves 92.4%mAP,92%accuracy,and 89.4%recall with an inference speed of 75.1 FPS,outperforming existing state-of-the-art methods.These results confirm the robustness and real-time applicability of SIM-Net for PCB defect inspection.
基金supported by the Jilin Science and Technology Development Plan (20240101029JJ) for the following study:synchronized high-speed detection of surface shape and defects in the grinding stage of complex surfaces (KLMSZZ202305)for the high-precision wide dynamic large aperture optical inspection system for fine astronomical observation by the National Major Research Instrument Development Project (62127901)+2 种基金for ultrasmooth manufacturing technology of large diameter complex curved surface by the National Key R&D Program(2022YFB3403405)for research on the key technology of rapid synchronous detection of surface shape and subsurface defects in the grinding stage of large diameter complex surfaces by the International Cooperation Project(2025010157)The Key Laboratory of Optical System Advanced Manufacturing Technology,Chinese Academy of Sciences (2022KLOMT02-04) also supported this study
文摘To address the challenges of high-precision optical surface defect detection,we propose a novel design for a wide-field and broadband light field camera in this work.The proposed system can achieve a 50°field of view and operates at both visible and near-infrared wavelengths.Using the principles of light field imaging,the proposed design enables 3D reconstruction of optical surfaces,thus enabling vertical surface height measurements with enhanced accuracy.Using Zemax-based simulations,we evaluate the system’s modulation transfer function,its optical aberrations,and its tolerance to shape variations through Zernike coefficient adjustments.The results demonstrate that this camera can achieve the required spatial resolution while also maintaining high imaging quality and thus offers a promising solution for advanced optical surface defect inspection.
基金funded by the Joint Funds of the National Natural Science Foundation of China(U2341223)the Beijing Municipal Natural Science Foundation(No.4232067).
文摘In printed circuit board(PCB)manufacturing,surface defects can significantly affect product quality.To address the performance degradation,high false detection rates,and missed detections caused by complex backgrounds in current intelligent inspection algorithms,this paper proposes CG-YOLOv8,a lightweight and improved model based on YOLOv8n for PCB surface defect detection.The proposed method optimizes the network architecture and compresses parameters to reduce model complexity while maintaining high detection accuracy,thereby enhancing the capability of identifying diverse defects under complex conditions.Specifically,a cascaded multi-receptive field(CMRF)module is adopted to replace the SPPF module in the backbone to improve feature perception,and an inverted residual mobile block(IRMB)is integrated into the C2f module to further enhance performance.Additionally,conventional convolution layers are replaced with GSConv to reduce computational cost,and a lightweight Convolutional Block Attention Module based Convolution(CBAMConv)module is introduced after Grouped Spatial Convolution(GSConv)to preserve accuracy through attention mechanisms.The detection head is also optimized by removing medium and large-scale detection layers,thereby enhancing the model’s ability to detect small-scale defects and further reducing complexity.Experimental results show that,compared to the original YOLOv8n,the proposed CG-YOLOv8 reduces parameter count by 53.9%,improves mAP@0.5 by 2.2%,and increases precision and recall by 2.0%and 1.8%,respectively.These improvements demonstrate that CG-YOLOv8 offers an efficient and lightweight solution for PCB surface defect detection.
基金supported by Natural Science Foundation of China(Grant No.52175488)Scientific Research Program for Young Outstanding Talent of Higher Education of Hebei Province(China)(Grant No.BJ2021045)S&T Program of Hebei(China)(Grant No.236Z1808G).
文摘High-performance lattice structures produced through powder bed fusion-laser beam exhibit high specific strength and energy absorption capabilities.However,a significant deviation exists between the mechanical properties,service life of lattice structures,and design expectations.This deviation arises from the intense interaction between the laser and powder,which leads to the formation of numerous defects within the lattice structure.To address these issues,this paper proposes a high-performance defect detection model for metal lattice structures based on YOLOv4,called YOLO-Lattice(YOLO-L).The main objectives of this paper are as follows:(1)utilize computed tomography to construct datasets of the diamond lattice and body-centered cubic lattice structures;(2)in the backbone network of YOLOv4,employ deformable convolution to enhance the feature extraction capability of the model for small-scale defects;(3)adopt a dual-attention mechanism to suppress invalid feature information and amplify the distinction between defect and background regions;and(4)implement a channel pruning strategy to eliminate channels carrying less feature information,thereby improving the inference speed of the model.The experimental results on the diamond lattice structure dataset demonstrate that the mean average precision of the YOLO-L model increased from 96.98% to 98.8%(with an intersection over union of 0.5),and the inference speed decreased from 51.3 ms to 32.5 ms when compared to YOLOv4.Thus,the YOLO-L model can be effectively used to detect defects in metal lattice structures.
基金Supported by the National Natural Science Foundation of China(No.51174151)Major Scientific Research Projects of Hubei Provincial Department of Education(No.2010Z19003)+1 种基金Natural Science Foundation of Science and Technology Department of Hubei Province(No.2010CDB03403)Student Research Fund of WUST(No.14ZRB047)
文摘An experimental platform accompanying with the improved Roberts algorithm has been developed to achieve accurate and real-time edge detection of surface defects on heavy rails.Detection results of scratching defects show that the improved Roberts operator can attain accurate positioning to defect contour and get complete edge information.Meanwhile,a decreasing amount of interference noises as well as more precise characteristic parameters of the extracted defects can also be confirmed for the improved algorithm.Furthermore,the BP neural network adopted for defects classification with the improved Roberts operator can obtain the target training precision with 98 iterative steps and time of 2s while that of traditional Roberts operator is 118 steps and 4s.Finally,an enhanced defects identification rate of 13.33%has also been confirmed after the Roberts operator is improved.The proposed detecting platform will be positive in producing high-quality heavy rails and guaranteeing the national transportation safety.
基金supported by the Natural Science Foundation of Heilongjiang Province(Grant Number:LH2021F002).
文摘With the advent of Industry 4.0,marked by a surge in intelligent manufacturing,advanced sensors embedded in smart factories now enable extensive data collection on equipment operation.The analysis of such data is pivotal for ensuring production safety,a critical factor in monitoring the health status of manufacturing apparatus.Conventional defect detection techniques,typically limited to specific scenarios,often require manual feature extraction,leading to inefficiencies and limited versatility in the overall process.Our research presents an intelligent defect detection methodology that leverages deep learning techniques to automate feature extraction and defect localization processes.Our proposed approach encompasses a suite of components:the high-level feature learning block(HLFLB),the multi-scale feature learning block(MSFLB),and a dynamic adaptive fusion block(DAFB),working in tandem to extract meticulously and synergistically aggregate defect-related characteristics across various scales and hierarchical levels.We have conducted validation of the proposed method using datasets derived from gearbox and bearing assessments.The empirical outcomes underscore the superior defect detection capability of our approach.It demonstrates consistently high performance across diverse datasets and possesses the accuracy required to categorize defects,taking into account their specific locations and the extent of damage,proving the method’s effectiveness and reliability in identifying defects in industrial components.
文摘Railway turnouts often develop defects such as chipping,cracks,and wear during use.If not detected and addressed promptly,these defects can pose significant risks to train operation safety and passenger security.Despite advances in defect detection technologies,research specifically targeting railway turnout defects remains limited.To address this gap,we collected images from railway inspectors and constructed a dataset of railway turnout defects in complex environments.To enhance detection accuracy,we propose an improved YOLOv8 model named YOLO-VSS-SOUP-Inner-CIoU(YOLO-VSI).The model employs a state-space model(SSM)to enhance the C2f module in the YOLOv8 backbone,proposed the C2f-VSS module to better capture long-range dependencies and contextual features,thus improving feature extraction in complex environments.In the network’s neck layer,we integrate SPDConv and Omni-Kernel Network(OKM)modules to improve the original PAFPN(Path Aggregation Feature Pyramid Network)structure,and proposed the Small Object Upgrade Pyramid(SOUP)structure to enhance small object detection capabilities.Additionally,the Inner-CIoU loss function with a scale factor is applied to further enhance the model’s detection capabilities.Compared to the baseline model,YOLO-VSI demonstrates a 3.5%improvement in average precision on our railway turnout dataset,showcasing increased accuracy and robustness.Experiments on the public NEU-DET dataset reveal a 2.3%increase in average precision over the baseline,indicating that YOLO-VSI has good generalization capabilities.
基金supported by the National Natural Science Foundation of China(Nos.62373215,62373219 and 62073193)the Natural Science Foundation of Shandong Province(No.ZR2023MF100)+1 种基金the Key Projects of the Ministry of Industry and Information Technology(No.TC220H057-2022)the Independently Developed Instrument Funds of Shandong University(No.zy20240201)。
文摘Current you only look once(YOLO)-based algorithm model is facing the challenge of overwhelming parameters and calculation complexity under the printed circuit board(PCB)defect detection application scenario.In order to solve this problem,we propose a new method,which combined the lightweight network mobile vision transformer(Mobile Vi T)with the convolutional block attention module(CBAM)mechanism and the new regression loss function.This method needed less computation resources,making it more suitable for embedded edge detection devices.Meanwhile,the new loss function improved the positioning accuracy of the bounding box and enhanced the robustness of the model.In addition,experiments on public datasets demonstrate that the improved model achieves an average accuracy of 87.9%across six typical defect detection tasks,while reducing computational costs by nearly 90%.It significantly reduces the model's computational requirements while maintaining accuracy,ensuring reliable performance for edge deployment.
基金supported by Postgraduate Research&Practice Innovation Program of Jiangsu Province(Grant No.KYCX24_4084).
文摘To solve the problem of low detection accuracy for complex weld defects,the paper proposes a weld defects detection method based on improved YOLOv5s.To enhance the ability to focus on key information in feature maps,the scSE attention mechanism is intro-duced into the backbone network of YOLOv5s.A Fusion-Block module and additional layers are added to the neck network of YOLOv5s to improve the effect of feature fusion,which is to meet the needs of complex object detection.To reduce the computation-al complexity of the model,the C3Ghost module is used to replace the CSP2_1 module in the neck network of YOLOv5s.The scSE-ASFF module is constructed and inserted between the neck network and the prediction end,which is to realize the fusion of features between the different layers.To address the issue of imbalanced sample quality in the dataset and improve the regression speed and accuracy of the loss function,the CIoU loss function in the YOLOv5s model is replaced with the Focal-EIoU loss function.Finally,ex-periments are conducted based on the collected weld defect dataset to verify the feasibility of the improved YOLOv5s for weld defects detection.The experimental results show that the precision and mAP of the improved YOLOv5s in detecting complex weld defects are as high as 83.4%and 76.1%,respectively,which are 2.5%and 7.6%higher than the traditional YOLOv5s model.The proposed weld defects detection method based on the improved YOLOv5s in this paper can effectively solve the problem of low weld defects detection accuracy.
基金funded by Woosong University Academic Research 2024.
文摘This study investigates the application of Learnable Memory Vision Transformers(LMViT)for detecting metal surface flaws,comparing their performance with traditional CNNs,specifically ResNet18 and ResNet50,as well as other transformer-based models including Token to Token ViT,ViT withoutmemory,and Parallel ViT.Leveraging awidely-used steel surface defect dataset,the research applies data augmentation and t-distributed stochastic neighbor embedding(t-SNE)to enhance feature extraction and understanding.These techniques mitigated overfitting,stabilized training,and improved generalization capabilities.The LMViT model achieved a test accuracy of 97.22%,significantly outperforming ResNet18(88.89%)and ResNet50(88.90%),aswell as the Token to TokenViT(88.46%),ViT without memory(87.18),and Parallel ViT(91.03%).Furthermore,LMViT exhibited superior training and validation performance,attaining a validation accuracy of 98.2%compared to 91.0%for ResNet 18,96.0%for ResNet50,and 89.12%,87.51%,and 91.21%for Token to Token ViT,ViT without memory,and Parallel ViT,respectively.The findings highlight the LMViT’s ability to capture long-range dependencies in images,an areawhere CNNs struggle due to their reliance on local receptive fields and hierarchical feature extraction.The additional transformer-based models also demonstrate improved performance in capturing complex features over CNNs,with LMViT excelling particularly at detecting subtle and complex defects,which is critical for maintaining product quality and operational efficiency in industrial applications.For instance,the LMViT model successfully identified fine scratches and minor surface irregularities that CNNs often misclassify.This study not only demonstrates LMViT’s potential for real-world defect detection but also underscores the promise of other transformer-based architectures like Token to Token ViT,ViT without memory,and Parallel ViT in industrial scenarios where complex spatial relationships are key.Future research may focus on enhancing LMViT’s computational efficiency for deployment in real-time quality control systems.
文摘For the characteristics of small,dense distribution,high diversity of defects and high precision and fast detection in the process of PCB(Printed Circuit Board)defect detection,a defect detection algorithm based on YOLOv8n was proposed in this study.Firstly,the original C2f module of YOLOv8n was improved into a C2FFaster-EMA module to reduce the number of parameters and floating-point operations(FLOPs).Additionally,the WIoUv3 loss function was introduced to mitigate the negative impact of low-quality defect images on model training.Consequently,a reduction in model size and an enhancement in detection precision were achieved.Finally,the ablation and comparative experiments were conducted on an augmented Deep PCB dataset,and the generalization experiments were performed on the PCB Defect-Augmented dataset.The results indicated that the proposed model reduces the number of parameters by 23.3%and FLOPs by 20%,P by 0.7%,mAP@0.5 by 0.3%,and mAP@0.5:0.95 by 3.9%,respectively,compared to the original YOLOv8n model.Furthermore,the comparative experiments demonstrated that the proposed model achieves higher accuracy and mAP compared to YOLOv5n and YOLOv5s.It was concluded that the proposed method satisfies the requirements for both accuracy and speed in PCB defect detection.
文摘To address the problems of large number of parameters and high complexity of calculation in the current steel surface defect detection model,a steel surface defect lightweight algorithm CGV-YOLO based on the improvement of YOLOv8 was proposed in this study.Firstly,in the process of optimizing the network architecture,the algorithm designed the FRC module and embeds it in the backbone network.Then,the GSConv convolution was employed to construct the Slim-neck network architecture,which further reduces computational load while maintaining model accuracy.Finally,the optimized CBAM replaced the C2f module in the YOLOv8 backbone network,reducing both model parameters and computational load.Based on the database of NEU-DET and BSD,the F1-Score of the CGV-YOLO algorithm is improved by 1.3%and 1.1%respectively compared with the baseline model.Based on the database of NEU-DET,the Params and computational complexity of the model are reduced by 30.6%and 35.3%respectively against the baseline.The results demonstrated that the proposed algorithm drastically reduces the number of parameters and computational cost with the maintenance of the accuracy of the model and realizes the lightweight effect.
基金supported by Guangdong Power Grid Co.Ltd.Science and Technology Project(GDKJXM20231455).
文摘Accurate and timely insulator defect detection is crucial for maintaining the reliability and safety of the power supply.However,the development of deep-learning-based insulator defect detection is hindered by the scarcity of comprehensive,high-quality datasets for insulator defects.To address this gap,the synthetic insulator defect imaging and annotation(SYNTHIDIA)system was proposed.SYNTHIDIA generates synthetic defect images in a 3D virtual environment using domain randomisation,offering a cost-effective and versatile solution for creating diverse and annotated data.Our dataset includes 22,000 images with accurate pixel-level and instance-level annotations,covering broken defect and drop defect types.Through rigorous experiments,SYNTHIDIA demonstrates strong generalisation capabilities to real-world data and provides valuable insights into the impact of various domain factors on model performance.The inclusion of 3D models further supports broader research initiatives.SYNTHIDIA addresses data insufficiency in insulator defect detection and enhances model performance in data-limited scenarios,contributing significantly to the advancement of power inspection.
基金supported by the National Natural Science Foundation of China(No.62103298)the Natural Science Foundation of Hebei Province(No.F2018209289)。
文摘Aiming at the problems of low detection efficiency and difficult positioning of traditional steel surface defect detection methods,a lightweight steel surface defect detection model based on you only look once version 7(YOLOv7)is proposed.First,a cascading style sheets(CSS)block module is proposed,which uses more lightweight operations to obtain redundant information in the feature map,reduces the amount of computation,and effectively improves the detection speed.Secondly,the improved spatial pyramid pooling with cross stage partial convolutions(SPPCSPC)structure is adopted to ensure that the model can also pay attention to the defect location information while predicting the defect category information,obtain richer defect features.In addition,the convolution operation in the original model is simplified,which significantly reduces the size of the model and helps to improve the detection speed.Finally,using efficient intersection over union(EIOU)loss to focus on high-quality anchors,speed up convergence and improve positioning accuracy.Experiments were carried out on the Northeastern University-defect(NEU-DET)steel surface defect dataset.Compared with the original YOLOv7 model,the number of parameters of this model was reduced by 40%,the frames per second(FPS)reached 112,and the average accuracy reached 79.1%,the detection accuracy and speed have been improved,which can meet the needs of steel surface defect detection.
基金supported by the Key Research and Development Program of Shaanxi Province-International Science and Technology Cooperation Program Project (No.2020KW-001)the Contract for Xi'an Municipal Science and Technology Plan Project-Xi'an City Strong Foundation Innovation Plan (No.21XJZZ0074)the Key Project of Graduate Student Innovation Fund at Xi'an University of Posts and Telecommunications (No.CXJJZL2023013)。
文摘The detection of surface defects in concrete bridges using deep learning is of significant importance for reducing operational risks,saving maintenance costs,and driving the intelligent transformation of bridge defect detection.In contrast to the subjective and inefficient manual visual inspection,deep learning-based algorithms for concrete defect detection exhibit remarkable advantages,emerging as a focal point in recent research.This paper comprehensively analyzes the research progress of deep learning algorithms in the field of surface defect detection in concrete bridges in recent years.It introduces the early detection methods for surface defects in concrete bridges and the development of deep learning.Subsequently,it provides an overview of deep learning-based concrete bridge surface defect detection research from three aspects:image classification,object detection,and semantic segmentation.The paper summarizes the strengths and weaknesses of existing methods and the challenges they face.Additionally,it analyzes and prospects the development trends of surface defect detection in concrete bridges.
基金supported in part by the Project of National Key Laboratory of Advanced Casting Technologies under Grant CAT2023-002.
文摘With the growing demand for higher product quality in manufacturing,X-ray non-destructive testing has found widespread application not only in industrial quality control but also in a wide range of industrial applications,owing to its unique capability to penetrate materials and reveal both internal and surface defects.This paper presents a systematic review of recent advances and current applications of X-ray-based defect detection in industrial components.It begins with an overview of the fundamental principles of X-ray imaging and typical inspection workflows,followed by a review of classical image processing methods for defect detection,segmentation,and classification,with particular emphasis on their limitations in feature extraction and robustness.The focus then shifts to recent developments in deep learning techniques—particularly convolutional neural networks,object detection,and segmentation algorithms—and their innovative applications in X-ray defect analysis,which demonstrate substantial advantages in terms of automation and accuracy.In addition,the paper summarizes newly released public datasets and performance evaluation metrics reported in recent years.Finally,it discusses the current challenges and potential solutions in X-ray-based defect detection for industrial components,outlines key directions for future research,and highlights the practical relevance of these advances to real-world industrial applications.
文摘To address the high cost of online detection equipment and the low adaptability and accuracy of online detection models that are caused by uneven lighting,high noise,low contrast and so on,a block-based template matching method incorporating fabric texture characteristics is proposed.Firstly,the template image set is evenly divided into N groups of sub-templates at the same positions to mitigate the effects of image illumination,reduce the model computation,and enhance the detection speed,with all image blocks being preprocessed.Then,the feature value information is extracted from the processed set of subtemplates at the same position,extracting two gray-level cooccurrence matrix(GLCM)feature values for each image block.These two feature values are then fused to construct a matching template.The mean feature value of all image blocks at the same position is calculated and used as the threshold for template detection,enabling automatic selection of template thresholds for different positions.Finally,the feature values of the image blocks in the experimental set are traversed and matched with subtemplates at the same positions to obtain fabric defect detection results.The detection experiments are conducted on a platform that simulates a fabric weaving environment,using defective gray fabrics from a weaving factory as the detected objects.The outcomes demonstrate the efficacy of the proposed method in detecting defects in gray fabrics,the mitigation of the impact of uneven external lighting on detection outcomes,and the enhancement of detection accuracy and adaptability.
基金supported by the Scientific and technological key project in Henan Province 22210224002the Natural Science Foundation of Henan Polytechnic University B2021-38.
文摘The service life of internal combustion engines is significantly influenced by surface defects in cylinder liners.To address the limitations of traditional detection methods,we propose an enhanced YOLOv8 model with Swin Transformer as the backbone network.This approach leverages Swin Transformer's multi-head self-attention mechanism for improved feature extraction of defects spanning various scales.Integrated with the YOLOv8 detection head,our model achieves a mean average precision of 85.1%on our dataset,outperforming baseline methods by 1.4%.The model's effectiveness is further demonstrated on a steel-surface defect dataset,indicating its broad applicability in industrial surface defect detection.Our work highlights the potential of combining Swin Transformer and YOLOv8 for accurate and efficient defect detection.
文摘Rail defects can pose significant safety risks in railway operations, raising the need for effective detection methods. Acoustic Emission (AE) technology has shown promise for identifying and monitoring these defects, and this study evaluates an advanced on-vehicle AE detection approach using bone-conduct sensors—a solution to improve upon previous AE methods of using on-rail sensor installations, which required extensive, costly on-rail sensor networks with limited effectiveness. In response to these challenges, the study specifically explored bone-conduct sensors mounted directly on the vehicle rather than rails by evaluating AE signals generated by the interaction between rails and the train’s wheels while in motion. In this research, a prototype detection system was developed and tested through initial trials at the Nevada Railroad Museum using a track with pre-damaged welding defects. Further testing was conducted at the Transportation Technology Center Inc. (rebranded as MxV Rail) in Colorado, where the system’s performance was evaluated across various defect types and train speeds. The results indicated that bone-conduct sensors were insufficient for detecting AE signals when mounted on moving vehicles. These findings highlight the limitations of contact-based methods in real-world applications and indicate the need for exploring improved, non-contact approaches.