To solve the problems of complex lesion region morphology,blurred edges,and limited hardware resources for deploying the recognition model in pneumonia image recognition,an improved EfficientNetV2 pneumo-nia recogniti...To solve the problems of complex lesion region morphology,blurred edges,and limited hardware resources for deploying the recognition model in pneumonia image recognition,an improved EfficientNetV2 pneumo-nia recognition model based on multiscale attention is proposed.First,the number of main module stacks of the model is reduced to avoid overfitting,while the dilated convolution is introduced in the first convolutional layer to expand the receptive field of the model;second,a redesigned improved mobile inverted bottleneck convolution(IMBConv)module is proposed,in which GSConv is introduced to enhance the model’s attention to inter-channel information,and a SimAM module is introduced to reduce the number of model parameters while guaranteeing the model’s recognition performance;finally,an improved multi-scale efficient local attention(MELA)module is proposed to ensure the model’s recognition ability for pneumonia images with complex lesion regions.The experimental results show that the improved model has a computational complexity of 1.96 GFLOPs,which is reduced by 32%relative to the baseline model,and the number of model parameters is also reduced,and achieves an accuracy of 86.67%on the triple classification task of the public dataset Chest X-ray,representing an improvement of 2.74%compared to the baseline model.The recognition accuracies of ResNet50,Inception-V4,and Swin Transformer V2 on this dataset are 84.36%,85.98%,and 83.42%,respectively,and their computational complexities and model parameter counts are all higher than those of the proposed model.This indicates that the proposed model has very high feasibility for deployment in edge computing or mobile healthcare systems.In addition,the improved model achieved the highest accuracy of 90.98%on the four-classification public dataset compared to other models,indicating that the model has better recognition accuracy and generalization ability for pneumonia image recognition.展开更多
Automatic pancreas segmentation plays a pivotal role in assisting physicians with diagnosing pancreatic diseases,facilitating treatment evaluations,and designing surgical plans.Due to the pancreas’s tiny size,signifi...Automatic pancreas segmentation plays a pivotal role in assisting physicians with diagnosing pancreatic diseases,facilitating treatment evaluations,and designing surgical plans.Due to the pancreas’s tiny size,significant variability in shape and location,and low contrast with surrounding tissues,achieving high segmentation accuracy remains challenging.To improve segmentation precision,we propose a novel network utilizing EfficientNetV2 and multi-branch structures for automatically segmenting the pancreas fromCT images.Firstly,an EfficientNetV2 encoder is employed to extract complex and multi-level features,enhancing the model’s ability to capture the pancreas’s intricate morphology.Then,a residual multi-branch dilated attention(RMDA)module is designed to suppress irrelevant background noise and highlight useful pancreatic features.And re-parameterization Visual Geometry Group(RepVGG)blocks with amulti-branch structure are introduced in the decoder to effectively integrate deep features and low-level details,improving segmentation accuracy.Furthermore,we apply re-parameterization to the model,reducing computations and parameters while accelerating inference and reducing memory usage.Our approach achieves average dice similarity coefficient(DSC)of 85.59%,intersection over union(IoU)of 75.03%,precision of 85.09%,and recall of 86.57%on the NIH pancreas dataset.Compared with other methods,our model has fewer parameters and faster inference speed,demonstrating its enormous potential in practical applications of pancreatic segmentation.展开更多
针对工程实际故障诊断环境下,可用数据稀缺,导致智能诊断模型对轴承健康状态识别精度较低这一问题,提出一种基于二次迁移学习和EfficientNetV2(Two-Step Transfer of Efficient⁃NetV2,TSTE)的滚动轴承故障诊断新方法。首先,将模型在轴...针对工程实际故障诊断环境下,可用数据稀缺,导致智能诊断模型对轴承健康状态识别精度较低这一问题,提出一种基于二次迁移学习和EfficientNetV2(Two-Step Transfer of Efficient⁃NetV2,TSTE)的滚动轴承故障诊断新方法。首先,将模型在轴承全寿命周期数据集中训练,之后冻结模型浅层权重,将其在多工况轴承数据集中训练,进行第一次迁移学习。其次,通过构造类不平衡数据集,研究实际故障环境下可用数据稀缺对故障诊断性能的影响。然后,基于合成少数类过采样技术(Synthetic Minority Oversampling Technique,SMOTE)过采样方法与编辑最近邻(Edited Nearest Neighbors,ENN)欠采样方法对故障数据进行扩充,使类不平衡数据集重构为类平衡数据集。最后,将模型在类平衡数据集中训练,冻结模型底层权重,训练模型深层,进行第二次迁移学习,使模型掌握平衡数据集故障特征。通过多种指标进行实验评估,同时与其他方法进行对比,并使用Grad-CAM方法进行了特征可视化。结果表明,所提方法能够将模型在实验室环境下积累的故障诊断知识应用于实际工程设备,适用于检测数据稀缺情形下的滚动轴承故障诊断。展开更多
基金supported by the Scientific Research Fund of Hunan Provincial Education Department,China(Grant Nos.21C0439,22A0408).
文摘To solve the problems of complex lesion region morphology,blurred edges,and limited hardware resources for deploying the recognition model in pneumonia image recognition,an improved EfficientNetV2 pneumo-nia recognition model based on multiscale attention is proposed.First,the number of main module stacks of the model is reduced to avoid overfitting,while the dilated convolution is introduced in the first convolutional layer to expand the receptive field of the model;second,a redesigned improved mobile inverted bottleneck convolution(IMBConv)module is proposed,in which GSConv is introduced to enhance the model’s attention to inter-channel information,and a SimAM module is introduced to reduce the number of model parameters while guaranteeing the model’s recognition performance;finally,an improved multi-scale efficient local attention(MELA)module is proposed to ensure the model’s recognition ability for pneumonia images with complex lesion regions.The experimental results show that the improved model has a computational complexity of 1.96 GFLOPs,which is reduced by 32%relative to the baseline model,and the number of model parameters is also reduced,and achieves an accuracy of 86.67%on the triple classification task of the public dataset Chest X-ray,representing an improvement of 2.74%compared to the baseline model.The recognition accuracies of ResNet50,Inception-V4,and Swin Transformer V2 on this dataset are 84.36%,85.98%,and 83.42%,respectively,and their computational complexities and model parameter counts are all higher than those of the proposed model.This indicates that the proposed model has very high feasibility for deployment in edge computing or mobile healthcare systems.In addition,the improved model achieved the highest accuracy of 90.98%on the four-classification public dataset compared to other models,indicating that the model has better recognition accuracy and generalization ability for pneumonia image recognition.
基金supported by the Science and Technology Innovation Programof Hunan Province(Grant No.2022RC1021)the Hunan Provincial Natural Science Foundation Project(Grant No.2023JJ60124)+1 种基金the Changsha Natural Science Foundation Project(Grant No.kq2202265)the key project of the Hunan Provincial of Education(Grant No.22A0255).
文摘Automatic pancreas segmentation plays a pivotal role in assisting physicians with diagnosing pancreatic diseases,facilitating treatment evaluations,and designing surgical plans.Due to the pancreas’s tiny size,significant variability in shape and location,and low contrast with surrounding tissues,achieving high segmentation accuracy remains challenging.To improve segmentation precision,we propose a novel network utilizing EfficientNetV2 and multi-branch structures for automatically segmenting the pancreas fromCT images.Firstly,an EfficientNetV2 encoder is employed to extract complex and multi-level features,enhancing the model’s ability to capture the pancreas’s intricate morphology.Then,a residual multi-branch dilated attention(RMDA)module is designed to suppress irrelevant background noise and highlight useful pancreatic features.And re-parameterization Visual Geometry Group(RepVGG)blocks with amulti-branch structure are introduced in the decoder to effectively integrate deep features and low-level details,improving segmentation accuracy.Furthermore,we apply re-parameterization to the model,reducing computations and parameters while accelerating inference and reducing memory usage.Our approach achieves average dice similarity coefficient(DSC)of 85.59%,intersection over union(IoU)of 75.03%,precision of 85.09%,and recall of 86.57%on the NIH pancreas dataset.Compared with other methods,our model has fewer parameters and faster inference speed,demonstrating its enormous potential in practical applications of pancreatic segmentation.
文摘针对工程实际故障诊断环境下,可用数据稀缺,导致智能诊断模型对轴承健康状态识别精度较低这一问题,提出一种基于二次迁移学习和EfficientNetV2(Two-Step Transfer of Efficient⁃NetV2,TSTE)的滚动轴承故障诊断新方法。首先,将模型在轴承全寿命周期数据集中训练,之后冻结模型浅层权重,将其在多工况轴承数据集中训练,进行第一次迁移学习。其次,通过构造类不平衡数据集,研究实际故障环境下可用数据稀缺对故障诊断性能的影响。然后,基于合成少数类过采样技术(Synthetic Minority Oversampling Technique,SMOTE)过采样方法与编辑最近邻(Edited Nearest Neighbors,ENN)欠采样方法对故障数据进行扩充,使类不平衡数据集重构为类平衡数据集。最后,将模型在类平衡数据集中训练,冻结模型底层权重,训练模型深层,进行第二次迁移学习,使模型掌握平衡数据集故障特征。通过多种指标进行实验评估,同时与其他方法进行对比,并使用Grad-CAM方法进行了特征可视化。结果表明,所提方法能够将模型在实验室环境下积累的故障诊断知识应用于实际工程设备,适用于检测数据稀缺情形下的滚动轴承故障诊断。