Confocal laser endomicroscopy(CLE)has become an indispensable tool in the diagnosis and detection of gastrointestinal(GI)diseases due to its high-resolution and high-contrast imaging capabilities.However,the early-sta...Confocal laser endomicroscopy(CLE)has become an indispensable tool in the diagnosis and detection of gastrointestinal(GI)diseases due to its high-resolution and high-contrast imaging capabilities.However,the early-stage imaging changes of gastrointestinal disorders are often subtle,and traditional medical image analysis methods rely heavily on manual interpretation,which is time-consuming,subject to observer variability,and inefficient for accurate lesion identification across large-scale image datasets.With the introduction of artificial intelligence(AI)technologies,AI-driven CLE image analysis systems can automatically extract pathological features and have demonstrated significant clinical value in lesion recognition,classification diagnosis,and malignancy prediction of GI diseases.These systems greatly enhance diagnostic efficiency and early detection capabilities.This review summarizes the applications of AI-assisted CLE in GI diseases,analyzes the limitations of current technologies,and explores future research directions.It is expected that the deep integration of AI and confocal imaging technologies will provide strong support for precision diagnosis and personalized treatment in the field of gastrointestinal disorders.展开更多
Deep learning models have become a core technological tool in the field of medical image analysis.However,these models often suffer from a lack of transparency in their decision-making processes,leading to challenges ...Deep learning models have become a core technological tool in the field of medical image analysis.However,these models often suffer from a lack of transparency in their decision-making processes,leading to challenges related to trust and interpret ability in clinical applications.To address this issue,explainable artificial intelligence(XAI)techniques have been applied to medical image analysis.While showing promising potential,XAI also brings significant ethical risks in practice—most notably,the problem of spurious explanations.Such explanations may rise further concerns regarding patient privacy,data security,and the attribution of decisionmaking authority in medical contexts.This paper analyzes the application of XAI methods—particularly saliency aps—in medical image interpretation,identifies the underlying causes of spurious explanations,and proposes possible mitigation strategies.The aim is to contribute to the responsible and sustainable integration of explainable AI into clinical practice.展开更多
Background:Accurate classification of normal blood cells is a critical foundation for automated hematological analysis,including the detection of pathological conditions like leukemia.While convolutional neural networ...Background:Accurate classification of normal blood cells is a critical foundation for automated hematological analysis,including the detection of pathological conditions like leukemia.While convolutional neural networks(CNNs)excel in local feature extraction,their ability to capture global contextual relationships in complex cellular morphologies is limited.This study introduces a hybrid CNN-Transformer framework to enhance normal blood cell classification,laying the groundwork for future leukemia diagnostics.Methods:The proposed architecture integrates pre-trained CNNs(ResNet50,EfficientNetB3,InceptionV3,CustomCNN)with Vision Transformer(ViT)layers to combine local and global feature modeling.Four hybrid models were evaluated on the publicly available Blood Cell Images dataset from Kaggle,comprising 17,092 annotated normal blood cell images across eight classes.The models were trained using transfer learning,fine-tuning,and computational optimizations,including cross-model parameter sharing to reduce redundancy by reusing weights across CNN backbones and attention-guided layer pruning to eliminate low-contribution layers based on attention scores,improving efficiency without sacrificing accuracy.Results:The InceptionV3-ViT model achieved a weighted accuracy of 97.66%(accounting for class imbalance by weighting each class’s contribution),a macro F1-score of 0.98,and a ROC-AUC of 0.998.The framework excelled in distinguishing morphologically similar cell types demonstrating robustness and reliable calibration(ECE of 0.019).The framework addresses generalization challenges,including class imbalance and morphological similarities,ensuring robust performance across diverse cell types.Conclusion:The hybrid CNN-Transformer framework significantly improves normal blood cell classification by capturing multi-scale features and long-range dependencies.Its high accuracy,efficiency,and generalization position it as a strong baseline for automated hematological analysis,with potential for extension to leukemia subtype classification through future validation on pathological samples.展开更多
Medical image analysis has become a cornerstone of modern healthcare,driven by the exponential growth of data from imaging modalities such as MRI,CT,PET,ultrasound,and X-ray.Traditional machine learning methods have m...Medical image analysis has become a cornerstone of modern healthcare,driven by the exponential growth of data from imaging modalities such as MRI,CT,PET,ultrasound,and X-ray.Traditional machine learning methods have made early contributions;however,recent advancements in deep learning(DL)have revolutionized the field,offering state-of-the-art performance in image classification,segmentation,detection,fusion,registration,and enhancement.This comprehensive review presents an in-depth analysis of deep learning methodologies applied across medical image analysis tasks,highlighting both foundational models and recent innovations.The article begins by introducing conventional techniques and their limitations,setting the stage for DL-based solutions.Core DL architectures,including Convolutional Neural Networks(CNNs),Recurrent Neural Networks(RNNs),Generative Adversarial Networks(GANs),Vision Transformers(ViTs),and hybrid models,are discussed in detail,including their advantages and domain-specific adaptations.Advanced learning paradigms such as semi-supervised learning,selfsupervised learning,and few-shot learning are explored for their potential to mitigate data annotation challenges in clinical datasets.This review further categorizes major tasks in medical image analysis,elaborating on how DL techniques have enabled precise tumor segmentation,lesion detection,modality fusion,super-resolution,and robust classification across diverse clinical settings.Emphasis is placed on applications in oncology,cardiology,neurology,and infectious diseases,including COVID-19.Challenges such as data scarcity,label imbalance,model generalizability,interpretability,and integration into clinical workflows are critically examined.Ethical considerations,explainable AI(XAI),federated learning,and regulatory compliance are discussed as essential components of real-world deployment.Benchmark datasets,evaluation metrics,and comparative performance analyses are presented to support future research.The article concludes with a forward-looking perspective on the role of foundation models,multimodal learning,edge AI,and bio-inspired computing in the future of medical imaging.Overall,this review serves as a valuable resource for researchers,clinicians,and developers aiming to harness deep learning for intelligent,efficient,and clinically viable medical image analysis.展开更多
Objective:By data mining,to analyze the characteristics of Professor Han Fei’s medication in the treatment of children with epilepsy,to explore the rules of medication,in order to provide reference for clinical treat...Objective:By data mining,to analyze the characteristics of Professor Han Fei’s medication in the treatment of children with epilepsy,to explore the rules of medication,in order to provide reference for clinical treatment of children with epilepsy by Chinese medicine.Methods:From January 2008 to March 2021,we collected the diagnosis and treatment data of the children with epilepsy who were treated by Professor Han Fei in the outpatient department of Guang’Anmen Hospital of Chinese Academy of Medical Sciences.Using the software of IBM SPSS Statistics 24.0 and IBM SPSS Modeler 18.0,the characteristics and rules of Professor Hanfei’s Chinese materia medica used were summarized through the descriptive analysis,correlation analysis and cluster analysis of drug cumulative frequency,drug flavor,drug channel tropism and efficacy.Results:A total of 224 cases were included in this study,excluding 1 case with other neurological disorders.Finally,223 prescriptions were included,involving 176 kinds of Chinese materia medica and the total medication frequency was 4712.The first 10 highfrequency Chinese materia medica were Chaihu(95.52%),Bombyx batryticatus(94.17%),keels(83.41%),oysters(72.65%),earthworm(72.20%),fructus aurantii(66.37%),Scorpion(64.57%),Gastrodia elata(60.99%),Acorus gramineus(59.19%)and Dannan Xing(58.30%).The main Chinese materia medica used were mainly for suppressing hyperactive liver for calming endogenous wind,relieving exterior syndromes and tranquillizing mind.The medicine properties were mainly to be flat,slight cold,pungent,bitter and willing,and they were mainly for liver,lung and heart meridian tropism.Correlation Analysis:Bupleurum chinense,Bombyx batryticatus,Dragon Bone,oyster as its core medicine group,Semen Ziziphi spinosae and semen platycladi are effective strong correlation medicine pair.Three medicine combinations were obtained by cluster analysis.Conclusion:Hanshi has the characteristics of“calming liver,tranquilizing mind,calming endogenous wind,removing the phlegm and extravasated blood”in treating epilepsy.展开更多
Automated grading of dandruff severity is a clinically significant but challenging task due to the inherent ordinal nature of severity levels and the high prevalence of label noise from subjective expert annotations.S...Automated grading of dandruff severity is a clinically significant but challenging task due to the inherent ordinal nature of severity levels and the high prevalence of label noise from subjective expert annotations.Standard classification methods fail to address these dual challenges,limiting their real-world performance.In this paper,a novel,three-phase training framework is proposed that learns a robust ordinal classifier directly from noisy labels.The approach synergistically combines a rank-based ordinal regression backbone with a cooperative,semi-supervised learning strategy to dynamically partition the data into clean and noisy subsets.A hybrid training objective is then employed,applying a supervised ordinal loss to the clean set.The noisy set is simultaneously trained using a dualobjective that combines a semi-supervised ordinal loss with a parallel,label-agnostic contrastive loss.This design allows themodel to learn fromthe entire noisy subset while using contrastive learning to mitigate the risk of error propagation frompotentially corrupt supervision.Extensive experiments on a new,large-scale,multi-site clinical dataset validate our approach.Themethod achieves state-of-the-art performance with 80.71%accuracy and a 76.86%F1-score,significantly outperforming existing approaches,including a 2.26%improvement over the strongest baseline method.This work provides not only a robust solution for a practical medical imaging problem but also a generalizable framework for other tasks plagued by noisy ordinal labels.展开更多
Visual diagnosis of skin cancer is challenging due to subtle inter-class similarities,variations in skin texture,the presence of hair,and inconsistent illumination.Deep learning models have shown promise in assisting ...Visual diagnosis of skin cancer is challenging due to subtle inter-class similarities,variations in skin texture,the presence of hair,and inconsistent illumination.Deep learning models have shown promise in assisting early detection,yet their performance is often limited by the severe class imbalance present in dermoscopic datasets.This paper proposes CANNSkin,a skin cancer classification framework that integrates a convolutional autoencoder with latent-space oversampling to address this imbalance.The autoencoder is trained to reconstruct lesion images,and its latent embeddings are used as features for classification.To enhance minority-class representation,the Synthetic Minority Oversampling Technique(SMOTE)is applied directly to the latent vectors before classifier training.The encoder and classifier are first trained independently and later fine-tuned end-to-end.On the HAM10000 dataset,CANNSkin achieves an accuracy of 93.01%,a macro-F1 of 88.54%,and an ROC–AUC of 98.44%,demonstrating strong robustness across ten test subsets.Evaluation on the more complex ISIC 2019 dataset further confirms the model’s effectiveness,where CANNSkin achieves 94.27%accuracy,93.95%precision,94.09%recall,and 99.02%F1-score,supported by high reconstruction fidelity(PSNR 35.03 dB,SSIM 0.86).These results demonstrate the effectiveness of our proposed latent-space balancing and fine-tuned representation learning as a new benchmark method for robust and accurate skin cancer classification across heterogeneous datasets.展开更多
Recent studies indicate that millions of individuals suffer from renal diseases,with renal carcinoma,a type of kidney cancer,emerging as both a chronic illness and a significant cause of mortality.Magnetic Resonance I...Recent studies indicate that millions of individuals suffer from renal diseases,with renal carcinoma,a type of kidney cancer,emerging as both a chronic illness and a significant cause of mortality.Magnetic Resonance Imaging(MRI)and Computed Tomography(CT)have become essential tools for diagnosing and assessing kidney disorders.However,accurate analysis of thesemedical images is critical for detecting and evaluating tumor severity.This study introduces an integrated hybrid framework that combines three complementary deep learning models for kidney tumor segmentation from MRI images.The proposed framework fuses a customized U-Net and Mask R-CNN using a weighted scheme to achieve semantic and instance-level segmentation.The fused outputs are further refined through edge detection using Stochastic FeatureMapping Neural Networks(SFMNN),while volumetric consistency is ensured through Improved Mini-Batch K-Means(IMBKM)clustering integrated with an Encoder-Decoder Convolutional Neural Network(EDCNN).The outputs of these three stages are combined through a weighted fusion mechanism,with optimal weights determined empirically.Experiments on MRI scans from the TCGA-KIRC dataset demonstrate that the proposed hybrid framework significantly outperforms standalone models,achieving a Dice Score of 92.5%,an IoU of 87.8%,a Precision of 93.1%,a Recall of 90.8%,and a Hausdorff Distance of 2.8 mm.These findings validate that the weighted integration of complementary architectures effectively overcomes key limitations in kidney tumor segmentation,leading to improved diagnostic accuracy and robustness in medical image analysis.展开更多
Objective:To mine the medication patterns of ancient prescriptions for diabetic retinopathy(DR)from databases of traditional Chinese medicine(TCM)ancient books,and provide evidence for clinical practice and scientific...Objective:To mine the medication patterns of ancient prescriptions for diabetic retinopathy(DR)from databases of traditional Chinese medicine(TCM)ancient books,and provide evidence for clinical practice and scientific research of TCM treatment for DR.Methods:The traditional library retrieval and modern data retrieval technology were combined to collect the ancient prescriptions in these databases,including the library ofHunan University ofChinese Medicine,Chinese Medical Dictionary,Duxiu,and Chaoxing Digital Library.And the TCM inheritance auxiliary platform(V3.0)was used for data mining,mainly including drug frequency analysis,medicinal property and meridian tropism analysis,efficacy analysis,correlation analysis,complex network analysis,and cluster analysis.Results:A total of 271 ancient prescriptions for the treatment of DR were collected,involving 296 drugs.The total medication frequency was 2,727.Most of them were cold and sweet drugs.The meridians primarily targeted were the liver,kidney,and spleen.The main effects of drugs were supplementing deficiency,clearing heat,releasing the exterior,inducing urination to drain dampness,pacifying liver and extinguishing wind,and circulating blood and transforming stasis.Saposhnikovia divaricata was the most frequently Chinese herbal medicine for DR in TCM ancient books.Saposhnikovia divaricata and ligusticum wallichi,saposhnikovia divaricata and notopterygium root,angelica sinensis and ligusticum wallichii were common herbal pairs.Saposhnikovia divaricata,ginseng,plantain seed,angelica sinensis,prepared rehmannia root and cassia seed constituted the core formula with the highest frequency.Conclusion:The core prescriptions for treating DR are mainly crafted from Dihuang pill,Ruiren powder,Siwu decoction,and Zhujing pill.Saposhnikovia divaricata is an important meridian-guiding medicine to open Xuanfu for DR.In clinical practice,the prescriptions should be modified according to the evolution of pathogenesis.展开更多
Parkinson’s disease is a neurodegenerative disorder that significantly impacts patients’lives.Currently,treatment primarily relies on drug therapy,while effective nursing interventions can help mitigate adverse reac...Parkinson’s disease is a neurodegenerative disorder that significantly impacts patients’lives.Currently,treatment primarily relies on drug therapy,while effective nursing interventions can help mitigate adverse reactions associated with medication use.This article reviews medication selection and nursing interventions for patients with Parkinson’s disease,aiming to alleviate symptoms,improve quality of life,and provide a scientific and comprehensive basis for medication and clinical nursing practices.展开更多
Medical image analysis is an active research topic,with thousands of studies published in the past few years.Transfer learning(TL)including convolutional neural networks(CNNs)focused to enhance efficiency on an innova...Medical image analysis is an active research topic,with thousands of studies published in the past few years.Transfer learning(TL)including convolutional neural networks(CNNs)focused to enhance efficiency on an innovative task using the knowledge of the same tasks learnt in advance.It has played a major role in medical image analysis since it solves the data scarcity issue along with that it saves hardware resources and time.This study develops an EnhancedTunicate SwarmOptimization withTransfer Learning EnabledMedical Image Analysis System(ETSOTL-MIAS).The goal of the ETSOTL-MIAS technique lies in the identification and classification of diseases through medical imaging.The ETSOTL-MIAS technique involves the Chan Vese segmentation technique to identify the affected regions in the medical image.For feature extraction purposes,the ETSOTL-MIAS technique designs a modified DarkNet-53 model.To avoid the manual hyperparameter adjustment process,the ETSOTLMIAS technique exploits the ETSO algorithm,showing the novelty of the work.Finally,the classification of medical images takes place by random forest(RF)classifier.The performance validation of the ETSOTL-MIAS technique is tested on a benchmark medical image database.The extensive experimental analysis showed the promising performance of the ETSOTL-MIAS technique under different measures.展开更多
Deep learning (DL) has seen an exponential development in recent years, with major impact in many medical fields, especially in the field of medical image. The purpose of the work converges in determining the importan...Deep learning (DL) has seen an exponential development in recent years, with major impact in many medical fields, especially in the field of medical image. The purpose of the work converges in determining the importance of each component, describing the specificity and correlations of these elements involved in achieving the precision of interpretation of medical images using DL. The major contribution of this work is primarily to the updated characterisation of the characteristics of the constituent elements of the deep learning process, scientific data, methods of knowledge incorporation, DL models according to the objectives for which they were designed and the presentation of medical applications in accordance with these tasks. Secondly, it describes the specific correlations between the quality, type and volume of data, the deep learning patterns used in the interpretation of diagnostic medical images and their applications in medicine. Finally presents problems and directions of future research. Data quality and volume, annotations and labels, identification and automatic extraction of specific medical terms can help deep learning models perform image analysis tasks. Moreover, the development of models capable of extracting unattended features and easily incorporated into the architecture of DL networks and the development of techniques to search for a certain network architecture according to the objectives set lead to performance in the interpretation of medical images.展开更多
Rational nutritional support shall be based on nutritional screening and nutritional assessment. This study is aimed to explore nutritional risk screening and its influencing factors of hospitalized patients in centra...Rational nutritional support shall be based on nutritional screening and nutritional assessment. This study is aimed to explore nutritional risk screening and its influencing factors of hospitalized patients in central urban area. It is helpful for the early detection of problems in nutritional supports, nutrition management and the implementation of intervention measures, which will contribute a lot to improving the patient's poor clinical outcome. A total of three tertiary medical institutions were enrolled in this study. From October 2015 to June 2016, 1202 hospitalized patients aged ≥18 years were enrolled in Nutrition Risk Screening 2002(NRS2002) for nutritional risk screening, including 8 cases who refused to participate, 5 cases of same-day surgery and 5 cases of coma. A single-factor chi-square test was performed on 312 patients with nutritional risk and 872 hospitalized patients without nutritional risk. Logistic regression analysis was performed with univariate analysis(P〈0.05), to investigate the incidence of nutritional risk and influencing factors. The incidence of nutritional risk was 26.35% in the inpatients, 25.90% in male and 26.84% in female, respectively. The single-factor analysis showed that the age ≥60, sleeping disorder, fasting, intraoperative bleeding, the surgery in recent month, digestive diseases, metabolic diseases and endocrine system diseases had significant effects on nutritional risk(P〈0.05). Having considered the above-mentioned factors as independent variables and nutritional risk(Y=1, N=0) as dependent variable, logistic regression analysis revealed that the age ≥60, fasting, sleeping disorders, the surgery in recent month and digestive diseases are hazardous factors for nutritional risk. Nutritional risk exists in hospitalized patients in central urban areas. Nutritional risk screening should be conducted for inpatients. Nutritional intervention programs should be formulated in consideration of those influencing factors, which enable to reduce the nutritional risk and to promote the rehabilitation of inpatients.展开更多
Electrocardiogram(ECG)is a low-cost,simple,fast,and non-invasive test.It can reflect the heart’s electrical activity and provide valuable diagnostic clues about the health of the entire body.Therefore,ECG has been wi...Electrocardiogram(ECG)is a low-cost,simple,fast,and non-invasive test.It can reflect the heart’s electrical activity and provide valuable diagnostic clues about the health of the entire body.Therefore,ECG has been widely used in various biomedical applications such as arrhythmia detection,disease-specific detection,mortality prediction,and biometric recognition.In recent years,ECG-related studies have been carried out using a variety of publicly available datasets,with many differences in the datasets used,data preprocessing methods,targeted challenges,and modeling and analysis techniques.Here we systematically summarize and analyze the ECGbased automatic analysis methods and applications.Specifically,we first reviewed 22 commonly used ECG public datasets and provided an overview of data preprocessing processes.Then we described some of the most widely used applications of ECG signals and analyzed the advanced methods involved in these applications.Finally,we elucidated some of the challenges in ECG analysis and provided suggestions for further research.展开更多
Image segmentation is attracting increasing attention in the field of medical image analysis.Since widespread utilization across various medical applications,ensuring and improving segmentation accuracy has become a c...Image segmentation is attracting increasing attention in the field of medical image analysis.Since widespread utilization across various medical applications,ensuring and improving segmentation accuracy has become a crucial topic of research.With advances in deep learning,researchers have developed numerous methods that combine Transformers and convolutional neural networks(CNNs)to create highly accurate models for medical image segmentation.However,efforts to further enhance accuracy by developing larger and more complex models or training with more extensive datasets,significantly increase computational resource consumption.To address this problem,we propose BiCLIP-nnFormer(the prefix"Bi"refers to the use of two distinct CLIP models),a virtual multimodal instrument that leverages CLIP models to enhance the segmentation performance of a medical segmentation model nnFormer.Since two CLIP models(PMC-CLIP and CoCa-CLIP)are pre-trained on large datasets,they do not require additional training,thus conserving computation resources.These models are used offline to extract image and text embeddings from medical images.These embeddings are then processed by the proposed 3D CLIP adapter,which adapts the CLIP knowledge for segmentation tasks by fine-tuning.Finally,the adapted embeddings are fused with feature maps extracted from the nnFormer encoder for generating predicted masks.This process enriches the representation capabilities of the feature maps by integrating global multimodal information,leading to more precise segmentation predictions.We demonstrate the superiority of BiCLIP-nnFormer and the effectiveness of using CLIP models to enhance nnFormer through experiments on two public datasets,namely the Synapse multi-organ segmentation dataset(Synapse)and the Automatic Cardiac Diagnosis Challenge dataset(ACDC),as well as a self-annotated lung multi-category segmentation dataset(LMCS).展开更多
Imaging evaluation of lymph node metastasis and infiltration faces problems such as low artificial outline efficiency and insufficient consistency.Deep learning technology based on convolutional neural networks has gr...Imaging evaluation of lymph node metastasis and infiltration faces problems such as low artificial outline efficiency and insufficient consistency.Deep learning technology based on convolutional neural networks has greatly improved the technical effect of radiomics in lymph node pathological characteristics analysis and efficacy monitoring through automatic lymph node detection,precise segmentation and three-dimensional reconstruction algorithms.This review focuses on the automatic lymph node segmentation model,treatment response prediction algorithm and benign and malignant differential diagnosis system for multimodal imaging,in order to provide a basis for further research on artificial intelligence to assist lymph node disease management and clinical decision-making,and provide a reference for promoting the construction of a system for accurate diagnosis,personalized treatment and prognostic evaluation of lymph node-related diseases.展开更多
Background:Early and accurate diagnosis of cataracts,which ranks among the leading preventable causes of blindness,is critical to securing positive outcomes for patients.Recently,eye image analyses have used deep lear...Background:Early and accurate diagnosis of cataracts,which ranks among the leading preventable causes of blindness,is critical to securing positive outcomes for patients.Recently,eye image analyses have used deep learning(DL)approaches to automate cataract classification more precisely,leading to the development of the Multiscale Parallel Feature Aggregation Network with Attention Fusion(MPFAN-AF).Focused on improving a model’s performance,this approach applies multiscale feature extraction,parallel feature fusion,along with attention-based fusion to sharpen its focus on salient features,which are crucial in detecting cataracts.Methods:Coarse-level features are captured through the application of convolutional layers,and these features undergo refinement through layered kernels of varying sizes.Moreover,this method captures all the diverse representations of cataracts accurately by parallel feature aggregation.Utilizing the Cataract Eye Dataset available on Kaggle,containing 612 labelled images of eyes with and without cataracts proportionately(normal vs.pathological),this model was trained and tested.Results:Results using the proposed model reflect greater precision over traditional convolutional neural networks(CNNs)models,achieving a classification accuracy of 97.52%.Additionally,the model demonstrated exceptional performance in classification tasks.The ablation studies validated that all applications added value to the prediction process,particularly emphasizing the attention fusion module.Conclusion:The MPFAN-AF model demonstrates high efficiency together with interpretability because it shows promise as an integration solution for real-time mobile cataract detection screening systems.Standard performance indicators indicate that AI-based ophthalmology tools have a promising future for use in remote conditions that lack medical resources.展开更多
Artificial intelligence(AI),particularly deep learning algorithms utilizing convolutional neural networks,plays an increasingly pivotal role in enhancing medical image examination.It demonstrates the potential for imp...Artificial intelligence(AI),particularly deep learning algorithms utilizing convolutional neural networks,plays an increasingly pivotal role in enhancing medical image examination.It demonstrates the potential for improving diagnostic accuracy within dental care.Orthopantomograms(OPGs)are essential in dentistry;however,their manual interpretation is often inconsistent and tedious.To the best of our knowledge,this is the first comprehensive application of YOLOv5m for the simultaneous detection and classification of six distinct dental pathologies using panoramic OPG images.The model was trained and refined on a custom dataset that began with 232 panoramic radiographs and was later expanded to 604 samples.These included annotated subclasses representing Caries,Infection,Impacted Teeth,Fractured Teeth,Broken Crowns,and Healthy conditions.The training was performed using GPU resources alongside tuned hyperparameters of batch size,learning rate schedule,and early stopping tailored for generalization to prevent overfitting.Evaluation on a held-out test set showed strong performance in the detection and localization of various dental pathologies and robust overall accuracy.At an IoU of 0.5,the system obtained a mean precision of 94.22%and recall of 90.42%,with mAP being 93.71%.This research confirms the use of YOLOv5m as a robust,highly efficient AI technology for the analysis of dental pathologies using OPGs,providing a clinically useful solution to enhance workflow efficiency and aid in sustaining consistency in complex multi-dimensional case evaluations.展开更多
The application of artificial intelligence(AI)in carotid atherosclerotic plaque detection via computed tomography angiography(CTA)has significantly ad-vanced over the past decade.This mini-review consolidates recent i...The application of artificial intelligence(AI)in carotid atherosclerotic plaque detection via computed tomography angiography(CTA)has significantly ad-vanced over the past decade.This mini-review consolidates recent innovations in deep learning architectures,domain adaptation techniques,and automated pl-aque characterization methodologies.Hybrid models,such as residual U-Net-Pyramid Scene Parsing Network,exhibit a remarkable precision of 80.49%in plaque segmentation,outperforming radiologists in diagnostic efficiency by reducing analysis time from minutes to mere seconds.Domain-adaptive fra-meworks,such as Lesion Assessment through Tracklet Evaluation,demonstrate robust performance across heterogeneous imaging datasets,achieving an area under the curve(AUC)greater than 0.88.Furthermore,novel approaches inte-grating U-Net and Efficient-Net architectures,enhanced by Bayesian optimi-zation,have achieved impressive correlation coefficients(0.89)for plaque quanti-fication.AI-powered CTA also enables high-precision three-dimensional vascular segmentation,with a Dice coefficient of 0.9119,and offers superior cardiovascular risk stratification compared to traditional Agatston scoring,yielding AUC values of 0.816 vs 0.729 at a 15-year follow-up.These breakthroughs address key challenges in plaque motion analysis,with systolic retractive motion biomarkers successfully identifying 80%of vulnerable plaques.Looking ahead,future directions focus on enhancing the interpretability of AI models through explainable AI and leveraging federated learning to mitigate data heterogeneity.This mini-review underscores the transformative potential of AI in carotid plaque assessment,offering substantial implic-ations for stroke prevention and personalized cerebrovascular management strategies.展开更多
基金Supported by Interdisciplinary Program of Shanghai Jiao Tong University,No.YG2024 LC01National Natural Science Foundation of China,No.62406190.
文摘Confocal laser endomicroscopy(CLE)has become an indispensable tool in the diagnosis and detection of gastrointestinal(GI)diseases due to its high-resolution and high-contrast imaging capabilities.However,the early-stage imaging changes of gastrointestinal disorders are often subtle,and traditional medical image analysis methods rely heavily on manual interpretation,which is time-consuming,subject to observer variability,and inefficient for accurate lesion identification across large-scale image datasets.With the introduction of artificial intelligence(AI)technologies,AI-driven CLE image analysis systems can automatically extract pathological features and have demonstrated significant clinical value in lesion recognition,classification diagnosis,and malignancy prediction of GI diseases.These systems greatly enhance diagnostic efficiency and early detection capabilities.This review summarizes the applications of AI-assisted CLE in GI diseases,analyzes the limitations of current technologies,and explores future research directions.It is expected that the deep integration of AI and confocal imaging technologies will provide strong support for precision diagnosis and personalized treatment in the field of gastrointestinal disorders.
文摘Deep learning models have become a core technological tool in the field of medical image analysis.However,these models often suffer from a lack of transparency in their decision-making processes,leading to challenges related to trust and interpret ability in clinical applications.To address this issue,explainable artificial intelligence(XAI)techniques have been applied to medical image analysis.While showing promising potential,XAI also brings significant ethical risks in practice—most notably,the problem of spurious explanations.Such explanations may rise further concerns regarding patient privacy,data security,and the attribution of decisionmaking authority in medical contexts.This paper analyzes the application of XAI methods—particularly saliency aps—in medical image interpretation,identifies the underlying causes of spurious explanations,and proposes possible mitigation strategies.The aim is to contribute to the responsible and sustainable integration of explainable AI into clinical practice.
基金the Deanship of Graduate Studies and Scientific Research at Najran University,Saudi Arabia,for their financial support through the Easy Track Research program,grant code(NU/EFP/MRC/13).
文摘Background:Accurate classification of normal blood cells is a critical foundation for automated hematological analysis,including the detection of pathological conditions like leukemia.While convolutional neural networks(CNNs)excel in local feature extraction,their ability to capture global contextual relationships in complex cellular morphologies is limited.This study introduces a hybrid CNN-Transformer framework to enhance normal blood cell classification,laying the groundwork for future leukemia diagnostics.Methods:The proposed architecture integrates pre-trained CNNs(ResNet50,EfficientNetB3,InceptionV3,CustomCNN)with Vision Transformer(ViT)layers to combine local and global feature modeling.Four hybrid models were evaluated on the publicly available Blood Cell Images dataset from Kaggle,comprising 17,092 annotated normal blood cell images across eight classes.The models were trained using transfer learning,fine-tuning,and computational optimizations,including cross-model parameter sharing to reduce redundancy by reusing weights across CNN backbones and attention-guided layer pruning to eliminate low-contribution layers based on attention scores,improving efficiency without sacrificing accuracy.Results:The InceptionV3-ViT model achieved a weighted accuracy of 97.66%(accounting for class imbalance by weighting each class’s contribution),a macro F1-score of 0.98,and a ROC-AUC of 0.998.The framework excelled in distinguishing morphologically similar cell types demonstrating robustness and reliable calibration(ECE of 0.019).The framework addresses generalization challenges,including class imbalance and morphological similarities,ensuring robust performance across diverse cell types.Conclusion:The hybrid CNN-Transformer framework significantly improves normal blood cell classification by capturing multi-scale features and long-range dependencies.Its high accuracy,efficiency,and generalization position it as a strong baseline for automated hematological analysis,with potential for extension to leukemia subtype classification through future validation on pathological samples.
文摘Medical image analysis has become a cornerstone of modern healthcare,driven by the exponential growth of data from imaging modalities such as MRI,CT,PET,ultrasound,and X-ray.Traditional machine learning methods have made early contributions;however,recent advancements in deep learning(DL)have revolutionized the field,offering state-of-the-art performance in image classification,segmentation,detection,fusion,registration,and enhancement.This comprehensive review presents an in-depth analysis of deep learning methodologies applied across medical image analysis tasks,highlighting both foundational models and recent innovations.The article begins by introducing conventional techniques and their limitations,setting the stage for DL-based solutions.Core DL architectures,including Convolutional Neural Networks(CNNs),Recurrent Neural Networks(RNNs),Generative Adversarial Networks(GANs),Vision Transformers(ViTs),and hybrid models,are discussed in detail,including their advantages and domain-specific adaptations.Advanced learning paradigms such as semi-supervised learning,selfsupervised learning,and few-shot learning are explored for their potential to mitigate data annotation challenges in clinical datasets.This review further categorizes major tasks in medical image analysis,elaborating on how DL techniques have enabled precise tumor segmentation,lesion detection,modality fusion,super-resolution,and robust classification across diverse clinical settings.Emphasis is placed on applications in oncology,cardiology,neurology,and infectious diseases,including COVID-19.Challenges such as data scarcity,label imbalance,model generalizability,interpretability,and integration into clinical workflows are critically examined.Ethical considerations,explainable AI(XAI),federated learning,and regulatory compliance are discussed as essential components of real-world deployment.Benchmark datasets,evaluation metrics,and comparative performance analyses are presented to support future research.The article concludes with a forward-looking perspective on the role of foundation models,multimodal learning,edge AI,and bio-inspired computing in the future of medical imaging.Overall,this review serves as a valuable resource for researchers,clinicians,and developers aiming to harness deep learning for intelligent,efficient,and clinically viable medical image analysis.
基金Special Fund Support for Basic Scientific Research Business Fee of Central Level Public Welfare Research Institute(No.ZZ13-024-05,ZZ15-XY-PT-03)。
文摘Objective:By data mining,to analyze the characteristics of Professor Han Fei’s medication in the treatment of children with epilepsy,to explore the rules of medication,in order to provide reference for clinical treatment of children with epilepsy by Chinese medicine.Methods:From January 2008 to March 2021,we collected the diagnosis and treatment data of the children with epilepsy who were treated by Professor Han Fei in the outpatient department of Guang’Anmen Hospital of Chinese Academy of Medical Sciences.Using the software of IBM SPSS Statistics 24.0 and IBM SPSS Modeler 18.0,the characteristics and rules of Professor Hanfei’s Chinese materia medica used were summarized through the descriptive analysis,correlation analysis and cluster analysis of drug cumulative frequency,drug flavor,drug channel tropism and efficacy.Results:A total of 224 cases were included in this study,excluding 1 case with other neurological disorders.Finally,223 prescriptions were included,involving 176 kinds of Chinese materia medica and the total medication frequency was 4712.The first 10 highfrequency Chinese materia medica were Chaihu(95.52%),Bombyx batryticatus(94.17%),keels(83.41%),oysters(72.65%),earthworm(72.20%),fructus aurantii(66.37%),Scorpion(64.57%),Gastrodia elata(60.99%),Acorus gramineus(59.19%)and Dannan Xing(58.30%).The main Chinese materia medica used were mainly for suppressing hyperactive liver for calming endogenous wind,relieving exterior syndromes and tranquillizing mind.The medicine properties were mainly to be flat,slight cold,pungent,bitter and willing,and they were mainly for liver,lung and heart meridian tropism.Correlation Analysis:Bupleurum chinense,Bombyx batryticatus,Dragon Bone,oyster as its core medicine group,Semen Ziziphi spinosae and semen platycladi are effective strong correlation medicine pair.Three medicine combinations were obtained by cluster analysis.Conclusion:Hanshi has the characteristics of“calming liver,tranquilizing mind,calming endogenous wind,removing the phlegm and extravasated blood”in treating epilepsy.
文摘Automated grading of dandruff severity is a clinically significant but challenging task due to the inherent ordinal nature of severity levels and the high prevalence of label noise from subjective expert annotations.Standard classification methods fail to address these dual challenges,limiting their real-world performance.In this paper,a novel,three-phase training framework is proposed that learns a robust ordinal classifier directly from noisy labels.The approach synergistically combines a rank-based ordinal regression backbone with a cooperative,semi-supervised learning strategy to dynamically partition the data into clean and noisy subsets.A hybrid training objective is then employed,applying a supervised ordinal loss to the clean set.The noisy set is simultaneously trained using a dualobjective that combines a semi-supervised ordinal loss with a parallel,label-agnostic contrastive loss.This design allows themodel to learn fromthe entire noisy subset while using contrastive learning to mitigate the risk of error propagation frompotentially corrupt supervision.Extensive experiments on a new,large-scale,multi-site clinical dataset validate our approach.Themethod achieves state-of-the-art performance with 80.71%accuracy and a 76.86%F1-score,significantly outperforming existing approaches,including a 2.26%improvement over the strongest baseline method.This work provides not only a robust solution for a practical medical imaging problem but also a generalizable framework for other tasks plagued by noisy ordinal labels.
基金supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University(IMSIU)(grant number IMSIU-DDRSP2601).
文摘Visual diagnosis of skin cancer is challenging due to subtle inter-class similarities,variations in skin texture,the presence of hair,and inconsistent illumination.Deep learning models have shown promise in assisting early detection,yet their performance is often limited by the severe class imbalance present in dermoscopic datasets.This paper proposes CANNSkin,a skin cancer classification framework that integrates a convolutional autoencoder with latent-space oversampling to address this imbalance.The autoencoder is trained to reconstruct lesion images,and its latent embeddings are used as features for classification.To enhance minority-class representation,the Synthetic Minority Oversampling Technique(SMOTE)is applied directly to the latent vectors before classifier training.The encoder and classifier are first trained independently and later fine-tuned end-to-end.On the HAM10000 dataset,CANNSkin achieves an accuracy of 93.01%,a macro-F1 of 88.54%,and an ROC–AUC of 98.44%,demonstrating strong robustness across ten test subsets.Evaluation on the more complex ISIC 2019 dataset further confirms the model’s effectiveness,where CANNSkin achieves 94.27%accuracy,93.95%precision,94.09%recall,and 99.02%F1-score,supported by high reconstruction fidelity(PSNR 35.03 dB,SSIM 0.86).These results demonstrate the effectiveness of our proposed latent-space balancing and fine-tuned representation learning as a new benchmark method for robust and accurate skin cancer classification across heterogeneous datasets.
基金funded by the Ongoing Research Funding Program-Research Chairs(ORF-RC-2025-2400),King Saud University,Riyadh,Saudi Arabia。
文摘Recent studies indicate that millions of individuals suffer from renal diseases,with renal carcinoma,a type of kidney cancer,emerging as both a chronic illness and a significant cause of mortality.Magnetic Resonance Imaging(MRI)and Computed Tomography(CT)have become essential tools for diagnosing and assessing kidney disorders.However,accurate analysis of thesemedical images is critical for detecting and evaluating tumor severity.This study introduces an integrated hybrid framework that combines three complementary deep learning models for kidney tumor segmentation from MRI images.The proposed framework fuses a customized U-Net and Mask R-CNN using a weighted scheme to achieve semantic and instance-level segmentation.The fused outputs are further refined through edge detection using Stochastic FeatureMapping Neural Networks(SFMNN),while volumetric consistency is ensured through Improved Mini-Batch K-Means(IMBKM)clustering integrated with an Encoder-Decoder Convolutional Neural Network(EDCNN).The outputs of these three stages are combined through a weighted fusion mechanism,with optimal weights determined empirically.Experiments on MRI scans from the TCGA-KIRC dataset demonstrate that the proposed hybrid framework significantly outperforms standalone models,achieving a Dice Score of 92.5%,an IoU of 87.8%,a Precision of 93.1%,a Recall of 90.8%,and a Hausdorff Distance of 2.8 mm.These findings validate that the weighted integration of complementary architectures effectively overcomes key limitations in kidney tumor segmentation,leading to improved diagnostic accuracy and robustness in medical image analysis.
基金supported by Research Project of Traditional Chinese Medicine in Hunan Province(No.B2023043)Scientific Research Project of Hunan Provincial Department of Education(No.22B0386)Research Fund of Hunan University of Chinese Medicine(No.2022XJZKC004).
文摘Objective:To mine the medication patterns of ancient prescriptions for diabetic retinopathy(DR)from databases of traditional Chinese medicine(TCM)ancient books,and provide evidence for clinical practice and scientific research of TCM treatment for DR.Methods:The traditional library retrieval and modern data retrieval technology were combined to collect the ancient prescriptions in these databases,including the library ofHunan University ofChinese Medicine,Chinese Medical Dictionary,Duxiu,and Chaoxing Digital Library.And the TCM inheritance auxiliary platform(V3.0)was used for data mining,mainly including drug frequency analysis,medicinal property and meridian tropism analysis,efficacy analysis,correlation analysis,complex network analysis,and cluster analysis.Results:A total of 271 ancient prescriptions for the treatment of DR were collected,involving 296 drugs.The total medication frequency was 2,727.Most of them were cold and sweet drugs.The meridians primarily targeted were the liver,kidney,and spleen.The main effects of drugs were supplementing deficiency,clearing heat,releasing the exterior,inducing urination to drain dampness,pacifying liver and extinguishing wind,and circulating blood and transforming stasis.Saposhnikovia divaricata was the most frequently Chinese herbal medicine for DR in TCM ancient books.Saposhnikovia divaricata and ligusticum wallichi,saposhnikovia divaricata and notopterygium root,angelica sinensis and ligusticum wallichii were common herbal pairs.Saposhnikovia divaricata,ginseng,plantain seed,angelica sinensis,prepared rehmannia root and cassia seed constituted the core formula with the highest frequency.Conclusion:The core prescriptions for treating DR are mainly crafted from Dihuang pill,Ruiren powder,Siwu decoction,and Zhujing pill.Saposhnikovia divaricata is an important meridian-guiding medicine to open Xuanfu for DR.In clinical practice,the prescriptions should be modified according to the evolution of pathogenesis.
文摘Parkinson’s disease is a neurodegenerative disorder that significantly impacts patients’lives.Currently,treatment primarily relies on drug therapy,while effective nursing interventions can help mitigate adverse reactions associated with medication use.This article reviews medication selection and nursing interventions for patients with Parkinson’s disease,aiming to alleviate symptoms,improve quality of life,and provide a scientific and comprehensive basis for medication and clinical nursing practices.
基金support for this work from the Deanship of Scientific Research (DSR),University of Tabuk,Tabuk,Saudi Arabia,under grant number S-1440-0262.
文摘Medical image analysis is an active research topic,with thousands of studies published in the past few years.Transfer learning(TL)including convolutional neural networks(CNNs)focused to enhance efficiency on an innovative task using the knowledge of the same tasks learnt in advance.It has played a major role in medical image analysis since it solves the data scarcity issue along with that it saves hardware resources and time.This study develops an EnhancedTunicate SwarmOptimization withTransfer Learning EnabledMedical Image Analysis System(ETSOTL-MIAS).The goal of the ETSOTL-MIAS technique lies in the identification and classification of diseases through medical imaging.The ETSOTL-MIAS technique involves the Chan Vese segmentation technique to identify the affected regions in the medical image.For feature extraction purposes,the ETSOTL-MIAS technique designs a modified DarkNet-53 model.To avoid the manual hyperparameter adjustment process,the ETSOTLMIAS technique exploits the ETSO algorithm,showing the novelty of the work.Finally,the classification of medical images takes place by random forest(RF)classifier.The performance validation of the ETSOTL-MIAS technique is tested on a benchmark medical image database.The extensive experimental analysis showed the promising performance of the ETSOTL-MIAS technique under different measures.
文摘Deep learning (DL) has seen an exponential development in recent years, with major impact in many medical fields, especially in the field of medical image. The purpose of the work converges in determining the importance of each component, describing the specificity and correlations of these elements involved in achieving the precision of interpretation of medical images using DL. The major contribution of this work is primarily to the updated characterisation of the characteristics of the constituent elements of the deep learning process, scientific data, methods of knowledge incorporation, DL models according to the objectives for which they were designed and the presentation of medical applications in accordance with these tasks. Secondly, it describes the specific correlations between the quality, type and volume of data, the deep learning patterns used in the interpretation of diagnostic medical images and their applications in medicine. Finally presents problems and directions of future research. Data quality and volume, annotations and labels, identification and automatic extraction of specific medical terms can help deep learning models perform image analysis tasks. Moreover, the development of models capable of extracting unattended features and easily incorporated into the architecture of DL networks and the development of techniques to search for a certain network architecture according to the objectives set lead to performance in the interpretation of medical images.
基金supported by Soft Science Application Program of Wuhan Scientific and Technological Bureau of China(No.2016040306010211)
文摘Rational nutritional support shall be based on nutritional screening and nutritional assessment. This study is aimed to explore nutritional risk screening and its influencing factors of hospitalized patients in central urban area. It is helpful for the early detection of problems in nutritional supports, nutrition management and the implementation of intervention measures, which will contribute a lot to improving the patient's poor clinical outcome. A total of three tertiary medical institutions were enrolled in this study. From October 2015 to June 2016, 1202 hospitalized patients aged ≥18 years were enrolled in Nutrition Risk Screening 2002(NRS2002) for nutritional risk screening, including 8 cases who refused to participate, 5 cases of same-day surgery and 5 cases of coma. A single-factor chi-square test was performed on 312 patients with nutritional risk and 872 hospitalized patients without nutritional risk. Logistic regression analysis was performed with univariate analysis(P〈0.05), to investigate the incidence of nutritional risk and influencing factors. The incidence of nutritional risk was 26.35% in the inpatients, 25.90% in male and 26.84% in female, respectively. The single-factor analysis showed that the age ≥60, sleeping disorder, fasting, intraoperative bleeding, the surgery in recent month, digestive diseases, metabolic diseases and endocrine system diseases had significant effects on nutritional risk(P〈0.05). Having considered the above-mentioned factors as independent variables and nutritional risk(Y=1, N=0) as dependent variable, logistic regression analysis revealed that the age ≥60, fasting, sleeping disorders, the surgery in recent month and digestive diseases are hazardous factors for nutritional risk. Nutritional risk exists in hospitalized patients in central urban areas. Nutritional risk screening should be conducted for inpatients. Nutritional intervention programs should be formulated in consideration of those influencing factors, which enable to reduce the nutritional risk and to promote the rehabilitation of inpatients.
基金Supported by the NSFC-Zhejiang Joint Fund for the Integration of Industrialization and Informatization(U1909208)the Science and Technology Major Project of Changsha(kh2202004)the Changsha Municipal Natural Science Foundation(kq2202106).
文摘Electrocardiogram(ECG)is a low-cost,simple,fast,and non-invasive test.It can reflect the heart’s electrical activity and provide valuable diagnostic clues about the health of the entire body.Therefore,ECG has been widely used in various biomedical applications such as arrhythmia detection,disease-specific detection,mortality prediction,and biometric recognition.In recent years,ECG-related studies have been carried out using a variety of publicly available datasets,with many differences in the datasets used,data preprocessing methods,targeted challenges,and modeling and analysis techniques.Here we systematically summarize and analyze the ECGbased automatic analysis methods and applications.Specifically,we first reviewed 22 commonly used ECG public datasets and provided an overview of data preprocessing processes.Then we described some of the most widely used applications of ECG signals and analyzed the advanced methods involved in these applications.Finally,we elucidated some of the challenges in ECG analysis and provided suggestions for further research.
基金funded by the National Natural Science Foundation of China(Grant No.6240072655)the Hubei Provincial Key Research and Development Program(Grant No.2023BCB151)+1 种基金the Wuhan Natural Science Foundation Exploration Program(Chenguang Program,Grant No.2024040801020202)the Natural Science Foundation of Hubei Province of China(Grant No.2025AFB148).
文摘Image segmentation is attracting increasing attention in the field of medical image analysis.Since widespread utilization across various medical applications,ensuring and improving segmentation accuracy has become a crucial topic of research.With advances in deep learning,researchers have developed numerous methods that combine Transformers and convolutional neural networks(CNNs)to create highly accurate models for medical image segmentation.However,efforts to further enhance accuracy by developing larger and more complex models or training with more extensive datasets,significantly increase computational resource consumption.To address this problem,we propose BiCLIP-nnFormer(the prefix"Bi"refers to the use of two distinct CLIP models),a virtual multimodal instrument that leverages CLIP models to enhance the segmentation performance of a medical segmentation model nnFormer.Since two CLIP models(PMC-CLIP and CoCa-CLIP)are pre-trained on large datasets,they do not require additional training,thus conserving computation resources.These models are used offline to extract image and text embeddings from medical images.These embeddings are then processed by the proposed 3D CLIP adapter,which adapts the CLIP knowledge for segmentation tasks by fine-tuning.Finally,the adapted embeddings are fused with feature maps extracted from the nnFormer encoder for generating predicted masks.This process enriches the representation capabilities of the feature maps by integrating global multimodal information,leading to more precise segmentation predictions.We demonstrate the superiority of BiCLIP-nnFormer and the effectiveness of using CLIP models to enhance nnFormer through experiments on two public datasets,namely the Synapse multi-organ segmentation dataset(Synapse)and the Automatic Cardiac Diagnosis Challenge dataset(ACDC),as well as a self-annotated lung multi-category segmentation dataset(LMCS).
基金Supported by Clinical Trials from the Nanjing Drum Tower Hospital,Affiliated Hospital of Medical School,Nanjing University,No.2021-LCYJ-MS-11Nanjing Drum Tower Hospital National Natural Science Foundation Youth Cultivation Project,No.2024-JCYJQP-15.
文摘Imaging evaluation of lymph node metastasis and infiltration faces problems such as low artificial outline efficiency and insufficient consistency.Deep learning technology based on convolutional neural networks has greatly improved the technical effect of radiomics in lymph node pathological characteristics analysis and efficacy monitoring through automatic lymph node detection,precise segmentation and three-dimensional reconstruction algorithms.This review focuses on the automatic lymph node segmentation model,treatment response prediction algorithm and benign and malignant differential diagnosis system for multimodal imaging,in order to provide a basis for further research on artificial intelligence to assist lymph node disease management and clinical decision-making,and provide a reference for promoting the construction of a system for accurate diagnosis,personalized treatment and prognostic evaluation of lymph node-related diseases.
文摘Background:Early and accurate diagnosis of cataracts,which ranks among the leading preventable causes of blindness,is critical to securing positive outcomes for patients.Recently,eye image analyses have used deep learning(DL)approaches to automate cataract classification more precisely,leading to the development of the Multiscale Parallel Feature Aggregation Network with Attention Fusion(MPFAN-AF).Focused on improving a model’s performance,this approach applies multiscale feature extraction,parallel feature fusion,along with attention-based fusion to sharpen its focus on salient features,which are crucial in detecting cataracts.Methods:Coarse-level features are captured through the application of convolutional layers,and these features undergo refinement through layered kernels of varying sizes.Moreover,this method captures all the diverse representations of cataracts accurately by parallel feature aggregation.Utilizing the Cataract Eye Dataset available on Kaggle,containing 612 labelled images of eyes with and without cataracts proportionately(normal vs.pathological),this model was trained and tested.Results:Results using the proposed model reflect greater precision over traditional convolutional neural networks(CNNs)models,achieving a classification accuracy of 97.52%.Additionally,the model demonstrated exceptional performance in classification tasks.The ablation studies validated that all applications added value to the prediction process,particularly emphasizing the attention fusion module.Conclusion:The MPFAN-AF model demonstrates high efficiency together with interpretability because it shows promise as an integration solution for real-time mobile cataract detection screening systems.Standard performance indicators indicate that AI-based ophthalmology tools have a promising future for use in remote conditions that lack medical resources.
基金funding from the Princess Nourah bint Abdulrahman University Researchers Supporting Project(PNURSP2025R195)the University of Bisha through its Fast-Track Research Support Program.
文摘Artificial intelligence(AI),particularly deep learning algorithms utilizing convolutional neural networks,plays an increasingly pivotal role in enhancing medical image examination.It demonstrates the potential for improving diagnostic accuracy within dental care.Orthopantomograms(OPGs)are essential in dentistry;however,their manual interpretation is often inconsistent and tedious.To the best of our knowledge,this is the first comprehensive application of YOLOv5m for the simultaneous detection and classification of six distinct dental pathologies using panoramic OPG images.The model was trained and refined on a custom dataset that began with 232 panoramic radiographs and was later expanded to 604 samples.These included annotated subclasses representing Caries,Infection,Impacted Teeth,Fractured Teeth,Broken Crowns,and Healthy conditions.The training was performed using GPU resources alongside tuned hyperparameters of batch size,learning rate schedule,and early stopping tailored for generalization to prevent overfitting.Evaluation on a held-out test set showed strong performance in the detection and localization of various dental pathologies and robust overall accuracy.At an IoU of 0.5,the system obtained a mean precision of 94.22%and recall of 90.42%,with mAP being 93.71%.This research confirms the use of YOLOv5m as a robust,highly efficient AI technology for the analysis of dental pathologies using OPGs,providing a clinically useful solution to enhance workflow efficiency and aid in sustaining consistency in complex multi-dimensional case evaluations.
基金Supported by Henan Province International Science and Technology Cooperation Project,2024,No.242102520054.
文摘The application of artificial intelligence(AI)in carotid atherosclerotic plaque detection via computed tomography angiography(CTA)has significantly ad-vanced over the past decade.This mini-review consolidates recent innovations in deep learning architectures,domain adaptation techniques,and automated pl-aque characterization methodologies.Hybrid models,such as residual U-Net-Pyramid Scene Parsing Network,exhibit a remarkable precision of 80.49%in plaque segmentation,outperforming radiologists in diagnostic efficiency by reducing analysis time from minutes to mere seconds.Domain-adaptive fra-meworks,such as Lesion Assessment through Tracklet Evaluation,demonstrate robust performance across heterogeneous imaging datasets,achieving an area under the curve(AUC)greater than 0.88.Furthermore,novel approaches inte-grating U-Net and Efficient-Net architectures,enhanced by Bayesian optimi-zation,have achieved impressive correlation coefficients(0.89)for plaque quanti-fication.AI-powered CTA also enables high-precision three-dimensional vascular segmentation,with a Dice coefficient of 0.9119,and offers superior cardiovascular risk stratification compared to traditional Agatston scoring,yielding AUC values of 0.816 vs 0.729 at a 15-year follow-up.These breakthroughs address key challenges in plaque motion analysis,with systolic retractive motion biomarkers successfully identifying 80%of vulnerable plaques.Looking ahead,future directions focus on enhancing the interpretability of AI models through explainable AI and leveraging federated learning to mitigate data heterogeneity.This mini-review underscores the transformative potential of AI in carotid plaque assessment,offering substantial implic-ations for stroke prevention and personalized cerebrovascular management strategies.