Wearable sensors integrated with deep learning techniques have the potential to revolutionize seamless human-machine interfaces for real-time health monitoring,clinical diagnosis,and robotic applications.Nevertheless,...Wearable sensors integrated with deep learning techniques have the potential to revolutionize seamless human-machine interfaces for real-time health monitoring,clinical diagnosis,and robotic applications.Nevertheless,it remains a critical challenge to simultaneously achieve desirable mechanical and electrical performance along with biocompatibility,adhesion,self-healing,and environmental robustness with excellent sensing metrics.Herein,we report a multifunctional,anti-freezing,selfadhesive,and self-healable organogel pressure sensor composed of cobalt nanoparticle encapsulated nitrogen-doped carbon nanotubes(CoN CNT)embedded in a polyvinyl alcohol-gelatin(PVA/GLE)matrix.Fabricated using a binary solvent system of water and ethylene glycol(EG),the CoN CNT/PVA/GLE organogel exhibits excellent flexibility,biocompatibility,and temperature tolerance with remarkable environmental stability.Electrochemical impedance spectroscopy confirms near-stable performance across a broad humidity range(40%-95%RH).Freeze-tolerant conductivity under sub-zero conditions(-20℃)is attributed to the synergistic role of CoN CNT and EG,preserving mobility and network integrity.The Co N CNT/PVA/GLE organogel sensor exhibits high sensitivity of 5.75 k Pa^(-1)in the detection range from 0 to 20 k Pa,ideal for subtle biomechanical motion detection.A smart human-machine interface for English letter recognition using deep learning achieved 98%accuracy.The organogel sensor utility was extended to detect human gestures like finger bending,wrist motion,and throat vibration during speech.展开更多
The rapid advancements in computer vision(CV)technology have transformed the traditional approaches to material microstructure analysis.This review outlines the history of CV and explores the applications of deep-lear...The rapid advancements in computer vision(CV)technology have transformed the traditional approaches to material microstructure analysis.This review outlines the history of CV and explores the applications of deep-learning(DL)-driven CV in four key areas of materials science:microstructure-based performance prediction,microstructure information generation,microstructure defect detection,and crystal structure-based property prediction.The CV has significantly reduced the cost of traditional experimental methods used in material performance prediction.Moreover,recent progress made in generating microstructure images and detecting microstructural defects using CV has led to increased efficiency and reliability in material performance assessments.The DL-driven CV models can accelerate the design of new materials with optimized performance by integrating predictions based on both crystal and microstructural data,thereby allowing for the discovery and innovation of next-generation materials.Finally,the review provides insights into the rapid interdisciplinary developments in the field of materials science and future prospects.展开更多
BACKGROUND The accurate prediction of lymph node metastasis(LNM)is crucial for managing locally advanced(T3/T4)colorectal cancer(CRC).However,both traditional histopathology and standard slide-level deep learning ofte...BACKGROUND The accurate prediction of lymph node metastasis(LNM)is crucial for managing locally advanced(T3/T4)colorectal cancer(CRC).However,both traditional histopathology and standard slide-level deep learning often fail to capture the sparse and diagnostically critical features of metastatic potential.AIM To develop and validate a case-level multiple-instance learning(MIL)framework mimicking a pathologist's comprehensive review and improve T3/T4 CRC LNM prediction.METHODS The whole-slide images of 130 patients with T3/T4 CRC were retrospectively collected.A case-level MIL framework utilising the CONCH v1.5 and UNI2-h deep learning models was trained on features from all haematoxylin and eosinstained primary tumour slides for each patient.These pathological features were subsequently integrated with clinical data,and model performance was evaluated using the area under the curve(AUC).RESULTS The case-level framework demonstrated superior LNM prediction over slide-level training,with the CONCH v1.5 model achieving a mean AUC(±SD)of 0.899±0.033 vs 0.814±0.083,respectively.Integrating pathology features with clinical data further enhanced performance,yielding a top model with a mean AUC of 0.904±0.047,in sharp contrast to a clinical-only model(mean AUC 0.584±0.084).Crucially,a pathologist’s review confirmed that the model-identified high-attention regions correspond to known high-risk histopathological features.CONCLUSION A case-level MIL framework provides a superior approach for predicting LNM in advanced CRC.This method shows promise for risk stratification and therapy decisions,requiring further validation.展开更多
Underwater pipeline inspection plays a vital role in the proactive maintenance and management of critical marine infrastructure and subaquatic systems.However,the inspection of underwater pipelines presents a challeng...Underwater pipeline inspection plays a vital role in the proactive maintenance and management of critical marine infrastructure and subaquatic systems.However,the inspection of underwater pipelines presents a challenge due to factors such as light scattering,absorption,restricted visibility,and ambient noise.The advancement of deep learning has introduced powerful techniques for processing large amounts of unstructured and imperfect data collected from underwater environments.This study evaluated the efficacy of the You Only Look Once(YOLO)algorithm,a real-time object detection and localization model based on convolutional neural networks,in identifying and classifying various types of pipeline defects in underwater settings.YOLOv8,the latest evolution in the YOLO family,integrates advanced capabilities,such as anchor-free detection,a cross-stage partial network backbone for efficient feature extraction,and a feature pyramid network+path aggregation network neck for robust multi-scale object detection,which make it particularly well-suited for complex underwater environments.Due to the lack of suitable open-access datasets for underwater pipeline defects,a custom dataset was captured using a remotely operated vehicle in a controlled environment.This application has the following assets available for use.Extensive experimentation demonstrated that YOLOv8 X-Large consistently outperformed other models in terms of pipe defect detection and classification and achieved a strong balance between precision and recall in identifying pipeline cracks,rust,corners,defective welds,flanges,tapes,and holes.This research establishes the baseline performance of YOLOv8 for underwater defect detection and showcases its potential to enhance the reliability and efficiency of pipeline inspection tasks in challenging underwater environments.展开更多
This review comprehensively summarized the potential of artificial intelligence(AI)in the management of esophageal cancer.It highlighted the significance of AI-assisted endoscopy in Japan where endoscopy is central to...This review comprehensively summarized the potential of artificial intelligence(AI)in the management of esophageal cancer.It highlighted the significance of AI-assisted endoscopy in Japan where endoscopy is central to both screening and diagnosis.For the clinical adaptation of AI,several challenges remain for its effective translation.The establishment of high-quality clinical databases,such as the National Clinical Database and Japan Endoscopy Database in Japan,which covers almost all cases of esophageal cancer,is essential for validating multimodal AI models.This requires rigorous external validation using diverse datasets,including those from different endoscope manufacturers and image qualities.Furthermore,endoscopists’skills significantly affect diagnostic accuracy,suggesting that AI should serve as a supportive tool rather than a replacement.Addressing these challenges,along with country-specific legal and ethical considerations,will facilitate the successful integration of multimodal AI into the management of esophageal cancer,particularly in endoscopic diagnosis,and contribute to improved patient outcomes.Although this review focused on Japan as a case study,the challenges and solutions described are broadly applicable to other high-incidence regions.展开更多
Flexible electronics face critical challenges in achieving monolithic three-dimensional(3D)integration,including material compatibility,structural stability,and scalable fabrication methods.Inspired by the tactile sen...Flexible electronics face critical challenges in achieving monolithic three-dimensional(3D)integration,including material compatibility,structural stability,and scalable fabrication methods.Inspired by the tactile sensing mechanism of the human skin,we have developed a flexible monolithic 3D-integrated tactile sensing system based on a holey MXene paste,where each vertical one-body unit simultaneously functions as a microsupercapacitor and pressure sensor.The in-plane mesopores of MXene significantly improve ion accessibility,mitigate the self-stacking of nanosheets,and allow the holey MXene to multifunctionally act as a sensing material,an active electrode,and a conductive interconnect,thus drastically reducing the interface mismatch and enhancing the mechanical robustness.Furthermore,we fabricate a large-scale device using a blade-coating and stamping method,which demonstrates excellent mechanical flexibility,low-power consumption,rapid response,and stable long-term operation.As a proof-of-concept application,we integrate our sensing array into a smart access control system,leveraging deep learning to accurately identify users based on their unique pressing behaviors.This study provides a promising approach for designing highly integrated,intelligent,and flexible electronic systems for advanced human-computer interactions and personalized electronics.展开更多
Artificial intelligence(AI)is revolutionizing medical imaging,particularly in chronic liver diseases assessment.AI technologies,including machine learning and deep learning,are increasingly integrated with multiparame...Artificial intelligence(AI)is revolutionizing medical imaging,particularly in chronic liver diseases assessment.AI technologies,including machine learning and deep learning,are increasingly integrated with multiparametric ultrasound(US)techniques to provide more accurate,objective,and non-invasive evaluations of liver fibrosis and steatosis.Analyzing large datasets from US images,AI enhances diagnostic precision,enabling better quantification of liver stiffness and fat content,which are essential for diagnosing and staging liver fibrosis and steatosis.Combining advanced US modalities,such as elastography and doppler imaging with AI,has demonstrated improved sensitivity in identifying different stages of liver disease and distinguishing various degrees of steatotic liver.These advancements also contribute to greater reproducibility and reduced operator dependency,addressing some of the limitations of traditional methods.The clinical implications of AI in liver disease are vast,ranging from early detection to predicting disease progression and evaluating treatment response.Despite these promising developments,challenges such as the need for large-scale datasets,algorithm transparency,and clinical validation remain.The aim of this review is to explore the current applications and future potential of AI in liver fibrosis and steatosis assessment using multiparametric US,highlighting the technological advances and clinical relevance of this emerging field.展开更多
Deep learning-based methods have become alternatives to traditional numerical weather prediction systems,offering faster computation and the ability to utilize large historical datasets.However,the application of deep...Deep learning-based methods have become alternatives to traditional numerical weather prediction systems,offering faster computation and the ability to utilize large historical datasets.However,the application of deep learning to medium-range regional weather forecasting with limited data remains a significant challenge.In this work,three key solutions are proposed:(1)motivated by the need to improve model performance in data-scarce regional forecasting scenarios,the authors innovatively apply semantic segmentation models,to better capture spatiotemporal features and improve prediction accuracy;(2)recognizing the challenge of overfitting and the inability of traditional noise-based data augmentation methods to effectively enhance model robustness,a novel learnable Gaussian noise mechanism is introduced that allows the model to adaptively optimize perturbations for different locations,ensuring more effective learning;and(3)to address the issue of error accumulation in autoregressive prediction,as well as the challenge of learning difficulty and the lack of intermediate data utilization in one-shot prediction,the authors propose a cascade prediction approach that effectively resolves these problems while significantly improving model forecasting performance.The method achieves a competitive result in The East China Regional AI Medium Range Weather Forecasting Competition.Ablation experiments further validate the effectiveness of each component,highlighting their contributions to enhancing prediction performance.展开更多
Alzheimer's disease is the most common type of cognitive disorder,and there is an urgent need to develop more effective,targeted and safer therapies for patients with this condition.Deep brain stimulation is an in...Alzheimer's disease is the most common type of cognitive disorder,and there is an urgent need to develop more effective,targeted and safer therapies for patients with this condition.Deep brain stimulation is an invasive surgical treatment that modulates abnormal neural activity by implanting electrodes into specific brain areas followed by electrical stimulation.As an emerging therapeutic approach,deep brain stimulation shows significant promise as a potential new therapy for Alzheimer's disease.Here,we review the potential mechanisms and therapeutic effects of deep brain stimulation in the treatment of Alzheimer's disease based on existing clinical and basic research.In clinical studies,the most commonly targeted sites include the fornix,the nucleus basalis of Meynert,and the ventral capsule/ventral striatum.Basic research has found that the most frequently targeted areas include the fornix,nucleus basalis of Meynert,hippocampus,entorhinal cortex,and rostral intralaminar thalamic nucleus.All of these individual targets exhibit therapeutic potential for patients with Alzheimer's disease and associated mechanisms of action have been investigated.Deep brain stimulation may exert therapeutic effects on Alzheimer's disease through various mechanisms,including reducing the deposition of amyloid-β,activation of the cholinergic system,increasing the levels of neurotrophic factors,enhancing synaptic activity and plasticity,promoting neurogenesis,and improving glucose metabolism.Currently,clinical trials investigating deep brain stimulation for Alzheimer's disease remain insufficient.In the future,it is essential to focus on translating preclinical mechanisms into clinical trials.Furthermore,consecutive follow-up studies are needed to evaluate the long-term safety and efficacy of deep brain stimulation for Alzheimer's disease,including cognitive function,neuropsychiatric symptoms,quality of life and changes in Alzheimer's disease biomarkers.Researchers must also prioritize the initiation of multi-center clinical trials of deep brain stimulation with large sample sizes and target earlier therapeutic windows,such as the prodromal and even the preclinical stages of Alzheimer's disease.Adopting these approaches will permit the efficient exploration of more effective and safer deep brain stimulation therapies for patients with Alzheimer's disease.展开更多
The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities...The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders.展开更多
An image processing and deep learning method for identifying different types of rock images was proposed.Preprocessing,such as rock image acquisition,gray scaling,Gaussian blurring,and feature dimensionality reduction...An image processing and deep learning method for identifying different types of rock images was proposed.Preprocessing,such as rock image acquisition,gray scaling,Gaussian blurring,and feature dimensionality reduction,was conducted to extract useful feature information and recognize and classify rock images using Tensor Flow-based convolutional neural network(CNN)and Py Qt5.A rock image dataset was established and separated into workouts,confirmation sets,and test sets.The framework was subsequently compiled and trained.The categorization approach was evaluated using image data from the validation and test datasets,and key metrics,such as accuracy,precision,and recall,were analyzed.Finally,the classification model conducted a probabilistic analysis of the measured data to determine the equivalent lithological type for each image.The experimental results indicated that the method combining deep learning,Tensor Flow-based CNN,and Py Qt5 to recognize and classify rock images has an accuracy rate of up to 98.8%,and can be successfully utilized for rock image recognition.The system can be extended to geological exploration,mine engineering,and other rock and mineral resource development to more efficiently and accurately recognize rock samples.Moreover,it can match them with the intelligent support design system to effectively improve the reliability and economy of the support scheme.The system can serve as a reference for supporting the design of other mining and underground space projects.展开更多
Metabolic dysfunction-associated steatotic liver disease(MASLD)is an increasingly prevalent condition associated with hepatic complications and cardiovascular and renal events.Given its significant clinical impact,the...Metabolic dysfunction-associated steatotic liver disease(MASLD)is an increasingly prevalent condition associated with hepatic complications and cardiovascular and renal events.Given its significant clinical impact,the development of new strategies for early diagnosis and treatment is essential to improve patient outcomes.Over the past decade,the integration of artificial intelligence(AI)into gastroenterology has led to transformative advancements in medical practice.AI represents a major step towards personalized medicine,offering the potential to enhance diagnostic accuracy,refine prognostic assessments,and optimize treatment strategies.Its applications are rapidly expanding.This article explores the emerging role of AI in the management of MASLD,emphasizing its ability to improve clinical prediction,enhance the diagnostic performance of imaging modalities,and support histopathological confirmation.Additionally,it examines the development of AI-guided personalized treatments,where lifestyle modifications and close monitoring play a pivotal role in achieving therapeutic success.展开更多
Gastrointestinal tumors require personalized treatment strategies due to their heterogeneity and complexity.Multimodal artificial intelligence(AI)addresses this challenge by integrating diverse data sources-including ...Gastrointestinal tumors require personalized treatment strategies due to their heterogeneity and complexity.Multimodal artificial intelligence(AI)addresses this challenge by integrating diverse data sources-including computed tomography(CT),magnetic resonance imaging(MRI),endoscopic imaging,and genomic profiles-to enable intelligent decision-making for individualized therapy.This approach leverages AI algorithms to fuse imaging,endoscopic,and omics data,facilitating comprehensive characterization of tumor biology,prediction of treatment response,and optimization of therapeutic strategies.By combining CT and MRI for structural assessment,endoscopic data for real-time visual inspection,and genomic information for molecular profiling,multimodal AI enhances the accuracy of patient stratification and treatment personalization.The clinical implementation of this technology demonstrates potential for improving patient outcomes,advancing precision oncology,and supporting individualized care in gastrointestinal cancers.Ultimately,multimodal AI serves as a transformative tool in oncology,bridging data integration with clinical application to effectively tailor therapies.展开更多
Early identification and treatment of stroke can greatly improve patient outcomes and quality of life.Although clinical tests such as the Cincinnati Pre-hospital Stroke Scale(CPSS)and the Face Arm Speech Test(FAST)are...Early identification and treatment of stroke can greatly improve patient outcomes and quality of life.Although clinical tests such as the Cincinnati Pre-hospital Stroke Scale(CPSS)and the Face Arm Speech Test(FAST)are commonly used for stroke screening,accurate administration is dependent on specialized training.In this study,we proposed a novel multimodal deep learning approach,based on the FAST,for assessing suspected stroke patients exhibiting symptoms such as limb weakness,facial paresis,and speech disorders in acute settings.We collected a dataset comprising videos and audio recordings of emergency room patients performing designated limb movements,facial expressions,and speech tests based on the FAST.We compared the constructed deep learning model,which was designed to process multi-modal datasets,with six prior models that achieved good action classification performance,including the I3D,SlowFast,X3D,TPN,TimeSformer,and MViT.We found that the findings of our deep learning model had a higher clinical value compared with the other approaches.Moreover,the multi-modal model outperformed its single-module variants,highlighting the benefit of utilizing multiple types of patient data,such as action videos and speech audio.These results indicate that a multi-modal deep learning model combined with the FAST could greatly improve the accuracy and sensitivity of early stroke identification of stroke,thus providing a practical and powerful tool for assessing stroke patients in an emergency clinical setting.展开更多
基金supported by the Basic Science Research Program(2023R1A2C3004336,RS-202300243807)&Regional Leading Research Center(RS-202400405278)through the National Research Foundation of Korea(NRF)grant funded by the Korea Government(MSIT)。
文摘Wearable sensors integrated with deep learning techniques have the potential to revolutionize seamless human-machine interfaces for real-time health monitoring,clinical diagnosis,and robotic applications.Nevertheless,it remains a critical challenge to simultaneously achieve desirable mechanical and electrical performance along with biocompatibility,adhesion,self-healing,and environmental robustness with excellent sensing metrics.Herein,we report a multifunctional,anti-freezing,selfadhesive,and self-healable organogel pressure sensor composed of cobalt nanoparticle encapsulated nitrogen-doped carbon nanotubes(CoN CNT)embedded in a polyvinyl alcohol-gelatin(PVA/GLE)matrix.Fabricated using a binary solvent system of water and ethylene glycol(EG),the CoN CNT/PVA/GLE organogel exhibits excellent flexibility,biocompatibility,and temperature tolerance with remarkable environmental stability.Electrochemical impedance spectroscopy confirms near-stable performance across a broad humidity range(40%-95%RH).Freeze-tolerant conductivity under sub-zero conditions(-20℃)is attributed to the synergistic role of CoN CNT and EG,preserving mobility and network integrity.The Co N CNT/PVA/GLE organogel sensor exhibits high sensitivity of 5.75 k Pa^(-1)in the detection range from 0 to 20 k Pa,ideal for subtle biomechanical motion detection.A smart human-machine interface for English letter recognition using deep learning achieved 98%accuracy.The organogel sensor utility was extended to detect human gestures like finger bending,wrist motion,and throat vibration during speech.
基金financially supported by the National Science Fund for Distinguished Young Scholars,China(No.52025041)the National Natural Science Foundation of China(Nos.52450003,U2341267,and 52174294)+1 种基金the National Postdoctoral Program for Innovative Talents,China(No.BX20240437)the Fundamental Research Funds for the Central Universities,China(Nos.FRF-IDRY-23-037 and FRF-TP-20-02C2)。
文摘The rapid advancements in computer vision(CV)technology have transformed the traditional approaches to material microstructure analysis.This review outlines the history of CV and explores the applications of deep-learning(DL)-driven CV in four key areas of materials science:microstructure-based performance prediction,microstructure information generation,microstructure defect detection,and crystal structure-based property prediction.The CV has significantly reduced the cost of traditional experimental methods used in material performance prediction.Moreover,recent progress made in generating microstructure images and detecting microstructural defects using CV has led to increased efficiency and reliability in material performance assessments.The DL-driven CV models can accelerate the design of new materials with optimized performance by integrating predictions based on both crystal and microstructural data,thereby allowing for the discovery and innovation of next-generation materials.Finally,the review provides insights into the rapid interdisciplinary developments in the field of materials science and future prospects.
基金Supported by Chongqing Medical Scientific Research Project(Joint Project of Chongqing Health Commission and Science and Technology Bureau),No.2023MSXM060.
文摘BACKGROUND The accurate prediction of lymph node metastasis(LNM)is crucial for managing locally advanced(T3/T4)colorectal cancer(CRC).However,both traditional histopathology and standard slide-level deep learning often fail to capture the sparse and diagnostically critical features of metastatic potential.AIM To develop and validate a case-level multiple-instance learning(MIL)framework mimicking a pathologist's comprehensive review and improve T3/T4 CRC LNM prediction.METHODS The whole-slide images of 130 patients with T3/T4 CRC were retrospectively collected.A case-level MIL framework utilising the CONCH v1.5 and UNI2-h deep learning models was trained on features from all haematoxylin and eosinstained primary tumour slides for each patient.These pathological features were subsequently integrated with clinical data,and model performance was evaluated using the area under the curve(AUC).RESULTS The case-level framework demonstrated superior LNM prediction over slide-level training,with the CONCH v1.5 model achieving a mean AUC(±SD)of 0.899±0.033 vs 0.814±0.083,respectively.Integrating pathology features with clinical data further enhanced performance,yielding a top model with a mean AUC of 0.904±0.047,in sharp contrast to a clinical-only model(mean AUC 0.584±0.084).Crucially,a pathologist’s review confirmed that the model-identified high-attention regions correspond to known high-risk histopathological features.CONCLUSION A case-level MIL framework provides a superior approach for predicting LNM in advanced CRC.This method shows promise for risk stratification and therapy decisions,requiring further validation.
文摘Underwater pipeline inspection plays a vital role in the proactive maintenance and management of critical marine infrastructure and subaquatic systems.However,the inspection of underwater pipelines presents a challenge due to factors such as light scattering,absorption,restricted visibility,and ambient noise.The advancement of deep learning has introduced powerful techniques for processing large amounts of unstructured and imperfect data collected from underwater environments.This study evaluated the efficacy of the You Only Look Once(YOLO)algorithm,a real-time object detection and localization model based on convolutional neural networks,in identifying and classifying various types of pipeline defects in underwater settings.YOLOv8,the latest evolution in the YOLO family,integrates advanced capabilities,such as anchor-free detection,a cross-stage partial network backbone for efficient feature extraction,and a feature pyramid network+path aggregation network neck for robust multi-scale object detection,which make it particularly well-suited for complex underwater environments.Due to the lack of suitable open-access datasets for underwater pipeline defects,a custom dataset was captured using a remotely operated vehicle in a controlled environment.This application has the following assets available for use.Extensive experimentation demonstrated that YOLOv8 X-Large consistently outperformed other models in terms of pipe defect detection and classification and achieved a strong balance between precision and recall in identifying pipeline cracks,rust,corners,defective welds,flanges,tapes,and holes.This research establishes the baseline performance of YOLOv8 for underwater defect detection and showcases its potential to enhance the reliability and efficiency of pipeline inspection tasks in challenging underwater environments.
基金Supported by Japan Society for the Promotion of Science,No.24K11935.
文摘This review comprehensively summarized the potential of artificial intelligence(AI)in the management of esophageal cancer.It highlighted the significance of AI-assisted endoscopy in Japan where endoscopy is central to both screening and diagnosis.For the clinical adaptation of AI,several challenges remain for its effective translation.The establishment of high-quality clinical databases,such as the National Clinical Database and Japan Endoscopy Database in Japan,which covers almost all cases of esophageal cancer,is essential for validating multimodal AI models.This requires rigorous external validation using diverse datasets,including those from different endoscope manufacturers and image qualities.Furthermore,endoscopists’skills significantly affect diagnostic accuracy,suggesting that AI should serve as a supportive tool rather than a replacement.Addressing these challenges,along with country-specific legal and ethical considerations,will facilitate the successful integration of multimodal AI into the management of esophageal cancer,particularly in endoscopic diagnosis,and contribute to improved patient outcomes.Although this review focused on Japan as a case study,the challenges and solutions described are broadly applicable to other high-incidence regions.
基金supported by the National Natural Science Foundation of China(52272177,12204010)the Foundation for the Introduction of High-Level Talents of Anhui University(S020118002/097)+1 种基金the University Synergy Innovation Program of Anhui Province(GXXT-2023-066)the Scientific Research Project of Anhui Provincial Higher Education Institution(2023AH040008)。
文摘Flexible electronics face critical challenges in achieving monolithic three-dimensional(3D)integration,including material compatibility,structural stability,and scalable fabrication methods.Inspired by the tactile sensing mechanism of the human skin,we have developed a flexible monolithic 3D-integrated tactile sensing system based on a holey MXene paste,where each vertical one-body unit simultaneously functions as a microsupercapacitor and pressure sensor.The in-plane mesopores of MXene significantly improve ion accessibility,mitigate the self-stacking of nanosheets,and allow the holey MXene to multifunctionally act as a sensing material,an active electrode,and a conductive interconnect,thus drastically reducing the interface mismatch and enhancing the mechanical robustness.Furthermore,we fabricate a large-scale device using a blade-coating and stamping method,which demonstrates excellent mechanical flexibility,low-power consumption,rapid response,and stable long-term operation.As a proof-of-concept application,we integrate our sensing array into a smart access control system,leveraging deep learning to accurately identify users based on their unique pressing behaviors.This study provides a promising approach for designing highly integrated,intelligent,and flexible electronic systems for advanced human-computer interactions and personalized electronics.
文摘Artificial intelligence(AI)is revolutionizing medical imaging,particularly in chronic liver diseases assessment.AI technologies,including machine learning and deep learning,are increasingly integrated with multiparametric ultrasound(US)techniques to provide more accurate,objective,and non-invasive evaluations of liver fibrosis and steatosis.Analyzing large datasets from US images,AI enhances diagnostic precision,enabling better quantification of liver stiffness and fat content,which are essential for diagnosing and staging liver fibrosis and steatosis.Combining advanced US modalities,such as elastography and doppler imaging with AI,has demonstrated improved sensitivity in identifying different stages of liver disease and distinguishing various degrees of steatotic liver.These advancements also contribute to greater reproducibility and reduced operator dependency,addressing some of the limitations of traditional methods.The clinical implications of AI in liver disease are vast,ranging from early detection to predicting disease progression and evaluating treatment response.Despite these promising developments,challenges such as the need for large-scale datasets,algorithm transparency,and clinical validation remain.The aim of this review is to explore the current applications and future potential of AI in liver fibrosis and steatosis assessment using multiparametric US,highlighting the technological advances and clinical relevance of this emerging field.
基金supported by the National Natural Science Foundation of China[grant number 62376217]the Young Elite Scientists Sponsorship Program by CAST[grant number 2023QNRC001]the Joint Research Project for Meteorological Capacity Improvement[grant number 24NLTSZ003]。
文摘Deep learning-based methods have become alternatives to traditional numerical weather prediction systems,offering faster computation and the ability to utilize large historical datasets.However,the application of deep learning to medium-range regional weather forecasting with limited data remains a significant challenge.In this work,three key solutions are proposed:(1)motivated by the need to improve model performance in data-scarce regional forecasting scenarios,the authors innovatively apply semantic segmentation models,to better capture spatiotemporal features and improve prediction accuracy;(2)recognizing the challenge of overfitting and the inability of traditional noise-based data augmentation methods to effectively enhance model robustness,a novel learnable Gaussian noise mechanism is introduced that allows the model to adaptively optimize perturbations for different locations,ensuring more effective learning;and(3)to address the issue of error accumulation in autoregressive prediction,as well as the challenge of learning difficulty and the lack of intermediate data utilization in one-shot prediction,the authors propose a cascade prediction approach that effectively resolves these problems while significantly improving model forecasting performance.The method achieves a competitive result in The East China Regional AI Medium Range Weather Forecasting Competition.Ablation experiments further validate the effectiveness of each component,highlighting their contributions to enhancing prediction performance.
基金supported by the Capital Fund for Health Improvement and Research,No.2022-2-2048(to WZ)the National Natural Science Foundation of China,No.81970992(to WZ)+3 种基金Capital Clinical Characteristic Application Research,No.Z121107001012161(to WZ)the Natural Science Foundation of Beijing,No.7082032(to WZ)the Key Technology R&D Program of Beijing Municipal Education Commission,No.KZ201610025030(to WZ)Project of Scientific and Technological Development of Traditional Chinese Medicine in Beijing,No.JJ2018-48(to WZ)。
文摘Alzheimer's disease is the most common type of cognitive disorder,and there is an urgent need to develop more effective,targeted and safer therapies for patients with this condition.Deep brain stimulation is an invasive surgical treatment that modulates abnormal neural activity by implanting electrodes into specific brain areas followed by electrical stimulation.As an emerging therapeutic approach,deep brain stimulation shows significant promise as a potential new therapy for Alzheimer's disease.Here,we review the potential mechanisms and therapeutic effects of deep brain stimulation in the treatment of Alzheimer's disease based on existing clinical and basic research.In clinical studies,the most commonly targeted sites include the fornix,the nucleus basalis of Meynert,and the ventral capsule/ventral striatum.Basic research has found that the most frequently targeted areas include the fornix,nucleus basalis of Meynert,hippocampus,entorhinal cortex,and rostral intralaminar thalamic nucleus.All of these individual targets exhibit therapeutic potential for patients with Alzheimer's disease and associated mechanisms of action have been investigated.Deep brain stimulation may exert therapeutic effects on Alzheimer's disease through various mechanisms,including reducing the deposition of amyloid-β,activation of the cholinergic system,increasing the levels of neurotrophic factors,enhancing synaptic activity and plasticity,promoting neurogenesis,and improving glucose metabolism.Currently,clinical trials investigating deep brain stimulation for Alzheimer's disease remain insufficient.In the future,it is essential to focus on translating preclinical mechanisms into clinical trials.Furthermore,consecutive follow-up studies are needed to evaluate the long-term safety and efficacy of deep brain stimulation for Alzheimer's disease,including cognitive function,neuropsychiatric symptoms,quality of life and changes in Alzheimer's disease biomarkers.Researchers must also prioritize the initiation of multi-center clinical trials of deep brain stimulation with large sample sizes and target earlier therapeutic windows,such as the prodromal and even the preclinical stages of Alzheimer's disease.Adopting these approaches will permit the efficient exploration of more effective and safer deep brain stimulation therapies for patients with Alzheimer's disease.
文摘The rapid growth of biomedical data,particularly multi-omics data including genomes,transcriptomics,proteomics,metabolomics,and epigenomics,medical research and clinical decision-making confront both new opportunities and obstacles.The huge and diversified nature of these datasets cannot always be managed using traditional data analysis methods.As a consequence,deep learning has emerged as a strong tool for analysing numerous omics data due to its ability to handle complex and non-linear relationships.This paper explores the fundamental concepts of deep learning and how they are used in multi-omics medical data mining.We demonstrate how autoencoders,variational autoencoders,multimodal models,attention mechanisms,transformers,and graph neural networks enable pattern analysis and recognition across all omics data.Deep learning has been found to be effective in illness classification,biomarker identification,gene network learning,and therapeutic efficacy prediction.We also consider critical problems like as data quality,model explainability,whether findings can be repeated,and computational power requirements.We now consider future elements of combining omics with clinical and imaging data,explainable AI,federated learning,and real-time diagnostics.Overall,this study emphasises the need of collaborating across disciplines to advance deep learning-based multi-omics research for precision medicine and comprehending complicated disorders.
基金financially supported by the National Science and Technology Major Project——Deep Earth Probe and Mineral Resources Exploration(No.2024ZD1003701)the National Key R&D Program of China(No.2022YFC2905004)。
文摘An image processing and deep learning method for identifying different types of rock images was proposed.Preprocessing,such as rock image acquisition,gray scaling,Gaussian blurring,and feature dimensionality reduction,was conducted to extract useful feature information and recognize and classify rock images using Tensor Flow-based convolutional neural network(CNN)and Py Qt5.A rock image dataset was established and separated into workouts,confirmation sets,and test sets.The framework was subsequently compiled and trained.The categorization approach was evaluated using image data from the validation and test datasets,and key metrics,such as accuracy,precision,and recall,were analyzed.Finally,the classification model conducted a probabilistic analysis of the measured data to determine the equivalent lithological type for each image.The experimental results indicated that the method combining deep learning,Tensor Flow-based CNN,and Py Qt5 to recognize and classify rock images has an accuracy rate of up to 98.8%,and can be successfully utilized for rock image recognition.The system can be extended to geological exploration,mine engineering,and other rock and mineral resource development to more efficiently and accurately recognize rock samples.Moreover,it can match them with the intelligent support design system to effectively improve the reliability and economy of the support scheme.The system can serve as a reference for supporting the design of other mining and underground space projects.
文摘Metabolic dysfunction-associated steatotic liver disease(MASLD)is an increasingly prevalent condition associated with hepatic complications and cardiovascular and renal events.Given its significant clinical impact,the development of new strategies for early diagnosis and treatment is essential to improve patient outcomes.Over the past decade,the integration of artificial intelligence(AI)into gastroenterology has led to transformative advancements in medical practice.AI represents a major step towards personalized medicine,offering the potential to enhance diagnostic accuracy,refine prognostic assessments,and optimize treatment strategies.Its applications are rapidly expanding.This article explores the emerging role of AI in the management of MASLD,emphasizing its ability to improve clinical prediction,enhance the diagnostic performance of imaging modalities,and support histopathological confirmation.Additionally,it examines the development of AI-guided personalized treatments,where lifestyle modifications and close monitoring play a pivotal role in achieving therapeutic success.
基金Supported by Xuhui District Health Commission,No.SHXH202214.
文摘Gastrointestinal tumors require personalized treatment strategies due to their heterogeneity and complexity.Multimodal artificial intelligence(AI)addresses this challenge by integrating diverse data sources-including computed tomography(CT),magnetic resonance imaging(MRI),endoscopic imaging,and genomic profiles-to enable intelligent decision-making for individualized therapy.This approach leverages AI algorithms to fuse imaging,endoscopic,and omics data,facilitating comprehensive characterization of tumor biology,prediction of treatment response,and optimization of therapeutic strategies.By combining CT and MRI for structural assessment,endoscopic data for real-time visual inspection,and genomic information for molecular profiling,multimodal AI enhances the accuracy of patient stratification and treatment personalization.The clinical implementation of this technology demonstrates potential for improving patient outcomes,advancing precision oncology,and supporting individualized care in gastrointestinal cancers.Ultimately,multimodal AI serves as a transformative tool in oncology,bridging data integration with clinical application to effectively tailor therapies.
基金supported by the Ministry of Science and Technology of China,No.2020AAA0109605(to XL)Meizhou Major Scientific and Technological Innovation PlatformsProjects of Guangdong Provincial Science & Technology Plan Projects,No.2019A0102005(to HW).
文摘Early identification and treatment of stroke can greatly improve patient outcomes and quality of life.Although clinical tests such as the Cincinnati Pre-hospital Stroke Scale(CPSS)and the Face Arm Speech Test(FAST)are commonly used for stroke screening,accurate administration is dependent on specialized training.In this study,we proposed a novel multimodal deep learning approach,based on the FAST,for assessing suspected stroke patients exhibiting symptoms such as limb weakness,facial paresis,and speech disorders in acute settings.We collected a dataset comprising videos and audio recordings of emergency room patients performing designated limb movements,facial expressions,and speech tests based on the FAST.We compared the constructed deep learning model,which was designed to process multi-modal datasets,with six prior models that achieved good action classification performance,including the I3D,SlowFast,X3D,TPN,TimeSformer,and MViT.We found that the findings of our deep learning model had a higher clinical value compared with the other approaches.Moreover,the multi-modal model outperformed its single-module variants,highlighting the benefit of utilizing multiple types of patient data,such as action videos and speech audio.These results indicate that a multi-modal deep learning model combined with the FAST could greatly improve the accuracy and sensitivity of early stroke identification of stroke,thus providing a practical and powerful tool for assessing stroke patients in an emergency clinical setting.