This study proposes a deep learning-based approach for shaft resistance evaluation of cast-in-site piles on reclaimed ground,independent of theoretical hypotheses and engineering experience.A series of field tests was...This study proposes a deep learning-based approach for shaft resistance evaluation of cast-in-site piles on reclaimed ground,independent of theoretical hypotheses and engineering experience.A series of field tests was first performed to investigate the characteristics of the shaft resistance of cast-in-site piles on reclaimed ground.Then,an intelligent approach based on the long short term memory deep-learning technique was proposed to calculate the shaft resistance of the cast-in-site pile.The proposed method allows accurate estimation of the shaft resistance of cast-in-site piles,not only under the ultimate load but also under the working load.Comparisons with empirical methods confirmed the effectiveness of the proposed method for the shaft resistance estimation of cast-in-site piles on reclaimed ground in offshore areas.展开更多
The number of films is numerous and the film contents are complex over the Internet and multimedia sources. It is time consuming for a viewer to select a favorite film. This paper presents an automatic recognition sys...The number of films is numerous and the film contents are complex over the Internet and multimedia sources. It is time consuming for a viewer to select a favorite film. This paper presents an automatic recognition system of film types. Initially, a film is firstly sampled as frame sequences. The color space, including hue, saturation,and brightness value(HSV), is analyzed for each sampled frame by computing the deviation and mean of HSV for each film. These features are utilized as inputs to a deep-learning neural network(DNN) for the recognition of film types. One hundred films are utilized to train and validate the model parameters of DNN. In the testing phase, a film is recognized as one of the five categories, including action, comedy, horror thriller, romance, and science fiction, by the trained DNN. The experimental results reveal that the film types can be effectively recognized by the proposed approach, enabling the viewer to select an interesting film accurately and quickly.展开更多
Realizing large materials models has emerged as a critical endeavor for materials research in the new era of artificial intelligence,but how to achieve this fantastic and challenging objective remains elusive.Here,we ...Realizing large materials models has emerged as a critical endeavor for materials research in the new era of artificial intelligence,but how to achieve this fantastic and challenging objective remains elusive.Here,we propose a feasible pathway to address this paramount pursuit by developing universal materials models of deep-learning density functional theory Hamiltonian(Deep H),enabling computational modeling of the complicated structure-property relationship of materials in general.By constructing a large materials database and substantially improving the Deep H method,we obtain a universal materials model of Deep H capable of handling diverse elemental compositions and material structures,achieving remarkable accuracy in predicting material properties.We further showcase a promising application of fine-tuning universal materials models for enhancing specific materials models.This work not only demonstrates the concept of Deep H's universal materials model but also lays the groundwork for developing large materials models,opening up significant opportunities for advancing artificial intelligencedriven materials discovery.展开更多
Accurately predicting the concentration of fine particulate matter(PM_(2.5))is crucial for evaluating air pollution levels and public exposure.Recent advancements have seen a significant rise in using deep learning(DL...Accurately predicting the concentration of fine particulate matter(PM_(2.5))is crucial for evaluating air pollution levels and public exposure.Recent advancements have seen a significant rise in using deep learning(DL)models for forecasting PM_(2.5) concentrations.Nonetheless,there is a lack of unified and standardized frameworks for assessing the performance of DL-based PM_(2.5) prediction models.Here we extensively reviewed those DL-based hybrid models for forecasting PM_(2.5) levels according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses(PRISMA)guidelines.We examined the similarities and differences among various DL models in predicting PM_(2.5) by comparing their complexity and effectiveness.We categorized PM_(2.5) DL methodologies into seven types based on performance and application conditions,including four types of DL-based models and three types of hybrid learning models.Our research indicates that established deep learning architectures are commonly used and respected for their efficiency.However,many of these models often fall short in terms of innovation and interpretability.Conversely,models hybrid with traditional approaches,like deterministic and statistical models,exhibit high interpretability but compromise on accuracy and speed.Besides,hybrid DL models,representing the pinnacle of innovation among the studied models,encounter issues with interpretability.We introduce a novel three-dimensional evaluation framework,i.e.,Dataset-MethodExperiment Standard(DMES)to unify and standardize the evaluation for PM_(2.5) predictions using DL models.This review provides a framework for future evaluations of DL-based models,which could inspire researchers to standardize DL model usage in PM_(2.5) prediction and improve the quality of related studies.展开更多
Dear Editor,Insulin is important for body metabolism regulation and glucose homeostasis,and its dysregulation often leads to metabolic syndrome(MS)and diabetes.Insulin is normally stored in large dense-core vesicles(L...Dear Editor,Insulin is important for body metabolism regulation and glucose homeostasis,and its dysregulation often leads to metabolic syndrome(MS)and diabetes.Insulin is normally stored in large dense-core vesicles(LDCVs)in pancreatic beta cells,and significant reductions in the number,size,gray level and density of insulin granules confer diabetes both in mice(Xue et al.,2012)and humans(Masini et al.,2012).展开更多
Recently,wearable gait-assist robots have been evolving towards using soft materials designed for the elderly rather than individuals with disabilities,which emphasize modularization,simplification,and weight reductio...Recently,wearable gait-assist robots have been evolving towards using soft materials designed for the elderly rather than individuals with disabilities,which emphasize modularization,simplification,and weight reduction.Thus,synchronizing the robotic assistive force with that of the user’s leg movements is crucial for usability,which requires accurate recognition of the user’s gait intent.In this study,we propose a deep learning model capable of identifying not only gait mode and gait phase but also phase progression.Utilizing data from five inertial measurement units placed on the body,the proposed two-stage architecture incorporates a bidirectional long short-term memory-based model for robust classification of locomotion modes and phases.Subsequently,phase progression is estimated through 1D convolutional neural network-based regressors,each dedicated to a specific phase.The model was evaluated on a diverse dataset encompassing level walking,stair ascent and descent,and sit-to-stand activities from 10 healthy participants.The results demonstrate its ability to accurately classify locomotion phases and estimate phase progression.Accurate phase progression estimation is essential due to the age-related variability in gait phase durations,particularly evident in older adults,the primary demographic for gait-assist robots.These findings underscore the potential to enhance the assistance,comfort,and safety provided by gait-assist robots.展开更多
The Conditional Nonlinear Optimal Perturbation(CNOP)method works essentially for conventional numerical models;however,it is not fully applicable to the commonly used deep-learning forecasting models(DLMs),which typic...The Conditional Nonlinear Optimal Perturbation(CNOP)method works essentially for conventional numerical models;however,it is not fully applicable to the commonly used deep-learning forecasting models(DLMs),which typically input multiple time slices without deterministic dependencies.In this study,the CNOP for DLMs(CNOP-DL)is proposed as an extension of the CNOP in the time dimension.This method is useful for targeted observations as it indicates not only where but also when to deploy additional observations.The CNOP-DL is calculated for a forecast case of sea surface temperature in the South China Sea with a DLM.The CNOP-DL identifies a sensitive area northwest of Palawan Island at the last input time.Sensitivity experiments demonstrate that the sensitive area identified by the CNOP-DL is effective not only for the CNOP-DL itself,but also for random perturbations.Therefore,this approach holds potential for guiding practical field campaigns.Notably,forecast errors are more sensitive to time than to location in the sensitive area.It highlights the crucial role of identifying the time of the sensitive area in targeted observations,corroborating the usefulness of extending the CNOP in the time dimension.展开更多
Gesture recognition utilizing flexible strain sensors is a highly valuable technology widely applied in human-machine interfaces.However,achieving rapid detection of subtle motions and timely processing of dynamic sig...Gesture recognition utilizing flexible strain sensors is a highly valuable technology widely applied in human-machine interfaces.However,achieving rapid detection of subtle motions and timely processing of dynamic signals remain a challenge for sensors.Here,highly resilient and durable ionogels are developed by introducing micro-scale incompatible phases in macroscopic homogeneous polymeric network.The compatible network disperses in conductive ionic liquid to form highly resilient and stretchable skeleton,while incompatible phase forms hydrogen bonds to dissipate energy thus strengthening the ionogels.The ionogels-derived strain sensors show highly sensitivity,fast response time(<10 ms),low detection limit(~50μm),and remarkable durability(>5000 cycles),allowing for precise monitoring of human motions.More importantly,a self-adaptive recognition program empowered by deep-learning algorithms is designed to compensate for sensors,creating a comprehensive system capable of dynamic gesture recognition.This system can comprehensively analyze both the temporal and spatial features of sensor data,enabling deeper understanding of the dynamic process underlying gestures.The system accurately classifies 10 hand gestures across five participants with impressive accuracy of 93.66%.Moreover,it maintains robust recognition performance without the need for further training even when different sensors or subjects are involved.This technological breakthrough paves the way for intuitive and seamless interaction between humans and machines,presenting significant opportunities in diverse applications,such as human-robot interaction,virtual reality control,and assistive devices for the disabled individuals.展开更多
Using first-principles-based machine-learning potential,molecular dynamics(MD)simulations are performed to investigate the micro-mechanism in phase transition of NbO_(2).Treating the DFT results of the low-and interme...Using first-principles-based machine-learning potential,molecular dynamics(MD)simulations are performed to investigate the micro-mechanism in phase transition of NbO_(2).Treating the DFT results of the low-and intermediate-temperature phases of NbO_(2)as training data in the deep-learning model,we successfully constructed an interatomic potential capable of accurately reproducing the phase transitions from low-temperature(pressure)to high-temperature(pressure)regimes.Notably,our simulations predict a high-pressure monoclinic phase(>14 GPa)without treating its information in the training set,consistent with previous experimental findings,demonstrating the reliability of the constructed interatomic potential.We identified the Nb-dimers as the key structural motif governing the phase transitions.At low temperatures,the displacements of the Nb-dimers drive the transition between the I41=a(α-NbO_(2))and I41(β-NbO_(2))phases,while at high temperatures,Nb ions are prone to being equally distributed and the disappearance of Nb-dimers leads to the stabilization of a high-symmetry P4_(2)=mnm phase.These findings elucidate the structural and dynamical mechanisms underlying the structural properties of NbO_(2)and highlight the utility of combining DFT and deep potential MD methods for studying complex phase transitions in transition metal oxides.展开更多
The representation of spatial variation of soil properties in the form of random fields permits advanced probabilistic assessment of slope stability.In many studies,the safety margin of the system is typically charact...The representation of spatial variation of soil properties in the form of random fields permits advanced probabilistic assessment of slope stability.In many studies,the safety margin of the system is typically characterized by the term“probability of failure(Pfailure)”.As the intensity and spatial distribution of soil properties vary in different random field realizations,the failure mechanism and deformation field of a slope can vary as well.Not only can the location of the failure surfaces vary,but the mode of failure also changes.Such information is equally valuable to engineering practitioners.In this paper,two slope examples that are modified from a real case study are presented.The first example pertains to the stability analysis of a multi-layer-slope while the second example deals with the serviceability analysis of a multi-layer c-φslope.In addition,due to the large number of simulations needed to reveal the full picture of the failure mechanism,Convolutional Neural Networks(CNNs)that adopt a U-Net architecture is proposed to offer a soft computing strategy to facilitate the investigation.The spatial distribution of the failure surfaces,the statistics of the sliding volume,and the statistics of the deformation field are presented.The results also show that the proposed deep-learning model is effective in predicting the failure mechanism and deformation field of slopes in spatially variable soils;therefore encouraging probabilistic study of slopes in practical scenarios.展开更多
Pedestrian positioning system(PPS)using wearable inertial sensors has wide applications towards various emerging fields such as smart healthcare,emergency rescue,soldier positioning,etc.The performance of traditional ...Pedestrian positioning system(PPS)using wearable inertial sensors has wide applications towards various emerging fields such as smart healthcare,emergency rescue,soldier positioning,etc.The performance of traditional PPS is limited by the cumulative error of inertial sensors,complex motion modes of pedestrians,and the low robustness of the multi-sensor collaboration structure.This paper presents a hybrid pedestrian positioning system using the combination of wearable inertial sensors and ultrasonic ranging(H-PPS).A robust two nodes integration structure is developed to adaptively combine the motion data acquired from the single waist-mounted and foot-mounted node,and enhanced by a novel ellipsoid constraint model.In addition,a deep-learning-based walking speed estimator is proposed by considering all the motion features provided by different nodes,which effectively reduces the cumulative error originating from inertial sensors.Finally,a comprehensive data and model dual-driven model is presented to effectively combine the motion data provided by different sensor nodes and walking speed estimator,and multi-level constraints are extracted to further improve the performance of the overall system.Experimental results indicate that the proposed H-PPS significantly improves the performance of the single PPS and outperforms existing algorithms in accuracy index under complex indoor scenarios.展开更多
This study describes improving network security by implementing and assessing an intrusion detection system(IDS)based on deep neural networks(DNNs).The paper investigates contemporary technical ways for enhancing intr...This study describes improving network security by implementing and assessing an intrusion detection system(IDS)based on deep neural networks(DNNs).The paper investigates contemporary technical ways for enhancing intrusion detection performance,given the vital relevance of safeguarding computer networks against harmful activity.The DNN-based IDS is trained and validated by the model using the NSL-KDD dataset,a popular benchmark for IDS research.The model performs well in both the training and validation stages,with 91.30%training accuracy and 94.38%validation accuracy.Thus,the model shows good learning and generalization capabilities with minor losses of 0.22 in training and 0.1553 in validation.Furthermore,for both macro and micro averages across class 0(normal)and class 1(anomalous)data,the study evaluates the model using a variety of assessment measures,such as accuracy scores,precision,recall,and F1 scores.The macro-average recall is 0.9422,the macro-average precision is 0.9482,and the accuracy scores are 0.942.Furthermore,macro-averaged F1 scores of 0.9245 for class 1 and 0.9434 for class 0 demonstrate the model’s ability to precisely identify anomalies precisely.The research also highlights how real-time threat monitoring and enhanced resistance against new online attacks may be achieved byDNN-based intrusion detection systems,which can significantly improve network security.The study underscores the critical function ofDNN-based IDS in contemporary cybersecurity procedures by setting the foundation for further developments in this field.Upcoming research aims to enhance intrusion detection systems by examining cooperative learning techniques and integrating up-to-date threat knowledge.展开更多
Breast cancer is a type of cancer responsible for higher mortality rates among women.The cruelty of breast cancer always requires a promising approach for its earlier detection.In light of this,the proposed research l...Breast cancer is a type of cancer responsible for higher mortality rates among women.The cruelty of breast cancer always requires a promising approach for its earlier detection.In light of this,the proposed research leverages the representation ability of pretrained EfficientNet-B0 model and the classification ability of the XGBoost model for the binary classification of breast tumors.In addition,the above transfer learning model is modified in such a way that it will focus more on tumor cells in the input mammogram.Accordingly,the work proposed an EfficientNet-B0 having a Spatial Attention Layer with XGBoost(ESA-XGBNet)for binary classification of mammograms.For this,the work is trained,tested,and validated using original and augmented mammogram images of three public datasets namely CBIS-DDSM,INbreast,and MIAS databases.Maximumclassification accuracy of 97.585%(CBISDDSM),98.255%(INbreast),and 98.91%(MIAS)is obtained using the proposed ESA-XGBNet architecture as compared with the existing models.Furthermore,the decision-making of the proposed ESA-XGBNet architecture is visualized and validated using the Attention Guided GradCAM-based Explainable AI technique.展开更多
Intrusion detection systems(IDS)are essential in the field of cybersecurity because they protect networks from a wide range of online threats.The goal of this research is to meet the urgent need for small-footprint,hi...Intrusion detection systems(IDS)are essential in the field of cybersecurity because they protect networks from a wide range of online threats.The goal of this research is to meet the urgent need for small-footprint,highly-adaptable Network Intrusion Detection Systems(NIDS)that can identify anomalies.The NSL-KDD dataset is used in the study;it is a sizable collection comprising 43 variables with the label’s“attack”and“level.”It proposes a novel approach to intrusion detection based on the combination of channel attention and convolutional neural networks(CNN).Furthermore,this dataset makes it easier to conduct a thorough assessment of the suggested intrusion detection strategy.Furthermore,maintaining operating efficiency while improving detection accuracy is the primary goal of this work.Moreover,typical NIDS examines both risky and typical behavior using a variety of techniques.On the NSL-KDD dataset,our CNN-based approach achieves an astounding 99.728%accuracy rate when paired with channel attention.Compared to previous approaches such as ensemble learning,CNN,RBM(Boltzmann machine),ANN,hybrid auto-encoders with CNN,MCNN,and ANN,and adaptive algorithms,our solution significantly improves intrusion detection performance.Moreover,the results highlight the effectiveness of our suggested method in improving intrusion detection precision,signifying a noteworthy advancement in this field.Subsequent efforts will focus on strengthening and expanding our approach in order to counteract growing cyberthreats and adjust to changing network circumstances.展开更多
BACKGROUND Hip dysplasia(HD)is characterized by insufficient acetabular coverage of the femoral head,leading to a predisposition for osteoarthritis.While radiographic measurements such as the lateral center edge angle...BACKGROUND Hip dysplasia(HD)is characterized by insufficient acetabular coverage of the femoral head,leading to a predisposition for osteoarthritis.While radiographic measurements such as the lateral center edge angle(LCEA)and Tönnis angle are essential in evaluating HD severity,patient-reported outcome measures(PROMs)offer insights into the subjective health impact on patients.AIM To investigate the correlations between machine-learning automated and manual radiographic measurements of HD and PROMs with the hypothesis that artificial intelligence(AI)-generated HD measurements indicating less severe dysplasia correlate with better PROMs.METHODS Retrospective study evaluating 256 hips from 130 HD patients from a hip preservation clinic database.Manual and AI-derived radiographic measurements were collected and PROMs such as the Harris hip score(HHS),international hip outcome tool(iHOT-12),short form(SF)12(SF-12),and Visual Analogue Scale of the European Quality of Life Group survey were correlated using Spearman's rank-order correlation.RESULTS The median patient age was 28.6 years(range 15.7-62.3 years)with 82.3%of patients being women and 17.7%being men.The median interpretation time for manual readers and AI ranged between 4-12 minutes per patient and 31 seconds,respectively.Manual measurements exhibited weak correlations with HHS,including LCEA(r=0.18)and Tönnis angle(r=-0.24).AI-derived metrics showed similar weak correlations,with the most significant being Caput-Collum-Diaphyseal(CCD)with iHOT-12 at r=-0.25(P=0.042)and CCD with SF-12 at r=0.25(P=0.048).Other measured correlations were not significant(P>0.05).CONCLUSION This study suggests AI can aid in HD assessment,but weak PROM correlations highlight their continued importance in predicting subjective health and outcomes,complementing AI-derived measurements in HD management.展开更多
Artificial intelligence(AI) using deep-learning(DL) has emerged as a breakthrough computer technology. By the era of big data, the accumulation of an enormous number of digital images and medical records drove the nee...Artificial intelligence(AI) using deep-learning(DL) has emerged as a breakthrough computer technology. By the era of big data, the accumulation of an enormous number of digital images and medical records drove the need for the utilization of AI to efficiently deal with these data, which have become fundamental resources for a machine to learn by itself. Among several DL models, the convolutional neural network showed outstanding performance in image analysis. In the field of gastroenterology, physicians handle large amounts of clinical data and various kinds of image devices such as endoscopy and ultrasound. AI has been applied in gastroenterology in terms of diagnosis,prognosis, and image analysis. However, potential inherent selection bias cannot be excluded in the form of retrospective study. Because overfitting and spectrum bias(class imbalance) have the possibility of overestimating the accuracy,external validation using unused datasets for model development, collected in a way that minimizes the spectrum bias, is mandatory. For robust verification,prospective studies with adequate inclusion/exclusion criteria, which represent the target populations, are needed. DL has its own lack of interpretability.Because interpretability is important in that it can provide safety measures, help to detect bias, and create social acceptance, further investigations should be performed.展开更多
Oral disintegrating tablets(ODTs) are a novel dosage form that can be dissolved on thetongue within 3 min or less especially for geriatric and pediatric patients. Current ODT for-mulation studies usually rely on the p...Oral disintegrating tablets(ODTs) are a novel dosage form that can be dissolved on thetongue within 3 min or less especially for geriatric and pediatric patients. Current ODT for-mulation studies usually rely on the personal experience of pharmaceutical experts andtrial-and-error in the laboratory, which is inefficient and time-consuming. The aim of cur-rent research was to establish the prediction model of ODT formulations with direct com-pression process by artificial neural network(ANN) and deep neural network(DNN) tech-niques. 145 formulation data were extracted from Web of Science. All datasets were dividedinto three parts: training set(105 data), validation set(20) and testing set(20). ANN andDNN were compared for the prediction of the disintegrating time. The accuracy of the ANNmodel have reached 85.60%, 80.00% and 75.00% on the training set, validation set and testingset respectively, whereas that of the DNN model were 85.60%, 85.00% and 80.00%, respec-tively. Compared with the ANN, DNN showed the better prediction for ODT formulations.It is the first time that deep neural network with the improved dataset selection algorithmis applied to formulation prediction on small data. The proposed predictive approach couldevaluate the critical parameters about quality control of formulation, and guide researchand process development. The implementation of this prediction model could effectivelyreduce drug product development timeline and material usage, and proactively facilitatethe development of a robust drug product.展开更多
Prevention is the most effective way to reduce dental caries.In order to provide a simple way to achieve oral healthcare direction in daily life,dual Channel,portable dental Imaging system that combine white light wit...Prevention is the most effective way to reduce dental caries.In order to provide a simple way to achieve oral healthcare direction in daily life,dual Channel,portable dental Imaging system that combine white light with autofluorescence techniques was established,and then,a group of volunteers were recruited,7200 tooth pictures of different dental caries stage and dental plaque were taken and collected.In this work,a customized Convolutional Neural Networks(CNNs)have been designed to classify dental image with early stage caries and dental plaque.Eighty percentage(n=6000)of the pictures taken were used to supervised training of the CNNs based on the experienced dentists'advice and the rest 20%(n=1200)were used to a test dataset to test the trained CNNs.The accuracy,sensitivity and specificity were calculated to evaluate perfor-mance of the CNNs.The accuracy for the early stage caries and dental plaque were 95.3%and 95.9%,respectively.These results shown that the designed image system combined the cus-tomized CNNs that could automatically and efficiently find early caries and dental plaque on occlusal,lingual and buccal surfaces.Therefore,this will provide a novel approach to dental caries prevention for everyone in daily life.展开更多
This paper proposes a simple and powerful optimal integration(OPI)method for improving hourly quantitative precipitation forecasts(QPFs,0-24 h)of a single-model by integrating the benefits of different biascorrected m...This paper proposes a simple and powerful optimal integration(OPI)method for improving hourly quantitative precipitation forecasts(QPFs,0-24 h)of a single-model by integrating the benefits of different biascorrected methods using the high-resolution CMA-GD model from the Guangzhou Institute of Tropical and Marine Meteorology of China Meteorological Administration(CMA).Three techniques are used to generate multi-method calibrated members for OPI:deep neural network(DNN),frequency-matching(FM),and optimal threat score(OTS).The results are as follows:(1)The QPF using DNN follows the basic physical patterns of CMA-GD.Despite providing superior improvements for clear-rainy and weak precipitation,DNN cannot improve the predictions for severe precipitation,while OTS can significantly strengthen these predictions.As a result,DNN and OTS are the optimal members to be incorporated into OPI.(2)Our new approach achieves state-of-the-art performances on a single model for all magnitudes of precipitation.Compared with the CMA-GD,OPI improves the TS by 2.5%,5.4%,7.8%,8.3%,and 6.1%for QPFs from clear-rainy to rainstorms in the verification dataset.Moreover,OPI shows good stability in the test dataset.(3)It is also noted that the rainstorm pattern of OPI relies heavily on the original model and that OPI cannot correct for deviations in the location of severe precipitation.Therefore,improvements in predicting severe precipitation using this method should be further realized by improving the numerical model's forecasting capability.展开更多
Automatic cell counting provides an effective tool for medical research and diagnosis.Currently,cell counting can be completed by transmitted-light microscope,however,it requires expert knowledge and the counting accu...Automatic cell counting provides an effective tool for medical research and diagnosis.Currently,cell counting can be completed by transmitted-light microscope,however,it requires expert knowledge and the counting accuracy which is unsatisfied for overlapped cells.Further,the image-translation-based detection method has been proposed and the potential has been shown to accomplish cell counting from transmitted-light microscope,automatically and effectively.In this work,a new deep-learning(DL)-based two-stage detection method(cGAN-YOLO)is designed to further enhance the performance of cell counting,which is achieved by combining a DL-based fluorescent image translation model and a DL-based cell detection model.The various results show that cGAN-YOLO can effectively detect and count some different types of cells from the acquired transmitted-light microscope images.Compared with the previously reported YOLO-based one-stage detection method,high recognition accuracy(RA)is achieved by the cGAN-YOLO method,with an improvement of 29.80%.Furthermore,we can also observe that cGAN-YOLO obtains an improvement of 12.11%in RA compared with the previously reported image-translation-based detection method.In a word,cGAN-YOLO makes it possible to implement cell counting directly from the experimental acquired transmitted-light microscopy images with high flexibility and performance,which extends the applicability in clinical research.展开更多
基金the Research Funding of Shantou University for New Faculty Member(No.NTF19024-2019)the National Nature Science Foundation of China(No.41372283)。
文摘This study proposes a deep learning-based approach for shaft resistance evaluation of cast-in-site piles on reclaimed ground,independent of theoretical hypotheses and engineering experience.A series of field tests was first performed to investigate the characteristics of the shaft resistance of cast-in-site piles on reclaimed ground.Then,an intelligent approach based on the long short term memory deep-learning technique was proposed to calculate the shaft resistance of the cast-in-site pile.The proposed method allows accurate estimation of the shaft resistance of cast-in-site piles,not only under the ultimate load but also under the working load.Comparisons with empirical methods confirmed the effectiveness of the proposed method for the shaft resistance estimation of cast-in-site piles on reclaimed ground in offshore areas.
基金supported by MOST under Grant No.MOST 104-2221-E-468-007。
文摘The number of films is numerous and the film contents are complex over the Internet and multimedia sources. It is time consuming for a viewer to select a favorite film. This paper presents an automatic recognition system of film types. Initially, a film is firstly sampled as frame sequences. The color space, including hue, saturation,and brightness value(HSV), is analyzed for each sampled frame by computing the deviation and mean of HSV for each film. These features are utilized as inputs to a deep-learning neural network(DNN) for the recognition of film types. One hundred films are utilized to train and validate the model parameters of DNN. In the testing phase, a film is recognized as one of the five categories, including action, comedy, horror thriller, romance, and science fiction, by the trained DNN. The experimental results reveal that the film types can be effectively recognized by the proposed approach, enabling the viewer to select an interesting film accurately and quickly.
基金supported by the Basic Science Center Project of National Natural Science Foundation of China(52388201)the National Natural Science Foundation of China(12334003)+4 种基金the National Science Fund for Distinguished Young Scholars(12025405)the National Key Basic Research and Development Program of China(2023YFA1406400)the Beijing Advanced Innovation Center for Future Chip(ICFC)the Beijing Advanced Innovation Center for Materials Genome Engineeringfunded by the Shuimu Tsinghua Scholar program。
文摘Realizing large materials models has emerged as a critical endeavor for materials research in the new era of artificial intelligence,but how to achieve this fantastic and challenging objective remains elusive.Here,we propose a feasible pathway to address this paramount pursuit by developing universal materials models of deep-learning density functional theory Hamiltonian(Deep H),enabling computational modeling of the complicated structure-property relationship of materials in general.By constructing a large materials database and substantially improving the Deep H method,we obtain a universal materials model of Deep H capable of handling diverse elemental compositions and material structures,achieving remarkable accuracy in predicting material properties.We further showcase a promising application of fine-tuning universal materials models for enhancing specific materials models.This work not only demonstrates the concept of Deep H's universal materials model but also lays the groundwork for developing large materials models,opening up significant opportunities for advancing artificial intelligencedriven materials discovery.
基金supported by the Fundamental Research Funds for the Central Public-interest Scientific Institution(2022YSKY-73).
文摘Accurately predicting the concentration of fine particulate matter(PM_(2.5))is crucial for evaluating air pollution levels and public exposure.Recent advancements have seen a significant rise in using deep learning(DL)models for forecasting PM_(2.5) concentrations.Nonetheless,there is a lack of unified and standardized frameworks for assessing the performance of DL-based PM_(2.5) prediction models.Here we extensively reviewed those DL-based hybrid models for forecasting PM_(2.5) levels according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses(PRISMA)guidelines.We examined the similarities and differences among various DL models in predicting PM_(2.5) by comparing their complexity and effectiveness.We categorized PM_(2.5) DL methodologies into seven types based on performance and application conditions,including four types of DL-based models and three types of hybrid learning models.Our research indicates that established deep learning architectures are commonly used and respected for their efficiency.However,many of these models often fall short in terms of innovation and interpretability.Conversely,models hybrid with traditional approaches,like deterministic and statistical models,exhibit high interpretability but compromise on accuracy and speed.Besides,hybrid DL models,representing the pinnacle of innovation among the studied models,encounter issues with interpretability.We introduce a novel three-dimensional evaluation framework,i.e.,Dataset-MethodExperiment Standard(DMES)to unify and standardize the evaluation for PM_(2.5) predictions using DL models.This review provides a framework for future evaluations of DL-based models,which could inspire researchers to standardize DL model usage in PM_(2.5) prediction and improve the quality of related studies.
基金This work was supported by grants from the National Key R&D Program of China(Grant Nos.2017YFA0504700 and 2016YFA0500400)the National Natural Science Foundation of China(Grant Nos.31570839,31661143041,61472395,31327901,31521062 and 31730054)+1 种基金the Beijing Natural Science Foundation(L172003)Joint Program between Chinese Academy of Sciences and Peking University.
文摘Dear Editor,Insulin is important for body metabolism regulation and glucose homeostasis,and its dysregulation often leads to metabolic syndrome(MS)and diabetes.Insulin is normally stored in large dense-core vesicles(LDCVs)in pancreatic beta cells,and significant reductions in the number,size,gray level and density of insulin granules confer diabetes both in mice(Xue et al.,2012)and humans(Masini et al.,2012).
基金supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute(KHIDI)funded by the Ministry of Health&Welfare,Republic of Korea(Grant Number:RS-2022-KH129263).
文摘Recently,wearable gait-assist robots have been evolving towards using soft materials designed for the elderly rather than individuals with disabilities,which emphasize modularization,simplification,and weight reduction.Thus,synchronizing the robotic assistive force with that of the user’s leg movements is crucial for usability,which requires accurate recognition of the user’s gait intent.In this study,we propose a deep learning model capable of identifying not only gait mode and gait phase but also phase progression.Utilizing data from five inertial measurement units placed on the body,the proposed two-stage architecture incorporates a bidirectional long short-term memory-based model for robust classification of locomotion modes and phases.Subsequently,phase progression is estimated through 1D convolutional neural network-based regressors,each dedicated to a specific phase.The model was evaluated on a diverse dataset encompassing level walking,stair ascent and descent,and sit-to-stand activities from 10 healthy participants.The results demonstrate its ability to accurately classify locomotion phases and estimate phase progression.Accurate phase progression estimation is essential due to the age-related variability in gait phase durations,particularly evident in older adults,the primary demographic for gait-assist robots.These findings underscore the potential to enhance the assistance,comfort,and safety provided by gait-assist robots.
基金supported by the National Natural Science Foundation of China (Grant No. 42288101, 42375062, 42476192, 42275158)the National Key Scientific and Technological Infrastructure project “Earth System Science Numerical Simulator Facility” (Earth Lab)the GHfund C (202407036001)
文摘The Conditional Nonlinear Optimal Perturbation(CNOP)method works essentially for conventional numerical models;however,it is not fully applicable to the commonly used deep-learning forecasting models(DLMs),which typically input multiple time slices without deterministic dependencies.In this study,the CNOP for DLMs(CNOP-DL)is proposed as an extension of the CNOP in the time dimension.This method is useful for targeted observations as it indicates not only where but also when to deploy additional observations.The CNOP-DL is calculated for a forecast case of sea surface temperature in the South China Sea with a DLM.The CNOP-DL identifies a sensitive area northwest of Palawan Island at the last input time.Sensitivity experiments demonstrate that the sensitive area identified by the CNOP-DL is effective not only for the CNOP-DL itself,but also for random perturbations.Therefore,this approach holds potential for guiding practical field campaigns.Notably,forecast errors are more sensitive to time than to location in the sensitive area.It highlights the crucial role of identifying the time of the sensitive area in targeted observations,corroborating the usefulness of extending the CNOP in the time dimension.
基金supported by the National Key Research and Development Program of China(No.2021YFA1401103)the National Natural Science Foundation of China(Nos.61825403,61921005,and 82370520).
文摘Gesture recognition utilizing flexible strain sensors is a highly valuable technology widely applied in human-machine interfaces.However,achieving rapid detection of subtle motions and timely processing of dynamic signals remain a challenge for sensors.Here,highly resilient and durable ionogels are developed by introducing micro-scale incompatible phases in macroscopic homogeneous polymeric network.The compatible network disperses in conductive ionic liquid to form highly resilient and stretchable skeleton,while incompatible phase forms hydrogen bonds to dissipate energy thus strengthening the ionogels.The ionogels-derived strain sensors show highly sensitivity,fast response time(<10 ms),low detection limit(~50μm),and remarkable durability(>5000 cycles),allowing for precise monitoring of human motions.More importantly,a self-adaptive recognition program empowered by deep-learning algorithms is designed to compensate for sensors,creating a comprehensive system capable of dynamic gesture recognition.This system can comprehensively analyze both the temporal and spatial features of sensor data,enabling deeper understanding of the dynamic process underlying gestures.The system accurately classifies 10 hand gestures across five participants with impressive accuracy of 93.66%.Moreover,it maintains robust recognition performance without the need for further training even when different sensors or subjects are involved.This technological breakthrough paves the way for intuitive and seamless interaction between humans and machines,presenting significant opportunities in diverse applications,such as human-robot interaction,virtual reality control,and assistive devices for the disabled individuals.
基金support from the National Natural Science Foundation of China(Grant No.12422407)support from the National Natural Science Foundation of China(Grant No.12204496)+1 种基金the Zhejiang Provincial Natural Science Foundation(Grant No.Q23A040003)Ningbo Nature Science Foundation(No.2023J360)。
文摘Using first-principles-based machine-learning potential,molecular dynamics(MD)simulations are performed to investigate the micro-mechanism in phase transition of NbO_(2).Treating the DFT results of the low-and intermediate-temperature phases of NbO_(2)as training data in the deep-learning model,we successfully constructed an interatomic potential capable of accurately reproducing the phase transitions from low-temperature(pressure)to high-temperature(pressure)regimes.Notably,our simulations predict a high-pressure monoclinic phase(>14 GPa)without treating its information in the training set,consistent with previous experimental findings,demonstrating the reliability of the constructed interatomic potential.We identified the Nb-dimers as the key structural motif governing the phase transitions.At low temperatures,the displacements of the Nb-dimers drive the transition between the I41=a(α-NbO_(2))and I41(β-NbO_(2))phases,while at high temperatures,Nb ions are prone to being equally distributed and the disappearance of Nb-dimers leads to the stabilization of a high-symmetry P4_(2)=mnm phase.These findings elucidate the structural and dynamical mechanisms underlying the structural properties of NbO_(2)and highlight the utility of combining DFT and deep potential MD methods for studying complex phase transitions in transition metal oxides.
基金supported by the National Natural Science Foundation of China (grant Nos.52130805)China National Postdoctoral Program for Innovative Talents (BX20220234)Shanghai Science and Technology Committee Program (20dz1202200)。
文摘The representation of spatial variation of soil properties in the form of random fields permits advanced probabilistic assessment of slope stability.In many studies,the safety margin of the system is typically characterized by the term“probability of failure(Pfailure)”.As the intensity and spatial distribution of soil properties vary in different random field realizations,the failure mechanism and deformation field of a slope can vary as well.Not only can the location of the failure surfaces vary,but the mode of failure also changes.Such information is equally valuable to engineering practitioners.In this paper,two slope examples that are modified from a real case study are presented.The first example pertains to the stability analysis of a multi-layer-slope while the second example deals with the serviceability analysis of a multi-layer c-φslope.In addition,due to the large number of simulations needed to reveal the full picture of the failure mechanism,Convolutional Neural Networks(CNNs)that adopt a U-Net architecture is proposed to offer a soft computing strategy to facilitate the investigation.The spatial distribution of the failure surfaces,the statistics of the sliding volume,and the statistics of the deformation field are presented.The results also show that the proposed deep-learning model is effective in predicting the failure mechanism and deformation field of slopes in spatially variable soils;therefore encouraging probabilistic study of slopes in practical scenarios.
基金supported by the National Natural Science Foundation of China under(Grant No.52175531)in part by the Science and Technology Research Program of Chongqing Municipal Education Commission under Grant(Grant Nos.KJQN202000605 and KJZD-M202000602)。
文摘Pedestrian positioning system(PPS)using wearable inertial sensors has wide applications towards various emerging fields such as smart healthcare,emergency rescue,soldier positioning,etc.The performance of traditional PPS is limited by the cumulative error of inertial sensors,complex motion modes of pedestrians,and the low robustness of the multi-sensor collaboration structure.This paper presents a hybrid pedestrian positioning system using the combination of wearable inertial sensors and ultrasonic ranging(H-PPS).A robust two nodes integration structure is developed to adaptively combine the motion data acquired from the single waist-mounted and foot-mounted node,and enhanced by a novel ellipsoid constraint model.In addition,a deep-learning-based walking speed estimator is proposed by considering all the motion features provided by different nodes,which effectively reduces the cumulative error originating from inertial sensors.Finally,a comprehensive data and model dual-driven model is presented to effectively combine the motion data provided by different sensor nodes and walking speed estimator,and multi-level constraints are extracted to further improve the performance of the overall system.Experimental results indicate that the proposed H-PPS significantly improves the performance of the single PPS and outperforms existing algorithms in accuracy index under complex indoor scenarios.
基金Princess Nourah bint Abdulrahman University for funding this project through the Researchers Supporting Project(PNURSP2024R319)funded by the Prince Sultan University,Riyadh,Saudi Arabia.
文摘This study describes improving network security by implementing and assessing an intrusion detection system(IDS)based on deep neural networks(DNNs).The paper investigates contemporary technical ways for enhancing intrusion detection performance,given the vital relevance of safeguarding computer networks against harmful activity.The DNN-based IDS is trained and validated by the model using the NSL-KDD dataset,a popular benchmark for IDS research.The model performs well in both the training and validation stages,with 91.30%training accuracy and 94.38%validation accuracy.Thus,the model shows good learning and generalization capabilities with minor losses of 0.22 in training and 0.1553 in validation.Furthermore,for both macro and micro averages across class 0(normal)and class 1(anomalous)data,the study evaluates the model using a variety of assessment measures,such as accuracy scores,precision,recall,and F1 scores.The macro-average recall is 0.9422,the macro-average precision is 0.9482,and the accuracy scores are 0.942.Furthermore,macro-averaged F1 scores of 0.9245 for class 1 and 0.9434 for class 0 demonstrate the model’s ability to precisely identify anomalies precisely.The research also highlights how real-time threat monitoring and enhanced resistance against new online attacks may be achieved byDNN-based intrusion detection systems,which can significantly improve network security.The study underscores the critical function ofDNN-based IDS in contemporary cybersecurity procedures by setting the foundation for further developments in this field.Upcoming research aims to enhance intrusion detection systems by examining cooperative learning techniques and integrating up-to-date threat knowledge.
基金supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project Number(PNURSP2024R432),Princess Nourah bint Abdulrahman University,Riyadh,Saudi Arabia.
文摘Breast cancer is a type of cancer responsible for higher mortality rates among women.The cruelty of breast cancer always requires a promising approach for its earlier detection.In light of this,the proposed research leverages the representation ability of pretrained EfficientNet-B0 model and the classification ability of the XGBoost model for the binary classification of breast tumors.In addition,the above transfer learning model is modified in such a way that it will focus more on tumor cells in the input mammogram.Accordingly,the work proposed an EfficientNet-B0 having a Spatial Attention Layer with XGBoost(ESA-XGBNet)for binary classification of mammograms.For this,the work is trained,tested,and validated using original and augmented mammogram images of three public datasets namely CBIS-DDSM,INbreast,and MIAS databases.Maximumclassification accuracy of 97.585%(CBISDDSM),98.255%(INbreast),and 98.91%(MIAS)is obtained using the proposed ESA-XGBNet architecture as compared with the existing models.Furthermore,the decision-making of the proposed ESA-XGBNet architecture is visualized and validated using the Attention Guided GradCAM-based Explainable AI technique.
基金The authors would like to thank Princess Nourah bint Abdulrahman University for funding this project through the Researchers Supporting Project(PNURSP2023R319)this research was funded by the Prince Sultan University,Riyadh,Saudi Arabia.
文摘Intrusion detection systems(IDS)are essential in the field of cybersecurity because they protect networks from a wide range of online threats.The goal of this research is to meet the urgent need for small-footprint,highly-adaptable Network Intrusion Detection Systems(NIDS)that can identify anomalies.The NSL-KDD dataset is used in the study;it is a sizable collection comprising 43 variables with the label’s“attack”and“level.”It proposes a novel approach to intrusion detection based on the combination of channel attention and convolutional neural networks(CNN).Furthermore,this dataset makes it easier to conduct a thorough assessment of the suggested intrusion detection strategy.Furthermore,maintaining operating efficiency while improving detection accuracy is the primary goal of this work.Moreover,typical NIDS examines both risky and typical behavior using a variety of techniques.On the NSL-KDD dataset,our CNN-based approach achieves an astounding 99.728%accuracy rate when paired with channel attention.Compared to previous approaches such as ensemble learning,CNN,RBM(Boltzmann machine),ANN,hybrid auto-encoders with CNN,MCNN,and ANN,and adaptive algorithms,our solution significantly improves intrusion detection performance.Moreover,the results highlight the effectiveness of our suggested method in improving intrusion detection precision,signifying a noteworthy advancement in this field.Subsequent efforts will focus on strengthening and expanding our approach in order to counteract growing cyberthreats and adjust to changing network circumstances.
基金the University of Texas Southwestern Institutional Review Board(approval No.Stu-2022-1014).
文摘BACKGROUND Hip dysplasia(HD)is characterized by insufficient acetabular coverage of the femoral head,leading to a predisposition for osteoarthritis.While radiographic measurements such as the lateral center edge angle(LCEA)and Tönnis angle are essential in evaluating HD severity,patient-reported outcome measures(PROMs)offer insights into the subjective health impact on patients.AIM To investigate the correlations between machine-learning automated and manual radiographic measurements of HD and PROMs with the hypothesis that artificial intelligence(AI)-generated HD measurements indicating less severe dysplasia correlate with better PROMs.METHODS Retrospective study evaluating 256 hips from 130 HD patients from a hip preservation clinic database.Manual and AI-derived radiographic measurements were collected and PROMs such as the Harris hip score(HHS),international hip outcome tool(iHOT-12),short form(SF)12(SF-12),and Visual Analogue Scale of the European Quality of Life Group survey were correlated using Spearman's rank-order correlation.RESULTS The median patient age was 28.6 years(range 15.7-62.3 years)with 82.3%of patients being women and 17.7%being men.The median interpretation time for manual readers and AI ranged between 4-12 minutes per patient and 31 seconds,respectively.Manual measurements exhibited weak correlations with HHS,including LCEA(r=0.18)and Tönnis angle(r=-0.24).AI-derived metrics showed similar weak correlations,with the most significant being Caput-Collum-Diaphyseal(CCD)with iHOT-12 at r=-0.25(P=0.042)and CCD with SF-12 at r=0.25(P=0.048).Other measured correlations were not significant(P>0.05).CONCLUSION This study suggests AI can aid in HD assessment,but weak PROM correlations highlight their continued importance in predicting subjective health and outcomes,complementing AI-derived measurements in HD management.
文摘Artificial intelligence(AI) using deep-learning(DL) has emerged as a breakthrough computer technology. By the era of big data, the accumulation of an enormous number of digital images and medical records drove the need for the utilization of AI to efficiently deal with these data, which have become fundamental resources for a machine to learn by itself. Among several DL models, the convolutional neural network showed outstanding performance in image analysis. In the field of gastroenterology, physicians handle large amounts of clinical data and various kinds of image devices such as endoscopy and ultrasound. AI has been applied in gastroenterology in terms of diagnosis,prognosis, and image analysis. However, potential inherent selection bias cannot be excluded in the form of retrospective study. Because overfitting and spectrum bias(class imbalance) have the possibility of overestimating the accuracy,external validation using unused datasets for model development, collected in a way that minimizes the spectrum bias, is mandatory. For robust verification,prospective studies with adequate inclusion/exclusion criteria, which represent the target populations, are needed. DL has its own lack of interpretability.Because interpretability is important in that it can provide safety measures, help to detect bias, and create social acceptance, further investigations should be performed.
基金financially supported by the Universityof Macao Research Grant (MYRG2016-00038-ICMS-QRCM &MYRG2016-00040-ICMS-QRCM)Macao Science and Technology Development Fund (FDCT) (Grant No. 103/2015/A3)the National Natural Science Foundation of China (Grant No. 61562011 )
文摘Oral disintegrating tablets(ODTs) are a novel dosage form that can be dissolved on thetongue within 3 min or less especially for geriatric and pediatric patients. Current ODT for-mulation studies usually rely on the personal experience of pharmaceutical experts andtrial-and-error in the laboratory, which is inefficient and time-consuming. The aim of cur-rent research was to establish the prediction model of ODT formulations with direct com-pression process by artificial neural network(ANN) and deep neural network(DNN) tech-niques. 145 formulation data were extracted from Web of Science. All datasets were dividedinto three parts: training set(105 data), validation set(20) and testing set(20). ANN andDNN were compared for the prediction of the disintegrating time. The accuracy of the ANNmodel have reached 85.60%, 80.00% and 75.00% on the training set, validation set and testingset respectively, whereas that of the DNN model were 85.60%, 85.00% and 80.00%, respec-tively. Compared with the ANN, DNN showed the better prediction for ODT formulations.It is the first time that deep neural network with the improved dataset selection algorithmis applied to formulation prediction on small data. The proposed predictive approach couldevaluate the critical parameters about quality control of formulation, and guide researchand process development. The implementation of this prediction model could effectivelyreduce drug product development timeline and material usage, and proactively facilitatethe development of a robust drug product.
基金supported by National Natural Science Foundation of China 61775140
文摘Prevention is the most effective way to reduce dental caries.In order to provide a simple way to achieve oral healthcare direction in daily life,dual Channel,portable dental Imaging system that combine white light with autofluorescence techniques was established,and then,a group of volunteers were recruited,7200 tooth pictures of different dental caries stage and dental plaque were taken and collected.In this work,a customized Convolutional Neural Networks(CNNs)have been designed to classify dental image with early stage caries and dental plaque.Eighty percentage(n=6000)of the pictures taken were used to supervised training of the CNNs based on the experienced dentists'advice and the rest 20%(n=1200)were used to a test dataset to test the trained CNNs.The accuracy,sensitivity and specificity were calculated to evaluate perfor-mance of the CNNs.The accuracy for the early stage caries and dental plaque were 95.3%and 95.9%,respectively.These results shown that the designed image system combined the cus-tomized CNNs that could automatically and efficiently find early caries and dental plaque on occlusal,lingual and buccal surfaces.Therefore,this will provide a novel approach to dental caries prevention for everyone in daily life.
基金Open Project Fund of Guangdong Provincial Key Laboratory of Regional Numerical Weather Prediction,CMA(J202009)Heavy Rain and Drought-Flood Disasters in Plateau and Basin Key Laboratory of Sichuan Province(SZKT202005)Innovation and Development Project of China Meteorological Administration(CXFZ2021J020)。
文摘This paper proposes a simple and powerful optimal integration(OPI)method for improving hourly quantitative precipitation forecasts(QPFs,0-24 h)of a single-model by integrating the benefits of different biascorrected methods using the high-resolution CMA-GD model from the Guangzhou Institute of Tropical and Marine Meteorology of China Meteorological Administration(CMA).Three techniques are used to generate multi-method calibrated members for OPI:deep neural network(DNN),frequency-matching(FM),and optimal threat score(OTS).The results are as follows:(1)The QPF using DNN follows the basic physical patterns of CMA-GD.Despite providing superior improvements for clear-rainy and weak precipitation,DNN cannot improve the predictions for severe precipitation,while OTS can significantly strengthen these predictions.As a result,DNN and OTS are the optimal members to be incorporated into OPI.(2)Our new approach achieves state-of-the-art performances on a single model for all magnitudes of precipitation.Compared with the CMA-GD,OPI improves the TS by 2.5%,5.4%,7.8%,8.3%,and 6.1%for QPFs from clear-rainy to rainstorms in the verification dataset.Moreover,OPI shows good stability in the test dataset.(3)It is also noted that the rainstorm pattern of OPI relies heavily on the original model and that OPI cannot correct for deviations in the location of severe precipitation.Therefore,improvements in predicting severe precipitation using this method should be further realized by improving the numerical model's forecasting capability.
基金supported by the National Natural Science Foundation of China under Grant Nos.12274092,61871263,and 12034005partially by the Explorer Program of Shanghai under Grant No.21TS1400200+1 种基金partially by Natural Science Foundation of Shanghai under Grant No.21ZR1405200partially by Medical Engineering Fund of Fudan University under Grant No.YG2022-6.Mengyang Lu and Wei Shi contributed equally to this work.
文摘Automatic cell counting provides an effective tool for medical research and diagnosis.Currently,cell counting can be completed by transmitted-light microscope,however,it requires expert knowledge and the counting accuracy which is unsatisfied for overlapped cells.Further,the image-translation-based detection method has been proposed and the potential has been shown to accomplish cell counting from transmitted-light microscope,automatically and effectively.In this work,a new deep-learning(DL)-based two-stage detection method(cGAN-YOLO)is designed to further enhance the performance of cell counting,which is achieved by combining a DL-based fluorescent image translation model and a DL-based cell detection model.The various results show that cGAN-YOLO can effectively detect and count some different types of cells from the acquired transmitted-light microscope images.Compared with the previously reported YOLO-based one-stage detection method,high recognition accuracy(RA)is achieved by the cGAN-YOLO method,with an improvement of 29.80%.Furthermore,we can also observe that cGAN-YOLO obtains an improvement of 12.11%in RA compared with the previously reported image-translation-based detection method.In a word,cGAN-YOLO makes it possible to implement cell counting directly from the experimental acquired transmitted-light microscopy images with high flexibility and performance,which extends the applicability in clinical research.