Heart disease remains a leading cause of mortality worldwide,emphasizing the urgent need for reliable and interpretable predictive models to support early diagnosis and timely intervention.However,existing Deep Learni...Heart disease remains a leading cause of mortality worldwide,emphasizing the urgent need for reliable and interpretable predictive models to support early diagnosis and timely intervention.However,existing Deep Learning(DL)approaches often face several limitations,including inefficient feature extraction,class imbalance,suboptimal classification performance,and limited interpretability,which collectively hinder their deployment in clinical settings.To address these challenges,we propose a novel DL framework for heart disease prediction that integrates a comprehensive preprocessing pipeline with an advanced classification architecture.The preprocessing stage involves label encoding and feature scaling.To address the issue of class imbalance inherent in the personal key indicators of the heart disease dataset,the localized random affine shadowsampling technique is employed,which enhances minority class representation while minimizing overfitting.At the core of the framework lies the Deep Residual Network(DeepResNet),which employs hierarchical residual transformations to facilitate efficient feature extraction and capture complex,non-linear relationships in the data.Experimental results demonstrate that the proposed model significantly outperforms existing techniques,achieving improvements of 3.26%in accuracy,3.16%in area under the receiver operating characteristics,1.09%in recall,and 1.07%in F1-score.Furthermore,robustness is validated using 10-fold crossvalidation,confirming the model’s generalizability across diverse data distributions.Moreover,model interpretability is ensured through the integration of Shapley additive explanations and local interpretable model-agnostic explanations,offering valuable insights into the contribution of individual features to model predictions.Overall,the proposed DL framework presents a robust,interpretable,and clinically applicable solution for heart disease prediction.展开更多
Numerous researches have been published on the application of landslide susceptibility assessment models;however,they were only applied in the same areas as the models were originated,the effect of applying the models...Numerous researches have been published on the application of landslide susceptibility assessment models;however,they were only applied in the same areas as the models were originated,the effect of applying the models to other areas than the origin of the models has not been explored.This study is purposed to develop an optimized random forest(RF)model with best ratios of positive-to-negative cells and 10-fold cross-validation for landslide susceptibility mapping(LSM),and then explore its generalization ability not only in the area where the model is originated but also in area other than the origin of the model.Two typical counties(Fengjie County and Wushan County)in the Three Gorges Reservoir area,China,which have the same terrain and geological conditions,were selected as an example.To begin with,landslide inventory was prepared based on field investigations,satellite images,and historical records,and 1522 landslides were then identified in Fengjie County.22 landslide-conditioning factors under the influence of topography,geology,environmental conditions,and human activities were prepared.Then,combined with 10-fold cross-validation,three typical ratios of positive-to-negative cells,i.e.,1:1,1:5,and 1:10,were adopted for comparative analyses.An optimized RF model(Fengjie-based model)with the best ratios of positive-to-negative cells and 10-fold cross-validation was constructed.Finally,the Fengjie-based model was applied to Fengjie County and Wushan County,and the confusion matrix and area under the receiver operating characteristic(ROC)curve value(AUC)were used to estimate the accuracy.The Fengjie-based model delivered high stability and predictive capability in Fengjie County,indicating a great generalization ability of the model to the area where the model is originated.The LSM in Wushan County generated by the Fengjie-based model had a reasonable reference value,indicating the Fengjiebased model had a great generalization ability in area other than the origin of the model.The Fengjiebased model in this study could be applied in other similar areas/countries with the same terrain and geological conditions,and a LSM may be generated without collecting landslide information for modeling,so as to reduce workload and improve efficiency in practice.展开更多
Predicting neuron growth is valuable to understand the morphology of neurons, thus it is helpful in the research of neuron classification. This study sought to propose a new method of predicting the growth of human ne...Predicting neuron growth is valuable to understand the morphology of neurons, thus it is helpful in the research of neuron classification. This study sought to propose a new method of predicting the growth of human neurons using 1 907 sets of data in human brain pyramidal neurons obtained from the website of NeuroMorpho.Org. First, we analyzed neurons in a morphology field and used an expectation-maximization algorithm to specify the neurons into six clusters. Second, naive Bayes classifier was used to verify the accuracy of the expectation-maximization algorithm. Experiment results proved that the cluster groups here were efficient and feasible. Finally, a new method to rank the six expectation-maximization algorithm clustered classes was used in predicting the growth of human pyramidal neurons.展开更多
基金funded by Ongoing Research Funding Program for Project number(ORF-2025-648),King Saud University,Riyadh,Saudi Arabia.
文摘Heart disease remains a leading cause of mortality worldwide,emphasizing the urgent need for reliable and interpretable predictive models to support early diagnosis and timely intervention.However,existing Deep Learning(DL)approaches often face several limitations,including inefficient feature extraction,class imbalance,suboptimal classification performance,and limited interpretability,which collectively hinder their deployment in clinical settings.To address these challenges,we propose a novel DL framework for heart disease prediction that integrates a comprehensive preprocessing pipeline with an advanced classification architecture.The preprocessing stage involves label encoding and feature scaling.To address the issue of class imbalance inherent in the personal key indicators of the heart disease dataset,the localized random affine shadowsampling technique is employed,which enhances minority class representation while minimizing overfitting.At the core of the framework lies the Deep Residual Network(DeepResNet),which employs hierarchical residual transformations to facilitate efficient feature extraction and capture complex,non-linear relationships in the data.Experimental results demonstrate that the proposed model significantly outperforms existing techniques,achieving improvements of 3.26%in accuracy,3.16%in area under the receiver operating characteristics,1.09%in recall,and 1.07%in F1-score.Furthermore,robustness is validated using 10-fold crossvalidation,confirming the model’s generalizability across diverse data distributions.Moreover,model interpretability is ensured through the integration of Shapley additive explanations and local interpretable model-agnostic explanations,offering valuable insights into the contribution of individual features to model predictions.Overall,the proposed DL framework presents a robust,interpretable,and clinically applicable solution for heart disease prediction.
基金the National Natural Science Foundation of China(No.41807498)the National Key Research and Development Program of China(No.2018YFC1505501)the Humanities and Social Sciences Foundation of the Ministry of Education of China(No.20XJAZH002)。
文摘Numerous researches have been published on the application of landslide susceptibility assessment models;however,they were only applied in the same areas as the models were originated,the effect of applying the models to other areas than the origin of the models has not been explored.This study is purposed to develop an optimized random forest(RF)model with best ratios of positive-to-negative cells and 10-fold cross-validation for landslide susceptibility mapping(LSM),and then explore its generalization ability not only in the area where the model is originated but also in area other than the origin of the model.Two typical counties(Fengjie County and Wushan County)in the Three Gorges Reservoir area,China,which have the same terrain and geological conditions,were selected as an example.To begin with,landslide inventory was prepared based on field investigations,satellite images,and historical records,and 1522 landslides were then identified in Fengjie County.22 landslide-conditioning factors under the influence of topography,geology,environmental conditions,and human activities were prepared.Then,combined with 10-fold cross-validation,three typical ratios of positive-to-negative cells,i.e.,1:1,1:5,and 1:10,were adopted for comparative analyses.An optimized RF model(Fengjie-based model)with the best ratios of positive-to-negative cells and 10-fold cross-validation was constructed.Finally,the Fengjie-based model was applied to Fengjie County and Wushan County,and the confusion matrix and area under the receiver operating characteristic(ROC)curve value(AUC)were used to estimate the accuracy.The Fengjie-based model delivered high stability and predictive capability in Fengjie County,indicating a great generalization ability of the model to the area where the model is originated.The LSM in Wushan County generated by the Fengjie-based model had a reasonable reference value,indicating the Fengjiebased model had a great generalization ability in area other than the origin of the model.The Fengjiebased model in this study could be applied in other similar areas/countries with the same terrain and geological conditions,and a LSM may be generated without collecting landslide information for modeling,so as to reduce workload and improve efficiency in practice.
基金supported by the National Natural Science Foundation of China,No.10872069
文摘Predicting neuron growth is valuable to understand the morphology of neurons, thus it is helpful in the research of neuron classification. This study sought to propose a new method of predicting the growth of human neurons using 1 907 sets of data in human brain pyramidal neurons obtained from the website of NeuroMorpho.Org. First, we analyzed neurons in a morphology field and used an expectation-maximization algorithm to specify the neurons into six clusters. Second, naive Bayes classifier was used to verify the accuracy of the expectation-maximization algorithm. Experiment results proved that the cluster groups here were efficient and feasible. Finally, a new method to rank the six expectation-maximization algorithm clustered classes was used in predicting the growth of human pyramidal neurons.