Surface-supported clusters forming by aggregation of excessive adatoms could be the main defects of 2D materials after chemical vapor deposition.They will significantly impact the electronic/magnetic properties.Moreov...Surface-supported clusters forming by aggregation of excessive adatoms could be the main defects of 2D materials after chemical vapor deposition.They will significantly impact the electronic/magnetic properties.Moreover,surface supported atoms are also widely explored for high active and selecting catalysts.Severe deformation,even dipping into the surface,of these clusters can be expected because of the very active edge of clusters and strong interaction between supported clusters and surfaces.However,most models of these clusters are supposed to simply float on the top of the surface because ab initio simulations cannot afford the complex reconstructions.Here,we develop an accurate graph neural network machine learning potential(MLP)from ab initio data by active learning architecture through fine-tuning pre-trained models,and then employ the MLP into Monte Carlo to explore the structural evolutions of Mo and S clusters(1-8 atoms)on perfect and various defective MoS2 monolayers.Interestingly,Mo clusters can always sink and embed themselves into MoS2 layers.In contrast,S clusters float on perfect surfaces.On the defective surface,a few S atoms will fill the vacancy and rest S clusters float on the top.Such significant structural reconstructions should be carefully taken into account.展开更多
In materials science,a significant correlation often exists between material input parameters and their corresponding performance attributes.Nevertheless,the inherent challenges associated with small data obscure thes...In materials science,a significant correlation often exists between material input parameters and their corresponding performance attributes.Nevertheless,the inherent challenges associated with small data obscure these statistical correlations,impeding machine learning models from effectively capturing the underlying patterns,thereby hampering efficient optimization of material properties.This work presents a novel active learning framework that integrates generative adversarial networks(GAN)with a directionally constrained expected absolute improvement(EAI)acquisition function to accelerate the discovery of ultra-high temperature ceramics(UHTCs)using small data.The framework employs GAN for data augmentation,symbolic regression for feature weight derivation,and a self-developed EAI function that incorporates input feature importance weighting to quantify bidirectional deviations from zero ablation rate.Through only two iterations,this framework successfully identified the optimal composition of HfB_(2)-3.52SiC-5.23TaSi_(2),which exhibits robust near-zero ablation rates under plasma ablation at 2500℃ for 200 s,demonstrating superior sampling efficiency compared to conventional active learning approaches.Microstructural analysis reveals that the exceptional performance stems from the formation of a highly viscous HfO_(2)-SiO_(2)-Ta_(2)O_(5)-HfSiO_(4)-Hf_(3)(BO_(3))_(4) oxide layer,which provides effective oxygen barrier protection.This work demonstrates an efficient and universal approach for rapid materials discovery using small data.展开更多
Industrial decarbonization is critical for achieving net-zero goals.The carbon dioxide electrochemical reduction reaction(CO_(2)RR)is a promising approach for converting CO_(2)into high-value chemicals,offering the po...Industrial decarbonization is critical for achieving net-zero goals.The carbon dioxide electrochemical reduction reaction(CO_(2)RR)is a promising approach for converting CO_(2)into high-value chemicals,offering the potential for decarbonizing industrial processes toward a sustainable,carbon-neutral future.However,developing CO_(2)RR catalysts with high selectivity and activity remains a challenge due to the complexity of finding such catalysts and the inefficiency of traditional computational or experimental approaches.Here,we present a methodology integrating density functional theory(DFT)calculations,deep learning models,and an active learning strategy to rapidly screen high-performance catalysts.The proposed methodology is then demonstrated on graphene-based single-atom catalysts for selective CO_(2)electroreduction to methanol.First,we conduct systematic binding energy calculations for 3045 single-atom catalysts to identify thermodynamically stable catalysts as the design space.We then use a graph neural network,fine-tuned with a specialized adsorption energy database,to predict the relative activity and selectivity of the candidate catalysts.An autonomous active learning framework is used to facilitate the exploration of designs.After six learning cycles and 2180 adsorption calculations across 15 intermediates,we develop a surrogate model that identifies four novel catalysts on the Pareto front of activity and selectivity.Our work demonstrates the effectiveness of leveraging a domain foundation model with an active learning framework and holds potential to significantly accelerate the discovery of high-performance CO_(2)RR catalysts.展开更多
Dynamical systems often exhibit multiple attractors representing significantly different functioning conditions.A global map of attraction basins can offer valuable guidance for stabilizing or transitioning system sta...Dynamical systems often exhibit multiple attractors representing significantly different functioning conditions.A global map of attraction basins can offer valuable guidance for stabilizing or transitioning system states.Such a map can be constructed without prior system knowledge by identifying attractors across a sufficient number of points in the state space.However,determining the attractor for each initial state can be a laborious task.Here,we tackle the challenge of reconstructing attraction basins using as few initial points as possible.In each iteration of our approach,informative points are selected through random seeding and are driven along the current classification boundary,promoting the eventual selection of points that are both diverse and enlightening.The results across various experimental dynamical systems demonstrate that our approach requires fewer points than baseline methods while achieving comparable mapping accuracy.Additionally,the reconstructed map allows us to accurately estimate the minimum escape distance required to transition the system state to a target basin.展开更多
Human Activity Recognition(HAR)has become increasingly critical in civic surveillance,medical care monitoring,and institutional protection.Current deep learning-based approaches often suffer from excessive computation...Human Activity Recognition(HAR)has become increasingly critical in civic surveillance,medical care monitoring,and institutional protection.Current deep learning-based approaches often suffer from excessive computational complexity,limited generalizability under varying conditions,and compromised real-time performance.To counter these,this paper introduces an Active Learning-aided Heuristic Deep Spatio-Textural Ensemble Learning(ALH-DSEL)framework.The model initially identifies keyframes from the surveillance videos with a Multi-Constraint Active Learning(MCAL)approach,with features extracted from DenseNet121.The frames are then segmented employing an optimized Fuzzy C-Means clustering algorithm with Firefly to identify areas of interest.A deep ensemble feature extractor,comprising DenseNet121,EfficientNet-B7,MobileNet,and GLCM,extracts varied spatial and textural features.Fused characteristics are enhanced through PCA and Min-Max normalization and discriminated by a maximum voting ensemble of RF,AdaBoost,and XGBoost.The experimental results show that ALH-DSEL provides higher accuracy,precision,recall,and F1-score,validating its superiority for real-time HAR in surveillance scenarios.展开更多
To capture the nonlinear dynamics and gain evolution in chirped pulse amplification(CPA)systems,the split-step Fourier method and the fourth-order Runge–Kutta method are integrated to iteratively address the generali...To capture the nonlinear dynamics and gain evolution in chirped pulse amplification(CPA)systems,the split-step Fourier method and the fourth-order Runge–Kutta method are integrated to iteratively address the generalized nonlinear Schrödinger equation and the rate equations.However,this approach is burdened by substantial computational demands,resulting in significant time expenditures.In the context of intelligent laser optimization and inverse design,the necessity for numerous simulations further exacerbates this issue,highlighting the need for fast and accurate simulation methodologies.Here,we introduce an end-to-end model augmented with active learning(E2E-AL)with decent generalization through different dedicated embedding methods over various parameters.On an identical computational platform,the artificial intelligence–driven model is 2000 times faster than the conventional simulation method.Benefiting from the active learning strategy,the E2E-AL model achieves decent precision with only two-thirds of the training samples compared with the case without such a strategy.Furthermore,we demonstrate a multi-objective inverse design of the CPA systems enabled by the E2E-AL model.The E2E-AL framework manifests the potential of becoming a standard approach for the rapid and accurate modeling of ultrafast lasers and is readily extended to simulate other complex systems.展开更多
The Reliability-Based Design Optimization(RBDO)of complex engineering structures considering uncertainties has problems of being high-dimensional,highly nonlinear,and timeconsuming,which requires a significant amount ...The Reliability-Based Design Optimization(RBDO)of complex engineering structures considering uncertainties has problems of being high-dimensional,highly nonlinear,and timeconsuming,which requires a significant amount of sampling simulation computation.In this paper,a basis-adaptive Polynomial Chaos(PC)-Kriging surrogate model is proposed,in order to relieve the computational burden and enhance the predictive accuracy of a metamodel.The active learning basis-adaptive PC-Kriging model is combined with a quantile-based RBDO framework.Finally,five engineering cases have been implemented,including a benchmark RBDO problem,three high-dimensional explicit problems,and a high-dimensional implicit problem.Compared with Support Vector Regression(SVR),Kriging,and polynomial chaos expansion models,results show that the proposed basis-adaptive PC-Kriging model is more accurate and efficient for RBDO problems of complex engineering structures.展开更多
One-step direct production of methanol from methane and water(PMMW)under mild conditions is challenging in heterogeneous catalysis owing to the absence of highly effective catalysts.Herein,we designed a series of“Sin...One-step direct production of methanol from methane and water(PMMW)under mild conditions is challenging in heterogeneous catalysis owing to the absence of highly effective catalysts.Herein,we designed a series of“Single-Atom”-“Frustrated Lewis Pair”(SA-FLP)dual active sites for the direct PMMW via density functional theory(DFT)calculations combined with a machine learning(ML)approach.The results indicate that the nine designed SA-FLP catalysts are capable of efficiently activate CH4 and H_(2)O and facilitate the coupling of OH^(*)and CH_(3)^(*)into methanol.The DFT-based microkinetic simulation(MKM)results indicate that CH_(3)OH production on Co1-FLP and Pt1-FLP catalysts can reach the turnover frequencies(TOFs)of 1.01×10^(−3)s^(-1)and 8.80×10^(−4)s^(-1),respectively,which exceed the experimentally reported values by three orders of magnitude.ML results unveil that the gradient boosted regression model with 13 simple features could give satisfactory predictions for the TOFs of CH_(3)OH production with RMSE and R^(2)of 0.009 s^(-1)and 1.00,respectively.The ML-predicted MKM results indicate that four catalysts including V_(1-),Fe_(1-),Ti_(1-),and Mn_(1)-FLP exhibit higher TOFs of CH_(3)OH production than the value that the most relevant experiments reported,indicating that the four catalysts are also promising catalysts for the PMMW.This study not only develops a simple and efficient approach for design and screening SA-FLP catalysts but also provides mechanistic insights into the direct PMMW.展开更多
Objective:Deep learning(DL)has become the prevailing method in chest radiograph analysis,yet its performance heavily depends on large quantities of annotated images.To mitigate the cost,cold-start active learning(AL),...Objective:Deep learning(DL)has become the prevailing method in chest radiograph analysis,yet its performance heavily depends on large quantities of annotated images.To mitigate the cost,cold-start active learning(AL),comprising an initialization followed by subsequent learning,selects a small subset of informative data points for labeling.Recent advancements in pretrained models by supervised or self-supervised learning tailored to chest radiograph have shown broad applicability to diverse downstream tasks.However,their potential in cold-start AL remains unexplored.Methods:To validate the efficacy of domain-specific pretraining,we compared two foundation models:supervised TXRV and self-supervised REMEDIS with their general domain counterparts pretrained on ImageNet.Model performance was evaluated at both initialization and subsequent learning stages on two diagnostic tasks:psychiatric pneumonia and COVID-19.For initialization,we assessed their integration with three strategies:diversity,uncertainty,and hybrid sampling.For subsequent learning,we focused on uncertainty sampling powered by different pretrained models.We also conducted statistical tests to compare the foundation models with ImageNet counterparts,investigate the relationship between initialization and subsequent learning,examine the performance of one-shot initialization against the full AL process,and investigate the influence of class balance in initialization samples on initialization and subsequent learning.Results:First,domain-specific foundation models failed to outperform ImageNet counterparts in six out of eight experiments on informative sample selection.Both domain-specific and general pretrained models were unable to generate representations that could substitute for the original images as model inputs in seven of the eight scenarios.However,pretrained model-based initialization surpassed random sampling,the default approach in cold-start AL.Second,initialization performance was positively correlated with subsequent learning performance,highlighting the importance of initialization strategies.Third,one-shot initialization performed comparably to the full AL process,demonstrating the potential of reducing experts'repeated waiting during AL iterations.Last,a U-shaped correlation was observed between the class balance of initialization samples and model performance,suggesting that the class balance is more strongly associated with performance at middle budget levels than at low or high budgets.Conclusions:In this study,we highlighted the limitations of medical pretraining compared to general pretraining in the context of cold-start AL.We also identified promising outcomes related to cold-start AL,including initialization based on pretrained models,the positive influence of initialization on subsequent learning,the potential for one-shot initialization,and the influence of class balance on middle-budget AL.Researchers are encouraged to improve medical pretraining for versatile DL foundations and explore novel AL methods.展开更多
1.Colors of chemical reaction engineering models Kinetic models of chemical reactions are a crucial asset for understanding and optimizing chemical processes[1].These models are critical for reactor design,process opt...1.Colors of chemical reaction engineering models Kinetic models of chemical reactions are a crucial asset for understanding and optimizing chemical processes[1].These models are critical for reactor design,process optimization,catalyst design,scale-up,and process control,making them indispensable in the chemical industry.Kinetic models predict the change in temperature and concentration of the relevant species,given an actual concentration and temperature.Reaction predictions are made by integrating the kinetic model with a reactor model,which accounts for external constraints,such as flow,inlet concentration。展开更多
Support vector machines(SVMs) are a popular class of supervised learning algorithms, and are particularly applicable to large and high-dimensional classification problems. Like most machine learning methods for data...Support vector machines(SVMs) are a popular class of supervised learning algorithms, and are particularly applicable to large and high-dimensional classification problems. Like most machine learning methods for data classification and information retrieval, they require manually labeled data samples in the training stage. However, manual labeling is a time consuming and errorprone task. One possible solution to this issue is to exploit the large number of unlabeled samples that are easily accessible via the internet. This paper presents a novel active learning method for text categorization. The main objective of active learning is to reduce the labeling effort, without compromising the accuracy of classification, by intelligently selecting which samples should be labeled.The proposed method selects a batch of informative samples using the posterior probabilities provided by a set of multi-class SVM classifiers, and these samples are then manually labeled by an expert. Experimental results indicate that the proposed active learning method significantly reduces the labeling effort, while simultaneously enhancing the classification accuracy.展开更多
This paper describes a new method for active learning in content-based image retrieval. The proposed method firstly uses support vector machine (SVM) classifiers to learn an initial query concept. Then the proposed ac...This paper describes a new method for active learning in content-based image retrieval. The proposed method firstly uses support vector machine (SVM) classifiers to learn an initial query concept. Then the proposed active learning scheme employs similarity measure to check the current version space and selects images with maximum expected information gain to solicit user's label. Finally, the learned query is refined based on the user's further feedback. With the combination of SVM classifier and similarity measure, the proposed method can alleviate model bias existing in each of them. Our experiments on several query concepts show that the proposed method can learn the user's query concept quickly and effectively only with several iterations.展开更多
Activity cliffs(ACs)are generally defined as pairs of similar compounds that only differ by a minor structural modification but exhibit a large difference in their binding affinity for a given target.ACs offer crucial...Activity cliffs(ACs)are generally defined as pairs of similar compounds that only differ by a minor structural modification but exhibit a large difference in their binding affinity for a given target.ACs offer crucial insights that aid medicinal chemists in optimizing molecular structures.Nonetheless,they also form a major source of prediction error in structure-activity relationship(SAR)models.To date,several studies have demonstrated that deep neural networks based on molecular images or graphs might need to be improved further in predicting the potency of ACs.In this paper,we integrated the triplet loss in face recognition with pre-training strategy to develop a prediction model ACtriplet,tailored for ACs.Through extensive comparison with multiple baseline models on 30 benchmark datasets,the results showed that ACtriplet was significantly better than those deep learning(DL)models without pretraining.In addition,we explored the effect of pre-training on data representation.Finally,the case study demonstrated that our model's interpretability module could explain the prediction results reasonably.In the dilemma that the amount of data could not be increased rapidly,this innovative framework would better make use of the existing data,which would propel the potential of DL in the early stage of drug discovery and optimization.展开更多
Education philosophy,which plays a major role in teacher beliefs,is crucial for teacher educators. Ever since the end of last century, there has been a trend in North America to promote, or restore active learning in ...Education philosophy,which plays a major role in teacher beliefs,is crucial for teacher educators. Ever since the end of last century, there has been a trend in North America to promote, or restore active learning in college/university classrooms. This paper, based on some explorations of the social constructivism, the six assumptions in Andragogy, the ARCS model, the active learning theory and some practical activities, proposes an integrated education philosophy about adult learning. It is assumed this will be able to provide some insight and guidance for college practitioners.展开更多
In active-learning classroom,the students play an active role in learning,and they are not only actively participating in classroom activities,but are interacting with the teacher and their peers.The teachers shoul tr...In active-learning classroom,the students play an active role in learning,and they are not only actively participating in classroom activities,but are interacting with the teacher and their peers.The teachers shoul try to avoid a noisy class with no real effects at all and they should help those who are poor at expressing in English.They should be encouraged to express themselves.To conclude,active-study method has brought us reward.展开更多
Automated recognition of violent activities from videos is vital for public safety,but often raises significant privacy concerns due to the sensitive nature of the footage.Moreover,resource constraints often hinder th...Automated recognition of violent activities from videos is vital for public safety,but often raises significant privacy concerns due to the sensitive nature of the footage.Moreover,resource constraints often hinder the deployment of deep learning-based complex video classification models on edge devices.With this motivation,this study aims to investigate an effective violent activity classifier while minimizing computational complexity,attaining competitive performance,and mitigating user data privacy concerns.We present a lightweight deep learning architecture with fewer parameters for efficient violent activity recognition.We utilize a two-stream formation of 3D depthwise separable convolution coupled with a linear self-attention mechanism for effective feature extraction,incorporating federated learning to address data privacy concerns.Experimental findings demonstrate the model’s effectiveness with test accuracies from 96%to above 97%on multiple datasets by incorporating the FedProx aggregation strategy.These findings underscore the potential to develop secure,efficient,and reliable solutions for violent activity recognition in real-world scenarios.展开更多
In the field of optoelectronics,certain types of data may be difficult to accurately annotate,such as high-resolution optoelectronic imaging or imaging in certain special spectral ranges.Weakly supervised learning can...In the field of optoelectronics,certain types of data may be difficult to accurately annotate,such as high-resolution optoelectronic imaging or imaging in certain special spectral ranges.Weakly supervised learning can provide a more reliable approach in these situations.Current popular approaches mainly adopt the classification-based class activation maps(CAM)as initial pseudo labels to solve the task.展开更多
The development of fungicides is time-consuming and costly.Introducing a fungicide-likeness assessment strategy at the early screening stage can help reduce development risks and improve the success rate.However,exist...The development of fungicides is time-consuming and costly.Introducing a fungicide-likeness assessment strategy at the early screening stage can help reduce development risks and improve the success rate.However,existing assessment methods are often plagued by low accuracy and poor generalization,while fragment-based design strategies commonly fail to account for synergistic effects between structural units.Therefore,based on a small-scale sample set,this study developed a more efficient global predictive model for fungicidal activity—-named APPf—by integrating multi-scale feature screening methods and machine learning algorithms,which also accounts for synergistic effects among different structural fragments.We utilized three independent external test sets for model validation:External Test Set 1 for general validation,External Test Set 2 for comparison with existing models,and External Test Set 3 for disease-specific fungicide evaluation.On External Test Set 1,the APPf model achieved a precision of 0.6454,a recall of 0.8535,and an F1 score of 0.7350,demonstrating its robust predictive performance.It also exhibited strong enrichment capability for positive samples in External Test Set 2.For External Test Set 3,APPf achieved a prediction accuracy exceeding 80%for each disease,suggesting its promising potential in practical fungicide development.Furthermore,we quantified the contribution of molecular descriptors to the model predictions using SHAP value analysis and identified nHdNH and NssssNp as strong indicative features for predicting fungicidal activity,thereby enhancing the interpretability of the model.APPf has been deployed on a public web server(http://pesticides.cau.edu.cn/APPf),providing a user-friendly online prediction service to support the discovery of novel fungicides.Meanwhile,we employed a molecular fragmentation strategy to analyze the co-occurrence relationships between fragments in fungicides and constructed a network map of fragment co-occurrence associated with fungicidal activity.This study provides both an active fragment library and a global fungicide-likeness assessment tool for AI-based de novo molecular generation aimed at discovering novel fungicidal leads,which is expected to enhance the efficiency of developing new fungicides.展开更多
This study applied machine learning methods to predict the durability performance(specifically shrinkage and freeze-thaw resistance)of solid waste-activated cementitious materials.It also offered insights for optimizi...This study applied machine learning methods to predict the durability performance(specifically shrinkage and freeze-thaw resistance)of solid waste-activated cementitious materials.It also offered insights for optimizing material formulations through feature impact analysis.The study collected a total of 130 sets of shrinkage data and 106 sets of freeze-thaw data,establishing various models,including BP,GA-BP,SVM,RF,RBF,and LSTM.The results revealed that the SVM model performed the best on the test dataset.It achieved an R^(2) of 0.9358 for shrinkage prediction,with MAE and RMSE values of 0.4644 and 0.6254,respectively.Regarding freeze-thaw quality loss prediction,the R^(2) was 0.9178,with MAE and RMSE values of 0.3139 and 0.5328,respectively.The study analyzed the impact of different features on the outcomes using the SHAP method,highlighting that the alkaline activator dosage,Al_(2)O_(3),SiO_(2),and water glass modulus were critical factors influencing shrinkage,while CaO,water-cement ratio,water,and Al_(2)O_(3) were crucial for freeze-thaw resistance.By investigating feature interactions through single-factor and two-factor analysis,the study proposed recommendations for optimizing material formulations.This research validated the efficacy of machine learning in predicting the durability of solid waste cementitious materials and offered insights for material optimization through feature impact analysis,thereby laying the groundwork for the development of related materials.展开更多
基金supported by the National Natural Science Foundation of China(Grant No.12374253,12074053,12004064)J.G.thanks the Foreign talents project(G2022127004L),The authors also acknowledge computer support from the Shanghai Supercomputer Center,the DUT Supercomputing Center,and the Tianhe supercomputer of Tianjin Center.
文摘Surface-supported clusters forming by aggregation of excessive adatoms could be the main defects of 2D materials after chemical vapor deposition.They will significantly impact the electronic/magnetic properties.Moreover,surface supported atoms are also widely explored for high active and selecting catalysts.Severe deformation,even dipping into the surface,of these clusters can be expected because of the very active edge of clusters and strong interaction between supported clusters and surfaces.However,most models of these clusters are supposed to simply float on the top of the surface because ab initio simulations cannot afford the complex reconstructions.Here,we develop an accurate graph neural network machine learning potential(MLP)from ab initio data by active learning architecture through fine-tuning pre-trained models,and then employ the MLP into Monte Carlo to explore the structural evolutions of Mo and S clusters(1-8 atoms)on perfect and various defective MoS2 monolayers.Interestingly,Mo clusters can always sink and embed themselves into MoS2 layers.In contrast,S clusters float on perfect surfaces.On the defective surface,a few S atoms will fill the vacancy and rest S clusters float on the top.Such significant structural reconstructions should be carefully taken into account.
基金supported by the Natural Science Foundation of China[grant numbers 52302093]Natural Science Foundation of Jiangxi Province[grant numbers 20224BAB204021].
文摘In materials science,a significant correlation often exists between material input parameters and their corresponding performance attributes.Nevertheless,the inherent challenges associated with small data obscure these statistical correlations,impeding machine learning models from effectively capturing the underlying patterns,thereby hampering efficient optimization of material properties.This work presents a novel active learning framework that integrates generative adversarial networks(GAN)with a directionally constrained expected absolute improvement(EAI)acquisition function to accelerate the discovery of ultra-high temperature ceramics(UHTCs)using small data.The framework employs GAN for data augmentation,symbolic regression for feature weight derivation,and a self-developed EAI function that incorporates input feature importance weighting to quantify bidirectional deviations from zero ablation rate.Through only two iterations,this framework successfully identified the optimal composition of HfB_(2)-3.52SiC-5.23TaSi_(2),which exhibits robust near-zero ablation rates under plasma ablation at 2500℃ for 200 s,demonstrating superior sampling efficiency compared to conventional active learning approaches.Microstructural analysis reveals that the exceptional performance stems from the formation of a highly viscous HfO_(2)-SiO_(2)-Ta_(2)O_(5)-HfSiO_(4)-Hf_(3)(BO_(3))_(4) oxide layer,which provides effective oxygen barrier protection.This work demonstrates an efficient and universal approach for rapid materials discovery using small data.
基金supported by the National Key Research and Development Program of China(2022ZD0117501)the Scientific Research Innovation Capability Support Project for Young Faculty(ZYGXQNJSKYCXNLZCXM-E7)the Tsinghua University Initiative Scientific Research Program and the Carbon Neutrality and Energy System Transformation(CNEST)Program led by Tsinghua University.
文摘Industrial decarbonization is critical for achieving net-zero goals.The carbon dioxide electrochemical reduction reaction(CO_(2)RR)is a promising approach for converting CO_(2)into high-value chemicals,offering the potential for decarbonizing industrial processes toward a sustainable,carbon-neutral future.However,developing CO_(2)RR catalysts with high selectivity and activity remains a challenge due to the complexity of finding such catalysts and the inefficiency of traditional computational or experimental approaches.Here,we present a methodology integrating density functional theory(DFT)calculations,deep learning models,and an active learning strategy to rapidly screen high-performance catalysts.The proposed methodology is then demonstrated on graphene-based single-atom catalysts for selective CO_(2)electroreduction to methanol.First,we conduct systematic binding energy calculations for 3045 single-atom catalysts to identify thermodynamically stable catalysts as the design space.We then use a graph neural network,fine-tuned with a specialized adsorption energy database,to predict the relative activity and selectivity of the candidate catalysts.An autonomous active learning framework is used to facilitate the exploration of designs.After six learning cycles and 2180 adsorption calculations across 15 intermediates,we develop a surrogate model that identifies four novel catalysts on the Pareto front of activity and selectivity.Our work demonstrates the effectiveness of leveraging a domain foundation model with an active learning framework and holds potential to significantly accelerate the discovery of high-performance CO_(2)RR catalysts.
基金supported by the National Natural Science Foundation of China(Grant Nos.T2225022,12350710786,62088101,and 12161141016)Shuguang Program of Shanghai Education Development Foundation(Grant No.22SG21)Shanghai Municipal Education Commission,and the Fundamental Research Funds for the Central Universities。
文摘Dynamical systems often exhibit multiple attractors representing significantly different functioning conditions.A global map of attraction basins can offer valuable guidance for stabilizing or transitioning system states.Such a map can be constructed without prior system knowledge by identifying attractors across a sufficient number of points in the state space.However,determining the attractor for each initial state can be a laborious task.Here,we tackle the challenge of reconstructing attraction basins using as few initial points as possible.In each iteration of our approach,informative points are selected through random seeding and are driven along the current classification boundary,promoting the eventual selection of points that are both diverse and enlightening.The results across various experimental dynamical systems demonstrate that our approach requires fewer points than baseline methods while achieving comparable mapping accuracy.Additionally,the reconstructed map allows us to accurately estimate the minimum escape distance required to transition the system state to a target basin.
文摘Human Activity Recognition(HAR)has become increasingly critical in civic surveillance,medical care monitoring,and institutional protection.Current deep learning-based approaches often suffer from excessive computational complexity,limited generalizability under varying conditions,and compromised real-time performance.To counter these,this paper introduces an Active Learning-aided Heuristic Deep Spatio-Textural Ensemble Learning(ALH-DSEL)framework.The model initially identifies keyframes from the surveillance videos with a Multi-Constraint Active Learning(MCAL)approach,with features extracted from DenseNet121.The frames are then segmented employing an optimized Fuzzy C-Means clustering algorithm with Firefly to identify areas of interest.A deep ensemble feature extractor,comprising DenseNet121,EfficientNet-B7,MobileNet,and GLCM,extracts varied spatial and textural features.Fused characteristics are enhanced through PCA and Min-Max normalization and discriminated by a maximum voting ensemble of RF,AdaBoost,and XGBoost.The experimental results show that ALH-DSEL provides higher accuracy,precision,recall,and F1-score,validating its superiority for real-time HAR in surveillance scenarios.
基金supported by the National Natural Science Foundation of China(Grant Nos.62227821,62025503,and 62205199).
文摘To capture the nonlinear dynamics and gain evolution in chirped pulse amplification(CPA)systems,the split-step Fourier method and the fourth-order Runge–Kutta method are integrated to iteratively address the generalized nonlinear Schrödinger equation and the rate equations.However,this approach is burdened by substantial computational demands,resulting in significant time expenditures.In the context of intelligent laser optimization and inverse design,the necessity for numerous simulations further exacerbates this issue,highlighting the need for fast and accurate simulation methodologies.Here,we introduce an end-to-end model augmented with active learning(E2E-AL)with decent generalization through different dedicated embedding methods over various parameters.On an identical computational platform,the artificial intelligence–driven model is 2000 times faster than the conventional simulation method.Benefiting from the active learning strategy,the E2E-AL model achieves decent precision with only two-thirds of the training samples compared with the case without such a strategy.Furthermore,we demonstrate a multi-objective inverse design of the CPA systems enabled by the E2E-AL model.The E2E-AL framework manifests the potential of becoming a standard approach for the rapid and accurate modeling of ultrafast lasers and is readily extended to simulate other complex systems.
基金supported by the National Key R&D Program of China(No.2021YFB1715000)the National Natural Science Foundation of China(No.52375073)。
文摘The Reliability-Based Design Optimization(RBDO)of complex engineering structures considering uncertainties has problems of being high-dimensional,highly nonlinear,and timeconsuming,which requires a significant amount of sampling simulation computation.In this paper,a basis-adaptive Polynomial Chaos(PC)-Kriging surrogate model is proposed,in order to relieve the computational burden and enhance the predictive accuracy of a metamodel.The active learning basis-adaptive PC-Kriging model is combined with a quantile-based RBDO framework.Finally,five engineering cases have been implemented,including a benchmark RBDO problem,three high-dimensional explicit problems,and a high-dimensional implicit problem.Compared with Support Vector Regression(SVR),Kriging,and polynomial chaos expansion models,results show that the proposed basis-adaptive PC-Kriging model is more accurate and efficient for RBDO problems of complex engineering structures.
文摘One-step direct production of methanol from methane and water(PMMW)under mild conditions is challenging in heterogeneous catalysis owing to the absence of highly effective catalysts.Herein,we designed a series of“Single-Atom”-“Frustrated Lewis Pair”(SA-FLP)dual active sites for the direct PMMW via density functional theory(DFT)calculations combined with a machine learning(ML)approach.The results indicate that the nine designed SA-FLP catalysts are capable of efficiently activate CH4 and H_(2)O and facilitate the coupling of OH^(*)and CH_(3)^(*)into methanol.The DFT-based microkinetic simulation(MKM)results indicate that CH_(3)OH production on Co1-FLP and Pt1-FLP catalysts can reach the turnover frequencies(TOFs)of 1.01×10^(−3)s^(-1)and 8.80×10^(−4)s^(-1),respectively,which exceed the experimentally reported values by three orders of magnitude.ML results unveil that the gradient boosted regression model with 13 simple features could give satisfactory predictions for the TOFs of CH_(3)OH production with RMSE and R^(2)of 0.009 s^(-1)and 1.00,respectively.The ML-predicted MKM results indicate that four catalysts including V_(1-),Fe_(1-),Ti_(1-),and Mn_(1)-FLP exhibit higher TOFs of CH_(3)OH production than the value that the most relevant experiments reported,indicating that the four catalysts are also promising catalysts for the PMMW.This study not only develops a simple and efficient approach for design and screening SA-FLP catalysts but also provides mechanistic insights into the direct PMMW.
文摘Objective:Deep learning(DL)has become the prevailing method in chest radiograph analysis,yet its performance heavily depends on large quantities of annotated images.To mitigate the cost,cold-start active learning(AL),comprising an initialization followed by subsequent learning,selects a small subset of informative data points for labeling.Recent advancements in pretrained models by supervised or self-supervised learning tailored to chest radiograph have shown broad applicability to diverse downstream tasks.However,their potential in cold-start AL remains unexplored.Methods:To validate the efficacy of domain-specific pretraining,we compared two foundation models:supervised TXRV and self-supervised REMEDIS with their general domain counterparts pretrained on ImageNet.Model performance was evaluated at both initialization and subsequent learning stages on two diagnostic tasks:psychiatric pneumonia and COVID-19.For initialization,we assessed their integration with three strategies:diversity,uncertainty,and hybrid sampling.For subsequent learning,we focused on uncertainty sampling powered by different pretrained models.We also conducted statistical tests to compare the foundation models with ImageNet counterparts,investigate the relationship between initialization and subsequent learning,examine the performance of one-shot initialization against the full AL process,and investigate the influence of class balance in initialization samples on initialization and subsequent learning.Results:First,domain-specific foundation models failed to outperform ImageNet counterparts in six out of eight experiments on informative sample selection.Both domain-specific and general pretrained models were unable to generate representations that could substitute for the original images as model inputs in seven of the eight scenarios.However,pretrained model-based initialization surpassed random sampling,the default approach in cold-start AL.Second,initialization performance was positively correlated with subsequent learning performance,highlighting the importance of initialization strategies.Third,one-shot initialization performed comparably to the full AL process,demonstrating the potential of reducing experts'repeated waiting during AL iterations.Last,a U-shaped correlation was observed between the class balance of initialization samples and model performance,suggesting that the class balance is more strongly associated with performance at middle budget levels than at low or high budgets.Conclusions:In this study,we highlighted the limitations of medical pretraining compared to general pretraining in the context of cold-start AL.We also identified promising outcomes related to cold-start AL,including initialization based on pretrained models,the positive influence of initialization on subsequent learning,the potential for one-shot initialization,and the influence of class balance on middle-budget AL.Researchers are encouraged to improve medical pretraining for versatile DL foundations and explore novel AL methods.
基金Yannick Ureel and Maarten Dobbelaere acknowledge financial support from the Fund for Scientific Research Flanders(FWO Flanders)respectively through doctoral fellowship grants(1185822N and 1S45522N)The authors acknowledge funding from the European Research Council under the European Union’s Horizon 2020 research and innovation programme/ERC(818607).
文摘1.Colors of chemical reaction engineering models Kinetic models of chemical reactions are a crucial asset for understanding and optimizing chemical processes[1].These models are critical for reactor design,process optimization,catalyst design,scale-up,and process control,making them indispensable in the chemical industry.Kinetic models predict the change in temperature and concentration of the relevant species,given an actual concentration and temperature.Reaction predictions are made by integrating the kinetic model with a reactor model,which accounts for external constraints,such as flow,inlet concentration。
文摘Support vector machines(SVMs) are a popular class of supervised learning algorithms, and are particularly applicable to large and high-dimensional classification problems. Like most machine learning methods for data classification and information retrieval, they require manually labeled data samples in the training stage. However, manual labeling is a time consuming and errorprone task. One possible solution to this issue is to exploit the large number of unlabeled samples that are easily accessible via the internet. This paper presents a novel active learning method for text categorization. The main objective of active learning is to reduce the labeling effort, without compromising the accuracy of classification, by intelligently selecting which samples should be labeled.The proposed method selects a batch of informative samples using the posterior probabilities provided by a set of multi-class SVM classifiers, and these samples are then manually labeled by an expert. Experimental results indicate that the proposed active learning method significantly reduces the labeling effort, while simultaneously enhancing the classification accuracy.
文摘This paper describes a new method for active learning in content-based image retrieval. The proposed method firstly uses support vector machine (SVM) classifiers to learn an initial query concept. Then the proposed active learning scheme employs similarity measure to check the current version space and selects images with maximum expected information gain to solicit user's label. Finally, the learned query is refined based on the user's further feedback. With the combination of SVM classifier and similarity measure, the proposed method can alleviate model bias existing in each of them. Our experiments on several query concepts show that the proposed method can learn the user's query concept quickly and effectively only with several iterations.
基金supported by the National Natural Science Foundation of China(Grant Nos.:U23A20530,82273858,and 82173746)the National Key Research and Development Programof China(Grant No.:2023YFF1204904)Shanghai Frontiers Science Center of Optogenetic Techniques for Cell Metabolism(Shanghai Municipal Education Commission,China).
文摘Activity cliffs(ACs)are generally defined as pairs of similar compounds that only differ by a minor structural modification but exhibit a large difference in their binding affinity for a given target.ACs offer crucial insights that aid medicinal chemists in optimizing molecular structures.Nonetheless,they also form a major source of prediction error in structure-activity relationship(SAR)models.To date,several studies have demonstrated that deep neural networks based on molecular images or graphs might need to be improved further in predicting the potency of ACs.In this paper,we integrated the triplet loss in face recognition with pre-training strategy to develop a prediction model ACtriplet,tailored for ACs.Through extensive comparison with multiple baseline models on 30 benchmark datasets,the results showed that ACtriplet was significantly better than those deep learning(DL)models without pretraining.In addition,we explored the effect of pre-training on data representation.Finally,the case study demonstrated that our model's interpretability module could explain the prediction results reasonably.In the dilemma that the amount of data could not be increased rapidly,this innovative framework would better make use of the existing data,which would propel the potential of DL in the early stage of drug discovery and optimization.
文摘Education philosophy,which plays a major role in teacher beliefs,is crucial for teacher educators. Ever since the end of last century, there has been a trend in North America to promote, or restore active learning in college/university classrooms. This paper, based on some explorations of the social constructivism, the six assumptions in Andragogy, the ARCS model, the active learning theory and some practical activities, proposes an integrated education philosophy about adult learning. It is assumed this will be able to provide some insight and guidance for college practitioners.
文摘In active-learning classroom,the students play an active role in learning,and they are not only actively participating in classroom activities,but are interacting with the teacher and their peers.The teachers shoul try to avoid a noisy class with no real effects at all and they should help those who are poor at expressing in English.They should be encouraged to express themselves.To conclude,active-study method has brought us reward.
基金Supported by the Research Chair of Online Dialogue and Cultural Communication,King Saud University,Saudi Arabia.
文摘Automated recognition of violent activities from videos is vital for public safety,but often raises significant privacy concerns due to the sensitive nature of the footage.Moreover,resource constraints often hinder the deployment of deep learning-based complex video classification models on edge devices.With this motivation,this study aims to investigate an effective violent activity classifier while minimizing computational complexity,attaining competitive performance,and mitigating user data privacy concerns.We present a lightweight deep learning architecture with fewer parameters for efficient violent activity recognition.We utilize a two-stream formation of 3D depthwise separable convolution coupled with a linear self-attention mechanism for effective feature extraction,incorporating federated learning to address data privacy concerns.Experimental findings demonstrate the model’s effectiveness with test accuracies from 96%to above 97%on multiple datasets by incorporating the FedProx aggregation strategy.These findings underscore the potential to develop secure,efficient,and reliable solutions for violent activity recognition in real-world scenarios.
文摘In the field of optoelectronics,certain types of data may be difficult to accurately annotate,such as high-resolution optoelectronic imaging or imaging in certain special spectral ranges.Weakly supervised learning can provide a more reliable approach in these situations.Current popular approaches mainly adopt the classification-based class activation maps(CAM)as initial pseudo labels to solve the task.
基金the National Key R&D Program of China(No.2022YFD1700200).
文摘The development of fungicides is time-consuming and costly.Introducing a fungicide-likeness assessment strategy at the early screening stage can help reduce development risks and improve the success rate.However,existing assessment methods are often plagued by low accuracy and poor generalization,while fragment-based design strategies commonly fail to account for synergistic effects between structural units.Therefore,based on a small-scale sample set,this study developed a more efficient global predictive model for fungicidal activity—-named APPf—by integrating multi-scale feature screening methods and machine learning algorithms,which also accounts for synergistic effects among different structural fragments.We utilized three independent external test sets for model validation:External Test Set 1 for general validation,External Test Set 2 for comparison with existing models,and External Test Set 3 for disease-specific fungicide evaluation.On External Test Set 1,the APPf model achieved a precision of 0.6454,a recall of 0.8535,and an F1 score of 0.7350,demonstrating its robust predictive performance.It also exhibited strong enrichment capability for positive samples in External Test Set 2.For External Test Set 3,APPf achieved a prediction accuracy exceeding 80%for each disease,suggesting its promising potential in practical fungicide development.Furthermore,we quantified the contribution of molecular descriptors to the model predictions using SHAP value analysis and identified nHdNH and NssssNp as strong indicative features for predicting fungicidal activity,thereby enhancing the interpretability of the model.APPf has been deployed on a public web server(http://pesticides.cau.edu.cn/APPf),providing a user-friendly online prediction service to support the discovery of novel fungicides.Meanwhile,we employed a molecular fragmentation strategy to analyze the co-occurrence relationships between fragments in fungicides and constructed a network map of fragment co-occurrence associated with fungicidal activity.This study provides both an active fragment library and a global fungicide-likeness assessment tool for AI-based de novo molecular generation aimed at discovering novel fungicidal leads,which is expected to enhance the efficiency of developing new fungicides.
文摘This study applied machine learning methods to predict the durability performance(specifically shrinkage and freeze-thaw resistance)of solid waste-activated cementitious materials.It also offered insights for optimizing material formulations through feature impact analysis.The study collected a total of 130 sets of shrinkage data and 106 sets of freeze-thaw data,establishing various models,including BP,GA-BP,SVM,RF,RBF,and LSTM.The results revealed that the SVM model performed the best on the test dataset.It achieved an R^(2) of 0.9358 for shrinkage prediction,with MAE and RMSE values of 0.4644 and 0.6254,respectively.Regarding freeze-thaw quality loss prediction,the R^(2) was 0.9178,with MAE and RMSE values of 0.3139 and 0.5328,respectively.The study analyzed the impact of different features on the outcomes using the SHAP method,highlighting that the alkaline activator dosage,Al_(2)O_(3),SiO_(2),and water glass modulus were critical factors influencing shrinkage,while CaO,water-cement ratio,water,and Al_(2)O_(3) were crucial for freeze-thaw resistance.By investigating feature interactions through single-factor and two-factor analysis,the study proposed recommendations for optimizing material formulations.This research validated the efficacy of machine learning in predicting the durability of solid waste cementitious materials and offered insights for material optimization through feature impact analysis,thereby laying the groundwork for the development of related materials.