Machine Learning(ML)potentials can be used for the construction of potential energy surfaces(PES)to avoid computationally expensive ab initio methods.However,many such applications still require a significant number o...Machine Learning(ML)potentials can be used for the construction of potential energy surfaces(PES)to avoid computationally expensive ab initio methods.However,many such applications still require a significant number of first-principles calculations to train the ML model,prior to use.Active learning methods can address this issue by performing these calculations and trainings“on-the-fly”(OTF),based on the ML model’s uncertainty estimation.Nevertheless,current active learning approaches suffer from problems in complex simulations were frequent retraining is required,since repeated training of a largeMLmodel increases training times substantially.This work presents a solution to this limitation by introducing the FALCON(Fast Active Learning for Computational ab initio mOlecular dyNamics)calculator.Instead of relying on a single largeMLmodel,FALCON clusters the training data into subsets of similar structures and distributes them across multiple smaller ML models.This approach significantly increases the efficiency of the OTF training,drastically reducing the computational cost of training-intensive simulations.The use of FALCON is demonstrated on various molecular dynamics(MD)simulations of bulk metals,metal clusters,cathode materials and water diffusion in a carbon nanotube.However,the FALCON calculator is highly flexible and could be easily adapted for various applications and different ML models.展开更多
基金funding by the Central Research Development Fund of the University of Bremen.
文摘Machine Learning(ML)potentials can be used for the construction of potential energy surfaces(PES)to avoid computationally expensive ab initio methods.However,many such applications still require a significant number of first-principles calculations to train the ML model,prior to use.Active learning methods can address this issue by performing these calculations and trainings“on-the-fly”(OTF),based on the ML model’s uncertainty estimation.Nevertheless,current active learning approaches suffer from problems in complex simulations were frequent retraining is required,since repeated training of a largeMLmodel increases training times substantially.This work presents a solution to this limitation by introducing the FALCON(Fast Active Learning for Computational ab initio mOlecular dyNamics)calculator.Instead of relying on a single largeMLmodel,FALCON clusters the training data into subsets of similar structures and distributes them across multiple smaller ML models.This approach significantly increases the efficiency of the OTF training,drastically reducing the computational cost of training-intensive simulations.The use of FALCON is demonstrated on various molecular dynamics(MD)simulations of bulk metals,metal clusters,cathode materials and water diffusion in a carbon nanotube.However,the FALCON calculator is highly flexible and could be easily adapted for various applications and different ML models.