Drug discovery and development affects various aspects of human health and dramatically impacts the pharmaceutical market.However,investments in a new drug often go unrewarded due to the long and complex process of dr...Drug discovery and development affects various aspects of human health and dramatically impacts the pharmaceutical market.However,investments in a new drug often go unrewarded due to the long and complex process of drug research and development(R&D).With the advancement of experimental technology and computer hardware,artificial intelligence(AI)has recently emerged as a leading tool in analyzing abundant and high-dimensional data.Explosive growth in the size of biomedical data provides advantages in applying AI in all stages of drug R&D.Driven by big data in biomedicine,AI has led to a revolution in drug R&D,due to its ability to discover new drugs more efficiently and at lower cost.This review begins with a brief overview of common AI models in the field of drug discovery;then,it summarizes and discusses in depth their specific applications in various stages of drug R&D,such as target discovery,drug discovery and design,preclinical research,automated drug synthesis,and influences in the pharmaceutical market.Finally,the major limitations of AI in drug R&D are fully discussed and possible solutions are proposed.展开更多
The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have be...The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have been developed to accelerate the identification of PPI sites. However, existing methods suffer from the low predictive accuracy or the limited scope of application. Specifically, some methods learned only global or local sequential features, leading to low predictive accuracy, while others achieved improved performance by extracting residue interactions from structures but were limited in their application scope for the serious dependence on precise structure information. There is an urgent need to develop a method that integrates comprehensive information to realize proteome-wide accurate profiling of PPI sites. Herein, a novel ensemble framework for PPI sites prediction, EnsemPPIS, was therefore proposed based on transformer and gated convolutional networks. EnsemPPIS can effectively capture not only global and local patterns but also residue interactions. Specifically, EnsemPPIS was unique in (a) extracting residue interactions from protein sequences with transformer and (b) further integrating global and local sequential features with the ensemble learning strategy. Compared with various existing methods, EnsemPPIS exhibited either superior performance or broader applicability on multiple PPI sites prediction tasks. Moreover, pattern analysis based on the interpretability of EnsemPPIS demonstrated that EnsemPPIS was fully capable of learning residue interactions within the local structure of PPI sites using only sequence information. The web server of EnsemPPIS is freely available at http://idrblab.org/ensemppis.展开更多
基金funded by the Natural Science Foundation of Zhejiang Province(LR21H300001)National Key R&D Program of China(2022YFC3400501)+4 种基金National Natural Science Foundation of China(22220102001,U1909208,81872798,and 81825020)Leading Talent of the“Ten Thousand Plan”-National High-Level Talents Special Support Plan of ChinaFundamental Research Fund of Central University(2018QNA7023)Key R&D Program of Zhejiang Province(2020C03010)“Double Top-Class”University(181201*194232101)。
文摘Drug discovery and development affects various aspects of human health and dramatically impacts the pharmaceutical market.However,investments in a new drug often go unrewarded due to the long and complex process of drug research and development(R&D).With the advancement of experimental technology and computer hardware,artificial intelligence(AI)has recently emerged as a leading tool in analyzing abundant and high-dimensional data.Explosive growth in the size of biomedical data provides advantages in applying AI in all stages of drug R&D.Driven by big data in biomedicine,AI has led to a revolution in drug R&D,due to its ability to discover new drugs more efficiently and at lower cost.This review begins with a brief overview of common AI models in the field of drug discovery;then,it summarizes and discusses in depth their specific applications in various stages of drug R&D,such as target discovery,drug discovery and design,preclinical research,automated drug synthesis,and influences in the pharmaceutical market.Finally,the major limitations of AI in drug R&D are fully discussed and possible solutions are proposed.
基金supported by the National Natural Science Foundation of China(82373790,U1909208,22220102001,and 81872798)Natural Science Foundation of Zhejiang Province(LR21H300001)+4 种基金Leading Talent of the"Ten Thousand Plan"-National High-Level Talents Special Supports Plan of China,National Key R&D Program of China(2022YFC3400501)Key R&D Program of Zhejiang Province(2020C03010)"Double Top-Class"Universities Projects(181201*194232101)Fundamental Research Funds for Central University(2018QNA7023)Funds for the open access charge:Natural Science Foundation of Zhejiang Province(LR21H300001).
文摘The identification of protein-protein interaction (PPI) sites is essential in the research of protein function and the discovery of new drugs. So far, a variety of computational tools based on machine learning have been developed to accelerate the identification of PPI sites. However, existing methods suffer from the low predictive accuracy or the limited scope of application. Specifically, some methods learned only global or local sequential features, leading to low predictive accuracy, while others achieved improved performance by extracting residue interactions from structures but were limited in their application scope for the serious dependence on precise structure information. There is an urgent need to develop a method that integrates comprehensive information to realize proteome-wide accurate profiling of PPI sites. Herein, a novel ensemble framework for PPI sites prediction, EnsemPPIS, was therefore proposed based on transformer and gated convolutional networks. EnsemPPIS can effectively capture not only global and local patterns but also residue interactions. Specifically, EnsemPPIS was unique in (a) extracting residue interactions from protein sequences with transformer and (b) further integrating global and local sequential features with the ensemble learning strategy. Compared with various existing methods, EnsemPPIS exhibited either superior performance or broader applicability on multiple PPI sites prediction tasks. Moreover, pattern analysis based on the interpretability of EnsemPPIS demonstrated that EnsemPPIS was fully capable of learning residue interactions within the local structure of PPI sites using only sequence information. The web server of EnsemPPIS is freely available at http://idrblab.org/ensemppis.