Artificial intelligence(AI)has revolutionized the protein engineering process from multiple aspects,including representing protein information,generating protein designs,and evaluating protein properties.This review a...Artificial intelligence(AI)has revolutionized the protein engineering process from multiple aspects,including representing protein information,generating protein designs,and evaluating protein properties.This review aims to introduce the recent progress of AI in protein research.We first introduce how AI models represent protein sequences,structures,and other properties.Further,the applications of generative models in protein design are introduced.The use of predictive models and smart agents in the evaluation process is then discussed,including high-precision protein property simulation and wet lab experimental design.Additionally,we discuss the future development of AI in protein research and the potential challenges it may encounter.展开更多
The primary goal of protein engineering has always been to create molecules with optimal functions and characteristics.One of the most exciting avenues of research in this field is de novo protein design.This approach...The primary goal of protein engineering has always been to create molecules with optimal functions and characteristics.One of the most exciting avenues of research in this field is de novo protein design.This approach facilitates the synthesis of entirely new molecules without relying on existing protein,thereby offering a novel method for generating molecular entities that were previously unimaginable.The application of artificial intelligence in this field has been a significant advancement.By leveraging machine learning algorithms trained on extensive sequence and structure datasets,scientists have been able to make de novo protein design a practical reality.In this paper,we will delve into the key artificial intelligence innovations that have driven this progress and explore how they unlock groundbreaking opportunities.These advancements,we believe,have the potential to push beyond the current state of the art,enabling us to design proteins strategically and robustly.Moreover,they offer solutions to pressing societal challenges,such as developing new therapeutics,creating sustainable biomaterials,or engineering enzymes for environmental remediation.展开更多
Computational design of proteins is a relatively new field, where scientists search the enormous sequence space for sequences that can fold into desired structure and perform desired functions. With the computational ...Computational design of proteins is a relatively new field, where scientists search the enormous sequence space for sequences that can fold into desired structure and perform desired functions. With the computational approach, proteins can be designed, for example, as regulators of biological processes, novel enzymes, or as biotherapeutics. These approaches not only provide valuable information for understanding of sequence-structure-function relations in proteins, but also hold promise for applications to protein engineering and biomedical research. In this review, we briefly introduce the rationale for computational protein design, then summarize the recent progress in this field, including de novo protein design, enzyme design, and design of protein-protein interactions. Challenges and future prospects of this field are also discussed.展开更多
Rational protein design is a powerful strategy,not only for revealing the structure and function relationship of natural metalloproteins,but also for creating artificial metalloproteins with improved properties and fu...Rational protein design is a powerful strategy,not only for revealing the structure and function relationship of natural metalloproteins,but also for creating artificial metalloproteins with improved properties and functions.Myoglobin(Mb),a small heme protein created by nature with diverse functions,has been shown to be an ideal scaffold for rational protein design.The progress reviewed herein includes fine-tuning its native functions of O2binding and transport,peroxidase activity and nitrite reductase(NIR)activity,and rational expanding its functionalities to peroxygenase,heme-copper oxidase(HCO),nitric oxide reductase(NOR),as well as hydroxylamine reductase.These studies have enhanced our understanding of how metalloproteins work in nature,and provided insights for rational design of functional metalloproteins for practical applications in the future.展开更多
We designed a disulfide-crosslinked mini-protein with a two-helical topology consisting of L-and Damino acids,which was exceptionally stable in serum.Therefore,we further used it as a scaffold to design mini-proteins ...We designed a disulfide-crosslinked mini-protein with a two-helical topology consisting of L-and Damino acids,which was exceptionally stable in serum.Therefore,we further used it as a scaffold to design mini-proteins targeting p53 positive tumor cells.Based on bifunctional grafting,key residues from the transactivation domain of p53 and a designed unnatural amino acid were grafted into the helix constituted by L-amino acids to confer the mini-protein with MDM2 inhibitory activity.Meanwhile,ten Arg residues were introduced to improve its membrane penetrating capacity.Among the mini-proteins,UPROL-10e showed nano-molar binding affinity on MDM2 and cellular toxicity on p53 expressing HCT116cells.展开更多
The protein inverse folding problem,designing amino acid sequences that fold into desired protein structures,is a critical challenge in biological sciences.Despite numerous data-driven and knowledge-driven methods,the...The protein inverse folding problem,designing amino acid sequences that fold into desired protein structures,is a critical challenge in biological sciences.Despite numerous data-driven and knowledge-driven methods,there remains a need for a user-friendly toolkit that effectively integrates these approaches for in-silico protein design.In this paper,we present DIProT,an interactive protein design toolkit.DIProT leverages a non-autoregressive deep generative model to solve the inverse folding problem,combined with a protein structure prediction model.This integration allows users to incorporate prior knowledge into the design process,evaluate designs in silico,and form a virtual design loop with human feedback.Our inverse folding model demonstrates competitive performance in terms of effectiveness and efficiency on TS50 and CATH4.2 datasets,with promising sequence recovery and inference time.Case studies further illustrate how DIProT can facilitate user-guided protein design.展开更多
Proteins perform a variety of functions in living organisms and their functions are largely determined by their shape. In this paper, we propose a novel mathematical method for designing protein-like molecules of a gi...Proteins perform a variety of functions in living organisms and their functions are largely determined by their shape. In this paper, we propose a novel mathematical method for designing protein-like molecules of a given shape. In the mathematical model, molecules are represented as loops of n-simplices (2-simplices are triangles and 3-simplices are tetrahedra). We design a new molecule of a given shape by patching together a set of smaller molecules that cover the shape. The covering set of small molecules is defined using a binary relation between sets of molecules. A new molecule is then obtained as a sum of the smaller molecules, where addition of molecules is defined using transformations acting on a set of (n + 1)-dimensional cones. Due to page limitations, only the two-dimensional case (i.e., loops of triangles) is considered. No prior knowledge of Sheaf Theory, Category Theory, or Protein Science is required. The author hopes that this paper will encourage further collaboration between Mathematics and Protein Science.展开更多
A new effective and fast minimization approach completely based on the physical theory is proposed for protein design. The sequence space is essentially searched according to the Boltzmann distribution. In this approa...A new effective and fast minimization approach completely based on the physical theory is proposed for protein design. The sequence space is essentially searched according to the Boltzmann distribution. In this approach, the relative entropy is used as a minimization object function. The method has been tested on an off-lattice model of proteins and the results are better than those obtained from other similar work. Therefore, it can be applied as a uniform frame for both folding and inverse folding of proteins.展开更多
In the present study, we have developed the method brought forward recently for protein design based on the relative entropy. The new approach can be used in more common situation other than the special limits in the ...In the present study, we have developed the method brought forward recently for protein design based on the relative entropy. The new approach can be used in more common situation other than the special limits in the anterior method. The results indicate that our generalized method has increased the prediction precision for protein sequence and will be in favor of the study for protein design.展开更多
Oleanolic acid derivatives act as newer protein tyrosine phosphatase 1B (PTP-1B) inhibitors for type 2 diabetes mellitus (T2DM). In order to understand the structural requirement of PTP-1B inhibitors, 52 oleanolic...Oleanolic acid derivatives act as newer protein tyrosine phosphatase 1B (PTP-1B) inhibitors for type 2 diabetes mellitus (T2DM). In order to understand the structural requirement of PTP-1B inhibitors, 52 oleanolic acid derivatives were divided into a training set (34 compounds) and a test set (18 compounds). The highly reliable and predictive 3D-QSAR models were constructed by CoMFA, CoMSIA and topomer CoMFA methods, respectively. The results showed that the cross validated coefficient (q2) and non-cross-validated coefficient (R2) were 0.554 and 0.999 in the CoMFA model, 0.675 and 0.971 in the CoMSIA model, and 0.628 and 0.939 in the topomer CoMFA model, which suggests that three models are robust and have good exterior predictive capabilities. Furthermore, ten novel inhibitors with much higher inhibitory potency were designed. Our design strategy was that (i) the electronegative substituents (Cl, -CH2OH, OH and -CH2Cl) were introduced into the double bond of ring C, (ii) the hydrogen bond acceptor groups (C≡N and N atom), electronegative groups (C≡N, N atom, -COOH and -COOCH3) and bulky substituents (C6H5N) were connected to the C-3 position, which would result in generating potent and selective PTP-1B inhibitors. We expect that the results in this paper have the potential to facilitate the process of design and to develop new potent PTP-1B inhibitors.展开更多
In the last few years, there have been important new insights into the structural biology of G-protein coupled receptors. It is now known that allosteric binding sites are involved in the affinity and selec- tivity of...In the last few years, there have been important new insights into the structural biology of G-protein coupled receptors. It is now known that allosteric binding sites are involved in the affinity and selec- tivity of ligands for G-protein coupled receptors, and that signaling by these receptors involves both G-protein dependent and independent pathways. The present review outlines the physiological and pharmacological implications of this perspective for the design of new drugs to treat disorders of the central nervous system. Specifically, new possibilities are explored in relation to allosteric and or- thosteric binding sites on dopamine receptors for the treatment of Parkinson's disease, and on muscarinic receptors for Alzheimer's disease. Future research can seek to identify ligands that can bind to more than one site on the same receptor, or simultaneously bind to two receptors and form a dimer. For example, the design of bivalent drugs that can reach homo/hetero-dimers of D2 dopa- mine receptor holds promise as a relevant therapeutic strategy for Parkinson's disease. Regarding the treatment of Alzheimer's disease, the design of dualsteric ligands for mono-oligomeric mus- carinic receptors could increase therapeutic effectiveness by generating potent compounds that could activate more than one signaling pathway.展开更多
Conversion of one cell type into another cell type by forcibly expressing specific cocktails of transcription factors (TFs) has demonstrated that cell fates are not fixed and that cellular differentiation can be a t...Conversion of one cell type into another cell type by forcibly expressing specific cocktails of transcription factors (TFs) has demonstrated that cell fates are not fixed and that cellular differentiation can be a two-way street with many intersections. These experiments also illustrated the sweeping potential of TFs to "read" genetically hardwired regulatory information even in cells where they are not normally expressed and to access and open up tightly packed chromatin to execute gene expression programs. Cellular reprogramming enables the modeling of diseases in a dish, to test the efficacy and toxicity of drugs in patient-derived cells and ultimately, could enable cell-based therapies to cure degenerative diseases. Yet, producing terminally differentiated cells that fully resemble their in vivo counterparts in sufficient quantities is still an unmet clinical need. While efforts are being made to reprogram cells nongeneticaUy by using drug.like molecules, defined TF cocktails still dominate reprogramming protocols. Therefore; the optimization of TFs by protein engineering has emerged as a strategy to enhance reprogramming to produce functional, stable and safe cells for regenerative biomedicine. Engineering approaches focused on Oct4, MyoD, Sox17, Nanog and Mef2c and range from chimeric TFs with added transactivation domains, designer transcription activator-like effectors to activate endogenous TFs to reprogramming TFs with rationally engineered DNA recognition principles. Possibly, applying the complete toolkit of protein design to cellular reprogramming can help to remove the hurdles that, thus far, impeded the clinical use of cells derived from reprogramming technologies.展开更多
This paper proposes a novel category theoretic approach to describe protein’s shape, i.e., a description of their shape by a set of algebraic equations. The focus of the approach is on the relations between proteins,...This paper proposes a novel category theoretic approach to describe protein’s shape, i.e., a description of their shape by a set of algebraic equations. The focus of the approach is on the relations between proteins, rather than on the proteins themselves. Knowledge of category theory is not required as mathematical notions are defined concretely. In this paper, proteins are represented as closed trajectories (i.e., loops) of flows of triangles. The relations between proteins are defined using the fusion and fission of loops of triangles, where allostery occurs naturally. The shape of a protein is then described with quantities that are measurable with unity elements called “unit loops”. That is, protein’s shape is described with the loops that are obtained by the fusion of unit loops. Measurable loops are called “integral”. In the approach, the unit loops play a role similar to the role “1” plays in the set Z of integers. In particular, the author considers two categories of loops, the “integral” loops and the “rational” loops. Rational loops are then defined using algebraic equations with “integral loop” coefficients. Because of the approach, our theory has some similarities to quantum mechanics, where only observable quantities are admitted in physical theory. The author believes that this paper not only provides a new perspective on protein engineering, but also promotes further collaboration between biology and other disciplines.展开更多
Some highly designable protein structures have dented on the surface of their native structures, and are not full compactly folded. According to hydrophobic-polar (HP) model the most designable structures are full c...Some highly designable protein structures have dented on the surface of their native structures, and are not full compactly folded. According to hydrophobic-polar (HP) model the most designable structures are full compactly folded. To investigate the designability of the dented structures, we introduce the hydrogen bond energy in the secondary structures by using the secondary-structure-favored HP model proposed by Ou-yang etc. The result shows that the average designability increases with the strength of the hydrogen bond. The designabilities of the structures with same dented shape increase exponentially with the number of secondary structure sites. The dented structures can have the highest designabilities for a certain value of hydrogen bond energy density.展开更多
In the past decade,the p53-MDM2 protein-protein interaction by small molecules has been confirmed as a successful strategy for cancer therapy.In our previous work,pyrrolo[3,4-c]pyrazol-6(1H)-ones were found to be po...In the past decade,the p53-MDM2 protein-protein interaction by small molecules has been confirmed as a successful strategy for cancer therapy.In our previous work,pyrrolo[3,4-c]pyrazol-6(1H)-ones were found to be potent p53-MDM2 inhibitors.Further optimization and structure-activity relationship studies were described in the present work.The result revealed that benzyl group on position N1 of imidazole and bromine on C4-phenyl of pyrrolidone showed higher inhibitory activities.In vitro antiproliferative assay demonstrated the potent p53-MDM2 inhibitor 5c with 4-fold selectivity for U2 OS and Saos-2 cells.These data indicated that 4,5-dihydropyrrolo[3,4-c]pyrazol-6(1H)-one moiety is a valuable scaffold for further development of p53-MDM2 inhibitors.展开更多
Transformer models have emerged as pivotal tools within the realm of drug discovery,distinguished by their unique architectural features and exceptional performance in managing intricate data landscapes.Leveraging the...Transformer models have emerged as pivotal tools within the realm of drug discovery,distinguished by their unique architectural features and exceptional performance in managing intricate data landscapes.Leveraging the innate capabilities of transformer architectures to comprehend intricate hierarchical dependencies inherent in sequential data,these models showcase remarkable efficacy across various tasks,including new drug design and drug target identification.The adaptability of pre-trained trans-former-based models renders them indispensable assets for driving data-centric advancements in drug discovery,chemistry,and biology,furnishing a robust framework that expedites innovation and dis-covery within these domains.Beyond their technical prowess,the success of transformer-based models in drug discovery,chemistry,and biology extends to their interdisciplinary potential,seamlessly combining biological,physical,chemical,and pharmacological insights to bridge gaps across diverse disciplines.This integrative approach not only enhances the depth and breadth of research endeavors but also fosters synergistic collaborations and exchange of ideas among disparate fields.In our review,we elucidate the myriad applications of transformers in drug discovery,as well as chemistry and biology,spanning from protein design and protein engineering,to molecular dynamics(MD),drug target iden-tification,transformer-enabled drug virtual screening(VS),drug lead optimization,drug addiction,small data set challenges,chemical and biological image analysis,chemical language understanding,and single cell data.Finally,we conclude the survey by deliberating on promising trends in transformer models within the context of drug discovery and other sciences.展开更多
On 9 October 2024,in a high-profile vote of confidence for the promise of using artificial intelligence(AI)in scientific discovery,the Royal Swedish Academy of Sciences awarded Demis Hassabis(co-founder and chief exec...On 9 October 2024,in a high-profile vote of confidence for the promise of using artificial intelligence(AI)in scientific discovery,the Royal Swedish Academy of Sciences awarded Demis Hassabis(co-founder and chief executive officer)and John M.Jumper(direc-tor)of Google DeepMind(London,UK)the 2024 Nobel Prize in Chemistry for their pioneering work in developing the AI-powered protein structure prediction model AlphaFold2(AF2)[1].Also shar-ing the prize was David Baker(half to Hassabis and Jumper;half to Baker),professor of biochemistry at the University of Washington(Seattle,WA,USA),for his work on computational protein design that started with the mid-1990s development of Rosetta,a since-evolving suite of software tools that model protein structures using physical principles[2]-and now also AI[3].展开更多
Traditional development of small protein scaffolds has relied on display technologies and mutation-based engineering,which limit sequence and functional diversity,thereby constraining their therapeutic and application...Traditional development of small protein scaffolds has relied on display technologies and mutation-based engineering,which limit sequence and functional diversity,thereby constraining their therapeutic and application potential.Protein design tools have significantly advanced the creation of novel protein sequences,structures,and functions.However,further improvements in design strategies are still needed to more efficiently optimize the functional performance of protein-based drugs and enhance their druggability.Here,we extended an evolution-based design protocol to create a novel minibinder,BindHer,against the human epidermal growth factor receptor 2(HER2).It not only exhibits super stability and binding selectivity but also demonstrates remarkable properties in tissue specificity.Radiolabeling experiments with ^(99)mTc,^(68)Ga,and ^(18)F revealed that BindHer efficiently targets tumors in HER2-positive breast cancer mouse models,with minimal nonspecific liver absorption,outperforming scaffolds designed through traditional engineering.These findings highlight a new rational approach to automated protein design,offering significant potential for large-scale applications in therapeutic mini-protein development.展开更多
基金supported by the Shanghai Agricultural Science and Technology Innovation Program(K2023001)the Key Research and Development Program of Shandong ProvinceChina(2024CXGC010213).
文摘Artificial intelligence(AI)has revolutionized the protein engineering process from multiple aspects,including representing protein information,generating protein designs,and evaluating protein properties.This review aims to introduce the recent progress of AI in protein research.We first introduce how AI models represent protein sequences,structures,and other properties.Further,the applications of generative models in protein design are introduced.The use of predictive models and smart agents in the evaluation process is then discussed,including high-precision protein property simulation and wet lab experimental design.Additionally,we discuss the future development of AI in protein research and the potential challenges it may encounter.
基金supported by the National Natural Science Foundation Projects of China(NO.82350003).
文摘The primary goal of protein engineering has always been to create molecules with optimal functions and characteristics.One of the most exciting avenues of research in this field is de novo protein design.This approach facilitates the synthesis of entirely new molecules without relying on existing protein,thereby offering a novel method for generating molecular entities that were previously unimaginable.The application of artificial intelligence in this field has been a significant advancement.By leveraging machine learning algorithms trained on extensive sequence and structure datasets,scientists have been able to make de novo protein design a practical reality.In this paper,we will delve into the key artificial intelligence innovations that have driven this progress and explore how they unlock groundbreaking opportunities.These advancements,we believe,have the potential to push beyond the current state of the art,enabling us to design proteins strategically and robustly.Moreover,they offer solutions to pressing societal challenges,such as developing new therapeutics,creating sustainable biomaterials,or engineering enzymes for environmental remediation.
基金supported by the National Basic Research Program of China(Grant No.2015CB910300)the National High Technology Research and Development Program of China(Grant No.2012AA020308)the National Natural Science Foundation of China(Grant No.11021463)
文摘Computational design of proteins is a relatively new field, where scientists search the enormous sequence space for sequences that can fold into desired structure and perform desired functions. With the computational approach, proteins can be designed, for example, as regulators of biological processes, novel enzymes, or as biotherapeutics. These approaches not only provide valuable information for understanding of sequence-structure-function relations in proteins, but also hold promise for applications to protein engineering and biomedical research. In this review, we briefly introduce the rationale for computational protein design, then summarize the recent progress in this field, including de novo protein design, enzyme design, and design of protein-protein interactions. Challenges and future prospects of this field are also discussed.
基金supported by the National Natural Science Foundation of China(21101091,31370812)the Scientific Research Foundation for the Returned Overseas Chinese Scholars,Ministry of Education of China.J.Wang is supported by the National Basic Research Program of China(2010CB912301,2009CB82 5505)+1 种基金the National Natural Science Foundation of China(90913022)Y.Lu is supported by the US National Institute of Health(GM062211)
文摘Rational protein design is a powerful strategy,not only for revealing the structure and function relationship of natural metalloproteins,but also for creating artificial metalloproteins with improved properties and functions.Myoglobin(Mb),a small heme protein created by nature with diverse functions,has been shown to be an ideal scaffold for rational protein design.The progress reviewed herein includes fine-tuning its native functions of O2binding and transport,peroxidase activity and nitrite reductase(NIR)activity,and rational expanding its functionalities to peroxygenase,heme-copper oxidase(HCO),nitric oxide reductase(NOR),as well as hydroxylamine reductase.These studies have enhanced our understanding of how metalloproteins work in nature,and provided insights for rational design of functional metalloproteins for practical applications in the future.
基金supported by the National Natural Science Foundation of China(Nos.3217110331 and 8212200560)Major new drug development in Shandong Province(No.2020CXGC010503)。
文摘We designed a disulfide-crosslinked mini-protein with a two-helical topology consisting of L-and Damino acids,which was exceptionally stable in serum.Therefore,we further used it as a scaffold to design mini-proteins targeting p53 positive tumor cells.Based on bifunctional grafting,key residues from the transactivation domain of p53 and a designed unnatural amino acid were grafted into the helix constituted by L-amino acids to confer the mini-protein with MDM2 inhibitory activity.Meanwhile,ten Arg residues were introduced to improve its membrane penetrating capacity.Among the mini-proteins,UPROL-10e showed nano-molar binding affinity on MDM2 and cellular toxicity on p53 expressing HCT116cells.
基金This work was supported by the National Natural Science Foundation of China(Nos.62250007,62225307,61721003)a grant from the Guoqiang Institute,Tsinghua University(2021GQG1023).
文摘The protein inverse folding problem,designing amino acid sequences that fold into desired protein structures,is a critical challenge in biological sciences.Despite numerous data-driven and knowledge-driven methods,there remains a need for a user-friendly toolkit that effectively integrates these approaches for in-silico protein design.In this paper,we present DIProT,an interactive protein design toolkit.DIProT leverages a non-autoregressive deep generative model to solve the inverse folding problem,combined with a protein structure prediction model.This integration allows users to incorporate prior knowledge into the design process,evaluate designs in silico,and form a virtual design loop with human feedback.Our inverse folding model demonstrates competitive performance in terms of effectiveness and efficiency on TS50 and CATH4.2 datasets,with promising sequence recovery and inference time.Case studies further illustrate how DIProT can facilitate user-guided protein design.
文摘Proteins perform a variety of functions in living organisms and their functions are largely determined by their shape. In this paper, we propose a novel mathematical method for designing protein-like molecules of a given shape. In the mathematical model, molecules are represented as loops of n-simplices (2-simplices are triangles and 3-simplices are tetrahedra). We design a new molecule of a given shape by patching together a set of smaller molecules that cover the shape. The covering set of small molecules is defined using a binary relation between sets of molecules. A new molecule is then obtained as a sum of the smaller molecules, where addition of molecules is defined using transformations acting on a set of (n + 1)-dimensional cones. Due to page limitations, only the two-dimensional case (i.e., loops of triangles) is considered. No prior knowledge of Sheaf Theory, Category Theory, or Protein Science is required. The author hopes that this paper will encourage further collaboration between Mathematics and Protein Science.
基金the National Natural Science Foundation of China (Grant Nos. 10174005 & 30170230)the Beijing Natural Science Foundation (No. 5032002)
文摘A new effective and fast minimization approach completely based on the physical theory is proposed for protein design. The sequence space is essentially searched according to the Boltzmann distribution. In this approach, the relative entropy is used as a minimization object function. The method has been tested on an off-lattice model of proteins and the results are better than those obtained from other similar work. Therefore, it can be applied as a uniform frame for both folding and inverse folding of proteins.
文摘In the present study, we have developed the method brought forward recently for protein design based on the relative entropy. The new approach can be used in more common situation other than the special limits in the anterior method. The results indicate that our generalized method has increased the prediction precision for protein sequence and will be in favor of the study for protein design.
基金Supported by the Natural Science Foundation of Guangxi Province(Nos.2013GXNSFAA019019 and 2013GXNSFAA019041)
文摘Oleanolic acid derivatives act as newer protein tyrosine phosphatase 1B (PTP-1B) inhibitors for type 2 diabetes mellitus (T2DM). In order to understand the structural requirement of PTP-1B inhibitors, 52 oleanolic acid derivatives were divided into a training set (34 compounds) and a test set (18 compounds). The highly reliable and predictive 3D-QSAR models were constructed by CoMFA, CoMSIA and topomer CoMFA methods, respectively. The results showed that the cross validated coefficient (q2) and non-cross-validated coefficient (R2) were 0.554 and 0.999 in the CoMFA model, 0.675 and 0.971 in the CoMSIA model, and 0.628 and 0.939 in the topomer CoMFA model, which suggests that three models are robust and have good exterior predictive capabilities. Furthermore, ten novel inhibitors with much higher inhibitory potency were designed. Our design strategy was that (i) the electronegative substituents (Cl, -CH2OH, OH and -CH2Cl) were introduced into the double bond of ring C, (ii) the hydrogen bond acceptor groups (C≡N and N atom), electronegative groups (C≡N, N atom, -COOH and -COOCH3) and bulky substituents (C6H5N) were connected to the C-3 position, which would result in generating potent and selective PTP-1B inhibitors. We expect that the results in this paper have the potential to facilitate the process of design and to develop new potent PTP-1B inhibitors.
基金supported by SIP-IPN,CONACYT (CB-168116)FIS/IMSS (FIS/IMSS/PROT/G11-2/1013)
文摘In the last few years, there have been important new insights into the structural biology of G-protein coupled receptors. It is now known that allosteric binding sites are involved in the affinity and selec- tivity of ligands for G-protein coupled receptors, and that signaling by these receptors involves both G-protein dependent and independent pathways. The present review outlines the physiological and pharmacological implications of this perspective for the design of new drugs to treat disorders of the central nervous system. Specifically, new possibilities are explored in relation to allosteric and or- thosteric binding sites on dopamine receptors for the treatment of Parkinson's disease, and on muscarinic receptors for Alzheimer's disease. Future research can seek to identify ligands that can bind to more than one site on the same receptor, or simultaneously bind to two receptors and form a dimer. For example, the design of bivalent drugs that can reach homo/hetero-dimers of D2 dopa- mine receptor holds promise as a relevant therapeutic strategy for Parkinson's disease. Regarding the treatment of Alzheimer's disease, the design of dualsteric ligands for mono-oligomeric mus- carinic receptors could increase therapeutic effectiveness by generating potent compounds that could activate more than one signaling pathway.
文摘Conversion of one cell type into another cell type by forcibly expressing specific cocktails of transcription factors (TFs) has demonstrated that cell fates are not fixed and that cellular differentiation can be a two-way street with many intersections. These experiments also illustrated the sweeping potential of TFs to "read" genetically hardwired regulatory information even in cells where they are not normally expressed and to access and open up tightly packed chromatin to execute gene expression programs. Cellular reprogramming enables the modeling of diseases in a dish, to test the efficacy and toxicity of drugs in patient-derived cells and ultimately, could enable cell-based therapies to cure degenerative diseases. Yet, producing terminally differentiated cells that fully resemble their in vivo counterparts in sufficient quantities is still an unmet clinical need. While efforts are being made to reprogram cells nongeneticaUy by using drug.like molecules, defined TF cocktails still dominate reprogramming protocols. Therefore; the optimization of TFs by protein engineering has emerged as a strategy to enhance reprogramming to produce functional, stable and safe cells for regenerative biomedicine. Engineering approaches focused on Oct4, MyoD, Sox17, Nanog and Mef2c and range from chimeric TFs with added transactivation domains, designer transcription activator-like effectors to activate endogenous TFs to reprogramming TFs with rationally engineered DNA recognition principles. Possibly, applying the complete toolkit of protein design to cellular reprogramming can help to remove the hurdles that, thus far, impeded the clinical use of cells derived from reprogramming technologies.
文摘This paper proposes a novel category theoretic approach to describe protein’s shape, i.e., a description of their shape by a set of algebraic equations. The focus of the approach is on the relations between proteins, rather than on the proteins themselves. Knowledge of category theory is not required as mathematical notions are defined concretely. In this paper, proteins are represented as closed trajectories (i.e., loops) of flows of triangles. The relations between proteins are defined using the fusion and fission of loops of triangles, where allostery occurs naturally. The shape of a protein is then described with quantities that are measurable with unity elements called “unit loops”. That is, protein’s shape is described with the loops that are obtained by the fusion of unit loops. Measurable loops are called “integral”. In the approach, the unit loops play a role similar to the role “1” plays in the set Z of integers. In particular, the author considers two categories of loops, the “integral” loops and the “rational” loops. Rational loops are then defined using algebraic equations with “integral loop” coefficients. Because of the approach, our theory has some similarities to quantum mechanics, where only observable quantities are admitted in physical theory. The author believes that this paper not only provides a new perspective on protein engineering, but also promotes further collaboration between biology and other disciplines.
基金Supported by the Foundation for the Author of National Excellent Doctoral Dissertation of China (200525)the Science and Tech-nology Program of Wuhan City (20067003111-07)
文摘Some highly designable protein structures have dented on the surface of their native structures, and are not full compactly folded. According to hydrophobic-polar (HP) model the most designable structures are full compactly folded. To investigate the designability of the dented structures, we introduce the hydrogen bond energy in the secondary structures by using the secondary-structure-favored HP model proposed by Ou-yang etc. The result shows that the average designability increases with the strength of the hydrogen bond. The designabilities of the structures with same dented shape increase exponentially with the number of secondary structure sites. The dented structures can have the highest designabilities for a certain value of hydrogen bond energy density.
基金supported in part by the National Natural Science Foundation of China (Nos. 81373331 and 81502978)the Bio-Pharmaceutical Project of Science and Technology of Shanghai (No. 14431902300)Youth grant of Second Military Medical University (No. 2014QN08)
文摘In the past decade,the p53-MDM2 protein-protein interaction by small molecules has been confirmed as a successful strategy for cancer therapy.In our previous work,pyrrolo[3,4-c]pyrazol-6(1H)-ones were found to be potent p53-MDM2 inhibitors.Further optimization and structure-activity relationship studies were described in the present work.The result revealed that benzyl group on position N1 of imidazole and bromine on C4-phenyl of pyrrolidone showed higher inhibitory activities.In vitro antiproliferative assay demonstrated the potent p53-MDM2 inhibitor 5c with 4-fold selectivity for U2 OS and Saos-2 cells.These data indicated that 4,5-dihydropyrrolo[3,4-c]pyrazol-6(1H)-one moiety is a valuable scaffold for further development of p53-MDM2 inhibitors.
基金supported in part by National Institute of Health(NIH),USA(Grant Nos.:R01GM126189,R01AI164266,and R35GM148196)the National Science Foundation,USA(Grant Nos.DMS2052983,DMS-1761320,and IIS-1900473)+3 种基金National Aero-nautics and Space Administration(NASA),USA(Grant No.:80NSSC21M0023)Michigan State University(MSU)Foundation,USA,Bristol-Myers Squibb(Grant No.:65109)USA,and Pfizer,USAsupported by the National Natural Science Foundation of China(Grant Nos.:11971367,12271416,and 11972266).
文摘Transformer models have emerged as pivotal tools within the realm of drug discovery,distinguished by their unique architectural features and exceptional performance in managing intricate data landscapes.Leveraging the innate capabilities of transformer architectures to comprehend intricate hierarchical dependencies inherent in sequential data,these models showcase remarkable efficacy across various tasks,including new drug design and drug target identification.The adaptability of pre-trained trans-former-based models renders them indispensable assets for driving data-centric advancements in drug discovery,chemistry,and biology,furnishing a robust framework that expedites innovation and dis-covery within these domains.Beyond their technical prowess,the success of transformer-based models in drug discovery,chemistry,and biology extends to their interdisciplinary potential,seamlessly combining biological,physical,chemical,and pharmacological insights to bridge gaps across diverse disciplines.This integrative approach not only enhances the depth and breadth of research endeavors but also fosters synergistic collaborations and exchange of ideas among disparate fields.In our review,we elucidate the myriad applications of transformers in drug discovery,as well as chemistry and biology,spanning from protein design and protein engineering,to molecular dynamics(MD),drug target iden-tification,transformer-enabled drug virtual screening(VS),drug lead optimization,drug addiction,small data set challenges,chemical and biological image analysis,chemical language understanding,and single cell data.Finally,we conclude the survey by deliberating on promising trends in transformer models within the context of drug discovery and other sciences.
文摘On 9 October 2024,in a high-profile vote of confidence for the promise of using artificial intelligence(AI)in scientific discovery,the Royal Swedish Academy of Sciences awarded Demis Hassabis(co-founder and chief executive officer)and John M.Jumper(direc-tor)of Google DeepMind(London,UK)the 2024 Nobel Prize in Chemistry for their pioneering work in developing the AI-powered protein structure prediction model AlphaFold2(AF2)[1].Also shar-ing the prize was David Baker(half to Hassabis and Jumper;half to Baker),professor of biochemistry at the University of Washington(Seattle,WA,USA),for his work on computational protein design that started with the mid-1990s development of Rosetta,a since-evolving suite of software tools that model protein structures using physical principles[2]-and now also AI[3].
基金supported in part by the National Science and Technology Major Project(2019ZX09201003-003,China)the Key Research and Development Program of Sichuan Province(2020YFS0271,China)+2 种基金the National Natural Science Foundation of China(81973243)the Natural Science Foundation of Sichuan Province(2025ZNSFSC0663,China)the National University of Singapore(A-8001129-00-00).
文摘Traditional development of small protein scaffolds has relied on display technologies and mutation-based engineering,which limit sequence and functional diversity,thereby constraining their therapeutic and application potential.Protein design tools have significantly advanced the creation of novel protein sequences,structures,and functions.However,further improvements in design strategies are still needed to more efficiently optimize the functional performance of protein-based drugs and enhance their druggability.Here,we extended an evolution-based design protocol to create a novel minibinder,BindHer,against the human epidermal growth factor receptor 2(HER2).It not only exhibits super stability and binding selectivity but also demonstrates remarkable properties in tissue specificity.Radiolabeling experiments with ^(99)mTc,^(68)Ga,and ^(18)F revealed that BindHer efficiently targets tumors in HER2-positive breast cancer mouse models,with minimal nonspecific liver absorption,outperforming scaffolds designed through traditional engineering.These findings highlight a new rational approach to automated protein design,offering significant potential for large-scale applications in therapeutic mini-protein development.