As a high-value eudicot family,many famous horticultural crop genomes have been deciphered in Oleaceae.However,there are currently no bioinformatics platforms focused on empowering genome research in Oleaceae.Herein,w...As a high-value eudicot family,many famous horticultural crop genomes have been deciphered in Oleaceae.However,there are currently no bioinformatics platforms focused on empowering genome research in Oleaceae.Herein,we developed the first comprehensive Oleaceae Genome Research Platform(OGRP,https://oleaceae.cgrpoee.top/).In OGRP,70 genomes of 10 Oleaceae species and 46 eudicots and 366 transcriptomes involving 18 Oleaceae plant tissues can be obtained.We built 34 window-operated bioinformatics tools,collected 38 professional practical software programs,and proposed 3 new pipelines,namely ancient polyploidization identification,ancestral karyotype reconstruction,and gene family evolution.Employing these pipelines to reanalyze the Oleaceae genomes,we clarified the polyploidization,reconstructed the ancestral karyotypes,and explored the effects of paleogenome evolution on genes with specific biological regulatory roles.Significantly,we generated a series of comparative genomic resources focusing on the Oleaceae,comprising 108 genomic synteny dot plots,1952225 collinear gene pairs,multiple genome alignments,and imprints of paleochromosome rearrangements.Moreover,in Oleaceae genomes,researchers can efficiently search for 1785987 functional annotations,22584 orthogroups,29582 important trait genes from 74 gene families,12664 transcription factor-related genes,9178872 transposable elements,and all involved regulatory pathways.In addition,we provided downloads and usage instructions for the tools,a species encyclopedia,ecological resources,relevant literatures,and external database links.In short,ORGP integrates rich data resources and powerful analytical tools with the characteristic of continuous updating,which can efficiently empower genome research and agricultural breeding in Oleaceae and other plants.展开更多
Rice is one of cereal crops and a model species for monocots.Since the release of the first draft rice genome sequences in 2002,considerable progress has been achieved in rice genomic researches,thanks to rapid develo...Rice is one of cereal crops and a model species for monocots.Since the release of the first draft rice genome sequences in 2002,considerable progress has been achieved in rice genomic researches,thanks to rapid development and efficient utilization of bioinformatics methods and tools.In this review,we summarize the progress of studies of rice genome sequencing and other omics and introduce the wellmaintained bioinformatics databases and tools developed for rice genome resources and breeding.After reviewing the history of rice bioinformatics,we use single-cell sequencing and machine learning as examples showing how bioinformatics integrates emerging technologies and how it continues to develop for future rice research.展开更多
In this editorial preface, I briefly r eview cancer bioinformatics and introduce the four articles in this special issue highlighting important applications of the field: detection of chromatin states; detection of SN...In this editorial preface, I briefly r eview cancer bioinformatics and introduce the four articles in this special issue highlighting important applications of the field: detection of chromatin states; detection of SNP- containing motifs and association with transcription factor-binding sites; improvements in functional enrichment modules; and gene association studies on aging and cancer. We expect this issue to provide bioinformatics scientists, cancer biologists, and clinical doctors with a better understanding of how cancer bioinformatics can be used to identify candidate biomarkers and targets and to conduct functional analysis.展开更多
Severe acute respiratory syndrome coronavirus(SARS-CoV)and SARS-CoV-2 are thought to transmit to humans via wild mammals,especially bats.However,evidence for direct bat-to-human transmission is lacking.Involvement of ...Severe acute respiratory syndrome coronavirus(SARS-CoV)and SARS-CoV-2 are thought to transmit to humans via wild mammals,especially bats.However,evidence for direct bat-to-human transmission is lacking.Involvement of intermediate hosts is considered a reason for SARS-CoV-2 transmission to humans and emergence of outbreak.Large biodiversity is found in tropical territories,such as Brazil.On the similar line,this study aimed to predict potential coronavirus hosts among Brazilian wild mammals based on angiotensin-converting enzyme 2(ACE2)sequences using evolutionary bioinformatics.Cougar,maned wolf,and bush dogs were predicted as potential hosts for coronavirus.These indigenous carnivores are philogenetically closer to the known SARS-CoV/SARS-CoV-2 hosts and presented low ACE2 divergence.A new coronavirus transmission chain was developed in which white-tailed deer,a susceptible SARS-CoV-2 host,have the central position.Cougar play an important role because of its low divergent ACE2 level in deer and humans.The discovery of these potential coronavirus hosts will be useful for epidemiological surveillance and discovery of interventions that can contribute to break the transmission chain.展开更多
Bioinformatics analysis often requires the filtering of multi-datasets,based on frequency or frequency of occurrence,for decisions on retention or deletion.Existing tools for this purpose often present a challenge wit...Bioinformatics analysis often requires the filtering of multi-datasets,based on frequency or frequency of occurrence,for decisions on retention or deletion.Existing tools for this purpose often present a challenge with complex installation,which necessitate custom coding,thereby impeding efficient data processing activities.To address this issue,Filterx,a user-friendly command line tool that written in C language,was developed that supports multi-condition filtering,based on frequency or occurrence.This tool enables users to complete the data processing tasks through a simple command line,greatly reducing both workload and data processing time.In addition,future development of this tool could facilitate its integration into various bioinformatics data analysis pipelines.展开更多
Transformer-based foundation models such as ChatGPTs have revolutionized our daily life and affected many fields including bioinformatics.In this perspective,we first discuss about the direct application of textual fo...Transformer-based foundation models such as ChatGPTs have revolutionized our daily life and affected many fields including bioinformatics.In this perspective,we first discuss about the direct application of textual foundation models on bioinformatics tasks,focusing on how to make the most out of canonical large language models and mitigate their inherent flaws.Meanwhile,we go through the transformer-based,bioinformaticstailored foundation models for both sequence and non-sequence data.In particular,we envision the further development directions as well as challenges for bioinformatics foundation models.展开更多
In the year 1971,the world’s biggest structural biology collaboration name—The Research Collaboratory for Structural Bioinformatics(RCSB),was formed to gather all the structural biologists at a single platform and t...In the year 1971,the world’s biggest structural biology collaboration name—The Research Collaboratory for Structural Bioinformatics(RCSB),was formed to gather all the structural biologists at a single platform and then extended out to be the world’s most extensive structural data repository named RCSB-Protein Data Bank(PDB)(https://www.rcsb.org/)that has provided the service for more than 50 years and continues its legacy for the discoveries and repositories for structural data.The RCSB has evolved from being a collaboratory network to a full-fledged database and tool with a huge list of protein structures,nucleic acid-containing structures,ModelArchive,and AlphaFold structures,and the best is that it is expanding day by day with computational advancement with tools and visual experiences.In this review article,we have discussed how RCSB has been a successful collaboratory network,its expansion in each decade,and how it has helped the ground-breaking research.The PDB tools that are helping the researchers,yearly data deposition,validation,processing,and suggestions that can help the developer improve for upcoming years are also discussed.This review will help future researchers understand the complete history of RCSB and its developments in each decade and how various future collaborative networks can be developed in various scientific areas and can be successful by keeping RCSB as a case study.展开更多
Realizing personalized medicine requires integrating diverse data types with bioinformatics.The most vital data are genomic information for individuals that are from advanced next-generation sequencing(NGS) technologi...Realizing personalized medicine requires integrating diverse data types with bioinformatics.The most vital data are genomic information for individuals that are from advanced next-generation sequencing(NGS) technologies at present.The technologies continue to advance in terms of both decreasing cost and sequencing speed with concomitant increase in the amount and complexity of the data.The prodigious data together with the requisite computational pipelines for data analysis and interpretation are stressors to IT infrastructure and the scientists conducting the work alike.Bioinformatics is increasingly becoming the rate-limiting step with numerous challenges to be overcome for translating NGS data for personalized medicine.We review some key bioinformatics tasks,issues,and challenges in contexts of IT requirements,data quality,analysis tools and pipelines,and validation of biomarkers.展开更多
Though a relatively young discipline, translational bioinformatics (TBI) has become a key component of biomedical research in the era of precision medicine. Development of high-throughput technologies and electronic...Though a relatively young discipline, translational bioinformatics (TBI) has become a key component of biomedical research in the era of precision medicine. Development of high-throughput technologies and electronic health records has caused a paradigm shift in both healthcare and biomedical research. Novel tools and methods are required to convert increasingly voluminous datasets into information and actionable knowledge. This review provides a definition and contex- tualization of the term TBI, describes the discipline's brief history and past accomplishments, as well as current loci, and concludes with predictions of future directions in the field.展开更多
Natural products are among the most important sources of lead molecules for drug discovery.With the development of affordable whole-genome sequencing technologies and other‘omics tools,the field of natural products r...Natural products are among the most important sources of lead molecules for drug discovery.With the development of affordable whole-genome sequencing technologies and other‘omics tools,the field of natural products research is currently undergoing a shift in paradigms.While,for decades,mainly analytical and chemical methods gave access to this group of compounds,nowadays genomics-based methods offer complementary approaches to find,identify and characterize such molecules.This paradigm shift also resulted in a high demand for computational tools to assist researchers in their daily work.In this context,this review gives a summary of tools and databases that currently are available to mine,identify and characterize natural product biosynthesis pathways and their producers based on‘omics data.A web portal called Secondary Metabolite Bioinformatics Portal(SMBP at http://www.secondarymetabolites.org)is introduced to provide a one-stop catalog and links to these bioinformatics resources.In addition,an outlook is presented how the existing tools and those to be developed will influence synthetic biology approaches in the natural products field.展开更多
In the 2017 first issue of this Journal - Genomes, Proteomes and Bioinformatics - a special database article entitled "GSA: Gen- ome Sequence Archive" is published. This article provides a brief introduction to th...In the 2017 first issue of this Journal - Genomes, Proteomes and Bioinformatics - a special database article entitled "GSA: Gen- ome Sequence Archive" is published. This article provides a brief introduction to the platform developed by the authors from the BIG Data Center (BIGD) of Beijing Institute of Genomics (BIG), Chinese Academy of Sciences (CAS). The aim of the GSA project is to collect, integrate, and archive raw sequence data submitted by domestic and international users. It is one of the major activities being carried on by a team of around 50 young bioinformaticians at BIGD. In addition to the GSA system, they are also working on several bioinformatics service-orientated projects as described in one of their recent publications .展开更多
The research and development of new traditional Chinese medicine(TCM)drugs have progressively established a novel system founded on the integration of TCM theory,human experience,and clinical trials(termed the“Three ...The research and development of new traditional Chinese medicine(TCM)drugs have progressively established a novel system founded on the integration of TCM theory,human experience,and clinical trials(termed the“Three Combinations”).However,considering TCM's distinctive features of“syndrome differentiation and treatment”and“multicomponent formulations and complex mechanisms”,current TCM drug development faces challenges such as insufficient understanding of the material basis and the overall mechanism of action and an incomplete evidence chain system.Moreover,significant obstacles persist in gathering human experience data,evaluating clinical efficacy,and controlling the quality of active ingredients,which impede the innovation process in TCM drug development.Network pharmacology,centered on the“network targets”theory,transcends the limitations of the conventional“single target”reductionist research model.It emphasizes the comprehensive effects of disease or syndrome biological networks as targets to elucidate the overall regulatory mechanism of TCM prescriptions.This approach aligns with the holistic perspective of TCM,offering a novel method consistent with TCM's holistic view for investigating the complex mechanisms of TCM and developing new TCM drugs.It is internationally recognized as a“next-generation drug research model”.To advance the research of new tools,methods,and standards for TCM evaluation and to overcome fundamental,critical,and cutting-edge technical challenges in TCM regulation,this consensus aims to explore the characteristics,progress,challenges,applicable pathways,and specific applications of network pharmacology as a new theory,method,and tool in TCM drug development.The goal is to enhance the quality of TCM drug research and development and accelerate the efficiency of developing new TCM products.展开更多
Corticotomy is a clinical procedure to accelerate orthodontic tooth movement characterized by the regional acceleratory phenomenon(RAP).Despite its therapeutic effects,the surgical risk and unclear mechanism hamper th...Corticotomy is a clinical procedure to accelerate orthodontic tooth movement characterized by the regional acceleratory phenomenon(RAP).Despite its therapeutic effects,the surgical risk and unclear mechanism hamper the clinical application.Numerous evidences support macrophages as the key immune cells during bone remodeling.Our study discovered that the monocyte-derived macrophages primarily exhibited a pro-inflammatory phenotype that dominated bone remodeling in corticotomy by CX3CR1CreERT2;R26GFP lineage tracing system.Fluorescence staining,flow cytometry analysis,and western blot determined the significantly enhanced expression of binding immunoglobulin protein(BiP)and emphasized the activation of sensor activating transcription factor 6(ATF6)in macrophages.Then,we verified that macrophage specific ATF6 deletion(ATF6f/f;CX3CR1CreERT2 mice)decreased the proportion of pro-inflammatory macrophages and therefore blocked the acceleration effect of corticotomy.In contrast,macrophage ATF6 overexpression exaggerated the acceleration of orthodontic tooth movement.In vitro experiments also proved that higher proportion of pro-inflammatory macrophages was positively correlated with higher expression of ATF6.At the mechanism level,RNA-seq and CUT&Tag analysis demonstrated that ATF6 modulated the macrophage-orchestrated inflammation through interacting with Tnfαpromotor and augmenting its transcription.Additionally,molecular docking simulation and dual-luciferase reporter system indicated the possible binding sites outside of the traditional endoplasmic reticulum-stress response element(ERSE).Taken together,ATF6 may aggravate orthodontic bone remodeling by promoting Tnfαtranscription in macrophages,suggesting that ATF6 may represent a promising therapeutic target for non-invasive accelerated orthodontics.展开更多
Our previous studies have reported that activation of the NLRP3(NOD-,LRR-and pyrin domain-containing protein 3)-inflammasome complex in ethanol-treated astrocytes and chronic alcohol-fed mice could be associated with ...Our previous studies have reported that activation of the NLRP3(NOD-,LRR-and pyrin domain-containing protein 3)-inflammasome complex in ethanol-treated astrocytes and chronic alcohol-fed mice could be associated with neuroinflammation and brain damage.Mesenchymal stem cell-derived extracellular vesicles(MSC-EVs)have been shown to restore the neuroinflammatory response,along with myelin and synaptic structural alterations in the prefrontal cortex,and alleviate cognitive and memory dysfunctions induced by binge-like ethanol treatment in adolescent mice.Considering the therapeutic role of the molecules contained in mesenchymal stem cell-derived extracellular vesicles,the present study analyzed whether the administration of mesenchymal stem cell-derived extracellular vesicles isolated from adipose tissue,which inhibited the activation of the NLRP3 inflammasome,was capable of reducing hippocampal neuroinflammation in adolescent mice treated with binge drinking.We demonstrated that the administration of mesenchymal stem cell-derived extracellular vesicles ameliorated the activation of the hippocampal NLRP3 inflammasome complex and other NLRs inflammasomes(e.g.,pyrin domain-containing 1,caspase recruitment domain-containing 4,and absent in melanoma 2,as well as the alterations in inflammatory genes(interleukin-1β,interleukin-18,inducible nitric oxide synthase,nuclear factor-kappa B,monocyte chemoattractant protein-1,and C–X3–C motif chemokine ligand 1)and miRNAs(miR-21a-5p,miR-146a-5p,and miR-141-5p)induced by binge-like ethanol treatment in adolescent mice.Bioinformatic analysis further revealed the involvement of miR-21a-5p and miR-146a-5p with inflammatory target genes and NOD-like receptor signaling pathways.Taken together,these findings provide novel evidence of the therapeutic potential of MSC-derived EVs to ameliorate the hippocampal neuroinflammatory response associated with NLRP3 inflammasome activation induced by binge drinking in adolescence.展开更多
Combined with elastic network model(ENM),the perturbation response scanning(PRS)has emerged as a robust technique for pinpointing allosteric interactions within proteins.Here,we proposed the PRS analysis of drug-targe...Combined with elastic network model(ENM),the perturbation response scanning(PRS)has emerged as a robust technique for pinpointing allosteric interactions within proteins.Here,we proposed the PRS analysis of drug-target networks(DTNs),which could provide a promising avenue in network medicine.We demonstrated the utility of the method by introducing a deep learning and network perturbation-based framework,for drug repurposing of multiple sclerosis(MS).First,the MS comorbidity network was constructed by performing a random walk with restart algorithm based on shared genes between MS and other diseases as seed nodes.Then,based on topological analysis and functional annotation,the neurotransmission module was identified as the“therapeutic module”of MS.Further,perturbation scores of drugs on the module were calculated by constructing the DTN and introducing the PRS analysis,giving a list of repurposable drugs for MS.Mechanism of action analysis both at pathway and structural levels screened dihydroergocristine as a candidate drug of MS by targeting a serotonin receptor of se-rotonin 2B receptor(HTR2B).Finally,we established a cuprizone-induced chronic mouse model to evaluate the alteration of HTR2B in mouse brain regions and observed that HTR2B was significantly reduced in the cuprizone-induced mouse cortex.These findings proved that the network perturbation modeling is a promising avenue for drug repurposing of MS.As a useful systematic method,our approach can also be used to discover the new molecular mechanism and provide effective candidate drugs for other complex diseases.展开更多
Comprehensive studies identify motor neuron spectrum disorders including amyotrophic lateral sclerosis(ALS)as globally rising fatal disorders with the highest prevalence in aging populations,influenced by ethnicity an...Comprehensive studies identify motor neuron spectrum disorders including amyotrophic lateral sclerosis(ALS)as globally rising fatal disorders with the highest prevalence in aging populations,influenced by ethnicity and ancestry(GBD 2016 Motor Neuron Disease Colla borators,2018).While~10% of diagnoses involve a family history(fALS),most cases are considered sporadic(sALS).However,population-based studies suggest that even cases without a common index mutation impart heritability(Ryan et al.,2019),indicating a crucial role of rare and as yet unknown genetic denominators.展开更多
We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database(YPED) that is used by investigators at more than 300 institutions worldwide. YPED ...We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database(YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a singlelaboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography–tandem mass spectrometry(LC–MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring(MRM)/selective reaction monitoring(SRM) assay development. We have linked YPED's database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results.展开更多
Silicon stands as a key anode material in lithium-ion battery ascribing to its high energy density.Nevertheless,the poor rate performance and limited cycling life remain unresolved through conventional approaches that...Silicon stands as a key anode material in lithium-ion battery ascribing to its high energy density.Nevertheless,the poor rate performance and limited cycling life remain unresolved through conventional approaches that involve carbon composites or nanostructures,primarily due to the un-controllable effects arising from the substantial formation of a solid electrolyte interphase(SEI)during the cycling.Here,an ultra-thin and homogeneous Ti doping alumina oxide catalytic interface is meticulously applied on the porous Si through a synergistic etching and hydrolysis process.This defect-rich oxide interface promotes a selective adsorption of fluoroethylene carbonate,leading to a catalytic reaction that can be aptly described as“molecular concentration-in situ conversion”.The resultant inorganic-rich SEI layer is electrochemical stable and favors ion-transport,particularly at high-rate cycling and high temperature.The robustly shielded porous Si,with a large surface area,achieves a high initial Coulombic efficiency of 84.7%and delivers exceptional high-rate performance at 25 A g^(−1)(692 mAh g^(−1))and a high Coulombic efficiency of 99.7%over 1000 cycles.The robust SEI constructed through a precious catalytic layer promises significant advantages for the fast development of silicon-based anode in fast-charging batteries.展开更多
INTRODUCTION.Depressive disorders are mental illnesses that seriously affect public health.There are approximately 320 million patients with depression worldwide,accounting for 4.4% of the total disease burden.1Depres...INTRODUCTION.Depressive disorders are mental illnesses that seriously affect public health.There are approximately 320 million patients with depression worldwide,accounting for 4.4% of the total disease burden.1Depression leads to social and occupational impairment,diminished quality of life and an elevated risk of death by suicide.展开更多
Traditional Chinese medicine(TCM)demonstrates distinctive advantages in disease prevention and treatment.However,analyzing its biological mechanisms through the modern medical research paradigm of“single drug,single ...Traditional Chinese medicine(TCM)demonstrates distinctive advantages in disease prevention and treatment.However,analyzing its biological mechanisms through the modern medical research paradigm of“single drug,single target”presents significant challenges due to its holistic approach.Network pharmacology and its core theory of network targets connect drugs and diseases from a holistic and systematic perspective based on biological networks,overcoming the limitations of reductionist research models and showing considerable value in TCM research.Recent integration of network target computational and experimental methods with artificial intelligence(AI)and multi-modal multi-omics technologies has substantially enhanced network pharmacology methodology.The advancement in computational and experimental techniques provides complementary support for network target theory in decoding TCM principles.This review,centered on network targets,examines the progress of network target methods combined with AI in predicting disease molecular mechanisms and drug-target relationships,alongside the application of multi-modal multi-omics technologies in analyzing TCM formulae,syndromes,and toxicity.Looking forward,network target theory is expected to incorporate emerging technologies while developing novel approaches aligned with its unique characteristics,potentially leading to significant breakthroughs in TCM research and advancing scientific understanding and innovation in TCM.展开更多
基金supported by the National Natural Science Foundation of China(32470676 and 32170236)Central Guidance on Local Science and Technology Development Fund of Hebei Province(246Z2508G)+2 种基金Hebei Natural Science Foundation(C2020209064)Tangshan Science and Technology Program Project(21130217C)Key research project of North China University of Science and Technology(ZD-YG-202313-23).
文摘As a high-value eudicot family,many famous horticultural crop genomes have been deciphered in Oleaceae.However,there are currently no bioinformatics platforms focused on empowering genome research in Oleaceae.Herein,we developed the first comprehensive Oleaceae Genome Research Platform(OGRP,https://oleaceae.cgrpoee.top/).In OGRP,70 genomes of 10 Oleaceae species and 46 eudicots and 366 transcriptomes involving 18 Oleaceae plant tissues can be obtained.We built 34 window-operated bioinformatics tools,collected 38 professional practical software programs,and proposed 3 new pipelines,namely ancient polyploidization identification,ancestral karyotype reconstruction,and gene family evolution.Employing these pipelines to reanalyze the Oleaceae genomes,we clarified the polyploidization,reconstructed the ancestral karyotypes,and explored the effects of paleogenome evolution on genes with specific biological regulatory roles.Significantly,we generated a series of comparative genomic resources focusing on the Oleaceae,comprising 108 genomic synteny dot plots,1952225 collinear gene pairs,multiple genome alignments,and imprints of paleochromosome rearrangements.Moreover,in Oleaceae genomes,researchers can efficiently search for 1785987 functional annotations,22584 orthogroups,29582 important trait genes from 74 gene families,12664 transcription factor-related genes,9178872 transposable elements,and all involved regulatory pathways.In addition,we provided downloads and usage instructions for the tools,a species encyclopedia,ecological resources,relevant literatures,and external database links.In short,ORGP integrates rich data resources and powerful analytical tools with the characteristic of continuous updating,which can efficiently empower genome research and agricultural breeding in Oleaceae and other plants.
基金supported by the National Natural Science Foundation of China(31971865)Zhejiang Natural Science Foundation(LZ17C130001)+1 种基金the Innovation Method Project of China(2018IM0301002)the Jiangsu Collaborative Innovation Center for Modern Crop Production。
文摘Rice is one of cereal crops and a model species for monocots.Since the release of the first draft rice genome sequences in 2002,considerable progress has been achieved in rice genomic researches,thanks to rapid development and efficient utilization of bioinformatics methods and tools.In this review,we summarize the progress of studies of rice genome sequencing and other omics and introduce the wellmaintained bioinformatics databases and tools developed for rice genome resources and breeding.After reviewing the history of rice bioinformatics,we use single-cell sequencing and machine learning as examples showing how bioinformatics integrates emerging technologies and how it continues to develop for future rice research.
文摘In this editorial preface, I briefly r eview cancer bioinformatics and introduce the four articles in this special issue highlighting important applications of the field: detection of chromatin states; detection of SNP- containing motifs and association with transcription factor-binding sites; improvements in functional enrichment modules; and gene association studies on aging and cancer. We expect this issue to provide bioinformatics scientists, cancer biologists, and clinical doctors with a better understanding of how cancer bioinformatics can be used to identify candidate biomarkers and targets and to conduct functional analysis.
文摘Severe acute respiratory syndrome coronavirus(SARS-CoV)and SARS-CoV-2 are thought to transmit to humans via wild mammals,especially bats.However,evidence for direct bat-to-human transmission is lacking.Involvement of intermediate hosts is considered a reason for SARS-CoV-2 transmission to humans and emergence of outbreak.Large biodiversity is found in tropical territories,such as Brazil.On the similar line,this study aimed to predict potential coronavirus hosts among Brazilian wild mammals based on angiotensin-converting enzyme 2(ACE2)sequences using evolutionary bioinformatics.Cougar,maned wolf,and bush dogs were predicted as potential hosts for coronavirus.These indigenous carnivores are philogenetically closer to the known SARS-CoV/SARS-CoV-2 hosts and presented low ACE2 divergence.A new coronavirus transmission chain was developed in which white-tailed deer,a susceptible SARS-CoV-2 host,have the central position.Cougar play an important role because of its low divergent ACE2 level in deer and humans.The discovery of these potential coronavirus hosts will be useful for epidemiological surveillance and discovery of interventions that can contribute to break the transmission chain.
基金supported by grant CNTC-110202101039(JY-16)and YNTC-2022530000241008.
文摘Bioinformatics analysis often requires the filtering of multi-datasets,based on frequency or frequency of occurrence,for decisions on retention or deletion.Existing tools for this purpose often present a challenge with complex installation,which necessitate custom coding,thereby impeding efficient data processing activities.To address this issue,Filterx,a user-friendly command line tool that written in C language,was developed that supports multi-condition filtering,based on frequency or occurrence.This tool enables users to complete the data processing tasks through a simple command line,greatly reducing both workload and data processing time.In addition,future development of this tool could facilitate its integration into various bioinformatics data analysis pipelines.
基金National Key Research and Development Program of China,Grant/Award Number:2022ZD0115004。
文摘Transformer-based foundation models such as ChatGPTs have revolutionized our daily life and affected many fields including bioinformatics.In this perspective,we first discuss about the direct application of textual foundation models on bioinformatics tasks,focusing on how to make the most out of canonical large language models and mitigate their inherent flaws.Meanwhile,we go through the transformer-based,bioinformaticstailored foundation models for both sequence and non-sequence data.In particular,we envision the further development directions as well as challenges for bioinformatics foundation models.
文摘In the year 1971,the world’s biggest structural biology collaboration name—The Research Collaboratory for Structural Bioinformatics(RCSB),was formed to gather all the structural biologists at a single platform and then extended out to be the world’s most extensive structural data repository named RCSB-Protein Data Bank(PDB)(https://www.rcsb.org/)that has provided the service for more than 50 years and continues its legacy for the discoveries and repositories for structural data.The RCSB has evolved from being a collaboratory network to a full-fledged database and tool with a huge list of protein structures,nucleic acid-containing structures,ModelArchive,and AlphaFold structures,and the best is that it is expanding day by day with computational advancement with tools and visual experiences.In this review article,we have discussed how RCSB has been a successful collaboratory network,its expansion in each decade,and how it has helped the ground-breaking research.The PDB tools that are helping the researchers,yearly data deposition,validation,processing,and suggestions that can help the developer improve for upcoming years are also discussed.This review will help future researchers understand the complete history of RCSB and its developments in each decade and how various future collaborative networks can be developed in various scientific areas and can be successful by keeping RCSB as a case study.
文摘Realizing personalized medicine requires integrating diverse data types with bioinformatics.The most vital data are genomic information for individuals that are from advanced next-generation sequencing(NGS) technologies at present.The technologies continue to advance in terms of both decreasing cost and sequencing speed with concomitant increase in the amount and complexity of the data.The prodigious data together with the requisite computational pipelines for data analysis and interpretation are stressors to IT infrastructure and the scientists conducting the work alike.Bioinformatics is increasingly becoming the rate-limiting step with numerous challenges to be overcome for translating NGS data for personalized medicine.We review some key bioinformatics tasks,issues,and challenges in contexts of IT requirements,data quality,analysis tools and pipelines,and validation of biomarkers.
基金supported in part by the Clinical and Translational Science Award(Grant No.UL1TR001117)to Duke University from the National Institutes of Health(NIH),United States
文摘Though a relatively young discipline, translational bioinformatics (TBI) has become a key component of biomedical research in the era of precision medicine. Development of high-throughput technologies and electronic health records has caused a paradigm shift in both healthcare and biomedical research. Novel tools and methods are required to convert increasingly voluminous datasets into information and actionable knowledge. This review provides a definition and contex- tualization of the term TBI, describes the discipline's brief history and past accomplishments, as well as current loci, and concludes with predictions of future directions in the field.
文摘Natural products are among the most important sources of lead molecules for drug discovery.With the development of affordable whole-genome sequencing technologies and other‘omics tools,the field of natural products research is currently undergoing a shift in paradigms.While,for decades,mainly analytical and chemical methods gave access to this group of compounds,nowadays genomics-based methods offer complementary approaches to find,identify and characterize such molecules.This paradigm shift also resulted in a high demand for computational tools to assist researchers in their daily work.In this context,this review gives a summary of tools and databases that currently are available to mine,identify and characterize natural product biosynthesis pathways and their producers based on‘omics data.A web portal called Secondary Metabolite Bioinformatics Portal(SMBP at http://www.secondarymetabolites.org)is introduced to provide a one-stop catalog and links to these bioinformatics resources.In addition,an outlook is presented how the existing tools and those to be developed will influence synthetic biology approaches in the natural products field.
文摘In the 2017 first issue of this Journal - Genomes, Proteomes and Bioinformatics - a special database article entitled "GSA: Gen- ome Sequence Archive" is published. This article provides a brief introduction to the platform developed by the authors from the BIG Data Center (BIGD) of Beijing Institute of Genomics (BIG), Chinese Academy of Sciences (CAS). The aim of the GSA project is to collect, integrate, and archive raw sequence data submitted by domestic and international users. It is one of the major activities being carried on by a team of around 50 young bioinformaticians at BIGD. In addition to the GSA system, they are also working on several bioinformatics service-orientated projects as described in one of their recent publications .
基金supported by the National Medical Products Administration Commissioned Research Project (No.20211440216)the National Administration of Traditional Chinese Medicine Science and Technology Project (No.GZY-KJS-2024-03)+3 种基金the State Key Laboratory of Drug Regulatory Science Project (No.2023SKLDRS0104)the Basic Research Program Natural Science Fund-Frontier Leading Technology Basic Research Special Project of Jiangsu Province (No.BK20232014)the Programs Foundation for Leading Talents in National Administration of Traditional Chinese Medicine of China“Qihuang scholars”Projectthe Tianjin Administration for Market Regulation Science and Technology Key Projects (No.2022-W35)。
文摘The research and development of new traditional Chinese medicine(TCM)drugs have progressively established a novel system founded on the integration of TCM theory,human experience,and clinical trials(termed the“Three Combinations”).However,considering TCM's distinctive features of“syndrome differentiation and treatment”and“multicomponent formulations and complex mechanisms”,current TCM drug development faces challenges such as insufficient understanding of the material basis and the overall mechanism of action and an incomplete evidence chain system.Moreover,significant obstacles persist in gathering human experience data,evaluating clinical efficacy,and controlling the quality of active ingredients,which impede the innovation process in TCM drug development.Network pharmacology,centered on the“network targets”theory,transcends the limitations of the conventional“single target”reductionist research model.It emphasizes the comprehensive effects of disease or syndrome biological networks as targets to elucidate the overall regulatory mechanism of TCM prescriptions.This approach aligns with the holistic perspective of TCM,offering a novel method consistent with TCM's holistic view for investigating the complex mechanisms of TCM and developing new TCM drugs.It is internationally recognized as a“next-generation drug research model”.To advance the research of new tools,methods,and standards for TCM evaluation and to overcome fundamental,critical,and cutting-edge technical challenges in TCM regulation,this consensus aims to explore the characteristics,progress,challenges,applicable pathways,and specific applications of network pharmacology as a new theory,method,and tool in TCM drug development.The goal is to enhance the quality of TCM drug research and development and accelerate the efficiency of developing new TCM products.
基金supported by the National Natural Science Foundation of China(82071143,82371000,82270361)Key Research and Development Program of Jiangsu Province(BE2022795)+2 种基金the Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX22_1801)the Jiangsu Province Capability Improvement Project through the Science,Technology and Education-Jiangsu Provincial Research Hospital Cultivation Unit(YJXYYJSDW4)Jiangsu Provincial Medical Innovation Center(CXZX202227).
文摘Corticotomy is a clinical procedure to accelerate orthodontic tooth movement characterized by the regional acceleratory phenomenon(RAP).Despite its therapeutic effects,the surgical risk and unclear mechanism hamper the clinical application.Numerous evidences support macrophages as the key immune cells during bone remodeling.Our study discovered that the monocyte-derived macrophages primarily exhibited a pro-inflammatory phenotype that dominated bone remodeling in corticotomy by CX3CR1CreERT2;R26GFP lineage tracing system.Fluorescence staining,flow cytometry analysis,and western blot determined the significantly enhanced expression of binding immunoglobulin protein(BiP)and emphasized the activation of sensor activating transcription factor 6(ATF6)in macrophages.Then,we verified that macrophage specific ATF6 deletion(ATF6f/f;CX3CR1CreERT2 mice)decreased the proportion of pro-inflammatory macrophages and therefore blocked the acceleration effect of corticotomy.In contrast,macrophage ATF6 overexpression exaggerated the acceleration of orthodontic tooth movement.In vitro experiments also proved that higher proportion of pro-inflammatory macrophages was positively correlated with higher expression of ATF6.At the mechanism level,RNA-seq and CUT&Tag analysis demonstrated that ATF6 modulated the macrophage-orchestrated inflammation through interacting with Tnfαpromotor and augmenting its transcription.Additionally,molecular docking simulation and dual-luciferase reporter system indicated the possible binding sites outside of the traditional endoplasmic reticulum-stress response element(ERSE).Taken together,ATF6 may aggravate orthodontic bone remodeling by promoting Tnfαtranscription in macrophages,suggesting that ATF6 may represent a promising therapeutic target for non-invasive accelerated orthodontics.
基金supported by grants from the Spanish Ministry of Health-PNSD(2019-I039 and 2023-I024)(to MP)FEDER/Ministerio de Ciencia e Innovación-Agencia Estatal de Investigación PID2021-1243590B-I100(to VMM)+2 种基金GVA(CIAICO/2021/203)(to MP)the Primary Addiction Care Research Network(RD21/0009/0005)(to MP)a predoctoral fellowship from the Generalitat Valenciana(ACIF/2021/338)(to CPC).
文摘Our previous studies have reported that activation of the NLRP3(NOD-,LRR-and pyrin domain-containing protein 3)-inflammasome complex in ethanol-treated astrocytes and chronic alcohol-fed mice could be associated with neuroinflammation and brain damage.Mesenchymal stem cell-derived extracellular vesicles(MSC-EVs)have been shown to restore the neuroinflammatory response,along with myelin and synaptic structural alterations in the prefrontal cortex,and alleviate cognitive and memory dysfunctions induced by binge-like ethanol treatment in adolescent mice.Considering the therapeutic role of the molecules contained in mesenchymal stem cell-derived extracellular vesicles,the present study analyzed whether the administration of mesenchymal stem cell-derived extracellular vesicles isolated from adipose tissue,which inhibited the activation of the NLRP3 inflammasome,was capable of reducing hippocampal neuroinflammation in adolescent mice treated with binge drinking.We demonstrated that the administration of mesenchymal stem cell-derived extracellular vesicles ameliorated the activation of the hippocampal NLRP3 inflammasome complex and other NLRs inflammasomes(e.g.,pyrin domain-containing 1,caspase recruitment domain-containing 4,and absent in melanoma 2,as well as the alterations in inflammatory genes(interleukin-1β,interleukin-18,inducible nitric oxide synthase,nuclear factor-kappa B,monocyte chemoattractant protein-1,and C–X3–C motif chemokine ligand 1)and miRNAs(miR-21a-5p,miR-146a-5p,and miR-141-5p)induced by binge-like ethanol treatment in adolescent mice.Bioinformatic analysis further revealed the involvement of miR-21a-5p and miR-146a-5p with inflammatory target genes and NOD-like receptor signaling pathways.Taken together,these findings provide novel evidence of the therapeutic potential of MSC-derived EVs to ameliorate the hippocampal neuroinflammatory response associated with NLRP3 inflammasome activation induced by binge drinking in adolescence.
基金supported by the National Natural Science Foundation of China(Grant Nos.:32271292,31872723,32200778,and 22377089)the Jiangsu Students Innovation and Entrepre-neurship Training Program,China(Program No.:202210285081Z)+6 种基金the Project of MOE Key Laboratory of Geriatric Diseases and Immunology,China(Project No.:JYN202404)Proj-ect Funded by the Priority Academic Program Development(PAPD)of Jiangsu Higher Education Institutions,Natural Science Foundation of Jiangsu Province,China(Project No.:BK20220494)Suzhou Medical and Health Technology Innovation Project,China(Grant No.:SKY2022107)the Clinical Research Center of Neuro-logical Disease in The Second Affiliated Hospital of Soochow University,China(Grant No.:ND2022A04)State Key Laboratory of Drug Research(Grant No.:SKLDR-2023-KF-05)Jiangsu Shuang-chuang Program for Doctor,Young Science Talents Promotion Project of Jiangsu Science and Technology Association(Program No.:TJ-2023-019)Young Science Talents Promotion Project of Suzhou Science and Technology Association,Suzhou International Joint Laboratory for Diagnosis and Treatment of Brain Diseases,and startup funding(Grant Nos.:NH21500221,NH21500122,and NH21500123)to Qifei Cong.
文摘Combined with elastic network model(ENM),the perturbation response scanning(PRS)has emerged as a robust technique for pinpointing allosteric interactions within proteins.Here,we proposed the PRS analysis of drug-target networks(DTNs),which could provide a promising avenue in network medicine.We demonstrated the utility of the method by introducing a deep learning and network perturbation-based framework,for drug repurposing of multiple sclerosis(MS).First,the MS comorbidity network was constructed by performing a random walk with restart algorithm based on shared genes between MS and other diseases as seed nodes.Then,based on topological analysis and functional annotation,the neurotransmission module was identified as the“therapeutic module”of MS.Further,perturbation scores of drugs on the module were calculated by constructing the DTN and introducing the PRS analysis,giving a list of repurposable drugs for MS.Mechanism of action analysis both at pathway and structural levels screened dihydroergocristine as a candidate drug of MS by targeting a serotonin receptor of se-rotonin 2B receptor(HTR2B).Finally,we established a cuprizone-induced chronic mouse model to evaluate the alteration of HTR2B in mouse brain regions and observed that HTR2B was significantly reduced in the cuprizone-induced mouse cortex.These findings proved that the network perturbation modeling is a promising avenue for drug repurposing of MS.As a useful systematic method,our approach can also be used to discover the new molecular mechanism and provide effective candidate drugs for other complex diseases.
基金The lab of AK obtained support from the Interdisciplinary Center for Clinical Research(IZKF)Jena(MSPProject ID:MSP09)+2 种基金DG and MJA B were supported by the Circular Vision project,which has received funding from the European Union's Horizon 2020 research and innovation program(Grant agreement No.899417)the Ministerio de Ciencia e Innovoción,Spain(Grant No.PID2020-119715GB-I00/AEI/10.13039/501100011033)the Instituto de Salud CarlosⅢ,Infrastructure of Precision Medicine associated with Science and Technology(IMPaCT)of the Strategic Action in Health(iDATAMP)(to MJAB)。
文摘Comprehensive studies identify motor neuron spectrum disorders including amyotrophic lateral sclerosis(ALS)as globally rising fatal disorders with the highest prevalence in aging populations,influenced by ethnicity and ancestry(GBD 2016 Motor Neuron Disease Colla borators,2018).While~10% of diagnoses involve a family history(fALS),most cases are considered sporadic(sALS).However,population-based studies suggest that even cases without a common index mutation impart heritability(Ryan et al.,2019),indicating a crucial role of rare and as yet unknown genetic denominators.
基金supported in part by the National Institutes of Health of the United States(Grant Nos.UL1 RR024139 to Yale Clinical and Translational Science Award,1S10OD018034-01 to 6500 QTrap Mass Spectrometer for Yale University,1S10RR026707-01 to 5500QTrap Mass Spectrometer for Yale University,P30DA018343 to Yale/NIDA Neuroproteomics Center and NIDDK-K01DK089006 awarded to JR)
文摘We report a significantly-enhanced bioinformatics suite and database for proteomics research called Yale Protein Expression Database(YPED) that is used by investigators at more than 300 institutions worldwide. YPED meets the data management, archival, and analysis needs of a high-throughput mass spectrometry-based proteomics research ranging from a singlelaboratory, group of laboratories within and beyond an institution, to the entire proteomics community. The current version is a significant improvement over the first version in that it contains new modules for liquid chromatography–tandem mass spectrometry(LC–MS/MS) database search results, label and label-free quantitative proteomic analysis, and several scoring outputs for phosphopeptide site localization. In addition, we have added both peptide and protein comparative analysis tools to enable pairwise analysis of distinct peptides/proteins in each sample and of overlapping peptides/proteins between all samples in multiple datasets. We have also implemented a targeted proteomics module for automated multiple reaction monitoring(MRM)/selective reaction monitoring(SRM) assay development. We have linked YPED's database search results and both label-based and label-free fold-change analysis to the Skyline Panorama repository for online spectra visualization. In addition, we have built enhanced functionality to curate peptide identifications into an MS/MS peptide spectral library for all of our protein database search identification results.
基金the National Key R&D Plan of the Ministry of Science and Technology of China(2022YFE0122400)National Natural Science Foundation of China(52002238,22102207)+1 种基金Science and Technology Commission of Shanghai Municipality(22ZR1423800,21ZR1465200,23ZR1423600)Shanghai Municipal Education Commission and the NSRF via the Program Management Unit for Human Resources&Institutional Development,Research and Innovation(B49G680115).
文摘Silicon stands as a key anode material in lithium-ion battery ascribing to its high energy density.Nevertheless,the poor rate performance and limited cycling life remain unresolved through conventional approaches that involve carbon composites or nanostructures,primarily due to the un-controllable effects arising from the substantial formation of a solid electrolyte interphase(SEI)during the cycling.Here,an ultra-thin and homogeneous Ti doping alumina oxide catalytic interface is meticulously applied on the porous Si through a synergistic etching and hydrolysis process.This defect-rich oxide interface promotes a selective adsorption of fluoroethylene carbonate,leading to a catalytic reaction that can be aptly described as“molecular concentration-in situ conversion”.The resultant inorganic-rich SEI layer is electrochemical stable and favors ion-transport,particularly at high-rate cycling and high temperature.The robustly shielded porous Si,with a large surface area,achieves a high initial Coulombic efficiency of 84.7%and delivers exceptional high-rate performance at 25 A g^(−1)(692 mAh g^(−1))and a high Coulombic efficiency of 99.7%over 1000 cycles.The robust SEI constructed through a precious catalytic layer promises significant advantages for the fast development of silicon-based anode in fast-charging batteries.
基金funded by the Construction Project of the"Flagship"Department of Chinese and Western Medicine Coordination(LiuL/2024-221)the 2024 Medical Service and Security Capacity Improvement Project(National Clinical Key Specialty Construction)(LiuL/Huwei Medical/2024-65)+5 种基金the Shanghai Traditional Chinese Medicine Standardization Project(LiuL/No.2023JSP03)the Shanghai Key Discipline Construction Project of Traditional Chinese Medicine(Clinical)(LiuL/2024-No.3)the Shanghai Technical Standardization Management and Promotion Project(LiuL/No.SHDC22023212)the Shanghai Municipal Health Commission Traditional Chinese Medicine Research Project(2022)(LiuL/No.2022Cx004)Clinical research project of Shanghai Health Commission-Youth Project(LW/No.20214Y0056)Shanghai Institute of Traditional Chinese Medicine for Mental Health(LW/No.SZB2023201).
文摘INTRODUCTION.Depressive disorders are mental illnesses that seriously affect public health.There are approximately 320 million patients with depression worldwide,accounting for 4.4% of the total disease burden.1Depression leads to social and occupational impairment,diminished quality of life and an elevated risk of death by suicide.
文摘Traditional Chinese medicine(TCM)demonstrates distinctive advantages in disease prevention and treatment.However,analyzing its biological mechanisms through the modern medical research paradigm of“single drug,single target”presents significant challenges due to its holistic approach.Network pharmacology and its core theory of network targets connect drugs and diseases from a holistic and systematic perspective based on biological networks,overcoming the limitations of reductionist research models and showing considerable value in TCM research.Recent integration of network target computational and experimental methods with artificial intelligence(AI)and multi-modal multi-omics technologies has substantially enhanced network pharmacology methodology.The advancement in computational and experimental techniques provides complementary support for network target theory in decoding TCM principles.This review,centered on network targets,examines the progress of network target methods combined with AI in predicting disease molecular mechanisms and drug-target relationships,alongside the application of multi-modal multi-omics technologies in analyzing TCM formulae,syndromes,and toxicity.Looking forward,network target theory is expected to incorporate emerging technologies while developing novel approaches aligned with its unique characteristics,potentially leading to significant breakthroughs in TCM research and advancing scientific understanding and innovation in TCM.