Non-O1/non-O139 Vibrio(V.)cholerae(NOVC)has emerged as a potential pathogen in patients with compromised health conditions[1].We report the whole genome sequencing(WGS)of a rare NOVC sepsis isolate(GenBank Accession:G...Non-O1/non-O139 Vibrio(V.)cholerae(NOVC)has emerged as a potential pathogen in patients with compromised health conditions[1].We report the whole genome sequencing(WGS)of a rare NOVC sepsis isolate(GenBank Accession:GCF_051906115.1)from an 89-year-old male admitted to the Intensive Care Unit(ICU)with septic shock(lactate 6.61 mmol/L)digestive illness.展开更多
Structural variations(SVs≥50 bp)are a critical but underexplored source of genetic diversity in cattle,shaping traits vital for productivity,adaptability,and health.Advances in long-read sequencing,pangenome graph co...Structural variations(SVs≥50 bp)are a critical but underexplored source of genetic diversity in cattle,shaping traits vital for productivity,adaptability,and health.Advances in long-read sequencing,pangenome graph construction,and near-complete genome assemblies now allow accurate SV detection and genotyping.These innovations overcome the limitations of single-reference genomes,enabling the discovery of complex SVs,including nested and overlapping variants,and providing access to previously inaccessible genomic regions such as centromeres and telomeres.This review highlights the current landscape of cattle SV research,with emphasis on integrating longread sequencing and pangenome frameworks to uncover breed-specific and population-level variation.While many SVs are linked to economically important traits such as feed efficiency and disease resistance,their broader regulatory impacts remain an active area of investigation.Emerging functional genomics approaches,including transcriptomics,epigenomics,and genome editing,will clarify how SVs influence gene regulation and phenotype.Looking forward,the integration of SV catalogs with multi-omics data,imputation resources,and artificial intelligence-driven models will be essential for translating discoveries into breeding and conservation applications.Integrating structural variants into breeding pipelines promises to revolutionize livestock genomics,enabling precision selection and sustainable agriculture despite challenges in cost,data sharing,and functional validation.展开更多
Iris domestica,a perennial herb of the Iridaceae family,is widely recognized for its rich isoflavone content and broad therapeutic properties.To elucidate the biosynthetic pathway of these medicinally significant comp...Iris domestica,a perennial herb of the Iridaceae family,is widely recognized for its rich isoflavone content and broad therapeutic properties.To elucidate the biosynthetic pathway of these medicinally significant compounds,we constructed a haplotype-resolved genome assembly of this species.Transcriptomic and metabolomic analyses revealed tissue-specific accumulation of isoflavone,particularly in rhizomes and roots.Functional characterization identified two candidate isoflavone synthase genes,among which IdIFS was confirmed to promote the biosynthesis of key compounds tectorigenin and irisflorentin.The high-quality genome assembly presented here provides a foundational resource for further research into the evolution,secondary metabolite,and environmental adaptation of I.domestica.展开更多
IR64 is an elite Xian/indica variety developed by International Rice Research Institute(IRRl)in 1985,which has been the most widely grown variety and core breeding parent in South/Southeast Asia(Mackill and Khush,2018...IR64 is an elite Xian/indica variety developed by International Rice Research Institute(IRRl)in 1985,which has been the most widely grown variety and core breeding parent in South/Southeast Asia(Mackill and Khush,2018).IR64 has been utilized to develop stress-tolerant(such as drought-adapted and submergenceresistant)near-isogenic lines,underscoring its great potential in agricultural genomics(Tanaka et al.,2020).展开更多
Radish(Raphanus sativus L.)is an important cruciferous root vegetable,with bolting regulated by multiple genes.However,the genetic mechanisms underlying bolting regulation remain unclear.Here,the genome of the cultiva...Radish(Raphanus sativus L.)is an important cruciferous root vegetable,with bolting regulated by multiple genes.However,the genetic mechanisms underlying bolting regulation remain unclear.Here,the genome of the cultivar C60213 is assembled into a high-quality,gap-free telomere-to-telomere structure,spanning nine chromosomes and totaling 472.71 Mb,using a combination of Oxford Nanopore,PacBio,and Hi-C sequencing technologies.It identifies 49,768 protein-coding genes,97.38%of which are functionally annotated.Repetitive sequences constitute 59.72%of the genome,primarily comprising long terminal repeats.A high-density genetic linkage map is constructed using an F2 population derived from a cross between early-and late-bolting radishes,identifying seven major quantitative trait loci associated with bolting and flowering.RNA-seq and quantitative real-time PCR analysis reveal that the RsMIPS3 gene is found to be associated with bolting,with its expression decreasing during this process.Notably,RsMIPS3 overexpression in Arabidopsis delays bolting,confirming its role in regulating bolting time.These findings advance radish genome research and provide a valuable target for breeding late-bolting varieties.展开更多
Soybean(Glycine max L.)is a globally vital crop for oil production and food security.High-quality genomic resources are instrumental for both functional genomics and breeding.Here,we report a near-complete,high-qualit...Soybean(Glycine max L.)is a globally vital crop for oil production and food security.High-quality genomic resources are instrumental for both functional genomics and breeding.Here,we report a near-complete,high-quality genome assembly of the elite cultivar Tianlong 1(TL1),featuring fully resolved telomeres and centromeres,as well as a gap-free assembly of 14 of its 20 chromosomes.On the basis of the genome assembly,we generate an ethyl methanesulfonate(EMS)-mutagenized population comprising 2555 M7 plants.Whole-genome resequencing of 288 EMS mutants uncovers 1,163,869 high-confidence single-nucleotide polymorphisms(SNPs)and 542,709 insertions/deletions(InDels),achieving 91.89%coverage of predicted protein-coding genes.Phenotypic screening demonstrates robust genotype–phenotype associations,with two nonsynonymous mutants displaying pronounced defects in seed and leaf development.Collectively,the chromosome-scale TL1 genome assembly and the extensively characterized mutant population establish valuable resources for functional genomics and precision breeding in soybean and related legume species.展开更多
Emerging and powerful genome editing tools,particularly CRISPR/Cas9,are facilitating functional genomics research and accelerating crop improvement(Jiang et al.2021;Cao et al.2023;Chen C et al.2023;Liu et al.2023a).Ho...Emerging and powerful genome editing tools,particularly CRISPR/Cas9,are facilitating functional genomics research and accelerating crop improvement(Jiang et al.2021;Cao et al.2023;Chen C et al.2023;Liu et al.2023a).However,the detection and screening of transgenic lines remain major bottlenecks,being time-consuming,labor-intensive,and inefficient during transformation and subsequent mutation identification.A simple and efficient visual marker system plays a critical role in addressing these challenges.Recent studies demonstrated that the GmW1 and RUBY reporter systems were used to obtain visual transgenic soybean(Glycine max) plants(Chen L et al.2023;Chen et al.2024).展开更多
Natural hybridization is known to play a vital role in speciation;however,the mechanisms underlying the early stages of natural hybridization remain unclear.Where two plant species come into contact,two driving forces...Natural hybridization is known to play a vital role in speciation;however,the mechanisms underlying the early stages of natural hybridization remain unclear.Where two plant species come into contact,two driving forces may balance the dynamic consequences of hybridization:fusion by hybridization-mediated gene flow,and separation by reproductive isolation(RI)(Ma et al.,2010a,b;Chang et al.,2022).展开更多
The advantages of genome selection(GS) in animal and plant breeding are self-evident.Traditional parametric models have disadvantage in better fit the increasingly large sequencing data and capture complex effects acc...The advantages of genome selection(GS) in animal and plant breeding are self-evident.Traditional parametric models have disadvantage in better fit the increasingly large sequencing data and capture complex effects accurately.Machine learning models have demonstrated remarkable potential in addressing these challenges.In this study,we introduced the concept of mixed kernel functions to explore the performance of support vector machine regression(SVR) in GS.Six single kernel functions(SVR_L,SVR_C,SVR_G,SVR_P,SVR_S,SVR_L) and four mixed kernel functions(SVR_GS,SVR_GP,SVR_LS,SVR_LP) were used to predict genome breeding values.The prediction accuracy,mean squared error(MSE) and mean absolute error(MAE) were used as evaluation indicators to compare with two traditional parametric models(GBLUP,BayesB) and two popular machine learning models(RF,KcRR).The results indicate that in most cases,the performance of the mixed kernel function model significantly outperforms that of GBLUP,BayesB and single kernel function.For instance,for T1 in the pig dataset,the predictive accuracy of SVR_GS is improved by 10% compared to GBLUP,and by approximately 4.4 and 18.6% compared to SVR_G and SVR_S respectively.For E1 in the wheat dataset,SVR_GS achieves 13.3% higher prediction accuracy than GBLUP.Among single kernel functions,the Laplacian and Gaussian kernel functions yield similar results,with the Gaussian kernel function performing better.The mixed kernel function notably reduces the MSE and MAE when compared to all single kernel functions.Furthermore,regarding runtime,SVR_GS and SVR_GP mixed kernel functions run approximately three times faster than GBLUP in the pig dataset,with only a slight increase in runtime compared to the single kernel function model.In summary,the mixed kernel function model of SVR demonstrates speed and accuracy competitiveness,and the model such as SVR_GS has important application potential for GS.展开更多
This study examined the potential response mechanisms of Ligilactobacillus salivarius AR612 to glucose stress through whole-genome and comparative transcriptome analysis.We obtained the basic genome information of L.s...This study examined the potential response mechanisms of Ligilactobacillus salivarius AR612 to glucose stress through whole-genome and comparative transcriptome analysis.We obtained the basic genome information of L.salivarius AR612.The full genome length of L.salivarius AR612 was 1970245 bp,with a GC content of 33.01%and 1894 coding genes.Moreover,we identified many genes associated with genetic adaptations to various stress factors,including temperature,p H,osmotic pressure,bile salts,and oxidative stress.Physiological analysis revealed that the growth and morphology of AR612 changed significantly under glucose stress,with a decrease in the maximum growth and irregular cell morphology.Furthermore,a comparison of transcriptome data indicated that glucose stress induced changes in the number of differential genes.Moreover,AR612 could respond to extracellular glucose stress by changing the expression of genes related to cell morphology,carbohydrate metabolism,amino acid metabolism,fatty acid synthesis,and nucleotide metabolism.This study provides valuable theoretical insights for future research on the adaptation of L.salivarius AR612 to nutritional stress and its application in industrial processes.展开更多
Brain organoids are artificial neural tissues derived in vitro,containing a variety of cell types,as well as structural and/or functional brain regions.They can partially mimic brain physiological activities and disea...Brain organoids are artificial neural tissues derived in vitro,containing a variety of cell types,as well as structural and/or functional brain regions.They can partially mimic brain physiological activities and diseased processes.Owing to their operability and sample accessibility,brain organoids serve as a bridge between in vitro monolayer cell culture models and in vivo animal models.An increasing number of induction protocols for brain organoids have been developed over the preceding decade.A key future research direction will focus on ensuring the complexity and quality of brain organoids.The integration of powerful technologies,such as the CRISP R/Cas9 genome editing and lineage tra cing systems,shall precipitate practical and broad applications of brain organoids.In this review,we discuss the generation and application of brain organoids,as well as their integration with genome editing technologies,in the study of neural development,disease modeling,and mechanistic investigations.The innovative combination of these two technologies may offer a fresh perspective for exploring the fundamental aspects of the human nervous system and related diseases.展开更多
Previous investigations into gut microbiota dysbiosis in patients with Parkinson’s disease have relied on 16S rRNA amplicon sequencing and assembly-free metagenomic approaches.However,there is an urgent need to study...Previous investigations into gut microbiota dysbiosis in patients with Parkinson’s disease have relied on 16S rRNA amplicon sequencing and assembly-free metagenomic approaches.However,there is an urgent need to study the function of the gut microbiome at the genome level using metagenome-assembled genomes.Here,we conducted single-sample metagenomic binning analysis using shotgun metagenomic sequencing data and retrieved 2837 metagenome-assembled genomes to explore the gut microbiota profile at the genome level.Reconstructing microbial genomes from metagenomic sequences greatly enriched the diversity and number of microbial genomes,especially those of uncultivable strains.By integrating the analysis of metagenome-assembled genomes with clinical parameters,we observed higherα-diversity indexes and a very different composition of microbial communities in patients with Parkinson’s disease.We also identified microbial species and metagenome-assembled genomes that were significantly associated with clinical characteristics,including disease severity,medication,motor complications,and non-motor symptoms.The genes of Parkinson’s disease severity-associated metagenome-assembled genomes were distributed across multiple pathways,such as carbon metabolism,phosphonate metabolism,carbohydrate metabolism,amino acid metabolism,fatty acid metabolism,bile acid metabolism,metabolism of cofactors and vitamins,neuroprotective molecules,immunogenic components,toxic metabolites,translation,and bacterial secretion.Our work provides a comprehensive resource for investigating the gut microbiota-Parkinson’s disease relationship at the genome level,which may enhance our comprehension of the underlying mechanisms of this disease.展开更多
Amborella trichopoda(Amborellaceae;hereafter simply Amborella)(Fig.1A)is a shrub endemic to New Caledonia in the Southwest Pacific that represents the sole sister species of all other extant angiosperms(Qiu et al.,199...Amborella trichopoda(Amborellaceae;hereafter simply Amborella)(Fig.1A)is a shrub endemic to New Caledonia in the Southwest Pacific that represents the sole sister species of all other extant angiosperms(Qiu et al.,1999;One Thousand Plant Transcriptomes Initiative,2019).Due to its unique phylogenetic status,it holds tremendous interest for botanists.The nuclear and mitochondrial genomes of Amborella were first published in 2013,providing valuable resources for studies on genome and gene family evolution,phylogenomics,and flower development,despite the fact that the assembly is heavily fragmented(Amborella Genome Project,2013;Rice et al.,2013).In 2024,a haplotype-resolved Amborella genome assembly was published,showing significant improvement in quality and completeness(Carey et al.,2024).展开更多
Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural...Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural variations and other annotation information.B:The second type of visualization map was simple and only showed the synteny relationship between the chromosomes of two or three genomes.C:Multiplatform general GenomeSyn submission page,applicable to Windows,MAC and web platforms;other analysis files can be entered in the"other"option.The publisher would like to apologise for any inconvenience caused.展开更多
Finger millet(Eleusine coracana Gaertn.),a nutritionally rich and drought-resilient C4 cereal,possesses exceptional grain storage longevity(up to 50 years).Here,we report a high-quality genome assembly of the allotetr...Finger millet(Eleusine coracana Gaertn.),a nutritionally rich and drought-resilient C4 cereal,possesses exceptional grain storage longevity(up to 50 years).Here,we report a high-quality genome assembly of the allotetraploid cultivar C142,revealing extensive structural rearrangements between its two subgenomes(subA and subB),which are associated with asymmetric gene expression and subgenome dominance favoring subA.SubB diverged from subA and E.indica approximately 6.8 million years ago.Subsequently,two whole-genome duplication events shaped the current genome architecture,contributing to gene redundancy and adaptive potential.Notably,expansion of stress-related gene families,such as aldo-keto reductases,suggests a role in oxidative stress response and drought adaptation.Using genome-wide association studies,we identify several candidate genes associated with key agronomic traits.Among them,EcMDHAR,encoding monodehydroascorbate reductase,plays a critical role in enhancing drought tolerance.Different EcMDHAR haplotypes exhibit distinct expression profiles,supporting their functional relevance in drought adaptation.This genomic resource not only advances our understanding of polyploid genome evolution in millets,but also provides a foundation for genome-assisted improvement of drought resistance and nutritional quality in finger millet.展开更多
Trentepohliales is a completely terrestrial order within Ulvophyceae(the core Chlorophyta),and its closely related lineages are mainly marine macroalgae(green seaweeds).Despite the considerable interest in their biote...Trentepohliales is a completely terrestrial order within Ulvophyceae(the core Chlorophyta),and its closely related lineages are mainly marine macroalgae(green seaweeds).Despite the considerable interest in their biotechnological potential,little is known about their adaptations to challenging terrestrial habitats.Here,we assemble the high-quality reference genome of Trentepohlia odorata.This alga shows duplications of key genes associated with lipid metabolism and carotenoid synthesis,potentially facilitating intracellular accumulation of lipid droplets and carotenoids.We further reveal positive selection and expansion of gene families involved in vesicle trafficking and cell division regulation in T.odorata compared with other algae(cleavage furrow-mediated cell division)in Ulvophyceae,providing a genetic foundation for the evolution of phragmoplast-mediated cell division.The combined C_(4)-like and biophysical CO_(2)-concentrating mechanisms(CCMs)of T.odorata enable adaptation to fluctuating CO_(2) environments,and support efficient photosynthesis under CO_(2)-limited conditions.Adaptive strategies of T.odorata to terrestrial stressors,such as drought,intense light,and UV-B radiation,include horizontally acquired genes involved in cell wall synthesis and remodeling,homeostasis of aldehydes,and expanded genes associated with reactive oxygen species(ROS),DNA repair,and photoprotection.Our study provides a valuable genomic resource for studying aerial algae and improves understanding of plant terrestrialization.展开更多
The demand for extended electric vehicle(EV)range necessitates advanced lightweighting strategies.This study introduces a materials genome approach,augmented by machine learning(ML),for optimizing lightweight composit...The demand for extended electric vehicle(EV)range necessitates advanced lightweighting strategies.This study introduces a materials genome approach,augmented by machine learning(ML),for optimizing lightweight composite designs for EVs.A comprehensive materials genome database was developed,encompassing composites based on carbon,glass,and natural fibers.This database systematically records critical parameters such as mechanical properties,density,cost,and environmental impact.Machine learning models,including Random Forest,Support Vector Machines,and Artificial Neural Networks,were employed to construct a predictive system for material performance.Subsequent material composition optimization was performed using amulti-objective genetic algorithm.Experimental validation demonstrated that an optimized carbon fiber/bio-based resin composite achieved a 45%weight reduction compared to conventional steel,while maintaining equivalent structural strength.The predictive accuracy of the models reached 94.2%.A cost-benefit analysis indicated that despite a 15%increase in material cost,the overall vehicle energy consumption decreased by 12%,leading to an 18%total cost saving over a five-year operational lifecycle,under a representative mid-size battery electric vehicle(BEV)operational scenario.展开更多
The genetic basis for Gossypium hirsutum race latifolium,the putative ancestor of cultivated upland cotton,emerging from the semi-wild races to be domesticated into cultivated upland cotton is unknown.Here,we reported...The genetic basis for Gossypium hirsutum race latifolium,the putative ancestor of cultivated upland cotton,emerging from the semi-wild races to be domesticated into cultivated upland cotton is unknown.Here,we reported a high-quality genome assembly of G.latifolium.Comparative genome analyses revealed substantial variations in both gene group composition and genomic sequences across 13 cotton genomes,including the expansion of photosynthesis-related gene groups in G.latifolium compared with other races and the pivotal contribution of structural variations(SVs)to G.hirsutum domestication.Based on the resequencing reads and constructed pan-genome of upland cotton,co-selection regions and SVs with significant frequency differences among different populations were identified.Genes located in these regions or affected by these variations may characterize the differences between G.latifolium and other races,and could be involved in maintenance of upland cotton domestication phenotypes.These findings may assist in mining genes for upland cotton improvement and improving the understanding of the genetic basis of upland cotton domestication.展开更多
Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the...Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the genomic landscape of Indian cattle.To comprehensively characterize the genomic variation within Bos indicus and,specifically,dairy breeds,we aim to identify non-reference sequences and construct a comprehensive pangenome.Results Five representative genomes of prominent dairy breeds,including Gir,Kankrej,Tharparkar,Sahiwal,and Red Sindhi,were sequenced using 10X Genomics‘linked-read’technology.Assemblies generated from these linked-reads ranged from 2.70 Gb to 2.77 Gb,comparable to the Bos indicus Brahman reference genome.A pangenome of Bos indicus cattle was constructed by comparing the newly assembled genomes with the reference using alignment and graph-based methods,revealing 8 Mb and 17.7 Mb of novel sequence respectively.A confident set of 6,844 Non-reference Unique Insertions(NUIs)spanning 7.57 Mb was identified through both methods,representing the pange-nome of Indian Bos indicus breeds.Comparative analysis with previously published pangenomes unveiled 2.8 Mb(37%)commonality with the Chinese indicine pangenome and only 1%commonality with the Bos taurus pange-nome.Among these,2,312 NUIs encompassing~2 Mb,were commonly found in 98 samples of the 5 breeds and des-ignated as Bos indicus Common Insertions(BICIs)in the population.Furthermore,926 BICIs were identified within 682 protein-coding genes,54 long non-coding RNAs(lncRNA),and 18 pseudogenes.These protein-coding genes were enriched for functions such as chemical synaptic transmission,cell junction organization,cell-cell adhesion,and cell morphogenesis.The protein-coding genes were found in various prominent quantitative trait locus(QTL)regions,suggesting potential roles of BICIs in traits related to milk production,reproduction,exterior,health,meat,and carcass.Notably,63.21%of the bases within the BICIs call set contained interspersed repeats,predominantly Long Inter-spersed Nuclear Elements(LINEs).Additionally,70.28%of BICIs are shared with other domesticated and wild species,highlighting their evolutionary significance.Conclusions This is the first report unveiling a robust set of NUIs defining the pangenome of Bos indicus breeds of India.The analyses contribute valuable insights into the genomic landscape of desi cattle breeds.展开更多
Selaginella moellendorffii Hieron.,a lycophyte of significant medicinal and evolutionary importance,is recognized as one of the earliest vascular plants.However,the absence of a high-quality reference genome has hinde...Selaginella moellendorffii Hieron.,a lycophyte of significant medicinal and evolutionary importance,is recognized as one of the earliest vascular plants.However,the absence of a high-quality reference genome has hindered the comprehensive exploration of its unique phylogenetic position and therapeutic potential,thereby limiting our understanding of its genomic structure and metabolic capabilities.In this study,we present the first chromosome-level,telomere-to-telomere(T2T)genome assembly of S.moellendorffii,constructed utilizing PacBio HiFi,Oxford Nanopore(ONT),and Hi-C technologies.The assembled genome,spanning 112.83 Mb across 10 chromosomes with a contig N50 of 11.11 Mb,exhibited exceptional completeness(BUSCO score:95.7%)and accuracy(QV=48.11).Comparative genomic analysis identified 3515 gene families unique to S.moellendorffii,with significant enrichment in secondary metabolismpathways,including those related to flavonoid biosynthesis.Phylogenetic analysis revealed that S.moellendorffii diverged from Isoetes approximately 339.6 million years ago(MYA),representing a key evolutionary transition in early vascular plants.By integrating tissue-specific transcriptome and metabolome analyses,we uncovered the molecular basis of biflavone biosynthesis,identifying key enzymes and regulatory networks that govern the production of these bioactive compounds.We observed a correlation between the tissue-specific accumulation patterns of six major biflavones,including amentoflavone and ginkgetin,and the expression of their biosynthetic genes.This high-quality genome assembly,coupled with multi-omics analyses,offers unprecedented insights into the evolution of early vascular plants and elucidates the molecular mechanisms behind their specialized metabolism.展开更多
文摘Non-O1/non-O139 Vibrio(V.)cholerae(NOVC)has emerged as a potential pathogen in patients with compromised health conditions[1].We report the whole genome sequencing(WGS)of a rare NOVC sepsis isolate(GenBank Accession:GCF_051906115.1)from an 89-year-old male admitted to the Intensive Care Unit(ICU)with septic shock(lactate 6.61 mmol/L)digestive illness.
基金supported in part by AFRI grant numbers 2019-7015-29321 and 2021-67015-33409 from the USDA National Institute of Food and Agriculture(NIFA)the SCINet project of the USDA ARS project number 0500-00093-001-00-D。
文摘Structural variations(SVs≥50 bp)are a critical but underexplored source of genetic diversity in cattle,shaping traits vital for productivity,adaptability,and health.Advances in long-read sequencing,pangenome graph construction,and near-complete genome assemblies now allow accurate SV detection and genotyping.These innovations overcome the limitations of single-reference genomes,enabling the discovery of complex SVs,including nested and overlapping variants,and providing access to previously inaccessible genomic regions such as centromeres and telomeres.This review highlights the current landscape of cattle SV research,with emphasis on integrating longread sequencing and pangenome frameworks to uncover breed-specific and population-level variation.While many SVs are linked to economically important traits such as feed efficiency and disease resistance,their broader regulatory impacts remain an active area of investigation.Emerging functional genomics approaches,including transcriptomics,epigenomics,and genome editing,will clarify how SVs influence gene regulation and phenotype.Looking forward,the integration of SV catalogs with multi-omics data,imputation resources,and artificial intelligence-driven models will be essential for translating discoveries into breeding and conservation applications.Integrating structural variants into breeding pipelines promises to revolutionize livestock genomics,enabling precision selection and sustainable agriculture despite challenges in cost,data sharing,and functional validation.
文摘Iris domestica,a perennial herb of the Iridaceae family,is widely recognized for its rich isoflavone content and broad therapeutic properties.To elucidate the biosynthetic pathway of these medicinally significant compounds,we constructed a haplotype-resolved genome assembly of this species.Transcriptomic and metabolomic analyses revealed tissue-specific accumulation of isoflavone,particularly in rhizomes and roots.Functional characterization identified two candidate isoflavone synthase genes,among which IdIFS was confirmed to promote the biosynthesis of key compounds tectorigenin and irisflorentin.The high-quality genome assembly presented here provides a foundational resource for further research into the evolution,secondary metabolite,and environmental adaptation of I.domestica.
基金supported by the Natural Science Foundation of Anhui Province(2408085MC058 and 2308085QC91)National Natural Science Foundation of China(32301783and U21A20214)+5 种基金Innovation Program of the Chinese Academy of Agricultural Sciences(CAAS CSIAF-202303)Nanfan special project,CAAS(YYLH2309,YBXM2322,YYLH2401)Scientific Innovation 2030 Project(2022ZD0401703)CAAS Innovative Team Award,Science and Technology of Innovative research program of Anhui Province(202423m1005002)National Key Research and Development Program of China(2023YFD1200900)the Natural Science Foundation General Program of Hebei Province(C2024204242).
文摘IR64 is an elite Xian/indica variety developed by International Rice Research Institute(IRRl)in 1985,which has been the most widely grown variety and core breeding parent in South/Southeast Asia(Mackill and Khush,2018).IR64 has been utilized to develop stress-tolerant(such as drought-adapted and submergenceresistant)near-isogenic lines,underscoring its great potential in agricultural genomics(Tanaka et al.,2020).
基金supported by the Joint Funds of the National Natural Science Foundation of China(Grant No.U22A20494)the 1+9 Open Competition Project of Sichuan Academy of Agricultural Sciences(1+9KJGGo02)+4 种基金the National Key R&D Program of China(2024YFA130670O)the“5+1”Agricultural Frontier Technology Research Initiative of Sichuan Academy of Agricultural Sciences(5+1QYGG003)the Project of Sichuan Province Engineering Technology Research Center of Vegetables(2023PZSC0303)the 14th Five-Year Plan Vegetable Breeding Project of Sichuan Province(2021YFYZ0022)the Experts of Sichuan Vegetable Innovation Team(SCCXTD-2025-05).
文摘Radish(Raphanus sativus L.)is an important cruciferous root vegetable,with bolting regulated by multiple genes.However,the genetic mechanisms underlying bolting regulation remain unclear.Here,the genome of the cultivar C60213 is assembled into a high-quality,gap-free telomere-to-telomere structure,spanning nine chromosomes and totaling 472.71 Mb,using a combination of Oxford Nanopore,PacBio,and Hi-C sequencing technologies.It identifies 49,768 protein-coding genes,97.38%of which are functionally annotated.Repetitive sequences constitute 59.72%of the genome,primarily comprising long terminal repeats.A high-density genetic linkage map is constructed using an F2 population derived from a cross between early-and late-bolting radishes,identifying seven major quantitative trait loci associated with bolting and flowering.RNA-seq and quantitative real-time PCR analysis reveal that the RsMIPS3 gene is found to be associated with bolting,with its expression decreasing during this process.Notably,RsMIPS3 overexpression in Arabidopsis delays bolting,confirming its role in regulating bolting time.These findings advance radish genome research and provide a valuable target for breeding late-bolting varieties.
基金supported by the National Natural Science Foundation of China(31970344)Joint Funds of the Natural Science Foundation of Hainan Province,China(2021JJLH0065).
文摘Soybean(Glycine max L.)is a globally vital crop for oil production and food security.High-quality genomic resources are instrumental for both functional genomics and breeding.Here,we report a near-complete,high-quality genome assembly of the elite cultivar Tianlong 1(TL1),featuring fully resolved telomeres and centromeres,as well as a gap-free assembly of 14 of its 20 chromosomes.On the basis of the genome assembly,we generate an ethyl methanesulfonate(EMS)-mutagenized population comprising 2555 M7 plants.Whole-genome resequencing of 288 EMS mutants uncovers 1,163,869 high-confidence single-nucleotide polymorphisms(SNPs)and 542,709 insertions/deletions(InDels),achieving 91.89%coverage of predicted protein-coding genes.Phenotypic screening demonstrates robust genotype–phenotype associations,with two nonsynonymous mutants displaying pronounced defects in seed and leaf development.Collectively,the chromosome-scale TL1 genome assembly and the extensively characterized mutant population establish valuable resources for functional genomics and precision breeding in soybean and related legume species.
基金supported by the Jilin Science and Technology Development Program,China (20240602032RC)the Jilin Agricultural Science and Technology Innovation Project,China (CXGC2024ZD001)+1 种基金the Jilin Agricultural Science and Technology Innovation Project,China (CXGC2024ZY012)the Jilin Province Development and Reform Commission-Project for Improving the Independent Innovation Capacity of Major Grain Crops,China (2024C002)。
文摘Emerging and powerful genome editing tools,particularly CRISPR/Cas9,are facilitating functional genomics research and accelerating crop improvement(Jiang et al.2021;Cao et al.2023;Chen C et al.2023;Liu et al.2023a).However,the detection and screening of transgenic lines remain major bottlenecks,being time-consuming,labor-intensive,and inefficient during transformation and subsequent mutation identification.A simple and efficient visual marker system plays a critical role in addressing these challenges.Recent studies demonstrated that the GmW1 and RUBY reporter systems were used to obtain visual transgenic soybean(Glycine max) plants(Chen L et al.2023;Chen et al.2024).
基金supported by the National Natural Science Foundation of China(U23A20160,32360336)Guizhou Provincial Key Technology R&D Program(Qian KeHe ZhiCheng[2023]YiBan035).
文摘Natural hybridization is known to play a vital role in speciation;however,the mechanisms underlying the early stages of natural hybridization remain unclear.Where two plant species come into contact,two driving forces may balance the dynamic consequences of hybridization:fusion by hybridization-mediated gene flow,and separation by reproductive isolation(RI)(Ma et al.,2010a,b;Chang et al.,2022).
基金supported by the China Agriculture Research System of MOF and MARAthe National Natural Science Foundation of China (31872337 and 31501919)the Agricultural Science and Technology Innovation Project,China (ASTIP-IAS02)。
文摘The advantages of genome selection(GS) in animal and plant breeding are self-evident.Traditional parametric models have disadvantage in better fit the increasingly large sequencing data and capture complex effects accurately.Machine learning models have demonstrated remarkable potential in addressing these challenges.In this study,we introduced the concept of mixed kernel functions to explore the performance of support vector machine regression(SVR) in GS.Six single kernel functions(SVR_L,SVR_C,SVR_G,SVR_P,SVR_S,SVR_L) and four mixed kernel functions(SVR_GS,SVR_GP,SVR_LS,SVR_LP) were used to predict genome breeding values.The prediction accuracy,mean squared error(MSE) and mean absolute error(MAE) were used as evaluation indicators to compare with two traditional parametric models(GBLUP,BayesB) and two popular machine learning models(RF,KcRR).The results indicate that in most cases,the performance of the mixed kernel function model significantly outperforms that of GBLUP,BayesB and single kernel function.For instance,for T1 in the pig dataset,the predictive accuracy of SVR_GS is improved by 10% compared to GBLUP,and by approximately 4.4 and 18.6% compared to SVR_G and SVR_S respectively.For E1 in the wheat dataset,SVR_GS achieves 13.3% higher prediction accuracy than GBLUP.Among single kernel functions,the Laplacian and Gaussian kernel functions yield similar results,with the Gaussian kernel function performing better.The mixed kernel function notably reduces the MSE and MAE when compared to all single kernel functions.Furthermore,regarding runtime,SVR_GS and SVR_GP mixed kernel functions run approximately three times faster than GBLUP in the pig dataset,with only a slight increase in runtime compared to the single kernel function model.In summary,the mixed kernel function model of SVR demonstrates speed and accuracy competitiveness,and the model such as SVR_GS has important application potential for GS.
基金supported by the Natural Science Foundation of China(32272364)the Shanghai Education Committee Scientific Research Innovation Projects,China(2101070007800120)+2 种基金National Science Foundation for Distinguished Young Scholars(32025029)Shanghai Key Project in Synthetic Biology(23HC1400900)the Shanghai Engineering Research Center of 460 Food Microbiology Program(19DZ2281100).
文摘This study examined the potential response mechanisms of Ligilactobacillus salivarius AR612 to glucose stress through whole-genome and comparative transcriptome analysis.We obtained the basic genome information of L.salivarius AR612.The full genome length of L.salivarius AR612 was 1970245 bp,with a GC content of 33.01%and 1894 coding genes.Moreover,we identified many genes associated with genetic adaptations to various stress factors,including temperature,p H,osmotic pressure,bile salts,and oxidative stress.Physiological analysis revealed that the growth and morphology of AR612 changed significantly under glucose stress,with a decrease in the maximum growth and irregular cell morphology.Furthermore,a comparison of transcriptome data indicated that glucose stress induced changes in the number of differential genes.Moreover,AR612 could respond to extracellular glucose stress by changing the expression of genes related to cell morphology,carbohydrate metabolism,amino acid metabolism,fatty acid synthesis,and nucleotide metabolism.This study provides valuable theoretical insights for future research on the adaptation of L.salivarius AR612 to nutritional stress and its application in industrial processes.
基金Special Projectfor Clinical Research of Shanghai Municipal Health Commission,No.202140403Key Disciplines Group Construction Project of Pudong Health Bureau of Shanghai,No.PWZxq2022-05+2 种基金Natural Science Foundation of Ningxia Hui Autonomous Region,No.2024AAC05084Ningxia Hui Autonomous Region Key Research and Development Program,No.2021BEG03084National Natural Science Foundation of China,Nos.32370895,32070862。
文摘Brain organoids are artificial neural tissues derived in vitro,containing a variety of cell types,as well as structural and/or functional brain regions.They can partially mimic brain physiological activities and diseased processes.Owing to their operability and sample accessibility,brain organoids serve as a bridge between in vitro monolayer cell culture models and in vivo animal models.An increasing number of induction protocols for brain organoids have been developed over the preceding decade.A key future research direction will focus on ensuring the complexity and quality of brain organoids.The integration of powerful technologies,such as the CRISP R/Cas9 genome editing and lineage tra cing systems,shall precipitate practical and broad applications of brain organoids.In this review,we discuss the generation and application of brain organoids,as well as their integration with genome editing technologies,in the study of neural development,disease modeling,and mechanistic investigations.The innovative combination of these two technologies may offer a fresh perspective for exploring the fundamental aspects of the human nervous system and related diseases.
基金supported by the National Key R&D Program of China,No.2022YFE0210100(to XY)the Shanghai Rising-Star Program,No.22QA1405700(to XY)the National Natural Science Foundation of China,Nos.82301418(to YZ),82171246(to QX),and 82371251(to QX).
文摘Previous investigations into gut microbiota dysbiosis in patients with Parkinson’s disease have relied on 16S rRNA amplicon sequencing and assembly-free metagenomic approaches.However,there is an urgent need to study the function of the gut microbiome at the genome level using metagenome-assembled genomes.Here,we conducted single-sample metagenomic binning analysis using shotgun metagenomic sequencing data and retrieved 2837 metagenome-assembled genomes to explore the gut microbiota profile at the genome level.Reconstructing microbial genomes from metagenomic sequences greatly enriched the diversity and number of microbial genomes,especially those of uncultivable strains.By integrating the analysis of metagenome-assembled genomes with clinical parameters,we observed higherα-diversity indexes and a very different composition of microbial communities in patients with Parkinson’s disease.We also identified microbial species and metagenome-assembled genomes that were significantly associated with clinical characteristics,including disease severity,medication,motor complications,and non-motor symptoms.The genes of Parkinson’s disease severity-associated metagenome-assembled genomes were distributed across multiple pathways,such as carbon metabolism,phosphonate metabolism,carbohydrate metabolism,amino acid metabolism,fatty acid metabolism,bile acid metabolism,metabolism of cofactors and vitamins,neuroprotective molecules,immunogenic components,toxic metabolites,translation,and bacterial secretion.Our work provides a comprehensive resource for investigating the gut microbiota-Parkinson’s disease relationship at the genome level,which may enhance our comprehension of the underlying mechanisms of this disease.
基金supported by the National Natural Science Foundation of China(32270217,31970205,31770211)Metasequoia funding of Nanjing Forestry University to YY。
文摘Amborella trichopoda(Amborellaceae;hereafter simply Amborella)(Fig.1A)is a shrub endemic to New Caledonia in the Southwest Pacific that represents the sole sister species of all other extant angiosperms(Qiu et al.,1999;One Thousand Plant Transcriptomes Initiative,2019).Due to its unique phylogenetic status,it holds tremendous interest for botanists.The nuclear and mitochondrial genomes of Amborella were first published in 2013,providing valuable resources for studies on genome and gene family evolution,phylogenomics,and flower development,despite the fact that the assembly is heavily fragmented(Amborella Genome Project,2013;Rice et al.,2013).In 2024,a haplotype-resolved Amborella genome assembly was published,showing significant improvement in quality and completeness(Carey et al.,2024).
文摘Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural variations and other annotation information.B:The second type of visualization map was simple and only showed the synteny relationship between the chromosomes of two or three genomes.C:Multiplatform general GenomeSyn submission page,applicable to Windows,MAC and web platforms;other analysis files can be entered in the"other"option.The publisher would like to apologise for any inconvenience caused.
基金supported by the National Crop Germplasm Resources Center(NCGRC-2024-056)the National Natural Science Foundation of China(32301813).
文摘Finger millet(Eleusine coracana Gaertn.),a nutritionally rich and drought-resilient C4 cereal,possesses exceptional grain storage longevity(up to 50 years).Here,we report a high-quality genome assembly of the allotetraploid cultivar C142,revealing extensive structural rearrangements between its two subgenomes(subA and subB),which are associated with asymmetric gene expression and subgenome dominance favoring subA.SubB diverged from subA and E.indica approximately 6.8 million years ago.Subsequently,two whole-genome duplication events shaped the current genome architecture,contributing to gene redundancy and adaptive potential.Notably,expansion of stress-related gene families,such as aldo-keto reductases,suggests a role in oxidative stress response and drought adaptation.Using genome-wide association studies,we identify several candidate genes associated with key agronomic traits.Among them,EcMDHAR,encoding monodehydroascorbate reductase,plays a critical role in enhancing drought tolerance.Different EcMDHAR haplotypes exhibit distinct expression profiles,supporting their functional relevance in drought adaptation.This genomic resource not only advances our understanding of polyploid genome evolution in millets,but also provides a foundation for genome-assisted improvement of drought resistance and nutritional quality in finger millet.
基金supported by the National Natural Science Foundation of China(W2511024,32370228,32470232)the Natural Science Foundation of Jiangsu Province(BK20250004)+3 种基金the Collaborative Innovation Center for Modern Crop Production co-sponsored by Province and Ministry,the Priority Academic Program Development of Jiangsu Higher Education Institutions(PAPD)the fund of Taxonomy Scientist Program'of the Chinese Academy of Sciences(CAS-TAX-24-038)the Youth Innovation Promotion Association CAS(2023355)Postgraduate Research&Practice Innovation Program of Jiangsu Province(KYCX24_1846).
文摘Trentepohliales is a completely terrestrial order within Ulvophyceae(the core Chlorophyta),and its closely related lineages are mainly marine macroalgae(green seaweeds).Despite the considerable interest in their biotechnological potential,little is known about their adaptations to challenging terrestrial habitats.Here,we assemble the high-quality reference genome of Trentepohlia odorata.This alga shows duplications of key genes associated with lipid metabolism and carotenoid synthesis,potentially facilitating intracellular accumulation of lipid droplets and carotenoids.We further reveal positive selection and expansion of gene families involved in vesicle trafficking and cell division regulation in T.odorata compared with other algae(cleavage furrow-mediated cell division)in Ulvophyceae,providing a genetic foundation for the evolution of phragmoplast-mediated cell division.The combined C_(4)-like and biophysical CO_(2)-concentrating mechanisms(CCMs)of T.odorata enable adaptation to fluctuating CO_(2) environments,and support efficient photosynthesis under CO_(2)-limited conditions.Adaptive strategies of T.odorata to terrestrial stressors,such as drought,intense light,and UV-B radiation,include horizontally acquired genes involved in cell wall synthesis and remodeling,homeostasis of aldehydes,and expanded genes associated with reactive oxygen species(ROS),DNA repair,and photoprotection.Our study provides a valuable genomic resource for studying aerial algae and improves understanding of plant terrestrialization.
文摘The demand for extended electric vehicle(EV)range necessitates advanced lightweighting strategies.This study introduces a materials genome approach,augmented by machine learning(ML),for optimizing lightweight composite designs for EVs.A comprehensive materials genome database was developed,encompassing composites based on carbon,glass,and natural fibers.This database systematically records critical parameters such as mechanical properties,density,cost,and environmental impact.Machine learning models,including Random Forest,Support Vector Machines,and Artificial Neural Networks,were employed to construct a predictive system for material performance.Subsequent material composition optimization was performed using amulti-objective genetic algorithm.Experimental validation demonstrated that an optimized carbon fiber/bio-based resin composite achieved a 45%weight reduction compared to conventional steel,while maintaining equivalent structural strength.The predictive accuracy of the models reached 94.2%.A cost-benefit analysis indicated that despite a 15%increase in material cost,the overall vehicle energy consumption decreased by 12%,leading to an 18%total cost saving over a five-year operational lifecycle,under a representative mid-size battery electric vehicle(BEV)operational scenario.
基金supported by the National Natural Science Foundation of China(32201873)the Key Research and Development Plan of Hubei Province(2023BBB050)。
文摘The genetic basis for Gossypium hirsutum race latifolium,the putative ancestor of cultivated upland cotton,emerging from the semi-wild races to be domesticated into cultivated upland cotton is unknown.Here,we reported a high-quality genome assembly of G.latifolium.Comparative genome analyses revealed substantial variations in both gene group composition and genomic sequences across 13 cotton genomes,including the expansion of photosynthesis-related gene groups in G.latifolium compared with other races and the pivotal contribution of structural variations(SVs)to G.hirsutum domestication.Based on the resequencing reads and constructed pan-genome of upland cotton,co-selection regions and SVs with significant frequency differences among different populations were identified.Genes located in these regions or affected by these variations may characterize the differences between G.latifolium and other races,and could be involved in maintenance of upland cotton domestication phenotypes.These findings may assist in mining genes for upland cotton improvement and improving the understanding of the genetic basis of upland cotton domestication.
基金the project “Genomics for Conservation of Indigenous Cattle Breeds and for Enhancing Milk Yield, Phase-I” [BT/ PR26466/AAQ/1/704/2017], funded by the Department of Biotechnology (DBT ), Indiathe project “Identification of key molecular factors involved in resistance/susceptibility to paratuberculosis infection in indigenous breeds of cows” [BT/PR32758/AAQ/1/760/2019], which was also funded by Department of Biotechnology (DBT ), India。
文摘Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the genomic landscape of Indian cattle.To comprehensively characterize the genomic variation within Bos indicus and,specifically,dairy breeds,we aim to identify non-reference sequences and construct a comprehensive pangenome.Results Five representative genomes of prominent dairy breeds,including Gir,Kankrej,Tharparkar,Sahiwal,and Red Sindhi,were sequenced using 10X Genomics‘linked-read’technology.Assemblies generated from these linked-reads ranged from 2.70 Gb to 2.77 Gb,comparable to the Bos indicus Brahman reference genome.A pangenome of Bos indicus cattle was constructed by comparing the newly assembled genomes with the reference using alignment and graph-based methods,revealing 8 Mb and 17.7 Mb of novel sequence respectively.A confident set of 6,844 Non-reference Unique Insertions(NUIs)spanning 7.57 Mb was identified through both methods,representing the pange-nome of Indian Bos indicus breeds.Comparative analysis with previously published pangenomes unveiled 2.8 Mb(37%)commonality with the Chinese indicine pangenome and only 1%commonality with the Bos taurus pange-nome.Among these,2,312 NUIs encompassing~2 Mb,were commonly found in 98 samples of the 5 breeds and des-ignated as Bos indicus Common Insertions(BICIs)in the population.Furthermore,926 BICIs were identified within 682 protein-coding genes,54 long non-coding RNAs(lncRNA),and 18 pseudogenes.These protein-coding genes were enriched for functions such as chemical synaptic transmission,cell junction organization,cell-cell adhesion,and cell morphogenesis.The protein-coding genes were found in various prominent quantitative trait locus(QTL)regions,suggesting potential roles of BICIs in traits related to milk production,reproduction,exterior,health,meat,and carcass.Notably,63.21%of the bases within the BICIs call set contained interspersed repeats,predominantly Long Inter-spersed Nuclear Elements(LINEs).Additionally,70.28%of BICIs are shared with other domesticated and wild species,highlighting their evolutionary significance.Conclusions This is the first report unveiling a robust set of NUIs defining the pangenome of Bos indicus breeds of India.The analyses contribute valuable insights into the genomic landscape of desi cattle breeds.
基金funded by the National Natural Science Foundation of China(Grant No.81903921)the Key project at central government level:The ability establishment of sustainable use for valuable Chinese medicine resources(2060302)the Distinguished Young Scholars of Hubei University of Chinese Medicine(Grant No.2022ZZXJ002).
文摘Selaginella moellendorffii Hieron.,a lycophyte of significant medicinal and evolutionary importance,is recognized as one of the earliest vascular plants.However,the absence of a high-quality reference genome has hindered the comprehensive exploration of its unique phylogenetic position and therapeutic potential,thereby limiting our understanding of its genomic structure and metabolic capabilities.In this study,we present the first chromosome-level,telomere-to-telomere(T2T)genome assembly of S.moellendorffii,constructed utilizing PacBio HiFi,Oxford Nanopore(ONT),and Hi-C technologies.The assembled genome,spanning 112.83 Mb across 10 chromosomes with a contig N50 of 11.11 Mb,exhibited exceptional completeness(BUSCO score:95.7%)and accuracy(QV=48.11).Comparative genomic analysis identified 3515 gene families unique to S.moellendorffii,with significant enrichment in secondary metabolismpathways,including those related to flavonoid biosynthesis.Phylogenetic analysis revealed that S.moellendorffii diverged from Isoetes approximately 339.6 million years ago(MYA),representing a key evolutionary transition in early vascular plants.By integrating tissue-specific transcriptome and metabolome analyses,we uncovered the molecular basis of biflavone biosynthesis,identifying key enzymes and regulatory networks that govern the production of these bioactive compounds.We observed a correlation between the tissue-specific accumulation patterns of six major biflavones,including amentoflavone and ginkgetin,and the expression of their biosynthetic genes.This high-quality genome assembly,coupled with multi-omics analyses,offers unprecedented insights into the evolution of early vascular plants and elucidates the molecular mechanisms behind their specialized metabolism.