Wheat(Triticum aestivum)faces significant threats from diseases such as powdery mildew(Blumeria graminis)and Fusarium head blight(FHB;caused by Fusarium graminearum),which cause severe yield losses.Moreover,the antago...Wheat(Triticum aestivum)faces significant threats from diseases such as powdery mildew(Blumeria graminis)and Fusarium head blight(FHB;caused by Fusarium graminearum),which cause severe yield losses.Moreover,the antagonism between yield-related traits and disease resistance makes yield resistance coordination a major challenge in wheat breeding.The lack of genetic resources combining both disease resistance and high yield constrains the elucidation of underlying resistance-yield trade-off mechanisms,thereby hindering the development of high-yield and disease-resistant wheat cultivars.Remarkably,Yangmai 33(YM33),a notable wheat cultivar with resistance to both powdery mildew and FHB as well as high-yield performance,was recently developed.It offers a unique opportunity to dissect the genomic architecture underlying the coordination between disease resistance and yield.展开更多
Emerging and powerful genome editing tools,particularly CRISPR/Cas9,are facilitating functional genomics research and accelerating crop improvement(Jiang et al.2021;Cao et al.2023;Chen C et al.2023;Liu et al.2023a).Ho...Emerging and powerful genome editing tools,particularly CRISPR/Cas9,are facilitating functional genomics research and accelerating crop improvement(Jiang et al.2021;Cao et al.2023;Chen C et al.2023;Liu et al.2023a).However,the detection and screening of transgenic lines remain major bottlenecks,being time-consuming,labor-intensive,and inefficient during transformation and subsequent mutation identification.A simple and efficient visual marker system plays a critical role in addressing these challenges.Recent studies demonstrated that the GmW1 and RUBY reporter systems were used to obtain visual transgenic soybean(Glycine max) plants(Chen L et al.2023;Chen et al.2024).展开更多
Iris domestica,a perennial herb of the Iridaceae family,is widely recognized for its rich isoflavone content and broad therapeutic properties.To elucidate the biosynthetic pathway of these medicinally significant comp...Iris domestica,a perennial herb of the Iridaceae family,is widely recognized for its rich isoflavone content and broad therapeutic properties.To elucidate the biosynthetic pathway of these medicinally significant compounds,we constructed a haplotype-resolved genome assembly of this species.Transcriptomic and metabolomic analyses revealed tissue-specific accumulation of isoflavone,particularly in rhizomes and roots.Functional characterization identified two candidate isoflavone synthase genes,among which IdIFS was confirmed to promote the biosynthesis of key compounds tectorigenin and irisflorentin.The high-quality genome assembly presented here provides a foundational resource for further research into the evolution,secondary metabolite,and environmental adaptation of I.domestica.展开更多
Natural hybridization is known to play a vital role in speciation;however,the mechanisms underlying the early stages of natural hybridization remain unclear.Where two plant species come into contact,two driving forces...Natural hybridization is known to play a vital role in speciation;however,the mechanisms underlying the early stages of natural hybridization remain unclear.Where two plant species come into contact,two driving forces may balance the dynamic consequences of hybridization:fusion by hybridization-mediated gene flow,and separation by reproductive isolation(RI)(Ma et al.,2010a,b;Chang et al.,2022).展开更多
Rice, a global staple food, is critical for food security. The cultivated Oryza sativa, domesticated from wild O. rufipogon, derives~80%of its 993 identified domestication-related genes from O. rufipogon and 20%from S...Rice, a global staple food, is critical for food security. The cultivated Oryza sativa, domesticated from wild O. rufipogon, derives~80%of its 993 identified domestication-related genes from O. rufipogon and 20%from South/Southeast Asian wild O. nivara(Jing et al., 2023). Genes like An-1, BH4, PROG1,SH4, Rc, Rd, and GS3—which regulate awn length, hull color,til er angle, seed shattering, pericarp color, seed length, and thousand-grain weight, respectively—were selected against during domestication to form modern O. sativa(Yu et al., 2021).However, domestication and yield-focused breeding eliminated wild rice's valuable genes(e.g., for disease resistance, stress tolerance, nutrition), narrowing genetic diversity and impeding efforts to meet growing societal demands.展开更多
The advantages of genome selection(GS) in animal and plant breeding are self-evident.Traditional parametric models have disadvantage in better fit the increasingly large sequencing data and capture complex effects acc...The advantages of genome selection(GS) in animal and plant breeding are self-evident.Traditional parametric models have disadvantage in better fit the increasingly large sequencing data and capture complex effects accurately.Machine learning models have demonstrated remarkable potential in addressing these challenges.In this study,we introduced the concept of mixed kernel functions to explore the performance of support vector machine regression(SVR) in GS.Six single kernel functions(SVR_L,SVR_C,SVR_G,SVR_P,SVR_S,SVR_L) and four mixed kernel functions(SVR_GS,SVR_GP,SVR_LS,SVR_LP) were used to predict genome breeding values.The prediction accuracy,mean squared error(MSE) and mean absolute error(MAE) were used as evaluation indicators to compare with two traditional parametric models(GBLUP,BayesB) and two popular machine learning models(RF,KcRR).The results indicate that in most cases,the performance of the mixed kernel function model significantly outperforms that of GBLUP,BayesB and single kernel function.For instance,for T1 in the pig dataset,the predictive accuracy of SVR_GS is improved by 10% compared to GBLUP,and by approximately 4.4 and 18.6% compared to SVR_G and SVR_S respectively.For E1 in the wheat dataset,SVR_GS achieves 13.3% higher prediction accuracy than GBLUP.Among single kernel functions,the Laplacian and Gaussian kernel functions yield similar results,with the Gaussian kernel function performing better.The mixed kernel function notably reduces the MSE and MAE when compared to all single kernel functions.Furthermore,regarding runtime,SVR_GS and SVR_GP mixed kernel functions run approximately three times faster than GBLUP in the pig dataset,with only a slight increase in runtime compared to the single kernel function model.In summary,the mixed kernel function model of SVR demonstrates speed and accuracy competitiveness,and the model such as SVR_GS has important application potential for GS.展开更多
This study examined the potential response mechanisms of Ligilactobacillus salivarius AR612 to glucose stress through whole-genome and comparative transcriptome analysis.We obtained the basic genome information of L.s...This study examined the potential response mechanisms of Ligilactobacillus salivarius AR612 to glucose stress through whole-genome and comparative transcriptome analysis.We obtained the basic genome information of L.salivarius AR612.The full genome length of L.salivarius AR612 was 1970245 bp,with a GC content of 33.01%and 1894 coding genes.Moreover,we identified many genes associated with genetic adaptations to various stress factors,including temperature,p H,osmotic pressure,bile salts,and oxidative stress.Physiological analysis revealed that the growth and morphology of AR612 changed significantly under glucose stress,with a decrease in the maximum growth and irregular cell morphology.Furthermore,a comparison of transcriptome data indicated that glucose stress induced changes in the number of differential genes.Moreover,AR612 could respond to extracellular glucose stress by changing the expression of genes related to cell morphology,carbohydrate metabolism,amino acid metabolism,fatty acid synthesis,and nucleotide metabolism.This study provides valuable theoretical insights for future research on the adaptation of L.salivarius AR612 to nutritional stress and its application in industrial processes.展开更多
Amborella trichopoda(Amborellaceae;hereafter simply Amborella)(Fig.1A)is a shrub endemic to New Caledonia in the Southwest Pacific that represents the sole sister species of all other extant angiosperms(Qiu et al.,199...Amborella trichopoda(Amborellaceae;hereafter simply Amborella)(Fig.1A)is a shrub endemic to New Caledonia in the Southwest Pacific that represents the sole sister species of all other extant angiosperms(Qiu et al.,1999;One Thousand Plant Transcriptomes Initiative,2019).Due to its unique phylogenetic status,it holds tremendous interest for botanists.The nuclear and mitochondrial genomes of Amborella were first published in 2013,providing valuable resources for studies on genome and gene family evolution,phylogenomics,and flower development,despite the fact that the assembly is heavily fragmented(Amborella Genome Project,2013;Rice et al.,2013).In 2024,a haplotype-resolved Amborella genome assembly was published,showing significant improvement in quality and completeness(Carey et al.,2024).展开更多
Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural...Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural variations and other annotation information.B:The second type of visualization map was simple and only showed the synteny relationship between the chromosomes of two or three genomes.C:Multiplatform general GenomeSyn submission page,applicable to Windows,MAC and web platforms;other analysis files can be entered in the"other"option.The publisher would like to apologise for any inconvenience caused.展开更多
Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the...Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the genomic landscape of Indian cattle.To comprehensively characterize the genomic variation within Bos indicus and,specifically,dairy breeds,we aim to identify non-reference sequences and construct a comprehensive pangenome.Results Five representative genomes of prominent dairy breeds,including Gir,Kankrej,Tharparkar,Sahiwal,and Red Sindhi,were sequenced using 10X Genomics‘linked-read’technology.Assemblies generated from these linked-reads ranged from 2.70 Gb to 2.77 Gb,comparable to the Bos indicus Brahman reference genome.A pangenome of Bos indicus cattle was constructed by comparing the newly assembled genomes with the reference using alignment and graph-based methods,revealing 8 Mb and 17.7 Mb of novel sequence respectively.A confident set of 6,844 Non-reference Unique Insertions(NUIs)spanning 7.57 Mb was identified through both methods,representing the pange-nome of Indian Bos indicus breeds.Comparative analysis with previously published pangenomes unveiled 2.8 Mb(37%)commonality with the Chinese indicine pangenome and only 1%commonality with the Bos taurus pange-nome.Among these,2,312 NUIs encompassing~2 Mb,were commonly found in 98 samples of the 5 breeds and des-ignated as Bos indicus Common Insertions(BICIs)in the population.Furthermore,926 BICIs were identified within 682 protein-coding genes,54 long non-coding RNAs(lncRNA),and 18 pseudogenes.These protein-coding genes were enriched for functions such as chemical synaptic transmission,cell junction organization,cell-cell adhesion,and cell morphogenesis.The protein-coding genes were found in various prominent quantitative trait locus(QTL)regions,suggesting potential roles of BICIs in traits related to milk production,reproduction,exterior,health,meat,and carcass.Notably,63.21%of the bases within the BICIs call set contained interspersed repeats,predominantly Long Inter-spersed Nuclear Elements(LINEs).Additionally,70.28%of BICIs are shared with other domesticated and wild species,highlighting their evolutionary significance.Conclusions This is the first report unveiling a robust set of NUIs defining the pangenome of Bos indicus breeds of India.The analyses contribute valuable insights into the genomic landscape of desi cattle breeds.展开更多
The genetic basis for Gossypium hirsutum race latifolium,the putative ancestor of cultivated upland cotton,emerging from the semi-wild races to be domesticated into cultivated upland cotton is unknown.Here,we reported...The genetic basis for Gossypium hirsutum race latifolium,the putative ancestor of cultivated upland cotton,emerging from the semi-wild races to be domesticated into cultivated upland cotton is unknown.Here,we reported a high-quality genome assembly of G.latifolium.Comparative genome analyses revealed substantial variations in both gene group composition and genomic sequences across 13 cotton genomes,including the expansion of photosynthesis-related gene groups in G.latifolium compared with other races and the pivotal contribution of structural variations(SVs)to G.hirsutum domestication.Based on the resequencing reads and constructed pan-genome of upland cotton,co-selection regions and SVs with significant frequency differences among different populations were identified.Genes located in these regions or affected by these variations may characterize the differences between G.latifolium and other races,and could be involved in maintenance of upland cotton domestication phenotypes.These findings may assist in mining genes for upland cotton improvement and improving the understanding of the genetic basis of upland cotton domestication.展开更多
Selaginella moellendorffii Hieron.,a lycophyte of significant medicinal and evolutionary importance,is recognized as one of the earliest vascular plants.However,the absence of a high-quality reference genome has hinde...Selaginella moellendorffii Hieron.,a lycophyte of significant medicinal and evolutionary importance,is recognized as one of the earliest vascular plants.However,the absence of a high-quality reference genome has hindered the comprehensive exploration of its unique phylogenetic position and therapeutic potential,thereby limiting our understanding of its genomic structure and metabolic capabilities.In this study,we present the first chromosome-level,telomere-to-telomere(T2T)genome assembly of S.moellendorffii,constructed utilizing PacBio HiFi,Oxford Nanopore(ONT),and Hi-C technologies.The assembled genome,spanning 112.83 Mb across 10 chromosomes with a contig N50 of 11.11 Mb,exhibited exceptional completeness(BUSCO score:95.7%)and accuracy(QV=48.11).Comparative genomic analysis identified 3515 gene families unique to S.moellendorffii,with significant enrichment in secondary metabolismpathways,including those related to flavonoid biosynthesis.Phylogenetic analysis revealed that S.moellendorffii diverged from Isoetes approximately 339.6 million years ago(MYA),representing a key evolutionary transition in early vascular plants.By integrating tissue-specific transcriptome and metabolome analyses,we uncovered the molecular basis of biflavone biosynthesis,identifying key enzymes and regulatory networks that govern the production of these bioactive compounds.We observed a correlation between the tissue-specific accumulation patterns of six major biflavones,including amentoflavone and ginkgetin,and the expression of their biosynthetic genes.This high-quality genome assembly,coupled with multi-omics analyses,offers unprecedented insights into the evolution of early vascular plants and elucidates the molecular mechanisms behind their specialized metabolism.展开更多
Common bean(Phaseolus vulgaris L.)is a vital source of protein and essential nutrients for human consumption and plays a key role in sustainable agriculture due to its nitrogen-fixing ability(Nadeem et al.,2021).Kidne...Common bean(Phaseolus vulgaris L.)is a vital source of protein and essential nutrients for human consumption and plays a key role in sustainable agriculture due to its nitrogen-fixing ability(Nadeem et al.,2021).Kidney beans,a subcategory of dry common beans,are highly valued for their rich protein,dietary fiber,low fat content,and various trace elements(Garcia-Cordero et al.,2021).Despite the release of several de novo genome assemblies(Goodstein et al.,2012;Schmutz et al.,2014;Vlasova et al.,2016;Cortinovis et al.,2024),existing common bean genomes remain incomplete,particularly in complex regions such as centromeres and telomeres,limiting a comprehensive understanding of the genomic landscape.展开更多
Sechium edule(chayote)is an important vegetable crop belonging to the Cucurbitaceae family.To decipher the chayote genome,a highquality chromosome-level chayote genome was obtained by genome sequencing and bioinformat...Sechium edule(chayote)is an important vegetable crop belonging to the Cucurbitaceae family.To decipher the chayote genome,a highquality chromosome-level chayote genome was obtained by genome sequencing and bioinformatic analysis.The total length was612.91 Mb,and 25755 genes were detected in the chayote genome.The contig N50 was more than 20.01 Mb,and the scaffold N50 was over47.11 Mb.Of the genome,60.35%were composed of repetitive sequences,and 31.18%of genome sequences belonged to long-terminal repeats.A global alignment of homologous regions in chayote and other Cucurbitaceae plant genomes was constructed using grape as a reference.Based on this genome-wide and global alignment map,researchers can easily identify homologous collinear genes of the studied genomes in most Cucurbitaceae species.Twenty-five chayote accessions were divided into two subgroups based on phylogenetic tree,population structure analysis,and principal component analysis using genome re-sequencing data.The chayote genome,re-sequencing dataset,and comprehensive genomic analysis will accelerate comparative and functional genomic analysis of chayote and other Cucurbitaceae species in the future.展开更多
The genus Oryza consists of two cultivated species (O. sativa L. and O. glaberrima Steud.) and approximately 20 wild relative species widely distributed in the pan-tropics. These species have been classified into four...The genus Oryza consists of two cultivated species (O. sativa L. and O. glaberrima Steud.) and approximately 20 wild relative species widely distributed in the pan-tropics. These species have been classified into four complexes following the Vaughan's taxonomic system([1]). The O. officinalis complex is the largest complex in the genus, which includes ten species, having BE, CC, on, and EE genomes in the diploids as well as BBCC and CCDD genomes in the tetraploids. The relationships among the BE, CC, and EE genomes still remain unclear, although previous studies have indicated certain affinities of these genomes([2-4]). Genomic in situ hybridization (GISH) is a powerful technique to detect the relationships among the related genomes at chromosome and DNA levels. The objective of the present study was to investigate the relationships among the BE, CC and EE genomes in the genus Oryza by the two-probe GISH.展开更多
Juglans sigillata is an economically valuable nut crop renowned for its nutritional richness,including essential nutrients,antioxidants,and healthy fats,which boost human cardial,brain and gut health.Despite its impor...Juglans sigillata is an economically valuable nut crop renowned for its nutritional richness,including essential nutrients,antioxidants,and healthy fats,which boost human cardial,brain and gut health.Despite its importance,the lack of a complete genome assembly has been a stumbling block in its biological breeding process.Therefore,we generated deep coverage ultralong Oxford Nanopore Technology(ONT)and PacBio HiFi reads to construct a telomere-to-telomere(T2T)genome assembly.The final assembly spans 537.27 Mb with no gaps,demonstrating a remarkable completeness of 98.1%.We utilized a combination of transcriptome data and homologous proteins to annotate the genome,identifying 36018 protein-coding genes.Furthermore,we profiled global cytosine DNA methylations using ONT sequencing data.Global methylome analysis revealed high methylation levels in transposable element(TE)-rich chromosomal regions juxtaposed with comparatively lower methylation in gene-rich areas.By integrating a detailed multi-omics data analysis,we obtained valuable insights into the mechanism underlying endopleura coloration.This investigation led to the identification of eight candidate genes(e.g.ANR)involved in anthocyanin biosynthesis pathways,which are crucial for the development of color in plants.The comprehensive genome assembly and the understanding of the genetic basis of important traits like endopleura coloration will open avenues for more efficient breeding programs and improved crop quality.展开更多
In-depth knowledge of the microbes responsible for biogenic amine(BA)production during soy sauce fermentation remains limited.Herein,the variations in the BA profiles,microbial communities,and microbes involved in BA ...In-depth knowledge of the microbes responsible for biogenic amine(BA)production during soy sauce fermentation remains limited.Herein,the variations in the BA profiles,microbial communities,and microbes involved in BA production during the fermentation of soy sauce through Japanese-type(JP)and Cantonese-type(CP)processes were compared.BA analysis revealed that the most abundant BA species were putrescine,tyramine,and histamine in the later three stages(1187.68,785.16,and 193.20 mg/kg on average,respectively).The BA profiles differed significantly,with CP samples containing higher contents of putrescine,tyramine,and histamine(P<0.05)at the end of fermentation.Metagenomic analysis indicated that BA-producing genes exhibited different abundance profiles,with most genes,including spe A,spe B,arg,spe E,and tyr DC,having higher abundances in microbial communities during the CP process.In total,15 high-quality metagenome-assembled genomes(MAGs)were retrieved,of which 10 encoded at BA production-related genes.Enterococcus faecium(MAG10)and Weissella paramesenteroides(MAG5)might be the major tyramine producers.The high putrescine content in CP might be associated with the high abundance of Staphylococcus gallinarum(MAG8).This study provides a comprehensive understanding of the diversity and abundance of genes involved in BA synthesis,especially at the species level,during food fermentation.展开更多
Genetic information has been instrumental in elucidating the relationship between the East Asian Summer Monsoon(EASM)and subtropical evergreen broad-leaved forests(EBLFs).However,how the genomic insights of EBLFs’spe...Genetic information has been instrumental in elucidating the relationship between the East Asian Summer Monsoon(EASM)and subtropical evergreen broad-leaved forests(EBLFs).However,how the genomic insights of EBLFs’species correspond to environmental shifts induced by the EASM remains limited.In this study,we investigated the adaptive mechanisms of evergreen Engelhardia species in response to the EASM through genome sequencing and comparative genomic analyses from the de novo genome assemblies of fiveclosely related Engelhardia taxa and one Rhoiptelea species.Our findingsrevealed that the divergence of evergreen trees from their sister deciduous species is closely associated with the onset and intensification of the EASM.This genomic transitionmayhave coincided with a significantexpansion of the terpene synthase(TPS)gene family in E.fenzelii,driven by four distinct modes of gene duplication.This expansion enhances the biosynthesis of terpene volatiles,providing a defensive mechanism against potential herbivory in EASM affected environments.We also identifieda shared whole-genome duplication(WGD)event across Engelhardia,along with substantial differences in transposable element(TE)composition and activity,which contributed to genome size variation between E.fenzelii and E.roxburghiana.In addition,demographic analyses revealed a continuous population decline over the past 10 million years,further exacerbated by recenthumandisturbance,underscoring the conservation urgency for these species.These results not only provide preliminary insights into the complex evolutionary dynamics within the Engelhardia genus from genomic insights(e.g.,the intricate relationships between genomic variations,environmental changes,and adaptive responses driven by significantclimatic events such as the EASM),but also provides valuable insights into the conservation significance of EBLFs.展开更多
Precise chromosome engineering has traditionally relied on the Cre-Lox recombination system-an approach in which the enzyme Cre functions like molecular scissors,cutting and rejoining DNA at specific“Lox”sites to ad...Precise chromosome engineering has traditionally relied on the Cre-Lox recombination system-an approach in which the enzyme Cre functions like molecular scissors,cutting and rejoining DNA at specific“Lox”sites to add,remove,or flip genomic DNA segments inside living cells.展开更多
基金supported by the National Key R&D Program of China(2024YFD1201100)the research program from the Zhongshan Biological Breeding Laboratory(ZSBBL-KY2023-02)the National Natural Science Foundation of China(32341037).
文摘Wheat(Triticum aestivum)faces significant threats from diseases such as powdery mildew(Blumeria graminis)and Fusarium head blight(FHB;caused by Fusarium graminearum),which cause severe yield losses.Moreover,the antagonism between yield-related traits and disease resistance makes yield resistance coordination a major challenge in wheat breeding.The lack of genetic resources combining both disease resistance and high yield constrains the elucidation of underlying resistance-yield trade-off mechanisms,thereby hindering the development of high-yield and disease-resistant wheat cultivars.Remarkably,Yangmai 33(YM33),a notable wheat cultivar with resistance to both powdery mildew and FHB as well as high-yield performance,was recently developed.It offers a unique opportunity to dissect the genomic architecture underlying the coordination between disease resistance and yield.
基金supported by the Jilin Science and Technology Development Program,China (20240602032RC)the Jilin Agricultural Science and Technology Innovation Project,China (CXGC2024ZD001)+1 种基金the Jilin Agricultural Science and Technology Innovation Project,China (CXGC2024ZY012)the Jilin Province Development and Reform Commission-Project for Improving the Independent Innovation Capacity of Major Grain Crops,China (2024C002)。
文摘Emerging and powerful genome editing tools,particularly CRISPR/Cas9,are facilitating functional genomics research and accelerating crop improvement(Jiang et al.2021;Cao et al.2023;Chen C et al.2023;Liu et al.2023a).However,the detection and screening of transgenic lines remain major bottlenecks,being time-consuming,labor-intensive,and inefficient during transformation and subsequent mutation identification.A simple and efficient visual marker system plays a critical role in addressing these challenges.Recent studies demonstrated that the GmW1 and RUBY reporter systems were used to obtain visual transgenic soybean(Glycine max) plants(Chen L et al.2023;Chen et al.2024).
文摘Iris domestica,a perennial herb of the Iridaceae family,is widely recognized for its rich isoflavone content and broad therapeutic properties.To elucidate the biosynthetic pathway of these medicinally significant compounds,we constructed a haplotype-resolved genome assembly of this species.Transcriptomic and metabolomic analyses revealed tissue-specific accumulation of isoflavone,particularly in rhizomes and roots.Functional characterization identified two candidate isoflavone synthase genes,among which IdIFS was confirmed to promote the biosynthesis of key compounds tectorigenin and irisflorentin.The high-quality genome assembly presented here provides a foundational resource for further research into the evolution,secondary metabolite,and environmental adaptation of I.domestica.
基金supported by the National Natural Science Foundation of China(U23A20160,32360336)Guizhou Provincial Key Technology R&D Program(Qian KeHe ZhiCheng[2023]YiBan035).
文摘Natural hybridization is known to play a vital role in speciation;however,the mechanisms underlying the early stages of natural hybridization remain unclear.Where two plant species come into contact,two driving forces may balance the dynamic consequences of hybridization:fusion by hybridization-mediated gene flow,and separation by reproductive isolation(RI)(Ma et al.,2010a,b;Chang et al.,2022).
基金supported by the Biological BreedingMajor Projects(2023ZD04076)the National Natural Science Foundation of China(32300312)+2 种基金the Innovation Program of Chinses Academy of Agricultural Sciences(CAAS-CSIAF-202303)the Guangdong Basic and Applied Basic Research Foundation(2020B1515120086)the KeyArea Research and Development Program of Guangdong Province(2021B0707010006)。
文摘Rice, a global staple food, is critical for food security. The cultivated Oryza sativa, domesticated from wild O. rufipogon, derives~80%of its 993 identified domestication-related genes from O. rufipogon and 20%from South/Southeast Asian wild O. nivara(Jing et al., 2023). Genes like An-1, BH4, PROG1,SH4, Rc, Rd, and GS3—which regulate awn length, hull color,til er angle, seed shattering, pericarp color, seed length, and thousand-grain weight, respectively—were selected against during domestication to form modern O. sativa(Yu et al., 2021).However, domestication and yield-focused breeding eliminated wild rice's valuable genes(e.g., for disease resistance, stress tolerance, nutrition), narrowing genetic diversity and impeding efforts to meet growing societal demands.
基金supported by the China Agriculture Research System of MOF and MARAthe National Natural Science Foundation of China (31872337 and 31501919)the Agricultural Science and Technology Innovation Project,China (ASTIP-IAS02)。
文摘The advantages of genome selection(GS) in animal and plant breeding are self-evident.Traditional parametric models have disadvantage in better fit the increasingly large sequencing data and capture complex effects accurately.Machine learning models have demonstrated remarkable potential in addressing these challenges.In this study,we introduced the concept of mixed kernel functions to explore the performance of support vector machine regression(SVR) in GS.Six single kernel functions(SVR_L,SVR_C,SVR_G,SVR_P,SVR_S,SVR_L) and four mixed kernel functions(SVR_GS,SVR_GP,SVR_LS,SVR_LP) were used to predict genome breeding values.The prediction accuracy,mean squared error(MSE) and mean absolute error(MAE) were used as evaluation indicators to compare with two traditional parametric models(GBLUP,BayesB) and two popular machine learning models(RF,KcRR).The results indicate that in most cases,the performance of the mixed kernel function model significantly outperforms that of GBLUP,BayesB and single kernel function.For instance,for T1 in the pig dataset,the predictive accuracy of SVR_GS is improved by 10% compared to GBLUP,and by approximately 4.4 and 18.6% compared to SVR_G and SVR_S respectively.For E1 in the wheat dataset,SVR_GS achieves 13.3% higher prediction accuracy than GBLUP.Among single kernel functions,the Laplacian and Gaussian kernel functions yield similar results,with the Gaussian kernel function performing better.The mixed kernel function notably reduces the MSE and MAE when compared to all single kernel functions.Furthermore,regarding runtime,SVR_GS and SVR_GP mixed kernel functions run approximately three times faster than GBLUP in the pig dataset,with only a slight increase in runtime compared to the single kernel function model.In summary,the mixed kernel function model of SVR demonstrates speed and accuracy competitiveness,and the model such as SVR_GS has important application potential for GS.
基金supported by the Natural Science Foundation of China(32272364)the Shanghai Education Committee Scientific Research Innovation Projects,China(2101070007800120)+2 种基金National Science Foundation for Distinguished Young Scholars(32025029)Shanghai Key Project in Synthetic Biology(23HC1400900)the Shanghai Engineering Research Center of 460 Food Microbiology Program(19DZ2281100).
文摘This study examined the potential response mechanisms of Ligilactobacillus salivarius AR612 to glucose stress through whole-genome and comparative transcriptome analysis.We obtained the basic genome information of L.salivarius AR612.The full genome length of L.salivarius AR612 was 1970245 bp,with a GC content of 33.01%and 1894 coding genes.Moreover,we identified many genes associated with genetic adaptations to various stress factors,including temperature,p H,osmotic pressure,bile salts,and oxidative stress.Physiological analysis revealed that the growth and morphology of AR612 changed significantly under glucose stress,with a decrease in the maximum growth and irregular cell morphology.Furthermore,a comparison of transcriptome data indicated that glucose stress induced changes in the number of differential genes.Moreover,AR612 could respond to extracellular glucose stress by changing the expression of genes related to cell morphology,carbohydrate metabolism,amino acid metabolism,fatty acid synthesis,and nucleotide metabolism.This study provides valuable theoretical insights for future research on the adaptation of L.salivarius AR612 to nutritional stress and its application in industrial processes.
基金supported by the National Natural Science Foundation of China(32270217,31970205,31770211)Metasequoia funding of Nanjing Forestry University to YY。
文摘Amborella trichopoda(Amborellaceae;hereafter simply Amborella)(Fig.1A)is a shrub endemic to New Caledonia in the Southwest Pacific that represents the sole sister species of all other extant angiosperms(Qiu et al.,1999;One Thousand Plant Transcriptomes Initiative,2019).Due to its unique phylogenetic status,it holds tremendous interest for botanists.The nuclear and mitochondrial genomes of Amborella were first published in 2013,providing valuable resources for studies on genome and gene family evolution,phylogenomics,and flower development,despite the fact that the assembly is heavily fragmented(Amborella Genome Project,2013;Rice et al.,2013).In 2024,a haplotype-resolved Amborella genome assembly was published,showing significant improvement in quality and completeness(Carey et al.,2024).
文摘Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural variations and other annotation information.B:The second type of visualization map was simple and only showed the synteny relationship between the chromosomes of two or three genomes.C:Multiplatform general GenomeSyn submission page,applicable to Windows,MAC and web platforms;other analysis files can be entered in the"other"option.The publisher would like to apologise for any inconvenience caused.
基金the project “Genomics for Conservation of Indigenous Cattle Breeds and for Enhancing Milk Yield, Phase-I” [BT/ PR26466/AAQ/1/704/2017], funded by the Department of Biotechnology (DBT ), Indiathe project “Identification of key molecular factors involved in resistance/susceptibility to paratuberculosis infection in indigenous breeds of cows” [BT/PR32758/AAQ/1/760/2019], which was also funded by Department of Biotechnology (DBT ), India。
文摘Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the genomic landscape of Indian cattle.To comprehensively characterize the genomic variation within Bos indicus and,specifically,dairy breeds,we aim to identify non-reference sequences and construct a comprehensive pangenome.Results Five representative genomes of prominent dairy breeds,including Gir,Kankrej,Tharparkar,Sahiwal,and Red Sindhi,were sequenced using 10X Genomics‘linked-read’technology.Assemblies generated from these linked-reads ranged from 2.70 Gb to 2.77 Gb,comparable to the Bos indicus Brahman reference genome.A pangenome of Bos indicus cattle was constructed by comparing the newly assembled genomes with the reference using alignment and graph-based methods,revealing 8 Mb and 17.7 Mb of novel sequence respectively.A confident set of 6,844 Non-reference Unique Insertions(NUIs)spanning 7.57 Mb was identified through both methods,representing the pange-nome of Indian Bos indicus breeds.Comparative analysis with previously published pangenomes unveiled 2.8 Mb(37%)commonality with the Chinese indicine pangenome and only 1%commonality with the Bos taurus pange-nome.Among these,2,312 NUIs encompassing~2 Mb,were commonly found in 98 samples of the 5 breeds and des-ignated as Bos indicus Common Insertions(BICIs)in the population.Furthermore,926 BICIs were identified within 682 protein-coding genes,54 long non-coding RNAs(lncRNA),and 18 pseudogenes.These protein-coding genes were enriched for functions such as chemical synaptic transmission,cell junction organization,cell-cell adhesion,and cell morphogenesis.The protein-coding genes were found in various prominent quantitative trait locus(QTL)regions,suggesting potential roles of BICIs in traits related to milk production,reproduction,exterior,health,meat,and carcass.Notably,63.21%of the bases within the BICIs call set contained interspersed repeats,predominantly Long Inter-spersed Nuclear Elements(LINEs).Additionally,70.28%of BICIs are shared with other domesticated and wild species,highlighting their evolutionary significance.Conclusions This is the first report unveiling a robust set of NUIs defining the pangenome of Bos indicus breeds of India.The analyses contribute valuable insights into the genomic landscape of desi cattle breeds.
基金supported by the National Natural Science Foundation of China(32201873)the Key Research and Development Plan of Hubei Province(2023BBB050)。
文摘The genetic basis for Gossypium hirsutum race latifolium,the putative ancestor of cultivated upland cotton,emerging from the semi-wild races to be domesticated into cultivated upland cotton is unknown.Here,we reported a high-quality genome assembly of G.latifolium.Comparative genome analyses revealed substantial variations in both gene group composition and genomic sequences across 13 cotton genomes,including the expansion of photosynthesis-related gene groups in G.latifolium compared with other races and the pivotal contribution of structural variations(SVs)to G.hirsutum domestication.Based on the resequencing reads and constructed pan-genome of upland cotton,co-selection regions and SVs with significant frequency differences among different populations were identified.Genes located in these regions or affected by these variations may characterize the differences between G.latifolium and other races,and could be involved in maintenance of upland cotton domestication phenotypes.These findings may assist in mining genes for upland cotton improvement and improving the understanding of the genetic basis of upland cotton domestication.
基金funded by the National Natural Science Foundation of China(Grant No.81903921)the Key project at central government level:The ability establishment of sustainable use for valuable Chinese medicine resources(2060302)the Distinguished Young Scholars of Hubei University of Chinese Medicine(Grant No.2022ZZXJ002).
文摘Selaginella moellendorffii Hieron.,a lycophyte of significant medicinal and evolutionary importance,is recognized as one of the earliest vascular plants.However,the absence of a high-quality reference genome has hindered the comprehensive exploration of its unique phylogenetic position and therapeutic potential,thereby limiting our understanding of its genomic structure and metabolic capabilities.In this study,we present the first chromosome-level,telomere-to-telomere(T2T)genome assembly of S.moellendorffii,constructed utilizing PacBio HiFi,Oxford Nanopore(ONT),and Hi-C technologies.The assembled genome,spanning 112.83 Mb across 10 chromosomes with a contig N50 of 11.11 Mb,exhibited exceptional completeness(BUSCO score:95.7%)and accuracy(QV=48.11).Comparative genomic analysis identified 3515 gene families unique to S.moellendorffii,with significant enrichment in secondary metabolismpathways,including those related to flavonoid biosynthesis.Phylogenetic analysis revealed that S.moellendorffii diverged from Isoetes approximately 339.6 million years ago(MYA),representing a key evolutionary transition in early vascular plants.By integrating tissue-specific transcriptome and metabolome analyses,we uncovered the molecular basis of biflavone biosynthesis,identifying key enzymes and regulatory networks that govern the production of these bioactive compounds.We observed a correlation between the tissue-specific accumulation patterns of six major biflavones,including amentoflavone and ginkgetin,and the expression of their biosynthetic genes.This high-quality genome assembly,coupled with multi-omics analyses,offers unprecedented insights into the evolution of early vascular plants and elucidates the molecular mechanisms behind their specialized metabolism.
基金supported by the National Natural Science Foundation of China(32241045,32241046,32241038)the Major Special Science and Technology Projects in Shanxi Province(202101140601027)+3 种基金Shanxi Provincial Agricultural Key Technologies Breakthrough Project(NYGG01)Doctoral Research Starting Project at Shanxi Agricultural University(2024BQ77)the National Key Research and Development Program of China(2023YFD1202705/2023YFD120270503,2023YFD1202703/2023YFD1202703-4)Shanxi HouJi Laboratory Self-proposed Research Project(202304010930003/202304010930003-03).
文摘Common bean(Phaseolus vulgaris L.)is a vital source of protein and essential nutrients for human consumption and plays a key role in sustainable agriculture due to its nitrogen-fixing ability(Nadeem et al.,2021).Kidney beans,a subcategory of dry common beans,are highly valued for their rich protein,dietary fiber,low fat content,and various trace elements(Garcia-Cordero et al.,2021).Despite the release of several de novo genome assemblies(Goodstein et al.,2012;Schmutz et al.,2014;Vlasova et al.,2016;Cortinovis et al.,2024),existing common bean genomes remain incomplete,particularly in complex regions such as centromeres and telomeres,limiting a comprehensive understanding of the genomic landscape.
基金supported by the National Natural Science Foundation of China Project(Grant No.32260097)the National Guidance Foundation for Local Science and Technology Development of China(Grant No.[2023]009)the Natural Science Foundation for Distinguished Young Scholars of Hebei(Grant No.C2022209010)。
文摘Sechium edule(chayote)is an important vegetable crop belonging to the Cucurbitaceae family.To decipher the chayote genome,a highquality chromosome-level chayote genome was obtained by genome sequencing and bioinformatic analysis.The total length was612.91 Mb,and 25755 genes were detected in the chayote genome.The contig N50 was more than 20.01 Mb,and the scaffold N50 was over47.11 Mb.Of the genome,60.35%were composed of repetitive sequences,and 31.18%of genome sequences belonged to long-terminal repeats.A global alignment of homologous regions in chayote and other Cucurbitaceae plant genomes was constructed using grape as a reference.Based on this genome-wide and global alignment map,researchers can easily identify homologous collinear genes of the studied genomes in most Cucurbitaceae species.Twenty-five chayote accessions were divided into two subgroups based on phylogenetic tree,population structure analysis,and principal component analysis using genome re-sequencing data.The chayote genome,re-sequencing dataset,and comprehensive genomic analysis will accelerate comparative and functional genomic analysis of chayote and other Cucurbitaceae species in the future.
文摘The genus Oryza consists of two cultivated species (O. sativa L. and O. glaberrima Steud.) and approximately 20 wild relative species widely distributed in the pan-tropics. These species have been classified into four complexes following the Vaughan's taxonomic system([1]). The O. officinalis complex is the largest complex in the genus, which includes ten species, having BE, CC, on, and EE genomes in the diploids as well as BBCC and CCDD genomes in the tetraploids. The relationships among the BE, CC, and EE genomes still remain unclear, although previous studies have indicated certain affinities of these genomes([2-4]). Genomic in situ hybridization (GISH) is a powerful technique to detect the relationships among the related genomes at chromosome and DNA levels. The objective of the present study was to investigate the relationships among the BE, CC and EE genomes in the genus Oryza by the two-probe GISH.
基金supported by the Yunnan Seed Laboratory,China(202205AR070001-15)the National Natural Science Foundation of China,China(Grant No.32160697)。
文摘Juglans sigillata is an economically valuable nut crop renowned for its nutritional richness,including essential nutrients,antioxidants,and healthy fats,which boost human cardial,brain and gut health.Despite its importance,the lack of a complete genome assembly has been a stumbling block in its biological breeding process.Therefore,we generated deep coverage ultralong Oxford Nanopore Technology(ONT)and PacBio HiFi reads to construct a telomere-to-telomere(T2T)genome assembly.The final assembly spans 537.27 Mb with no gaps,demonstrating a remarkable completeness of 98.1%.We utilized a combination of transcriptome data and homologous proteins to annotate the genome,identifying 36018 protein-coding genes.Furthermore,we profiled global cytosine DNA methylations using ONT sequencing data.Global methylome analysis revealed high methylation levels in transposable element(TE)-rich chromosomal regions juxtaposed with comparatively lower methylation in gene-rich areas.By integrating a detailed multi-omics data analysis,we obtained valuable insights into the mechanism underlying endopleura coloration.This investigation led to the identification of eight candidate genes(e.g.ANR)involved in anthocyanin biosynthesis pathways,which are crucial for the development of color in plants.The comprehensive genome assembly and the understanding of the genetic basis of important traits like endopleura coloration will open avenues for more efficient breeding programs and improved crop quality.
基金supported by the Natural Science Foundation of Guangdong Province(2022A1515012158)the National Science Foundation of China(41977138)+3 种基金the Construction Project of Teaching Quality and Teaching Reform in Guangdong Province(SJD202001)the General University Project of Guangdong Provincial Department of Education(2021KCXTD070 and 2021ZDZX4072)the Key Project of Social Welfare and Basic Research of Zhongshan City(2020B2010)the Start-up Fund from the Zhongshan Institute at the University of Electronic Science and Technology in China(419YKQN12)。
文摘In-depth knowledge of the microbes responsible for biogenic amine(BA)production during soy sauce fermentation remains limited.Herein,the variations in the BA profiles,microbial communities,and microbes involved in BA production during the fermentation of soy sauce through Japanese-type(JP)and Cantonese-type(CP)processes were compared.BA analysis revealed that the most abundant BA species were putrescine,tyramine,and histamine in the later three stages(1187.68,785.16,and 193.20 mg/kg on average,respectively).The BA profiles differed significantly,with CP samples containing higher contents of putrescine,tyramine,and histamine(P<0.05)at the end of fermentation.Metagenomic analysis indicated that BA-producing genes exhibited different abundance profiles,with most genes,including spe A,spe B,arg,spe E,and tyr DC,having higher abundances in microbial communities during the CP process.In total,15 high-quality metagenome-assembled genomes(MAGs)were retrieved,of which 10 encoded at BA production-related genes.Enterococcus faecium(MAG10)and Weissella paramesenteroides(MAG5)might be the major tyramine producers.The high putrescine content in CP might be associated with the high abundance of Staphylococcus gallinarum(MAG8).This study provides a comprehensive understanding of the diversity and abundance of genes involved in BA synthesis,especially at the species level,during food fermentation.
基金supported by the National Natural Science Foundation of China(No.42171063)Southeast Asia Biodiversity Research Institute,Chinese Academy of Sciences(No.Y4ZK111B01)+6 种基金the Special Fund for ScientificResearch of Shanghai Landscaping&City Appearance Administrative Bureau(G242414,G242416)the“Yunnan Revitalization Talent Support Program”in Yunnan Province(XDYC-QNRC-2022-0028)Yunnan Revitalization Talent Support Program“Innovation Team”Project(202405AS350019)the CAS“Light of West China”Programthe 14th Five-Year Plan of Xishuangbanna Tropical Botanical Garden,Chinese Academy Sciences(XTBG-1450303)the European Research Council(ERC)under the European Union's Horizon 2020 research and innovation program(No.833522)GhentUniversity(Methusalem funding,BOF.MET.2021.0005.01).
文摘Genetic information has been instrumental in elucidating the relationship between the East Asian Summer Monsoon(EASM)and subtropical evergreen broad-leaved forests(EBLFs).However,how the genomic insights of EBLFs’species correspond to environmental shifts induced by the EASM remains limited.In this study,we investigated the adaptive mechanisms of evergreen Engelhardia species in response to the EASM through genome sequencing and comparative genomic analyses from the de novo genome assemblies of fiveclosely related Engelhardia taxa and one Rhoiptelea species.Our findingsrevealed that the divergence of evergreen trees from their sister deciduous species is closely associated with the onset and intensification of the EASM.This genomic transitionmayhave coincided with a significantexpansion of the terpene synthase(TPS)gene family in E.fenzelii,driven by four distinct modes of gene duplication.This expansion enhances the biosynthesis of terpene volatiles,providing a defensive mechanism against potential herbivory in EASM affected environments.We also identifieda shared whole-genome duplication(WGD)event across Engelhardia,along with substantial differences in transposable element(TE)composition and activity,which contributed to genome size variation between E.fenzelii and E.roxburghiana.In addition,demographic analyses revealed a continuous population decline over the past 10 million years,further exacerbated by recenthumandisturbance,underscoring the conservation urgency for these species.These results not only provide preliminary insights into the complex evolutionary dynamics within the Engelhardia genus from genomic insights(e.g.,the intricate relationships between genomic variations,environmental changes,and adaptive responses driven by significantclimatic events such as the EASM),but also provides valuable insights into the conservation significance of EBLFs.
文摘Precise chromosome engineering has traditionally relied on the Cre-Lox recombination system-an approach in which the enzyme Cre functions like molecular scissors,cutting and rejoining DNA at specific“Lox”sites to add,remove,or flip genomic DNA segments inside living cells.