Emerging and powerful genome editing tools,particularly CRISPR/Cas9,are facilitating functional genomics research and accelerating crop improvement(Jiang et al.2021;Cao et al.2023;Chen C et al.2023;Liu et al.2023a).Ho...Emerging and powerful genome editing tools,particularly CRISPR/Cas9,are facilitating functional genomics research and accelerating crop improvement(Jiang et al.2021;Cao et al.2023;Chen C et al.2023;Liu et al.2023a).However,the detection and screening of transgenic lines remain major bottlenecks,being time-consuming,labor-intensive,and inefficient during transformation and subsequent mutation identification.A simple and efficient visual marker system plays a critical role in addressing these challenges.Recent studies demonstrated that the GmW1 and RUBY reporter systems were used to obtain visual transgenic soybean(Glycine max) plants(Chen L et al.2023;Chen et al.2024).展开更多
Natural hybridization is known to play a vital role in speciation;however,the mechanisms underlying the early stages of natural hybridization remain unclear.Where two plant species come into contact,two driving forces...Natural hybridization is known to play a vital role in speciation;however,the mechanisms underlying the early stages of natural hybridization remain unclear.Where two plant species come into contact,two driving forces may balance the dynamic consequences of hybridization:fusion by hybridization-mediated gene flow,and separation by reproductive isolation(RI)(Ma et al.,2010a,b;Chang et al.,2022).展开更多
The advantages of genome selection(GS) in animal and plant breeding are self-evident.Traditional parametric models have disadvantage in better fit the increasingly large sequencing data and capture complex effects acc...The advantages of genome selection(GS) in animal and plant breeding are self-evident.Traditional parametric models have disadvantage in better fit the increasingly large sequencing data and capture complex effects accurately.Machine learning models have demonstrated remarkable potential in addressing these challenges.In this study,we introduced the concept of mixed kernel functions to explore the performance of support vector machine regression(SVR) in GS.Six single kernel functions(SVR_L,SVR_C,SVR_G,SVR_P,SVR_S,SVR_L) and four mixed kernel functions(SVR_GS,SVR_GP,SVR_LS,SVR_LP) were used to predict genome breeding values.The prediction accuracy,mean squared error(MSE) and mean absolute error(MAE) were used as evaluation indicators to compare with two traditional parametric models(GBLUP,BayesB) and two popular machine learning models(RF,KcRR).The results indicate that in most cases,the performance of the mixed kernel function model significantly outperforms that of GBLUP,BayesB and single kernel function.For instance,for T1 in the pig dataset,the predictive accuracy of SVR_GS is improved by 10% compared to GBLUP,and by approximately 4.4 and 18.6% compared to SVR_G and SVR_S respectively.For E1 in the wheat dataset,SVR_GS achieves 13.3% higher prediction accuracy than GBLUP.Among single kernel functions,the Laplacian and Gaussian kernel functions yield similar results,with the Gaussian kernel function performing better.The mixed kernel function notably reduces the MSE and MAE when compared to all single kernel functions.Furthermore,regarding runtime,SVR_GS and SVR_GP mixed kernel functions run approximately three times faster than GBLUP in the pig dataset,with only a slight increase in runtime compared to the single kernel function model.In summary,the mixed kernel function model of SVR demonstrates speed and accuracy competitiveness,and the model such as SVR_GS has important application potential for GS.展开更多
Amborella trichopoda(Amborellaceae;hereafter simply Amborella)(Fig.1A)is a shrub endemic to New Caledonia in the Southwest Pacific that represents the sole sister species of all other extant angiosperms(Qiu et al.,199...Amborella trichopoda(Amborellaceae;hereafter simply Amborella)(Fig.1A)is a shrub endemic to New Caledonia in the Southwest Pacific that represents the sole sister species of all other extant angiosperms(Qiu et al.,1999;One Thousand Plant Transcriptomes Initiative,2019).Due to its unique phylogenetic status,it holds tremendous interest for botanists.The nuclear and mitochondrial genomes of Amborella were first published in 2013,providing valuable resources for studies on genome and gene family evolution,phylogenomics,and flower development,despite the fact that the assembly is heavily fragmented(Amborella Genome Project,2013;Rice et al.,2013).In 2024,a haplotype-resolved Amborella genome assembly was published,showing significant improvement in quality and completeness(Carey et al.,2024).展开更多
Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural...Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural variations and other annotation information.B:The second type of visualization map was simple and only showed the synteny relationship between the chromosomes of two or three genomes.C:Multiplatform general GenomeSyn submission page,applicable to Windows,MAC and web platforms;other analysis files can be entered in the"other"option.The publisher would like to apologise for any inconvenience caused.展开更多
Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the...Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the genomic landscape of Indian cattle.To comprehensively characterize the genomic variation within Bos indicus and,specifically,dairy breeds,we aim to identify non-reference sequences and construct a comprehensive pangenome.Results Five representative genomes of prominent dairy breeds,including Gir,Kankrej,Tharparkar,Sahiwal,and Red Sindhi,were sequenced using 10X Genomics‘linked-read’technology.Assemblies generated from these linked-reads ranged from 2.70 Gb to 2.77 Gb,comparable to the Bos indicus Brahman reference genome.A pangenome of Bos indicus cattle was constructed by comparing the newly assembled genomes with the reference using alignment and graph-based methods,revealing 8 Mb and 17.7 Mb of novel sequence respectively.A confident set of 6,844 Non-reference Unique Insertions(NUIs)spanning 7.57 Mb was identified through both methods,representing the pange-nome of Indian Bos indicus breeds.Comparative analysis with previously published pangenomes unveiled 2.8 Mb(37%)commonality with the Chinese indicine pangenome and only 1%commonality with the Bos taurus pange-nome.Among these,2,312 NUIs encompassing~2 Mb,were commonly found in 98 samples of the 5 breeds and des-ignated as Bos indicus Common Insertions(BICIs)in the population.Furthermore,926 BICIs were identified within 682 protein-coding genes,54 long non-coding RNAs(lncRNA),and 18 pseudogenes.These protein-coding genes were enriched for functions such as chemical synaptic transmission,cell junction organization,cell-cell adhesion,and cell morphogenesis.The protein-coding genes were found in various prominent quantitative trait locus(QTL)regions,suggesting potential roles of BICIs in traits related to milk production,reproduction,exterior,health,meat,and carcass.Notably,63.21%of the bases within the BICIs call set contained interspersed repeats,predominantly Long Inter-spersed Nuclear Elements(LINEs).Additionally,70.28%of BICIs are shared with other domesticated and wild species,highlighting their evolutionary significance.Conclusions This is the first report unveiling a robust set of NUIs defining the pangenome of Bos indicus breeds of India.The analyses contribute valuable insights into the genomic landscape of desi cattle breeds.展开更多
The genetic basis for Gossypium hirsutum race latifolium,the putative ancestor of cultivated upland cotton,emerging from the semi-wild races to be domesticated into cultivated upland cotton is unknown.Here,we reported...The genetic basis for Gossypium hirsutum race latifolium,the putative ancestor of cultivated upland cotton,emerging from the semi-wild races to be domesticated into cultivated upland cotton is unknown.Here,we reported a high-quality genome assembly of G.latifolium.Comparative genome analyses revealed substantial variations in both gene group composition and genomic sequences across 13 cotton genomes,including the expansion of photosynthesis-related gene groups in G.latifolium compared with other races and the pivotal contribution of structural variations(SVs)to G.hirsutum domestication.Based on the resequencing reads and constructed pan-genome of upland cotton,co-selection regions and SVs with significant frequency differences among different populations were identified.Genes located in these regions or affected by these variations may characterize the differences between G.latifolium and other races,and could be involved in maintenance of upland cotton domestication phenotypes.These findings may assist in mining genes for upland cotton improvement and improving the understanding of the genetic basis of upland cotton domestication.展开更多
Selaginella moellendorffii Hieron.,a lycophyte of significant medicinal and evolutionary importance,is recognized as one of the earliest vascular plants.However,the absence of a high-quality reference genome has hinde...Selaginella moellendorffii Hieron.,a lycophyte of significant medicinal and evolutionary importance,is recognized as one of the earliest vascular plants.However,the absence of a high-quality reference genome has hindered the comprehensive exploration of its unique phylogenetic position and therapeutic potential,thereby limiting our understanding of its genomic structure and metabolic capabilities.In this study,we present the first chromosome-level,telomere-to-telomere(T2T)genome assembly of S.moellendorffii,constructed utilizing PacBio HiFi,Oxford Nanopore(ONT),and Hi-C technologies.The assembled genome,spanning 112.83 Mb across 10 chromosomes with a contig N50 of 11.11 Mb,exhibited exceptional completeness(BUSCO score:95.7%)and accuracy(QV=48.11).Comparative genomic analysis identified 3515 gene families unique to S.moellendorffii,with significant enrichment in secondary metabolismpathways,including those related to flavonoid biosynthesis.Phylogenetic analysis revealed that S.moellendorffii diverged from Isoetes approximately 339.6 million years ago(MYA),representing a key evolutionary transition in early vascular plants.By integrating tissue-specific transcriptome and metabolome analyses,we uncovered the molecular basis of biflavone biosynthesis,identifying key enzymes and regulatory networks that govern the production of these bioactive compounds.We observed a correlation between the tissue-specific accumulation patterns of six major biflavones,including amentoflavone and ginkgetin,and the expression of their biosynthetic genes.This high-quality genome assembly,coupled with multi-omics analyses,offers unprecedented insights into the evolution of early vascular plants and elucidates the molecular mechanisms behind their specialized metabolism.展开更多
Common bean(Phaseolus vulgaris L.)is a vital source of protein and essential nutrients for human consumption and plays a key role in sustainable agriculture due to its nitrogen-fixing ability(Nadeem et al.,2021).Kidne...Common bean(Phaseolus vulgaris L.)is a vital source of protein and essential nutrients for human consumption and plays a key role in sustainable agriculture due to its nitrogen-fixing ability(Nadeem et al.,2021).Kidney beans,a subcategory of dry common beans,are highly valued for their rich protein,dietary fiber,low fat content,and various trace elements(Garcia-Cordero et al.,2021).Despite the release of several de novo genome assemblies(Goodstein et al.,2012;Schmutz et al.,2014;Vlasova et al.,2016;Cortinovis et al.,2024),existing common bean genomes remain incomplete,particularly in complex regions such as centromeres and telomeres,limiting a comprehensive understanding of the genomic landscape.展开更多
Sechium edule(chayote)is an important vegetable crop belonging to the Cucurbitaceae family.To decipher the chayote genome,a highquality chromosome-level chayote genome was obtained by genome sequencing and bioinformat...Sechium edule(chayote)is an important vegetable crop belonging to the Cucurbitaceae family.To decipher the chayote genome,a highquality chromosome-level chayote genome was obtained by genome sequencing and bioinformatic analysis.The total length was612.91 Mb,and 25755 genes were detected in the chayote genome.The contig N50 was more than 20.01 Mb,and the scaffold N50 was over47.11 Mb.Of the genome,60.35%were composed of repetitive sequences,and 31.18%of genome sequences belonged to long-terminal repeats.A global alignment of homologous regions in chayote and other Cucurbitaceae plant genomes was constructed using grape as a reference.Based on this genome-wide and global alignment map,researchers can easily identify homologous collinear genes of the studied genomes in most Cucurbitaceae species.Twenty-five chayote accessions were divided into two subgroups based on phylogenetic tree,population structure analysis,and principal component analysis using genome re-sequencing data.The chayote genome,re-sequencing dataset,and comprehensive genomic analysis will accelerate comparative and functional genomic analysis of chayote and other Cucurbitaceae species in the future.展开更多
The genus Oryza consists of two cultivated species (O. sativa L. and O. glaberrima Steud.) and approximately 20 wild relative species widely distributed in the pan-tropics. These species have been classified into four...The genus Oryza consists of two cultivated species (O. sativa L. and O. glaberrima Steud.) and approximately 20 wild relative species widely distributed in the pan-tropics. These species have been classified into four complexes following the Vaughan's taxonomic system([1]). The O. officinalis complex is the largest complex in the genus, which includes ten species, having BE, CC, on, and EE genomes in the diploids as well as BBCC and CCDD genomes in the tetraploids. The relationships among the BE, CC, and EE genomes still remain unclear, although previous studies have indicated certain affinities of these genomes([2-4]). Genomic in situ hybridization (GISH) is a powerful technique to detect the relationships among the related genomes at chromosome and DNA levels. The objective of the present study was to investigate the relationships among the BE, CC and EE genomes in the genus Oryza by the two-probe GISH.展开更多
Juglans sigillata is an economically valuable nut crop renowned for its nutritional richness,including essential nutrients,antioxidants,and healthy fats,which boost human cardial,brain and gut health.Despite its impor...Juglans sigillata is an economically valuable nut crop renowned for its nutritional richness,including essential nutrients,antioxidants,and healthy fats,which boost human cardial,brain and gut health.Despite its importance,the lack of a complete genome assembly has been a stumbling block in its biological breeding process.Therefore,we generated deep coverage ultralong Oxford Nanopore Technology(ONT)and PacBio HiFi reads to construct a telomere-to-telomere(T2T)genome assembly.The final assembly spans 537.27 Mb with no gaps,demonstrating a remarkable completeness of 98.1%.We utilized a combination of transcriptome data and homologous proteins to annotate the genome,identifying 36018 protein-coding genes.Furthermore,we profiled global cytosine DNA methylations using ONT sequencing data.Global methylome analysis revealed high methylation levels in transposable element(TE)-rich chromosomal regions juxtaposed with comparatively lower methylation in gene-rich areas.By integrating a detailed multi-omics data analysis,we obtained valuable insights into the mechanism underlying endopleura coloration.This investigation led to the identification of eight candidate genes(e.g.ANR)involved in anthocyanin biosynthesis pathways,which are crucial for the development of color in plants.The comprehensive genome assembly and the understanding of the genetic basis of important traits like endopleura coloration will open avenues for more efficient breeding programs and improved crop quality.展开更多
Genetic information has been instrumental in elucidating the relationship between the East Asian Summer Monsoon(EASM)and subtropical evergreen broad-leaved forests(EBLFs).However,how the genomic insights of EBLFs’spe...Genetic information has been instrumental in elucidating the relationship between the East Asian Summer Monsoon(EASM)and subtropical evergreen broad-leaved forests(EBLFs).However,how the genomic insights of EBLFs’species correspond to environmental shifts induced by the EASM remains limited.In this study,we investigated the adaptive mechanisms of evergreen Engelhardia species in response to the EASM through genome sequencing and comparative genomic analyses from the de novo genome assemblies of fiveclosely related Engelhardia taxa and one Rhoiptelea species.Our findingsrevealed that the divergence of evergreen trees from their sister deciduous species is closely associated with the onset and intensification of the EASM.This genomic transitionmayhave coincided with a significantexpansion of the terpene synthase(TPS)gene family in E.fenzelii,driven by four distinct modes of gene duplication.This expansion enhances the biosynthesis of terpene volatiles,providing a defensive mechanism against potential herbivory in EASM affected environments.We also identifieda shared whole-genome duplication(WGD)event across Engelhardia,along with substantial differences in transposable element(TE)composition and activity,which contributed to genome size variation between E.fenzelii and E.roxburghiana.In addition,demographic analyses revealed a continuous population decline over the past 10 million years,further exacerbated by recenthumandisturbance,underscoring the conservation urgency for these species.These results not only provide preliminary insights into the complex evolutionary dynamics within the Engelhardia genus from genomic insights(e.g.,the intricate relationships between genomic variations,environmental changes,and adaptive responses driven by significantclimatic events such as the EASM),but also provides valuable insights into the conservation significance of EBLFs.展开更多
Precise chromosome engineering has traditionally relied on the Cre-Lox recombination system-an approach in which the enzyme Cre functions like molecular scissors,cutting and rejoining DNA at specific“Lox”sites to ad...Precise chromosome engineering has traditionally relied on the Cre-Lox recombination system-an approach in which the enzyme Cre functions like molecular scissors,cutting and rejoining DNA at specific“Lox”sites to add,remove,or flip genomic DNA segments inside living cells.展开更多
Prunus spinulosa(2n=4x=32)is an evergreen species of significantmedicinal usage and ecological value.However,the lacking of a high-quality genome of P.spinulosa has obstructed further exploration of its ecological stu...Prunus spinulosa(2n=4x=32)is an evergreen species of significantmedicinal usage and ecological value.However,the lacking of a high-quality genome of P.spinulosa has obstructed further exploration of its ecological study and phylogenetic relationship of Prunus.In this study,we present the first haplotype-resolved genome assembly of Prunus s.l.subgenus Laurocerasus,the tetraploid genome of P.spinulosa was phased into 32 pseudochromosomes with 4 haplotypes,the genome size of each haplotype ranged from 249.82 Mb to 259.69 Mb,and N50 fluctuatedfrom 31.35 Mb to 33.25 Mb,the protein-coding genes vary from 21,272 to 22,668.Different evaluation methods showed that the P.spinulosa genome assembly has high quality of completeness,continuity and accuracy.Being the firstcomplete genome of P.spinulosa,it provides a valuable genetic resource for the Prunus tetraploid species database and supports further functional genomic study of this species.展开更多
Background As an important genetic resource of chickens,gamecock has unique morphological and behavioral characteristics such as large size,muscular development and strong aggression,making it a good model for studyin...Background As an important genetic resource of chickens,gamecock has unique morphological and behavioral characteristics such as large size,muscular development and strong aggression,making it a good model for studying muscle development and behavior patterns,as well as an excellent breeding material.Gamecocks are distributed worldwide,forming different breeds and strains.However,the single or multiple origin of global gamecocks has not been fully established and much remains unknown about genetic introgression events between gamecocks and other chickens.Therefore,in this study,based on whole genome data of gamecocks,Chinese indigenous chickens,commercial chickens and wild jungle fowls,we performed population structure analysis,selection signal analysis,and genomic introgression analysis of gamecocks.Results The population structure analysis revealed that gamecocks have multiple origins.In addition,we used Fst,π-Ratio and XP-EHH methods to explore the candidate signatures of positive selection in gamecocks.A total number of fifteen shared candidate genes were identified using the three different detection strategies.Among these candidate genes,ETV1,DGKB,AGMO,MEOX2,ISPD and PLCB4 are related to the growth and muscle development,fighting performance and neurodevelopment of gamecocks.Furthermore,we detected the introgression event at the MYBPHL region from the Gallus sonneratii into Euramerican gamecocks and at the CPZ gene region from the Gallus gallus gallus into multiple gamecock populations,respectively,indicating the introgression from the wild junglefowl may impact the skeletal and muscle development as well as aggressive behavior of gamecocks.Conclusions In summary,we conducted a genome-wide exploration of gamecocks from multiple regions worldwide.Our analysis confirmed multiple origins of gamecocks and detected several candidate genes that are possibly related to important traits and characteristics in gamecocks.More importantly,this is the first study that has detected introgression events and genes from wild jungle fowls to gamecocks,which provides evidence of the wild jungle fowls contributing to the genetic diversity of gamecocks.Our findings offer new perspectives on the impact of introgression on gamecocks,and provide a basis for further resource conservation and utilization of gamecock,as well as provide excellent material for the genetic improvement of domestic chickens.展开更多
The black wolfberry(Lycium ruthenicum;2n=2x=24)is an important medicinal plant with ecological and economic value.Its fruits have numerous beneficial pharmacological activities,especially those of anthocyanins,polysac...The black wolfberry(Lycium ruthenicum;2n=2x=24)is an important medicinal plant with ecological and economic value.Its fruits have numerous beneficial pharmacological activities,especially those of anthocyanins,polysaccharides,and alkaloids,and have high nutritional value.However,the lack of available genomic resources for this species has hindered research on its medicinal and evolutionary mechanisms.In this study,we developed the telomere-to-telomere(T2T)nearly gapless genome of L.ruthenicum(2.26 Gb)by integrating PacBio HiFi,Nanopore Ultra-Long,and Hi-C technologies.The assembled genome comprised 12 chromosomes with 37,149 protein-coding genes functionally annotated.Approximately 80%of the repetitive sequences were identified,of which long terminal repeats(LTRs)were the most abundant,accounting for 73.01%.The abundance of LTRs might be the main reason for the larger genome of this species compared to that of other Lycium species.The species-specific genes of L.ruthenicum were related to defense mechanisms,salt tolerance,drought resistance,and oxidative stress,further demonstrating their superior adaptability to arid environments.Based on the assembled genome and fruit transcriptome data,we further constructed an anthocyanin biosynthesis pathway and identified 19 candidate structural genes and seven transcription factors that regulate anthocyanin biosynthesis in the fruit developmental stage of L.ruthenicum,most of which were highly expressed at a later stage in fruit development.Furthermore,154 potential disease resistance-related nucleotidebinding genes have been identified in the L.ruthenicum genome.The whole-genome and proximal,dispersed,and tandem duplication genes in the L.ruthenicum genome enriched the number of genes involved in anthocyanin synthesis and resistance-related pathways.These results provide an important genetic basis for understanding genome evolution and biosynthesis of pharmacologically active components in the Lycium genus.展开更多
Hybridization and polyploidy are key drivers of species diversity and genome variation in Lycoris,but their cytological and evolutionary consequences remain poorly understood.Here,we investigated chromosome numbers an...Hybridization and polyploidy are key drivers of species diversity and genome variation in Lycoris,but their cytological and evolutionary consequences remain poorly understood.Here,we investigated chromosome numbers and genome sizes in 64 accessions representing the morphological diversity across the genus.Chromosome numbers ranged from 12 to 33,with seven accessions newly identified,including L.chunxiaoensis(2n=33),two putative L.guangxiensis(2n=19),and fivenatural hybrids(2n=16,18,29,33).Genome sizes varied from 18.03 Gb(L.wulingensis)to 32.62 Gb(L.caldwellii).Although no significantcorrelationwas found between genome size and chromosome number across all accessions,a strong correlation within ploidy-level groups(i.e.,diploid or aneuploid)suggested roles for post-polyploid diploidization,aneuploidy,and dysploidy in speciation.Phylogenetic analyses based on chloroplast genomes and nuclear DNA sequences revealed significantdiscordance,indicating a complex reticulate evolution and historical hybridization,which may complicate morphological classification.Chromosome number aligned more closely with morphological groups,underscoring the necessity of integrating cytological,molecular,and morphological data for accurate taxonomy,particularly in largegenome taxa.Based on this evidence,we propose a putative speciation pathway involving multiple hybridization and polyploidization events,with allopolyploidy playing a predominant role.Furthermore,our results indicate that the species L.insularis and L.longifolia are geographic populations of L.sprengeri and L.aurea,respectively,and confirmedthe distribution of L.traubii and L.albiflora in China' Mainland.These findingsoffer new insights into the mechanisms underlying speciation,interspecificrelationships,and the evolutionary history of Lycoris.展开更多
The soil-resident pathogen, Plasmodiophora brassicae, infects cruciferous crops, causing obligate parasitic clubroot disease and posing a significant threat to the Brassica vegetable industry in China. To learn more a...The soil-resident pathogen, Plasmodiophora brassicae, infects cruciferous crops, causing obligate parasitic clubroot disease and posing a significant threat to the Brassica vegetable industry in China. To learn more about its pathogenesis, we reported a Nanopore sequencing-derived25.3 Mb high-quality genome sequence of P. brassicae pathotype 4 strain(P.b 4). Comparing the P.b 4 genome with that of the published P.brassicae e3 genome(P.b e3) identified single nucleotide polymorphisms, structural variations, and small insertions and deletions. We then carried out RNA-sequencing of root samples from a clubroot-susceptible line at 5, 14, and 28 days after inoculation(DAI), and classified genes into five categories based on their expression patterns. Interestingly, 158 genes were highly expressed at 14 DAI, which were enriched in budding cell isotropic bud growth, ascospore wall assembly, spore wall assembly, spore wall biogenesis, and ascospore wall biogenesis.Subsequently, we bioinformatically predicted 555 secreted effector candidates, among which only 125 were expressed during infection and had amino acid lengths less than 400. The putative effector Pb010018, which was highly expressed at 14 DAI, was validated to have a signal peptide using a yeast secretion system. Luciferase activity and co-immunoprecipitation assays demonstrated that Pb010018 interacts with serine hydroxymethyltransferase BrSHMT1, and expression analysis showed that SHMT1 was upregulated in both Arabidopsis and B. rapa during infection. Furthermore, after infection, the Arabidopsis shmt1 mutant(atshmt1) showed reduced severity of clubroot disease, together with downregulated expression of Pb010018. Our results offer new insights into plant-pathogen interaction mechanisms, and provide the possibility for improving Brassica resistance to clubroot disease.展开更多
基金supported by the Jilin Science and Technology Development Program,China (20240602032RC)the Jilin Agricultural Science and Technology Innovation Project,China (CXGC2024ZD001)+1 种基金the Jilin Agricultural Science and Technology Innovation Project,China (CXGC2024ZY012)the Jilin Province Development and Reform Commission-Project for Improving the Independent Innovation Capacity of Major Grain Crops,China (2024C002)。
文摘Emerging and powerful genome editing tools,particularly CRISPR/Cas9,are facilitating functional genomics research and accelerating crop improvement(Jiang et al.2021;Cao et al.2023;Chen C et al.2023;Liu et al.2023a).However,the detection and screening of transgenic lines remain major bottlenecks,being time-consuming,labor-intensive,and inefficient during transformation and subsequent mutation identification.A simple and efficient visual marker system plays a critical role in addressing these challenges.Recent studies demonstrated that the GmW1 and RUBY reporter systems were used to obtain visual transgenic soybean(Glycine max) plants(Chen L et al.2023;Chen et al.2024).
基金supported by the National Natural Science Foundation of China(U23A20160,32360336)Guizhou Provincial Key Technology R&D Program(Qian KeHe ZhiCheng[2023]YiBan035).
文摘Natural hybridization is known to play a vital role in speciation;however,the mechanisms underlying the early stages of natural hybridization remain unclear.Where two plant species come into contact,two driving forces may balance the dynamic consequences of hybridization:fusion by hybridization-mediated gene flow,and separation by reproductive isolation(RI)(Ma et al.,2010a,b;Chang et al.,2022).
基金supported by the China Agriculture Research System of MOF and MARAthe National Natural Science Foundation of China (31872337 and 31501919)the Agricultural Science and Technology Innovation Project,China (ASTIP-IAS02)。
文摘The advantages of genome selection(GS) in animal and plant breeding are self-evident.Traditional parametric models have disadvantage in better fit the increasingly large sequencing data and capture complex effects accurately.Machine learning models have demonstrated remarkable potential in addressing these challenges.In this study,we introduced the concept of mixed kernel functions to explore the performance of support vector machine regression(SVR) in GS.Six single kernel functions(SVR_L,SVR_C,SVR_G,SVR_P,SVR_S,SVR_L) and four mixed kernel functions(SVR_GS,SVR_GP,SVR_LS,SVR_LP) were used to predict genome breeding values.The prediction accuracy,mean squared error(MSE) and mean absolute error(MAE) were used as evaluation indicators to compare with two traditional parametric models(GBLUP,BayesB) and two popular machine learning models(RF,KcRR).The results indicate that in most cases,the performance of the mixed kernel function model significantly outperforms that of GBLUP,BayesB and single kernel function.For instance,for T1 in the pig dataset,the predictive accuracy of SVR_GS is improved by 10% compared to GBLUP,and by approximately 4.4 and 18.6% compared to SVR_G and SVR_S respectively.For E1 in the wheat dataset,SVR_GS achieves 13.3% higher prediction accuracy than GBLUP.Among single kernel functions,the Laplacian and Gaussian kernel functions yield similar results,with the Gaussian kernel function performing better.The mixed kernel function notably reduces the MSE and MAE when compared to all single kernel functions.Furthermore,regarding runtime,SVR_GS and SVR_GP mixed kernel functions run approximately three times faster than GBLUP in the pig dataset,with only a slight increase in runtime compared to the single kernel function model.In summary,the mixed kernel function model of SVR demonstrates speed and accuracy competitiveness,and the model such as SVR_GS has important application potential for GS.
基金supported by the National Natural Science Foundation of China(32270217,31970205,31770211)Metasequoia funding of Nanjing Forestry University to YY。
文摘Amborella trichopoda(Amborellaceae;hereafter simply Amborella)(Fig.1A)is a shrub endemic to New Caledonia in the Southwest Pacific that represents the sole sister species of all other extant angiosperms(Qiu et al.,1999;One Thousand Plant Transcriptomes Initiative,2019).Due to its unique phylogenetic status,it holds tremendous interest for botanists.The nuclear and mitochondrial genomes of Amborella were first published in 2013,providing valuable resources for studies on genome and gene family evolution,phylogenomics,and flower development,despite the fact that the assembly is heavily fragmented(Amborella Genome Project,2013;Rice et al.,2013).In 2024,a haplotype-resolved Amborella genome assembly was published,showing significant improvement in quality and completeness(Carey et al.,2024).
文摘Fig.1.The GenomeSyn tool for visualizing genome synteny and characterizing structural variations.A:The first synteny visualization map showed the detailed information of two or three genomes and can display structural variations and other annotation information.B:The second type of visualization map was simple and only showed the synteny relationship between the chromosomes of two or three genomes.C:Multiplatform general GenomeSyn submission page,applicable to Windows,MAC and web platforms;other analysis files can be entered in the"other"option.The publisher would like to apologise for any inconvenience caused.
基金the project “Genomics for Conservation of Indigenous Cattle Breeds and for Enhancing Milk Yield, Phase-I” [BT/ PR26466/AAQ/1/704/2017], funded by the Department of Biotechnology (DBT ), Indiathe project “Identification of key molecular factors involved in resistance/susceptibility to paratuberculosis infection in indigenous breeds of cows” [BT/PR32758/AAQ/1/760/2019], which was also funded by Department of Biotechnology (DBT ), India。
文摘Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the genomic landscape of Indian cattle.To comprehensively characterize the genomic variation within Bos indicus and,specifically,dairy breeds,we aim to identify non-reference sequences and construct a comprehensive pangenome.Results Five representative genomes of prominent dairy breeds,including Gir,Kankrej,Tharparkar,Sahiwal,and Red Sindhi,were sequenced using 10X Genomics‘linked-read’technology.Assemblies generated from these linked-reads ranged from 2.70 Gb to 2.77 Gb,comparable to the Bos indicus Brahman reference genome.A pangenome of Bos indicus cattle was constructed by comparing the newly assembled genomes with the reference using alignment and graph-based methods,revealing 8 Mb and 17.7 Mb of novel sequence respectively.A confident set of 6,844 Non-reference Unique Insertions(NUIs)spanning 7.57 Mb was identified through both methods,representing the pange-nome of Indian Bos indicus breeds.Comparative analysis with previously published pangenomes unveiled 2.8 Mb(37%)commonality with the Chinese indicine pangenome and only 1%commonality with the Bos taurus pange-nome.Among these,2,312 NUIs encompassing~2 Mb,were commonly found in 98 samples of the 5 breeds and des-ignated as Bos indicus Common Insertions(BICIs)in the population.Furthermore,926 BICIs were identified within 682 protein-coding genes,54 long non-coding RNAs(lncRNA),and 18 pseudogenes.These protein-coding genes were enriched for functions such as chemical synaptic transmission,cell junction organization,cell-cell adhesion,and cell morphogenesis.The protein-coding genes were found in various prominent quantitative trait locus(QTL)regions,suggesting potential roles of BICIs in traits related to milk production,reproduction,exterior,health,meat,and carcass.Notably,63.21%of the bases within the BICIs call set contained interspersed repeats,predominantly Long Inter-spersed Nuclear Elements(LINEs).Additionally,70.28%of BICIs are shared with other domesticated and wild species,highlighting their evolutionary significance.Conclusions This is the first report unveiling a robust set of NUIs defining the pangenome of Bos indicus breeds of India.The analyses contribute valuable insights into the genomic landscape of desi cattle breeds.
基金supported by the National Natural Science Foundation of China(32201873)the Key Research and Development Plan of Hubei Province(2023BBB050)。
文摘The genetic basis for Gossypium hirsutum race latifolium,the putative ancestor of cultivated upland cotton,emerging from the semi-wild races to be domesticated into cultivated upland cotton is unknown.Here,we reported a high-quality genome assembly of G.latifolium.Comparative genome analyses revealed substantial variations in both gene group composition and genomic sequences across 13 cotton genomes,including the expansion of photosynthesis-related gene groups in G.latifolium compared with other races and the pivotal contribution of structural variations(SVs)to G.hirsutum domestication.Based on the resequencing reads and constructed pan-genome of upland cotton,co-selection regions and SVs with significant frequency differences among different populations were identified.Genes located in these regions or affected by these variations may characterize the differences between G.latifolium and other races,and could be involved in maintenance of upland cotton domestication phenotypes.These findings may assist in mining genes for upland cotton improvement and improving the understanding of the genetic basis of upland cotton domestication.
基金funded by the National Natural Science Foundation of China(Grant No.81903921)the Key project at central government level:The ability establishment of sustainable use for valuable Chinese medicine resources(2060302)the Distinguished Young Scholars of Hubei University of Chinese Medicine(Grant No.2022ZZXJ002).
文摘Selaginella moellendorffii Hieron.,a lycophyte of significant medicinal and evolutionary importance,is recognized as one of the earliest vascular plants.However,the absence of a high-quality reference genome has hindered the comprehensive exploration of its unique phylogenetic position and therapeutic potential,thereby limiting our understanding of its genomic structure and metabolic capabilities.In this study,we present the first chromosome-level,telomere-to-telomere(T2T)genome assembly of S.moellendorffii,constructed utilizing PacBio HiFi,Oxford Nanopore(ONT),and Hi-C technologies.The assembled genome,spanning 112.83 Mb across 10 chromosomes with a contig N50 of 11.11 Mb,exhibited exceptional completeness(BUSCO score:95.7%)and accuracy(QV=48.11).Comparative genomic analysis identified 3515 gene families unique to S.moellendorffii,with significant enrichment in secondary metabolismpathways,including those related to flavonoid biosynthesis.Phylogenetic analysis revealed that S.moellendorffii diverged from Isoetes approximately 339.6 million years ago(MYA),representing a key evolutionary transition in early vascular plants.By integrating tissue-specific transcriptome and metabolome analyses,we uncovered the molecular basis of biflavone biosynthesis,identifying key enzymes and regulatory networks that govern the production of these bioactive compounds.We observed a correlation between the tissue-specific accumulation patterns of six major biflavones,including amentoflavone and ginkgetin,and the expression of their biosynthetic genes.This high-quality genome assembly,coupled with multi-omics analyses,offers unprecedented insights into the evolution of early vascular plants and elucidates the molecular mechanisms behind their specialized metabolism.
基金supported by the National Natural Science Foundation of China(32241045,32241046,32241038)the Major Special Science and Technology Projects in Shanxi Province(202101140601027)+3 种基金Shanxi Provincial Agricultural Key Technologies Breakthrough Project(NYGG01)Doctoral Research Starting Project at Shanxi Agricultural University(2024BQ77)the National Key Research and Development Program of China(2023YFD1202705/2023YFD120270503,2023YFD1202703/2023YFD1202703-4)Shanxi HouJi Laboratory Self-proposed Research Project(202304010930003/202304010930003-03).
文摘Common bean(Phaseolus vulgaris L.)is a vital source of protein and essential nutrients for human consumption and plays a key role in sustainable agriculture due to its nitrogen-fixing ability(Nadeem et al.,2021).Kidney beans,a subcategory of dry common beans,are highly valued for their rich protein,dietary fiber,low fat content,and various trace elements(Garcia-Cordero et al.,2021).Despite the release of several de novo genome assemblies(Goodstein et al.,2012;Schmutz et al.,2014;Vlasova et al.,2016;Cortinovis et al.,2024),existing common bean genomes remain incomplete,particularly in complex regions such as centromeres and telomeres,limiting a comprehensive understanding of the genomic landscape.
基金supported by the National Natural Science Foundation of China Project(Grant No.32260097)the National Guidance Foundation for Local Science and Technology Development of China(Grant No.[2023]009)the Natural Science Foundation for Distinguished Young Scholars of Hebei(Grant No.C2022209010)。
文摘Sechium edule(chayote)is an important vegetable crop belonging to the Cucurbitaceae family.To decipher the chayote genome,a highquality chromosome-level chayote genome was obtained by genome sequencing and bioinformatic analysis.The total length was612.91 Mb,and 25755 genes were detected in the chayote genome.The contig N50 was more than 20.01 Mb,and the scaffold N50 was over47.11 Mb.Of the genome,60.35%were composed of repetitive sequences,and 31.18%of genome sequences belonged to long-terminal repeats.A global alignment of homologous regions in chayote and other Cucurbitaceae plant genomes was constructed using grape as a reference.Based on this genome-wide and global alignment map,researchers can easily identify homologous collinear genes of the studied genomes in most Cucurbitaceae species.Twenty-five chayote accessions were divided into two subgroups based on phylogenetic tree,population structure analysis,and principal component analysis using genome re-sequencing data.The chayote genome,re-sequencing dataset,and comprehensive genomic analysis will accelerate comparative and functional genomic analysis of chayote and other Cucurbitaceae species in the future.
文摘The genus Oryza consists of two cultivated species (O. sativa L. and O. glaberrima Steud.) and approximately 20 wild relative species widely distributed in the pan-tropics. These species have been classified into four complexes following the Vaughan's taxonomic system([1]). The O. officinalis complex is the largest complex in the genus, which includes ten species, having BE, CC, on, and EE genomes in the diploids as well as BBCC and CCDD genomes in the tetraploids. The relationships among the BE, CC, and EE genomes still remain unclear, although previous studies have indicated certain affinities of these genomes([2-4]). Genomic in situ hybridization (GISH) is a powerful technique to detect the relationships among the related genomes at chromosome and DNA levels. The objective of the present study was to investigate the relationships among the BE, CC and EE genomes in the genus Oryza by the two-probe GISH.
基金supported by the Yunnan Seed Laboratory,China(202205AR070001-15)the National Natural Science Foundation of China,China(Grant No.32160697)。
文摘Juglans sigillata is an economically valuable nut crop renowned for its nutritional richness,including essential nutrients,antioxidants,and healthy fats,which boost human cardial,brain and gut health.Despite its importance,the lack of a complete genome assembly has been a stumbling block in its biological breeding process.Therefore,we generated deep coverage ultralong Oxford Nanopore Technology(ONT)and PacBio HiFi reads to construct a telomere-to-telomere(T2T)genome assembly.The final assembly spans 537.27 Mb with no gaps,demonstrating a remarkable completeness of 98.1%.We utilized a combination of transcriptome data and homologous proteins to annotate the genome,identifying 36018 protein-coding genes.Furthermore,we profiled global cytosine DNA methylations using ONT sequencing data.Global methylome analysis revealed high methylation levels in transposable element(TE)-rich chromosomal regions juxtaposed with comparatively lower methylation in gene-rich areas.By integrating a detailed multi-omics data analysis,we obtained valuable insights into the mechanism underlying endopleura coloration.This investigation led to the identification of eight candidate genes(e.g.ANR)involved in anthocyanin biosynthesis pathways,which are crucial for the development of color in plants.The comprehensive genome assembly and the understanding of the genetic basis of important traits like endopleura coloration will open avenues for more efficient breeding programs and improved crop quality.
基金supported by the National Natural Science Foundation of China(No.42171063)Southeast Asia Biodiversity Research Institute,Chinese Academy of Sciences(No.Y4ZK111B01)+6 种基金the Special Fund for ScientificResearch of Shanghai Landscaping&City Appearance Administrative Bureau(G242414,G242416)the“Yunnan Revitalization Talent Support Program”in Yunnan Province(XDYC-QNRC-2022-0028)Yunnan Revitalization Talent Support Program“Innovation Team”Project(202405AS350019)the CAS“Light of West China”Programthe 14th Five-Year Plan of Xishuangbanna Tropical Botanical Garden,Chinese Academy Sciences(XTBG-1450303)the European Research Council(ERC)under the European Union's Horizon 2020 research and innovation program(No.833522)GhentUniversity(Methusalem funding,BOF.MET.2021.0005.01).
文摘Genetic information has been instrumental in elucidating the relationship between the East Asian Summer Monsoon(EASM)and subtropical evergreen broad-leaved forests(EBLFs).However,how the genomic insights of EBLFs’species correspond to environmental shifts induced by the EASM remains limited.In this study,we investigated the adaptive mechanisms of evergreen Engelhardia species in response to the EASM through genome sequencing and comparative genomic analyses from the de novo genome assemblies of fiveclosely related Engelhardia taxa and one Rhoiptelea species.Our findingsrevealed that the divergence of evergreen trees from their sister deciduous species is closely associated with the onset and intensification of the EASM.This genomic transitionmayhave coincided with a significantexpansion of the terpene synthase(TPS)gene family in E.fenzelii,driven by four distinct modes of gene duplication.This expansion enhances the biosynthesis of terpene volatiles,providing a defensive mechanism against potential herbivory in EASM affected environments.We also identifieda shared whole-genome duplication(WGD)event across Engelhardia,along with substantial differences in transposable element(TE)composition and activity,which contributed to genome size variation between E.fenzelii and E.roxburghiana.In addition,demographic analyses revealed a continuous population decline over the past 10 million years,further exacerbated by recenthumandisturbance,underscoring the conservation urgency for these species.These results not only provide preliminary insights into the complex evolutionary dynamics within the Engelhardia genus from genomic insights(e.g.,the intricate relationships between genomic variations,environmental changes,and adaptive responses driven by significantclimatic events such as the EASM),but also provides valuable insights into the conservation significance of EBLFs.
文摘Precise chromosome engineering has traditionally relied on the Cre-Lox recombination system-an approach in which the enzyme Cre functions like molecular scissors,cutting and rejoining DNA at specific“Lox”sites to add,remove,or flip genomic DNA segments inside living cells.
基金supported by the National Natural Science Foundation of China(Grant No.32570239)Key Technology Research and Development Program of Zhejiang Province(Grant No.2023C03138).
文摘Prunus spinulosa(2n=4x=32)is an evergreen species of significantmedicinal usage and ecological value.However,the lacking of a high-quality genome of P.spinulosa has obstructed further exploration of its ecological study and phylogenetic relationship of Prunus.In this study,we present the first haplotype-resolved genome assembly of Prunus s.l.subgenus Laurocerasus,the tetraploid genome of P.spinulosa was phased into 32 pseudochromosomes with 4 haplotypes,the genome size of each haplotype ranged from 249.82 Mb to 259.69 Mb,and N50 fluctuatedfrom 31.35 Mb to 33.25 Mb,the protein-coding genes vary from 21,272 to 22,668.Different evaluation methods showed that the P.spinulosa genome assembly has high quality of completeness,continuity and accuracy.Being the firstcomplete genome of P.spinulosa,it provides a valuable genetic resource for the Prunus tetraploid species database and supports further functional genomic study of this species.
基金supported by the National Key Research and Development Program of China(2024YFF1001400)the Open Project of State Key Laboratory of Animal Biotech Breeding(2024SKLAB6-106)Fundamental Research Funds for Central University(7110100870).
文摘Background As an important genetic resource of chickens,gamecock has unique morphological and behavioral characteristics such as large size,muscular development and strong aggression,making it a good model for studying muscle development and behavior patterns,as well as an excellent breeding material.Gamecocks are distributed worldwide,forming different breeds and strains.However,the single or multiple origin of global gamecocks has not been fully established and much remains unknown about genetic introgression events between gamecocks and other chickens.Therefore,in this study,based on whole genome data of gamecocks,Chinese indigenous chickens,commercial chickens and wild jungle fowls,we performed population structure analysis,selection signal analysis,and genomic introgression analysis of gamecocks.Results The population structure analysis revealed that gamecocks have multiple origins.In addition,we used Fst,π-Ratio and XP-EHH methods to explore the candidate signatures of positive selection in gamecocks.A total number of fifteen shared candidate genes were identified using the three different detection strategies.Among these candidate genes,ETV1,DGKB,AGMO,MEOX2,ISPD and PLCB4 are related to the growth and muscle development,fighting performance and neurodevelopment of gamecocks.Furthermore,we detected the introgression event at the MYBPHL region from the Gallus sonneratii into Euramerican gamecocks and at the CPZ gene region from the Gallus gallus gallus into multiple gamecock populations,respectively,indicating the introgression from the wild junglefowl may impact the skeletal and muscle development as well as aggressive behavior of gamecocks.Conclusions In summary,we conducted a genome-wide exploration of gamecocks from multiple regions worldwide.Our analysis confirmed multiple origins of gamecocks and detected several candidate genes that are possibly related to important traits and characteristics in gamecocks.More importantly,this is the first study that has detected introgression events and genes from wild jungle fowls to gamecocks,which provides evidence of the wild jungle fowls contributing to the genetic diversity of gamecocks.Our findings offer new perspectives on the impact of introgression on gamecocks,and provide a basis for further resource conservation and utilization of gamecock,as well as provide excellent material for the genetic improvement of domestic chickens.
基金supported by the National Natural Science Foundation of China(32360058)the Central Government Guides Local Science and Technology Development Projects,China(2023ZYZX1224)Xinjiang University Excellent Doctoral Student Innovation Project(XJU2022BS051)。
文摘The black wolfberry(Lycium ruthenicum;2n=2x=24)is an important medicinal plant with ecological and economic value.Its fruits have numerous beneficial pharmacological activities,especially those of anthocyanins,polysaccharides,and alkaloids,and have high nutritional value.However,the lack of available genomic resources for this species has hindered research on its medicinal and evolutionary mechanisms.In this study,we developed the telomere-to-telomere(T2T)nearly gapless genome of L.ruthenicum(2.26 Gb)by integrating PacBio HiFi,Nanopore Ultra-Long,and Hi-C technologies.The assembled genome comprised 12 chromosomes with 37,149 protein-coding genes functionally annotated.Approximately 80%of the repetitive sequences were identified,of which long terminal repeats(LTRs)were the most abundant,accounting for 73.01%.The abundance of LTRs might be the main reason for the larger genome of this species compared to that of other Lycium species.The species-specific genes of L.ruthenicum were related to defense mechanisms,salt tolerance,drought resistance,and oxidative stress,further demonstrating their superior adaptability to arid environments.Based on the assembled genome and fruit transcriptome data,we further constructed an anthocyanin biosynthesis pathway and identified 19 candidate structural genes and seven transcription factors that regulate anthocyanin biosynthesis in the fruit developmental stage of L.ruthenicum,most of which were highly expressed at a later stage in fruit development.Furthermore,154 potential disease resistance-related nucleotidebinding genes have been identified in the L.ruthenicum genome.The whole-genome and proximal,dispersed,and tandem duplication genes in the L.ruthenicum genome enriched the number of genes involved in anthocyanin synthesis and resistance-related pathways.These results provide an important genetic basis for understanding genome evolution and biosynthesis of pharmacologically active components in the Lycium genus.
基金supported by the ScientificFund of Nanjing Botanical Garden Men.Sun Yat-Sen(JSPKLB202519)Jiangsu Provincial Crop Germplasm Resource Bank(Lycoris)(JS-ZW-K04)Forestry Science and Technology Popularization Demonstration Project of the Central Finance[Su(2024)TG06].
文摘Hybridization and polyploidy are key drivers of species diversity and genome variation in Lycoris,but their cytological and evolutionary consequences remain poorly understood.Here,we investigated chromosome numbers and genome sizes in 64 accessions representing the morphological diversity across the genus.Chromosome numbers ranged from 12 to 33,with seven accessions newly identified,including L.chunxiaoensis(2n=33),two putative L.guangxiensis(2n=19),and fivenatural hybrids(2n=16,18,29,33).Genome sizes varied from 18.03 Gb(L.wulingensis)to 32.62 Gb(L.caldwellii).Although no significantcorrelationwas found between genome size and chromosome number across all accessions,a strong correlation within ploidy-level groups(i.e.,diploid or aneuploid)suggested roles for post-polyploid diploidization,aneuploidy,and dysploidy in speciation.Phylogenetic analyses based on chloroplast genomes and nuclear DNA sequences revealed significantdiscordance,indicating a complex reticulate evolution and historical hybridization,which may complicate morphological classification.Chromosome number aligned more closely with morphological groups,underscoring the necessity of integrating cytological,molecular,and morphological data for accurate taxonomy,particularly in largegenome taxa.Based on this evidence,we propose a putative speciation pathway involving multiple hybridization and polyploidization events,with allopolyploidy playing a predominant role.Furthermore,our results indicate that the species L.insularis and L.longifolia are geographic populations of L.sprengeri and L.aurea,respectively,and confirmedthe distribution of L.traubii and L.albiflora in China' Mainland.These findingsoffer new insights into the mechanisms underlying speciation,interspecificrelationships,and the evolutionary history of Lycoris.
基金supported by the Youth Foundation of Beijing Academy of Agriculture and Forestry Sciences[Grant No.QNJJ202242]the Excellent Young Scholars of Beijing Academy of Agriculture and Forestry Sciences[Grant No.YXQN202205]+3 种基金the Beijing Nova Program[Grant No.20220484052]the National Natural Science Foundation of China[Grant No.31801852]the Collaborative Innovation Center of Beijing Academy of Agriculture and Forestry Sciences[Grant No.KJCX201907-2]the Earmarked Fund for China Agriculture Research System[Grant No.CARS-23-A-05].
文摘The soil-resident pathogen, Plasmodiophora brassicae, infects cruciferous crops, causing obligate parasitic clubroot disease and posing a significant threat to the Brassica vegetable industry in China. To learn more about its pathogenesis, we reported a Nanopore sequencing-derived25.3 Mb high-quality genome sequence of P. brassicae pathotype 4 strain(P.b 4). Comparing the P.b 4 genome with that of the published P.brassicae e3 genome(P.b e3) identified single nucleotide polymorphisms, structural variations, and small insertions and deletions. We then carried out RNA-sequencing of root samples from a clubroot-susceptible line at 5, 14, and 28 days after inoculation(DAI), and classified genes into five categories based on their expression patterns. Interestingly, 158 genes were highly expressed at 14 DAI, which were enriched in budding cell isotropic bud growth, ascospore wall assembly, spore wall assembly, spore wall biogenesis, and ascospore wall biogenesis.Subsequently, we bioinformatically predicted 555 secreted effector candidates, among which only 125 were expressed during infection and had amino acid lengths less than 400. The putative effector Pb010018, which was highly expressed at 14 DAI, was validated to have a signal peptide using a yeast secretion system. Luciferase activity and co-immunoprecipitation assays demonstrated that Pb010018 interacts with serine hydroxymethyltransferase BrSHMT1, and expression analysis showed that SHMT1 was upregulated in both Arabidopsis and B. rapa during infection. Furthermore, after infection, the Arabidopsis shmt1 mutant(atshmt1) showed reduced severity of clubroot disease, together with downregulated expression of Pb010018. Our results offer new insights into plant-pathogen interaction mechanisms, and provide the possibility for improving Brassica resistance to clubroot disease.