期刊文献+
共找到81篇文章
< 1 2 5 >
每页显示 20 50 100
Graph-based pangenome provides insights into structural variations and genetic basis of metabolic traits in potato
1
作者 Xiaoling Zhu Rui Yang +16 位作者 Qiqi Liang Yuye Yu Tingting Wang Li Meng Ping Wang Shaoyang Wang Xianping Li Qiongfen Yang Huachun Guo Qijun Sui Qiang Wang Hai Du Qin Chen Zhe Liang Xuewei Wu Qian Zeng Binquan Huang 《Molecular Plant》 2025年第4期590-602,共13页
Potato is the world’s most important nongrain crop.In this study,to assess genetic diversity within the Petota section,29 genomes from Petota and Etuberosum sections were newly de novo assembled and 248 accessions of... Potato is the world’s most important nongrain crop.In this study,to assess genetic diversity within the Petota section,29 genomes from Petota and Etuberosum sections were newly de novo assembled and 248 accessions of wild potatoes,landraces,and modern cultivars were re-sequenced at>253 depth.Subsequently,a graph-based pangenome was constructed using DM8.1 as the backbone,integrating194,330 nonredundant structural variants.To characterize the metabolome of tubers and illuminate the genomic basis of metabolic traits,LC-MS/MS was employed to obtain the metabolome of 157 accessions,and 9,321 structural variants(SVs)were detected to be significantly associated with 1,258 distinct metabolites via PAV(presence and absence variations)-based metabolomics-GWAS analysis,including metabolites of flavonoids,phenolic acids,and phospholipids.To facilitate the utilization of pangenome resources,a comprehensive platform,the Potato Pangenome Database(PPDB),was developed.Our study provides a comprehensive genomic resource for dissecting the genomic basis of agronomic and metabolic traits in potato,which will accelerate functional genomics studies and genetic improvements in potato. 展开更多
关键词 Genome assembly structural variants graph-based pangenome PAV-based metabolomics-GWAS
原文传递
Advancing the Indian cattle pangenome: characterizing non-reference sequences in Bos indicus 被引量:1
2
作者 Sarwar Azam Abhisek Sahu +6 位作者 Naveen Kumar Pandey Mahesh Neupane Curtis P Van Tassell Benjamin D Rosen Ravi Kumar Gandham Subha Narayan Rath Subeer S Majumdar 《Journal of Animal Science and Biotechnology》 2025年第2期497-516,共20页
Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the... Background India harbors the world’s largest cattle population,encompassing over 50 distinct Bos indicus breeds.This rich genetic diversity underscores the inadequacy of a single reference genome to fully capture the genomic landscape of Indian cattle.To comprehensively characterize the genomic variation within Bos indicus and,specifically,dairy breeds,we aim to identify non-reference sequences and construct a comprehensive pangenome.Results Five representative genomes of prominent dairy breeds,including Gir,Kankrej,Tharparkar,Sahiwal,and Red Sindhi,were sequenced using 10X Genomics‘linked-read’technology.Assemblies generated from these linked-reads ranged from 2.70 Gb to 2.77 Gb,comparable to the Bos indicus Brahman reference genome.A pangenome of Bos indicus cattle was constructed by comparing the newly assembled genomes with the reference using alignment and graph-based methods,revealing 8 Mb and 17.7 Mb of novel sequence respectively.A confident set of 6,844 Non-reference Unique Insertions(NUIs)spanning 7.57 Mb was identified through both methods,representing the pange-nome of Indian Bos indicus breeds.Comparative analysis with previously published pangenomes unveiled 2.8 Mb(37%)commonality with the Chinese indicine pangenome and only 1%commonality with the Bos taurus pange-nome.Among these,2,312 NUIs encompassing~2 Mb,were commonly found in 98 samples of the 5 breeds and des-ignated as Bos indicus Common Insertions(BICIs)in the population.Furthermore,926 BICIs were identified within 682 protein-coding genes,54 long non-coding RNAs(lncRNA),and 18 pseudogenes.These protein-coding genes were enriched for functions such as chemical synaptic transmission,cell junction organization,cell-cell adhesion,and cell morphogenesis.The protein-coding genes were found in various prominent quantitative trait locus(QTL)regions,suggesting potential roles of BICIs in traits related to milk production,reproduction,exterior,health,meat,and carcass.Notably,63.21%of the bases within the BICIs call set contained interspersed repeats,predominantly Long Inter-spersed Nuclear Elements(LINEs).Additionally,70.28%of BICIs are shared with other domesticated and wild species,highlighting their evolutionary significance.Conclusions This is the first report unveiling a robust set of NUIs defining the pangenome of Bos indicus breeds of India.The analyses contribute valuable insights into the genomic landscape of desi cattle breeds. 展开更多
关键词 BICIs Bos indicus CATTLE Genome assembly Linked-reads NUIs pangenome
在线阅读 下载PDF
Structural variation-based and gene-based pangenome construction reveals untapped diversity of hexaploid wheat
3
作者 Hong Cheng Lingpeng Kong +7 位作者 Kun Zhu Hang Zhao Xiuli Li Yanwen Zhang Weidong Ning Mei Jiang Bo Song Shifeng Cheng 《Journal of Genetics and Genomics》 2025年第6期774-785,共12页
Increasing number of structural variations(SVs)have been identified as causative mutations for diverse agronomic traits.However,the systematic exploration of SVs quantity,distribution,and contribution in wheat was lac... Increasing number of structural variations(SVs)have been identified as causative mutations for diverse agronomic traits.However,the systematic exploration of SVs quantity,distribution,and contribution in wheat was lacking.Here,we report high-quality gene-based and SV-based pangenomes comprising 22 hexaploid wheat assemblies showing a wide range of chromosome size,gene number,and TE component,which indicates their representativeness of wheat genetic diversity.Pan-gene analyses uncover 140,261 distinct gene families,of which only 23.2%are shared in all accessions.Moreover,we build a∼16.15 Gb graph pangenome containing 695,897 bubbles,intersecting 5132 genes and 230,307 cis-regulatory regions.Pairwise genome comparisons identify∼1,978,221 non-redundant SVs and 497 SV hotspots.Notably,the density of bubbles as well as SVs show remarkable aggregation in centromeres,which probably play an important role in chromosome plasticity and stability.As for functional SVs exploration,we identify 2769 SVs with absolute relative frequency differences exceeding 0.7 between spring and winter growth habit groups.Additionally,several reported functional genes in wheat display complex structural graphs,for example,PPD-A1,VRT-A2,and TaNAAT2-A.These findings deepen our understanding of wheat genetic diversity,providing valuable graphical pangenome and variation resources to improve the efficiency of genome-wide association mapping in wheat. 展开更多
关键词 Wheat pangenome Structural variation Centromere plasticity Growth habit
原文传递
Graph-Based Transform and Dual Graph Laplacian Regularization for Depth Map Denoising
4
作者 MENG Yaqun GE Huayong +2 位作者 HOU Xinxin JI Yukai LI Sisi 《Journal of Donghua University(English Edition)》 2025年第5期534-542,共9页
Owing to the constraints of depth sensing technology,images acquired by depth cameras are inevitably mixed with various noises.For depth maps presented in gray values,this research proposes a novel denoising model,ter... Owing to the constraints of depth sensing technology,images acquired by depth cameras are inevitably mixed with various noises.For depth maps presented in gray values,this research proposes a novel denoising model,termed graph-based transform(GBT)and dual graph Laplacian regularization(DGLR)(DGLR-GBT).This model specifically aims to remove Gaussian white noise by capitalizing on the nonlocal self-similarity(NSS)and the piecewise smoothness properties intrinsic to depth maps.Within the group sparse coding(GSC)framework,a combination of GBT and DGLR is implemented.Firstly,within each group,the graph is constructed by using estimates of the true values of the averaged blocks instead of the observations.Secondly,the graph Laplacian regular terms are constructed based on rows and columns of similar block groups,respectively.Lastly,the solution is obtained effectively by combining the alternating direction multiplication method(ADMM)with the weighted thresholding method within the domain of GBT. 展开更多
关键词 depth map graph signal processing dual graph Laplacian regularization(DGLR) graph-based transform(GBT) group sparse coding(GSC)
在线阅读 下载PDF
Multi-resolution graph-based clustering analysis for lithofacies identifi cation from well log data: Case study of intraplatform bank gas fi elds, Amu Darya Basin 被引量:15
5
作者 Tian Yu Xu Hong +4 位作者 Zhang Xing-Yang Wang Hong-Jun Guo Tong-Cui Zhang Liang-Jie Gong Xing-Lin 《Applied Geophysics》 SCIE CSCD 2016年第4期598-607,736,共11页
In this study, we used the multi-resolution graph-based clustering (MRGC) method for determining the electrofacies (EF) and lithofacies (LF) from well log data obtained from the intraplatform bank gas fields loc... In this study, we used the multi-resolution graph-based clustering (MRGC) method for determining the electrofacies (EF) and lithofacies (LF) from well log data obtained from the intraplatform bank gas fields located in the Amu Darya Basin. The MRGC could automatically determine the optimal number of clusters without prior knowledge about the structure or cluster numbers of the analyzed data set and allowed the users to control the level of detail actually needed to define the EF. Based on the LF identification and successful EF calibration using core data, an MRGC EF partition model including five clusters and a quantitative LF interpretation chart were constructed. The EF clusters 1 to 5 were interpreted as lagoon, anhydrite flat, interbank, low-energy bank, and high-energy bank, and the coincidence rate in the cored interval could reach 85%. We concluded that the MRGC could be accurately applied to predict the LF in non-cored but logged wells. Therefore, continuous EF clusters were partitioned and corresponding LF were characteristics &different LF were analyzed interpreted, and the distribution and petrophysical in the framework of sequence stratigraphy. 展开更多
关键词 Multi-resolution graph-based clustering method electrofacies lithofacies intraplatform bank gas fields Amu Darya Basin
在线阅读 下载PDF
A review of the pangenome:how it affects our understanding of genomic variation,selection and breeding in domestic animals? 被引量:6
6
作者 Ying Gong Yefang Li +2 位作者 Xuexue Liu Yuehui Ma Lin Jiang 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2023年第5期1815-1833,共19页
As large-scale genomic studies have progressed,it has been revealed that a single reference genome pattern cannot represent genetic diversity at the species level.While domestic animals tend to have complex routes of ... As large-scale genomic studies have progressed,it has been revealed that a single reference genome pattern cannot represent genetic diversity at the species level.While domestic animals tend to have complex routes of origin and migration,suggesting a possible omission of some population-specific sequences in the current reference genome.Conversely,the pangenome is a collection of all DNA sequences of a species that contains sequences shared by all individuals(core genome)and is also able to display sequence information unique to each individual(variable genome).The progress of pangenome research in humans,plants and domestic animals has proved that the missing genetic components and the identification of large structural variants(SVs)can be explored through pangenomic studies.Many individual specific sequences have been shown to be related to biological adaptability,phenotype and important economic traits.The maturity of technologies and methods such as third-generation sequencing,Tel-omere-to-telomere genomes,graphic genomes,and reference-free assembly will further promote the development of pangenome.In the future,pangenome combined with long-read data and multi-omics will help to resolve large SVs and their relationship with the main economic traits of interest in domesticated animals,providing better insights into animal domestication,evolution and breeding.In this review,we mainly discuss how pangenome analysis reveals genetic variations in domestic animals(sheep,cattle,pigs,chickens)and their impacts on phenotypes and how this can contribute to the understanding of species diversity.Additionally,we also go through potential issues and the future perspectives of pangenome research in livestock and poultry. 展开更多
关键词 BREEDING Domestic animals pangenome Structural variations
在线阅读 下载PDF
A graph-based approach for the structural analysis of road and building layouts 被引量:3
7
作者 Mathieu Domingo Rémy Thibaud Christophe Claramunt 《Geo-Spatial Information Science》 SCIE CSCD 2019年第1期59-72,共14页
A better understanding of the relationship between the structure and functions of urban and suburban spaces is one of the avenues of research still open for geographical information science.The research presented in t... A better understanding of the relationship between the structure and functions of urban and suburban spaces is one of the avenues of research still open for geographical information science.The research presented in this paper develops several graph-based metrics whose objective is to characterize some local and global structural properties that reflect the way the overall building layout can be cross-related to the one of the road layout.Such structural properties are modeled as an aggregation of parcels,buildings,and road networks.We introduce several computational measures(Ratio Minimum Distance,Minimum Ratio Minimum Distance,and Metric Compactness)that respectively evaluate the capability for a given road to be connected with the whole road network.These measures reveal emerging sub-network structures and point out differences between less-connective and moreconnective parts of the network.Based on these local and global properties derived from the topological and graph-based representation,and on building density metrics,this paper proposes an analysis of road and building layouts at different levels of granularity.The metrics developed are applied to a case study in which the derived properties reveal coherent as well as incoherent neighborhoods that illustrate the potential of the approach and the way buildings and roads can be relatively connected in a given urban environment.Overall,and by integrating the parcels and buildings layouts,this approach complements other previous and related works that mainly retain the configurational structure of the urban network as well as morphological studies whose focus is generally limited to the analysis of the building layout. 展开更多
关键词 Urban and suburban spaces graph-based modeling structural analysis GIS
原文传递
Pig pangenome graph reveals functional features of non‑reference sequences 被引量:2
8
作者 Jian Miao Xingyu Wei +6 位作者 Caiyun Cao Jiabao Sun Yuejin Xu Zhe Zhang Qishan Wang Yuchun Pan Zhen Wang 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2024年第3期956-970,共15页
Background The reliance on a solitary linear reference genome has imposed a significant constraint on our compre-hensive understanding of genetic variation in animals.This constraint is particularly pronounced for non... Background The reliance on a solitary linear reference genome has imposed a significant constraint on our compre-hensive understanding of genetic variation in animals.This constraint is particularly pronounced for non-reference sequences(NRSs),which have not been extensively studied.Results In this study,we constructed a pig pangenome graph using 21 pig assemblies and identified 23,831 NRSs with a total length of 105 Mb.Our findings revealed that NRSs were more prevalent in breeds exhibiting greater genetic divergence from the reference genome.Furthermore,we observed that NRSs were rarely found within coding sequences,while NRS insertions were enriched in immune-related Gene Ontology terms.Notably,our investigation also unveiled a close association between novel genes and the immune capacity of pigs.We observed substantial differences in terms of frequencies of NRSs between Eastern and Western pigs,and the heat-resistant pigs exhibited a substantial number of NRS insertions in an 11.6 Mb interval on chromosome X.Additionally,we discovered a 665 bp insertion in the fourth intron of the TNFRSF19 gene that may be associated with the ability of heat tolerance in South-ern Chinese pigs.Conclusions Our findings demonstrate the potential of a graph genome approach to reveal important functional features of NRSs in pig populations. 展开更多
关键词 Heat tolerance Immune ability Non-reference sequences Pig pangenome
在线阅读 下载PDF
Low Data Overlab Rate Graph-Based SLAM with Distributed Submap Strategy 被引量:3
9
作者 XIANGjiawei ZHANG Jinyi +1 位作者 WANG Bin MA Yongbin 《Journal of Shanghai Jiaotong university(Science)》 EI 2020年第5期650-658,共9页
Simultaneous localization and mapping(SLAM)is widely used in many robot applications to acquire the unknown environment's map and the robots location.Graph-based SLAM is demonstrated to be effective in large-scale... Simultaneous localization and mapping(SLAM)is widely used in many robot applications to acquire the unknown environment's map and the robots location.Graph-based SLAM is demonstrated to be effective in large-scale scenarios,and it intuitively performs the SLAM as a pose graph.But because of the high data overlap rate,traditional graph-based SLAM is not efficient in some respects,such as real time performance and memory usage.To reduce1 data overlap rate,a graph-based SLAM with distributed submap strategy(DSS)is presented.In its front-end,submap based scan matching is processed and loop closing detection is conducted.Moreover in its back-end,pose graph is updated for global optimization and submap merging.From a series of experiments,it is demonstrated that graph-based SLAM with DSS reduces 51.79%data overlap rate,decreases 39.70%runtime and 24.60%memory usage.The advantages over other low overlap rate method is also proved in runtime,memory usage,accuracy and robustness performance. 展开更多
关键词 graph-based SLAM distributed submap strategy data overlap rate
原文传递
Pangenome and multi-tissue gene atlas provide new insights into the domestication and highland adaptation of yaks 被引量:1
10
作者 Daoliang Lan Wei Fu +10 位作者 Wenhui Ji Tserang‑Donko Mipam Xianrong Xiong Shi Ying Yan Xiong Peng Sheng Jiangping Ni Lijun Bai Tongling Shan Xiangdong Kong Jian Li 《Journal of Animal Science and Biotechnology》 SCIE CAS CSCD 2024年第5期1832-1850,共19页
Background The genetic diversity of yak,a key domestic animal on the Qinghai-Tibetan Plateau(QTP),is a vital resource for domestication and breeding efforts.This study presents the first yak pangenome obtained through... Background The genetic diversity of yak,a key domestic animal on the Qinghai-Tibetan Plateau(QTP),is a vital resource for domestication and breeding efforts.This study presents the first yak pangenome obtained through the de novo assembly of 16 yak genomes.Results We discovered 290 Mb of nonreference sequences and 504 new genes.Our pangenome-wide presence and absence variation(PAV)analysis revealed 5,120 PAV-related genes,highlighting a wide range of variety-specific genes and genes with varying frequencies across yak populations.Principal component analysis(PCA)based on binary gene PAV data classified yaks into three new groups:wild,domestic,and Jinchuan.Moreover,we pro-posed a‘two-haplotype genomic hybridization model'for understanding the hybridization patterns among breeds by integrating gene frequency,heterozygosity,and gene PAV data.A gene PAV-GWAS identified a novel gene(Bos-Gru3G009179)that may be associated with the multirib trait in Jinchuan yaks.Furthermore,an integrated transcrip-tome and pangenome analysis highlighted the significant differences in the expression of core genes and the muta-tional burden of differentially expressed genes between yaks from high and low altitudes.Transcriptome analysis across multiple species revealed that yaks have the most unique differentially expressed m RNAs and lnc RNAs(between high-and low-altitude regions),especially in the heart and lungs,when comparing high-and low-altitude adaptations.Conclusions The yak pangenome offers a comprehensive resource and new insights for functional genomic studies,supporting future biological research and breeding strategies. 展开更多
关键词 High-and low-altitude Novel genes pangenome PAV-GWAS YAK
在线阅读 下载PDF
BotSward: Centrality Measures for Graph-Based Bot Detection Using Machine Learning
11
作者 Khlood Shinan Khalid Alsubhi M.Usman Ashraf 《Computers, Materials & Continua》 SCIE EI 2023年第1期693-714,共22页
The number of botnet malware attacks on Internet devices has grown at an equivalent rate to the number of Internet devices that are connected to the Internet.Bot detection using machine learning(ML)with flow-based fea... The number of botnet malware attacks on Internet devices has grown at an equivalent rate to the number of Internet devices that are connected to the Internet.Bot detection using machine learning(ML)with flow-based features has been extensively studied in the literature.Existing flow-based detection methods involve significant computational overhead that does not completely capture network communication patterns that might reveal other features ofmalicious hosts.Recently,Graph-Based Bot Detection methods using ML have gained attention to overcome these limitations,as graphs provide a real representation of network communications.The purpose of this study is to build a botnet malware detection system utilizing centrality measures for graph-based botnet detection and ML.We propose BotSward,a graph-based bot detection system that is based on ML.We apply the efficient centrality measures,which are Closeness Centrality(CC),Degree Centrality(CC),and PageRank(PR),and compare them with others used in the state-of-the-art.The efficiency of the proposed method is verified on the available Czech Technical University 13 dataset(CTU-13).The CTU-13 dataset contains 13 real botnet traffic scenarios that are connected to a command-and-control(C&C)channel and that cause malicious actions such as phishing,distributed denial-of-service(DDoS)attacks,spam attacks,etc.BotSward is robust to zero-day attacks,suitable for large-scale datasets,and is intended to produce better accuracy than state-of-the-art techniques.The proposed BotSward solution achieved 99%accuracy in botnet attack detection with a false positive rate as low as 0.0001%. 展开更多
关键词 Network security botnet detection graph-based features machine learning measure centrality
在线阅读 下载PDF
A Novel Method for Node Connectivity with Adaptive Dragonfly Algorithm and Graph-Based m-Connection Establishment in MANET
12
作者 S.B.Manoojkumaar C.Poongodi 《Computers, Materials & Continua》 SCIE EI 2020年第11期1649-1670,共22页
Maximizing network lifetime is measured as the primary issue in Mobile Ad-hoc Networks(MANETs).In geographically routing based models,packet transmission seems to be more appropriate in dense circumstances.The involve... Maximizing network lifetime is measured as the primary issue in Mobile Ad-hoc Networks(MANETs).In geographically routing based models,packet transmission seems to be more appropriate in dense circumstances.The involvement of the Heuristic model directly is not appropriate to offer an effectual solution as it becomes NP-hard issues;therefore investigators concentrate on using Meta-heuristic approaches.Dragonfly Optimization(DFO)is an effective meta-heuristic approach to resolve these problems by providing optimal solutions.Moreover,Meta-heuristic approaches(DFO)turn to be slower in convergence problems and need proper computational time while expanding network size.Thus,DFO is adaptively improved as Adaptive Dragonfly Optimization(ADFO)to fit this model and re-formulated using graph-based m-connection establishment(G-𝑚𝑚CE)to overcome computational time and DFO’s convergence based problems,considerably enhancing DFO performance.In(G-𝑚𝑚CE),Connectivity Zone(CZ)is chosen among source to destination in which optimality should be under those connected regions and ADFO is used for effective route establishment in CZ indeed of complete networking model.To measure complementary features of ADFO and(G-𝑚𝑚CE),hybridization of DFO-(G-𝑚𝑚CE)is anticipated over dense circumstances with reduced energy consumption and delay to enhance network lifetime.The simulation was performed in MATLAB environment. 展开更多
关键词 Routing connectivity zone ADFO mobile ad-hoc network graph-based m-connection establishment
在线阅读 下载PDF
Model Change Active Learning in Graph-Based Semi-supervised Learning
13
作者 Kevin S.Miller Andrea L.Bertozzi 《Communications on Applied Mathematics and Computation》 EI 2024年第2期1270-1298,共29页
Active learning in semi-supervised classification involves introducing additional labels for unlabelled data to improve the accuracy of the underlying classifier.A challenge is to identify which points to label to bes... Active learning in semi-supervised classification involves introducing additional labels for unlabelled data to improve the accuracy of the underlying classifier.A challenge is to identify which points to label to best improve performance while limiting the number of new labels."Model Change"active learning quantifies the resulting change incurred in the classifier by introducing the additional label(s).We pair this idea with graph-based semi-supervised learning(SSL)methods,that use the spectrum of the graph Laplacian matrix,which can be truncated to avoid prohibitively large computational and storage costs.We consider a family of convex loss functions for which the acquisition function can be efficiently approximated using the Laplace approximation of the posterior distribution.We show a variety of multiclass examples that illustrate improved performance over prior state-of-art. 展开更多
关键词 Active learning graph-based methods Semi-supervised learning(SSL) Graph Laplacian
在线阅读 下载PDF
Graph-Based Replication and Two Factor Authentication in Cloud Computing
14
作者 S.Lavanya N.M.Saravanakumar 《Computer Systems Science & Engineering》 SCIE EI 2023年第6期2869-2883,共15页
Many cutting-edge methods are now possible in real-time commercial settings and are growing in popularity on cloud platforms.By incorporating new,cutting-edge technologies to a larger extent without using more infrast... Many cutting-edge methods are now possible in real-time commercial settings and are growing in popularity on cloud platforms.By incorporating new,cutting-edge technologies to a larger extent without using more infrastructures,the information technology platform is anticipating a completely new level of devel-opment.The following concepts are proposed in this research paper:1)A reliable authentication method Data replication that is optimised;graph-based data encryp-tion and packing colouring in Redundant Array of Independent Disks(RAID)sto-rage.At the data centre,data is encrypted using crypto keys called Key Streams.These keys are produced using the packing colouring method in the web graph’s jump graph.In order to achieve space efficiency,the replication is carried out on optimised many servers employing packing colours.It would be thought that more connections would provide better authentication.This study provides an innovative architecture with robust security,enhanced authentication,and low cost. 展开更多
关键词 graph-based encryption REPLICATION ENCRYPTION packing coloring jump graph web graph stream cipher key stream
在线阅读 下载PDF
Varigraph: An accurate and widely applicable pangenome graph-based variant genotyper for diploid and polyploid genomes
15
作者 Ze-Zhen Du Jia-Bao He +3 位作者 Pei-Xuan Xiao Jianbing Hu Ning Yang Wen-Biao Jiao 《Molecular Plant》 2025年第9期1587-1601,共15页
Accurate variant genotyping is crucial for genomics-assisted breeding.Graph pangenome references can address single-reference bias,thereby enhancing the performance of variant genotyping and empowering downstream appl... Accurate variant genotyping is crucial for genomics-assisted breeding.Graph pangenome references can address single-reference bias,thereby enhancing the performance of variant genotyping and empowering downstream applications in population genetics and quantitative genetics.However,existing pangenome-based genotyping methods are ineffective in handling large or complex pangenome graphs,particularly in polyploid genomes.Here,we introduce Varigraph,an algorithm that leverages the comparison of unique and repetitive k-mers between variant sites and short reads for genotyping both small and large variants.We evaluated Varigraph on a diverse set of representative plant genomes as well as human genomes.Vari-graph outperforms current state-of-the-art linear and graph-based genotypers across non-human ge-nomes while maintaining comparable genotyping performance in human genomes.By employing efficient data structures including counting Bloom filter and bitmap storage,as well as GPU models,Varigraph achieves improved precision and robustness in repetitive regions while managing computational costs for large datasets.Its wide applicability extends to highly repetitive or large genomes,such as those of maize and wheat.Significantly,Varigraph can handle extensive pangenome graphs,as demonstrated by its performance on a dataset containing 252 rice genomes,for which it achieved a precision exceeding 0.9 for both small and large variants.Notably,Varigraph is capable of effectively utilizing pangenome graphs for genotyping autopolyploids,enabling precise determination of allele dosage.In summary,this work provides a robust and accurate solution for genotyping plant genomes and will advance plant genomic studiesandgenomics-assistedbreeding. 展开更多
关键词 pangenome graph variant genotyping structural variant POLYPLOID
原文传递
Plant graph-based pangenomics:techniques,applications,and challenges
16
作者 Ze-Zhen Du Jia-Bao He Wen-Biao Jiao 《aBIOTECH》 2025年第2期361-376,共16页
Innovations in DNA sequencing technologies have greatly boosted population-level genomic studies in plants,facilitating the identification of key genetic variations for investigating population diversity and accelerat... Innovations in DNA sequencing technologies have greatly boosted population-level genomic studies in plants,facilitating the identification of key genetic variations for investigating population diversity and accelerating the molecular breeding of crops.Conventional methods for genomic analysis typically rely on small variants,such as SNPs and indels,and use single linear reference genomes,which introduces biases and reduces performance in highly divergent genomic regions.By integrating the population level of sequences,pangenomes,particularly graph pangenomes,offer a promising solution to these challenges.To date,numerous algorithms have been developed for constructing pangenome graphs,aligning reads to these graphs,and performing variant genotyping based on these graphs.As demonstrated in various plant pangenomic studies,these advancements allow for the detection of previously hidden variants,especially structural variants,thereby enhancing applications such as genetic mapping of agronomically important genes.However,noteworthy challenges remain to be overcome in applying pangenome graph approaches to plants.Addressing these issues will require the development of more sophisticated algorithms tailored specifically to plants.Such improvements will contribute to the scalability of this approach,facilitating the production of super-pangenomes,in which hundreds or even thousands of de novo–assembled genomes from one species or genus can be integrated.This,in turn,will promote broader pan-omic studies,further advancing our understanding of genetic diversity and driving innovations in crop breeding. 展开更多
关键词 Crop breeding Genome graph GENOTYPING pangenome Structural variation
原文传递
A pangenomic study of Bacillus thuringiensis 被引量:1
17
作者 Yongjun Fang Zhaolong Li +10 位作者 Jiucheng Liu Changlong Shu Xumin Wang Xiaowei Zhang Xiaoguang Yu Duojun Zhao Guiming Liu Songnian Hu Jie Zhang Ibrahim Al-Mssallem Jun Yu 《Journal of Genetics and Genomics》 SCIE CAS CSCD 2011年第12期567-576,共10页
Bacillus thuringiensis(B.thuringiensis) is a soil-dwelling Gram-positive bacterium and its plasmid-encoded toxins(Cry) are commonly used as biological alternatives to pesticides.In a pangenomic study,we sequenced ... Bacillus thuringiensis(B.thuringiensis) is a soil-dwelling Gram-positive bacterium and its plasmid-encoded toxins(Cry) are commonly used as biological alternatives to pesticides.In a pangenomic study,we sequenced seven B.thuringiensis isolates in both high coverage and base-quality using the next-generation sequencing platform.The B.thuringiensis pangenome was extrapolated to have 4196 core genes and an asymptotic value of 558 unique genes when a new genome is added.Compared to the pangenomes of its closely related species of the same genus,B.thuringiensis pangenome shows an open characteristic,similar to B.cereus but not to B.anthracis;the latter has a closed pangenome. We also found extensive divergence among the seven B.thuringiensis genome assemblies,which harbor ample repeats and single nucleotide polymorphisms(SNPs).The identities among orthologous genes are greater than 84.5%and the hotspots for the genome variations were discovered in genomic regions of 2.3-2.8 Mb and 5.0-5.6 Mb.We concluded that high-coverage sequence assemblies from multiple strains, before all the gaps are closed,are very useful for pangenomic studies. 展开更多
关键词 Bacillus thuringiensis(B.thuringiensis) Pseudo-chromosome pangenome
原文传递
作物泛基因组研究进展与展望 被引量:4
18
作者 王晖 丁保朋 +4 位作者 李彧贤 任泉如 周海 赵均良 胡海飞 《中国农业科学》 北大核心 2025年第11期2045-2061,共17页
全球人口的持续增长和气候变化给粮食供给带来了严峻挑战,粮食安全问题因此愈发突出。为了满足不断增长的人口对粮食的需求,提升作物产量并增强其对环境的适应性已成为农业领域的重要目标。在此背景下,基因组学被视为加速作物育种进程... 全球人口的持续增长和气候变化给粮食供给带来了严峻挑战,粮食安全问题因此愈发突出。为了满足不断增长的人口对粮食的需求,提升作物产量并增强其对环境的适应性已成为农业领域的重要目标。在此背景下,基因组学被视为加速作物育种进程的重要手段。通过深入挖掘和利用作物优异功能基因信息,不仅能有效提高作物产量,还能增强其抗逆性和适应性,为保障全球粮食安全和实现农业可持续发展提供有力支撑。然而,传统的单一参考基因组往往无法全面反映作物在驯化和改良过程中所累积的所有基因组变异,导致研究者对功能基因及其调控网络的认识存在局限。随着高通量测序技术的不断发展,基因组学研究开始迈入泛基因组学时代。通过整合多个高质量基因组,构建涵盖物种基因序列全集的泛基因组,能精准地鉴定包括单核苷酸多态性(SNPs)及结构变异(SVs)在内的多种遗传变异,全面地捕获物种在不同品种、亚种及野生亲缘种中广泛存在的遗传多样性,为系统挖掘优异功能基因提供更完善的分析框架。通过结合多组学数据(如转录组、蛋白质组、表观组等),泛基因组研究能在更精细的水平上挖掘优异功能基因,进而为分子育种提供更具针对性和准确性的基因靶标。同时,借助CRISPR-Cas9等基因编辑技术,可进一步对重要基因位点进行定向改造,剔除影响作物生长的不利性状或强化其对环境胁迫的抗性,从而为培育兼具高产、优质和抗逆特性的新一代作物品种奠定坚实基础。本文阐述了目前泛基因组的主要构建方法及展现形式的研究进展,并系统地梳理了作物泛基因组的发展及其在育种改良中的应用,深入探讨了泛基因组在未来作物育种中面临的挑战,对如何更好地应用泛基因组进行作物遗传改良进行了讨论,为未来精准分子育种改良提供了新的思路和策略。 展开更多
关键词 分子育种 泛基因组 结构变异 新质生产力 遗传多样性
在线阅读 下载PDF
上一页 1 2 5 下一页 到第
使用帮助 返回顶部