Human leukocyte antigen(HLA)genes in the major histocompatibility complex(MHC)region are crucial for immunity and are associated with numerous diseases and phenotypes.The MHC region’s complexity and high genetic dive...Human leukocyte antigen(HLA)genes in the major histocompatibility complex(MHC)region are crucial for immunity and are associated with numerous diseases and phenotypes.The MHC region’s complexity and high genetic diversity make it challenging to analyze using short-read sequencing(SRS)technology.We sequence the MHC region of 100 Han Chinese individuals using both long-read sequencing(LRS)and SRS platforms at approximately 30X coverage to study genetic alterations and their potential functional impacts.LRS provides significantly greater coverage of the MHC region and eight classical HLA genes,particularly at the HLA-DRB1 locus,compared with SRS.We detect 78,249 single nucleotide polymorphisms(SNPs)using LRS,with 26.0%undetectable by SRS.Based on SNP and inferred HLA allele types,we construct an LRS-based MHC reference panel for the Han Chinese,containing approximately 2.6 times more genetic variants than the SRS-based Han-MHC reference panel.A phenome-wide association study assessing 26,024 phenotypes across 15 categories identifies significant associations for 7,879 independent variants(including 809 LRS-specific SNPs)with 409 phenotypes in nine categories.This analysis reveals 24 unreported HLA allele associations in the bioelectric and cellular categories.The conditional analysis identifies 530 independent signals across the 409 phenotypes,including 28 previously unreported signals of eight classical HLA genes associated with 33 phenotypes.Of the top-associated SNPs,191 are detected by LRS only.Fine-mapping identifies 126 independent candidate causal SNPs for three immune-related cellular phenotypes,with 17 detected exclusively by LRS.Our study reveals previously unreported variants and their functional impacts in the MHC region,enhancing our understanding of genetic diversity and its potential biological implications in the Han Chinese population.展开更多
Neurodegenerative diseases cause great medical and economic burdens for both patients and society;however, the complex molecular mechanisms thereof are not yet well understood. With the development of high-coverage se...Neurodegenerative diseases cause great medical and economic burdens for both patients and society;however, the complex molecular mechanisms thereof are not yet well understood. With the development of high-coverage sequencing technology, researchers have started to notice that genomic repeat regions, previously neglected in search of disease culprits, are active contributors to multiple neurodegenerative diseases. In this review, we describe the association between repeat element variants and multiple degenerative diseases through genome-wide association studies and targeted sequencing. We discuss the identification of disease-relevant repeat element variants, further powered by the advancement of long-read sequencing technologies and their related tools, and summarize recent findings in the molecular mechanisms of repeat element variants in brain degeneration, such as those causing transcriptional silencing or RNA-mediated gain of toxic function. Furthermore, we describe how in silico predictions using innovative computational models, such as deep learning language models, could enhance and accelerate our understanding of the functional impact of repeat element variants. Finally, we discuss future directions to advance current findings for a better understanding of neurodegenerative diseases and the clinical applications of genomic repeat elements.展开更多
Over the past decade,nanopore sequencing has experienced significant advancements and changes,transitioning from an initially emerging technology to a significant instrument in the field of genomic sequencing.However,...Over the past decade,nanopore sequencing has experienced significant advancements and changes,transitioning from an initially emerging technology to a significant instrument in the field of genomic sequencing.However,as advancements in next-generation sequencing technology persist,nanopore sequencing also improves.This paper reviews the developments,applications,and outlook on nanopore sequencing technology.Currently,nanopore sequencing supports both DNA and RNA sequencing,making it widely applicable in areas such as telomere-to-telomere(T2T)genome assembly,direct RNA sequencing(DRS),and metagenomics.The openness and versatility of nanopore sequencing have established it as a preferred option for an increasing number of research teams,signaling a transformative influence on life science research.As the nanopore sequencing technology advances,it provides a faster,more costeffective approach with extended read lengths,demonstrating the significant potential for complex genome assembly,pathogen detection,environmental monitoring,and human disease research,offering a fresh perspective in sequencing technologies.展开更多
基金supported by the National Natural Science Foundation of China(32370686)the National Key Research and Development Program of China(2021YFC2500202)+2 种基金the 111 Project(B13016)Shanghai Municipal Science and Technology(2017SHZDZX01)supported by the Human Phenome Data Center at Fudan University.
文摘Human leukocyte antigen(HLA)genes in the major histocompatibility complex(MHC)region are crucial for immunity and are associated with numerous diseases and phenotypes.The MHC region’s complexity and high genetic diversity make it challenging to analyze using short-read sequencing(SRS)technology.We sequence the MHC region of 100 Han Chinese individuals using both long-read sequencing(LRS)and SRS platforms at approximately 30X coverage to study genetic alterations and their potential functional impacts.LRS provides significantly greater coverage of the MHC region and eight classical HLA genes,particularly at the HLA-DRB1 locus,compared with SRS.We detect 78,249 single nucleotide polymorphisms(SNPs)using LRS,with 26.0%undetectable by SRS.Based on SNP and inferred HLA allele types,we construct an LRS-based MHC reference panel for the Han Chinese,containing approximately 2.6 times more genetic variants than the SRS-based Han-MHC reference panel.A phenome-wide association study assessing 26,024 phenotypes across 15 categories identifies significant associations for 7,879 independent variants(including 809 LRS-specific SNPs)with 409 phenotypes in nine categories.This analysis reveals 24 unreported HLA allele associations in the bioelectric and cellular categories.The conditional analysis identifies 530 independent signals across the 409 phenotypes,including 28 previously unreported signals of eight classical HLA genes associated with 33 phenotypes.Of the top-associated SNPs,191 are detected by LRS only.Fine-mapping identifies 126 independent candidate causal SNPs for three immune-related cellular phenotypes,with 17 detected exclusively by LRS.Our study reveals previously unreported variants and their functional impacts in the MHC region,enhancing our understanding of genetic diversity and its potential biological implications in the Han Chinese population.
基金supported by the National Natural Science Foundation of China, No.61932008Natural Science Foundation of Shanghai, No.21ZR1403200 (both to JC)。
文摘Neurodegenerative diseases cause great medical and economic burdens for both patients and society;however, the complex molecular mechanisms thereof are not yet well understood. With the development of high-coverage sequencing technology, researchers have started to notice that genomic repeat regions, previously neglected in search of disease culprits, are active contributors to multiple neurodegenerative diseases. In this review, we describe the association between repeat element variants and multiple degenerative diseases through genome-wide association studies and targeted sequencing. We discuss the identification of disease-relevant repeat element variants, further powered by the advancement of long-read sequencing technologies and their related tools, and summarize recent findings in the molecular mechanisms of repeat element variants in brain degeneration, such as those causing transcriptional silencing or RNA-mediated gain of toxic function. Furthermore, we describe how in silico predictions using innovative computational models, such as deep learning language models, could enhance and accelerate our understanding of the functional impact of repeat element variants. Finally, we discuss future directions to advance current findings for a better understanding of neurodegenerative diseases and the clinical applications of genomic repeat elements.
基金financially supported by the Natural Science Foundation of China(32470055 and U23A20148)the China Postdoctoral Science Foundation(2024M753580)the Agricultural Science and Technology Innovation Program(CAAS-ZDRW202308)。
文摘Over the past decade,nanopore sequencing has experienced significant advancements and changes,transitioning from an initially emerging technology to a significant instrument in the field of genomic sequencing.However,as advancements in next-generation sequencing technology persist,nanopore sequencing also improves.This paper reviews the developments,applications,and outlook on nanopore sequencing technology.Currently,nanopore sequencing supports both DNA and RNA sequencing,making it widely applicable in areas such as telomere-to-telomere(T2T)genome assembly,direct RNA sequencing(DRS),and metagenomics.The openness and versatility of nanopore sequencing have established it as a preferred option for an increasing number of research teams,signaling a transformative influence on life science research.As the nanopore sequencing technology advances,it provides a faster,more costeffective approach with extended read lengths,demonstrating the significant potential for complex genome assembly,pathogen detection,environmental monitoring,and human disease research,offering a fresh perspective in sequencing technologies.