Intergenic subset organization within a set of geographically-defined viral sequences from the 2009 H1N1 influenza A pandemic

Intergenic subset organization within a set of geographically-defined viral sequences from the 2009 H1N1 influenza A pandemic

暂未订购

导出

摘要 We report a bioinformatic analysis of the datasets of sequences of all ten genes from the 2009 H1N1 influenza A pandemic in the state of Wisconsin. The gene with the greatest summed information entropy was found to be the hemagglutinin (HA) gene. Based upon the viral ID identifier of the HA gene sequence, the sequences of all of the genes were sorted into two subsets, depending upon whether the nucleotide occupying the position of maximum entropy, position 658 of the HA sequence, was either A or U. It was found that the information entropy (H) distributions of subsets differed significantly from each other, from H distributions of randomly generated subsets and from the H distributions of the complete datasets of each gene. Mutual information (MI) values facilitated identification of nine nucleotide positions, distributed over seven of the influenza genes, at which the nucleotide subsets were disjoint, or almost disjoint. Nucleotide frequencies at these nine positions were used to compute mutual information values that subsequently served as weighting factors for edges in a graph net-work. Seven of the nucleotide positions in the graph network are sites of synonymous mutations. Three of these sites of synonymous mutation are within a single gene, the M1 gene, which occupied the position of greatest graph centrality. It is proposed that these bioinformatic and network graph results may reflect alterations in M1-mediated viral packaging and exteriorization, known to be susceptible to synonymous mutations. We report a bioinformatic analysis of the datasets of sequences of all ten genes from the 2009 H1N1 influenza A pandemic in the state of Wisconsin. The gene with the greatest summed information entropy was found to be the hemagglutinin (HA) gene. Based upon the viral ID identifier of the HA gene sequence, the sequences of all of the genes were sorted into two subsets, depending upon whether the nucleotide occupying the position of maximum entropy, position 658 of the HA sequence, was either A or U. It was found that the information entropy (H) distributions of subsets differed significantly from each other, from H distributions of randomly generated subsets and from the H distributions of the complete datasets of each gene. Mutual information (MI) values facilitated identification of nine nucleotide positions, distributed over seven of the influenza genes, at which the nucleotide subsets were disjoint, or almost disjoint. Nucleotide frequencies at these nine positions were used to compute mutual information values that subsequently served as weighting factors for edges in a graph net-work. Seven of the nucleotide positions in the graph network are sites of synonymous mutations. Three of these sites of synonymous mutation are within a single gene, the M1 gene, which occupied the position of greatest graph centrality. It is proposed that these bioinformatic and network graph results may reflect alterations in M1-mediated viral packaging and exteriorization, known to be susceptible to synonymous mutations.

作者 William A. Thompson Joel K. Weltman

机构地区 Department of Medicine Division of Applied Mathematics and Center for Computational Molecular Biology

出处《American Journal of Molecular Biology》 2012年第1期32-41,共10页 美国分子生物学期刊（英文）

关键词 Influenza A H1N1 Bioinformatics Genes PANDEMIC Epidemic Information Entropy MutualInFormation Graph Network CENTRALITY SUBSETS Influenza A H1N1 Bioinformatics Genes Pandemic Epidemic Information Entropy MutualInFormation Graph Network Centrality Subsets

分类号 R73 [医药卫生—肿瘤]

引文网络
相关文献

1Hao Lin,Qian-Zhong Li,Cui-Xia Chen.Analysis and prediction of exon, intron, intergenic region and splice sites for A. thaliana and C. elegans genomes[J].Journal of Biomedical Science and Engineering,2009,2(6):367-373.
2Sarbottam Piya,Madhav P. Nepal.Characterization of Nuclear and Chloroplast Microsatellite Markers for <i>Falcaria vulgaris</i>(Apiaceae)[J].American Journal of Plant Sciences,2013,4(3):590-595.
3Donghui Song,Jing Li,Xiaoxu Hu,Bo Xi.Construction of a Shuttle Vector for Heterologous Gene Expression in Escherichia coli and Microalgae Anabaena[J].Engineering（科研）,2013,5(10):540-544. 被引量：2
4周原世,徐书婉,关沧海,胡增涛,姜兴明.恶性肿瘤中GAPLINC的调控作用及其与患者预后的关系[J].中华病理学杂志,2019,48(11):902-905. 被引量：3
5Xuejun Jiao,Jing Bai,Shanguang Chen,Qijie Li.Monitoring Mental Fatigue in Analog Space Environment Using Optical Brain Imaging[J].Engineering（科研）,2013,5(5):53-57. 被引量：3
6Kathryn J. Pederson,M. Nawal Lutfiyya,Laura C. Palombi,David R. Simmons,Darin J. Steenerson,Kenzie G. Hohman,Krista L. Huot.Cross-sectional population based study ascertaining the characteristics of US rural adults with mental health concerns who perceived a stigma regarding mental health issues[J].Health,2013,5(4):695-702.
7Wei Hu.Mutations in Hemagglutinin of H5N1 Influenza That Switch Receptor Specificity from Avian to Human Types[J].Computational Molecular Bioscience,2013,3(2):32-37. 被引量：1
8Maciste H. Macías-Cervantes,Juan M. Guzmán-Flores,Katya Vargas-Ortiz,Francisco J. Díaz-Cisneros,Joel Ramírez-Emiliano,Victoriano Pérez-Vázquez.Effect of Aerobic Exercise on Protein Expression in Muscle of Obese Mexican Adolescents: A Proteomic and Bioinformatic Analysis[J].Natural Science,2014,6(9):641-650.
9Zhiben Zhuang,Jing Wang,Jingyi Liu,Dingding Yang,Shiqiang Chen.A New Digital Image Encryption Algorithm Based on Improved Logistic Mapping and Josephus Circle[J].Journal of Computer and Communications,2018,6(6):31-44. 被引量：1
10Branko Blagojevic,Kalman Ziha.Probabilistic Indicators of Structural Redundancy in Mechanics[J].World Journal of Mechanics,2012,2(5):229-238.

American Journal of Molecular Biology

2012年第1期

浏览历史

内容加载中请稍等...

Intergenic subset organization within a set of geographically-defined viral sequences from the 2009 H1N1 influenza A pandemic

相关作者

相关机构

相关主题

浏览历史