期刊文献+

DNA序列数据压缩技术综述 被引量:9

Overview of DNA Sequence Data Compression Techniques
在线阅读 下载PDF
导出
摘要 DNA序列数据压缩技术是根据DNA数据特点针对性地构造编码算法,以提升整体压缩效率的数据处理方法.本文介绍了DNA序列的基本概念及数据特点,DNA序列压缩算法的一般性描述,DNA序列的典型压缩算法,以及评估DNA序列压缩算法性能的重要指标,并对DNA序列压缩算法未来的发展趋势做了展望. DNA data features based encoding algorithms are employed in DNA compression techniques for efficiency im- provement. In this paper, an overview of DNA data compression techniques is presented including basic concept and data features of DNA sequence, general description of DNA compression techniques, typical DNA compression algorithms, and compression evaluation standards. Future investigations on DNA compression techniques are also discussed.
出处 《电子学报》 EI CAS CSCD 北大核心 2010年第5期1113-1121,共9页 Acta Electronica Sinica
基金 国家自然科学基金(No.60872125) 霍英东高等院校青年教师基金基础性研究课题
关键词 DNA数据压缩 DNA压缩算法 压缩编码 压缩评价标准 DNA data compression DNA compression algorithms compression encoding compression evaluation standard
  • 相关文献

参考文献35

  • 1Wikipedia. DNA [ DB/OL ]. http://en.wikipedia.org/wiki/ DNA, 2009-05-15.
  • 2李辉,王金莲.基于基因表达谱的肿瘤预测模型研究[J].电子学报,2008,36(5):989-992. 被引量:6
  • 3Baxevanis A D, Ouellette B F F. Bioinformatics: A Practical Guide to the Analysis of Genes and Proteins, Third Edition [ M]. United States: Wiley Publishing House, 2005.
  • 4Galperin M Y, Cochrane G R. Petabyte-scale innovations at the european nucleotide archive[J].Nucleic Acids Research, 2009, 37:D1 - D4.
  • 5EMBL. EMBL nucleotide sequence database: Release notes, release 99 Mar. 2009 E DB/OL]. http://www, ebi. ac. uk/embl/Docttmentation/Release_ notes/current/relnotes, html, 2009-05- 15.
  • 6萨洛蒙.数据压缩原理与应用,第2版[M].北京:电子工业出版社,2003.
  • 7Lanctot J K, Li M, et al. Estimating DNA sequence entropy [ A] .Proc of the 11th Annual ACM-SIAM Symposium on Discrete Algorithms[ C]. San Francisco: SIAM, 2000.409 - 418.
  • 8Hattori M. Finishing the euchromatic sequence of the human genome[ J]. Nature,2004,431 (7011) :931 - 945.
  • 9Morcos P A. Achieving targeted and quantifiable alteration of mRNA splicing with Morpholino oligos [ J ]. Biochemical and Biophysical Research Communications, 2007, 358 (2) : 521 - 527.
  • 10Ferreira P J S G,Neves A J R,et al. Exploring three-base periodicity for DNA compression and modeling[A]. Proc of International Conference on Acoustics, Speech and Signal Processing[ C]. Toulouse:IEEE, 2006. 877 - 880.

二级参考文献32

  • 1邢仲璟,林丕源,林毅申.基于Bioperl的生物二次数据库建立及应用[J].计算机系统应用,2004,13(11):58-60. 被引量:7
  • 2R B Farber,A S Lapedes,Sirotkin K M.Determination of eukaryotic protein coding regions using neural networks and information theory[J].J Mol Biol,1992,226(2):471-479.
  • 3S V Buldyrev,et al.Long-range correlation properties of coding and noncoding DNA sequences:Genbank analysis[J].Phys Rev E,1995,51(5):5084-5094.
  • 4S Dong,D B Searls.Gene structure prediction by linguistic methods[J].Genomics,1994,23(3):540-551.
  • 5W Lee,L Luo.Periodicity of base correlation in nucleotide sequence[J].Phys Rev E,1997,56(1):848-851.
  • 6John A Berger,Sanjit K Mitra,Marco Carli,et al.Visualization and analysis of DNA sequences using DNA walks[J].Journal of the Franklin Institute,2004,341(1-2):37-53.
  • 7D Anasstassiou.Frequency-domain analysis of bio-molecular sequences[J].J.Bioinformatics,2000,16(12):1073-1081.
  • 8Stephane Mallat.A Wavelet Tour of Signal Processing.Academic Press[M].Sept.15,1999.
  • 9S Tiwari,S Ramachandran,A Bhattacharya,et al.Prediction of probable genes by Fourier analysis of genomic sequences[J].CABIOS,1997,13(3):263-270.
  • 10M Burset,R Guigó.Evaluation of Gene Structure prediction program[J].Genomics,1996,34(3):353-367.

共引文献11

同被引文献73

  • 1方小永,骆志刚.DNA序列拼接的分布式并行处理[J].计算机工程与科学,2005,27(2):71-73. 被引量:3
  • 2张晓东,张传富,彭科峰,顾文杰,曹立群,王立群.生物信息学数据库研究进展[J].生物信息学,2006,4(3):143-145. 被引量:8
  • 3胡吉祥,许洪波,刘悦,程学旗.重复串特征提取算法及其在文本聚类中的应用[J].计算机工程,2007,33(2):65-67. 被引量:6
  • 4张丽霞,张义青,林丕源,刘吉平.基于字符和0/1码的DNA压缩模式匹配算法[J].计算机应用研究,2007,24(9):22-24. 被引量:3
  • 5Ferreira P J S G, Neves A J R, et al. Explorin three-base periodicity for DNA compression and modeling. Proceeding of the IEEE Confer- ence on Acoustics ,Speech and Signal Processing. Toulouse ,2006: 877-880.
  • 6Chen X, Kwong S, et al. A compression algorithm for DNA se- quences and its applications in genome comparison. Procceeding of the 10th Workshop on Genome Informatics. Tokyo: GIW, 1999:51 - 61.
  • 7Korodi G, Tabus I, et al. DNA sequence compression-based on the normalized maximum likelihood model IEEE Signal Processing Maga- zine, 2007 ; 24 ( 1 ) :47-53.
  • 8Wheeler D A,Srinivasan M,Egholm M,et al. The completegenome of an individual by massively parallel DNA sequen-cing [ J ]. Nature,2008,452(7189) :872-876.
  • 9International Human Genome Sequencing Consortium. Finish-ing the euchromatic sequence of the human genome [ J ]. Na-ture ,2004,431(7011):931-945.
  • 10Christley S,Lu Y,Li C,et al. Human genomes as email attach-ments [J]. Bioinformatics,2009,25 (2) :274-275.

引证文献9

二级引证文献11

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部