期刊文献+

基于Pearson系数的芯片数据预处理方法 被引量:1

在线阅读 下载PDF
导出
摘要 数据预处理可以大大降低数据挖掘算法的成本和提高数据挖掘的效率,尤其对于海量和高维的基因表达数据更为重要。针对K-means算法对数据预处理手段敏感的问题,文章提出了一种以管家基因法初始化数据、Pearson系数度量芯片数据相似性的预处理方法。具体的实验数据证明了该方法能很好地解决上述问题并有效地提高k-means算法的收敛速度。
出处 《计算机时代》 2006年第11期37-38,共2页 Computer Era
基金 国家863计划资助项目(2003AA116060)
  • 相关文献

参考文献7

  • 1许海洋,汪国安,王万森.模糊聚类分析在数据挖掘中的应用研究[J].计算机工程与应用,2005,41(17):177-179. 被引量:26
  • 2Eli Eisenberg.Human housekeeping genes are compact [J].TRENDS in Genetics,2003.19(7):362-364
  • 3刘章文,任天怡,古天祥.3维数据的置信区间及异常数据的修复[J].计算机集成制造系统,2005,11(4):597-600. 被引量:5
  • 4贺宪民 ,武建虎 ,贺佳 ,XIANG Zhaoying .小样本情况下差异表达基因鉴别的参数统计分析[J].中国卫生统计,2005,22(3):141-145. 被引量:10
  • 5Christelle Hennequet-Antier. A set of SAS macros for the analysis of variance of gene expression data[J].BMC Bioinformatics,2005.6:150
  • 6Joaquin Dopazo. Methods and approaches in the analysis of gene expression data [J]. Journal of immunological methods,2001.250(1-2):93-112
  • 7Ka Yee Yeung. Details of the adjusted rand index and clustering algorithms supplement to the paper "An empirical study on principal component analysis for clustering gene expression data" (to appear in bioinformatics) [DB/OL]. http://faculty.washington.edu/. May 3,2001.

二级参考文献16

  • 1MehmedKantardzic.数据挖掘[M].清华大学出版社,2003..
  • 2JALKIS J A. Three dimensional in spection using multisoript structured light [J]. Optical Engineering, 1985, 24(6): 966-1002.
  • 3MURALI S, TAE C. Accurate recovery of three-dimensional shape from image focus [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 1995, 17(3): 266-274.
  • 4TORRIERI D J. Methods, statistics, and theory of delay calibration [J]. IEEE Transactions on Instrumentation and Measurement, 1975, 24(2): 96-105.
  • 5FERDINAND V D H. Edge and line feature extraction based on covaiance models [J]. IEEE Transactions on Pattern Analysis Machine Intelligence, 1995, 17(1): 16-33.
  • 6Tanaka TS, Jaradat SA, Lira MK, et al. Genome-wide expression profiling of mid-gestation placenta and embryo using a 15,000 mouse developmental cDNA microarray. Proc Natl Acad Sci USA, 2000, 97 (16) :9127-9132.
  • 7Arfin SM, Long AD, Ito ET, et al. Global gene expression profiling in Escherichia coli K12. The effects of integration host factor. J Biol Chem, 2000,275 (38) : 29672-29684.
  • 8Yang Ⅳ, Chen E, Hasseman Jp, et al. Within the fold: assessing differential expression measures and reproducibility in microarray assays-Genome Biology, 2002, 3( 11 ) :research0062.1-0062.12.
  • 9Wright GW, Simon RM. A random variance model for detection of differential gene expression in small microarray experiments. Bioinformatics,2003, 19(18) : 2448-55.
  • 10Cui X, Churchill GA. Statistical tests for differential expression in cDNA microarray experiments. Genome Biol, 2003, 4(4) : 210 (1-10).

共引文献35

同被引文献7

引证文献1

二级引证文献9

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部