期刊文献+

临床行为模式挖掘的数据预处理 被引量:4

Data preprocess for mining clinical behavior patterns
在线阅读 下载PDF
导出
摘要 临床行为数据经清理后仍然存在时间关系噪音,直接用于序列挖掘算法难以发现高质量的模式。提出了一种时间规范化模型,该模型定义了时序行为的顺序和并列关系,针对所给出的关系进行相交系数的计算,根据计算结果确定行为时间关系中的噪音,遵循规范后的所有行为相互之间既无噪音又保持原正确关系不变的准则,进行噪音清除。针对模型进行了算法实现,对样本数据的测试结果表明,经处理后的数据满足了后续的模式挖掘的要求。 The time relationship noises still exist even after the clinical behavior data are cleaned, so it is difficult to discover high quality patterns from such data using sequential mining algorithms. A model for normalization is proposed, which defines ordinal and parallel relationships of the temporal behaviors. The intersection coefficient is worked out using the given relationships, according to the calculated results, the noises in relatioships is determined, and then the work of eliminating noises is carried out complying with the guideline that no noises exist and original correct relationships are kept among the normalized behaviors. To test the sampling data, an algorithm for the model is implemented. The testing results show the clinical data processed by the algorithm can fulfil following data mining needs.
作者 王珏 杨鹤标
出处 《计算机工程与设计》 CSCD 北大核心 2009年第2期374-377,共4页 Computer Engineering and Design
基金 国家自然科学基金项目(60572112) 江苏省高技术研究基金项目(BG2007028)
关键词 临床行为 数据预处理 数据清理 时间规范化 时间关系噪音 clinical behavior data preprocessing data cleaning time normalization time-relationship noise
  • 相关文献

参考文献8

二级参考文献28

  • 1毕方明,张永平.数据挖掘技术研究[J].计算机工程与设计,2004,25(12):2242-2244. 被引量:28
  • 2JiaweiHan MichelineKambr.数据挖掘--概念与技术(影印版)[M].北京:高等教育出版社,2001..
  • 3Jiawei Hah.Data mining concepts and techniques [M].USA:Morgan Kaufmann Publishers,2000.
  • 4Hong T P,Chen J B.Processing individual fuzzy attributes for fuzzy rules induction [J].Fuzzy Sets and Systems,2000,(1):127-140.
  • 5Gonzalez A.A learning methodology in uncertain and imprecise environments [J].Intemat J Intell Systems,1995,(10):57-371.
  • 6Ordonez C,Santana C A,de Braal L.Discovery interesting association rules in medical data[EB/OL].http://citeseer.nj.nec.com/ordonez00 discovering.html.
  • 7Hu X.DB-HReduction:A data preprocessing algorithm for data mining applications[J].Applied Mathematics Letters, 2003 ; 16 (6) : 889-895.
  • 8Debuse D,Rayward-Smith V.Feature subset selection within a simulated annealing data mining algorithm[J].Joumal of Intelligent Information Systems, 1997 ; (9) : 57-81.
  • 9Pawlak Z et al.Rough sets:probabilistic versus deterministic approach[J].Intemational Journal of Man-Machine Studies, 1988 ; (29) : 81-95.
  • 10Witten l,Frank E.Data Mining:Practical machine learning tools with Java implementations[M].San Francisco:Morgan Kaufmann,2000.

共引文献22

同被引文献20

引证文献4

二级引证文献10

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部