期刊文献+

超级计算机错误预测模型研究

Analysis on failure prediction model of supercomputer
在线阅读 下载PDF
导出
摘要 错误预测对于提高计算机系统的运算稳定性有重要意义,日志分析是建立错误预测模型的有效方法。在同类型错误的时间预测模型的基础之上,通过日志分析建立了不同类型错误之间的关联模式,并在此基础上建立了基于关联模式的错误预测模型,填补了时间预测模型在错误发生后的短时间内无能为力的缺陷,提高了预测率,并在IBM的BlueGene/L的系统日志数据上验证了关联模式错误预测模型的有效性。 Failure prediction is of great significance for improving the stability of the computer system and log analysis is an effective way of establishing the failure prediction model.On the basis of time prediction model for the same kind of failure,this article establishes the associated mode for different kinds of failure by the log analysis and the failure prediction model based on the associated mode,which fills the disfigurement of time prediction model’s powerlessness within a short time after the occurrence of failure and improves the rate of predication greatly.The validity of the failure prediction model based on associated mode is tested by the data of BlueGene/L’s system log of IBM.
出处 《计算机工程与应用》 CSCD 北大核心 2010年第20期126-128,141,共4页 Computer Engineering and Applications
基金 国家自然科学基金No.60873031 the National Natural Science Foundation of China under Grant No.60873031
关键词 关联模式 错误预测 日志分析 BlueGene/L associated mode failure prediction log analysis BlueGene/L
  • 相关文献

参考文献15

  • 1Sandia[EB/OL].http://www.cs.sandia.gov/-jrstear/logs.
  • 2Oliner A J,Stearley J.What supercomputers say:A study of five system logs[C] //Proceedings of the 37th Annual IEEE/IFIP International Conference on Dependable Systems and Networks.[S.l.] :IEEE Computer Society Press,2007:575-584.
  • 3Adiga N R,BlueGene/L Team.An overview of the BlueGene/L supercomputer[C] //Proceedings of the 2002 ACM/IEEE Conference on Supercomputing.[S.l.] :IEEE Computer Society Press,2002:1-22.
  • 4BG/L control system software[EB/OL].http://www.mcs.anl.gov/-beckman/bluegene/SSW-Utah-2005/BGL-SSW07-ControlSys.pdf.
  • 5Liang Y,Zhang Y,Sivasusubramanian A,et al.Filtering failure logs for a BlueGene/L prototype[C] //Proceedings of the 2005International Conference on Dependable System and Networks.[S.l.] :IEEE Computer Society Press,2005:476-485.
  • 6Liang Y,Zhang Y,Jette M,et al.BlueGene/L failure analysis and prediction models[C] //Procecdings of the 2006 International Conference on Dependable System and Networks.[S.l.] :IEEE Computer Society Press,2006:425-434.
  • 7Liang Y,Zhang Y,Xiong H,et al.Failure prediction in IBM BlueGene/L event logs[C] //Proccedings of Seventh IEEE International Conference on Data Mining.[S.l.] :IEEE Computer Society Press,2007:583-588.
  • 8Stearicy J,Oliner A J.Bad words:Finding faults in spirit's syslogs[C] //Proceedings of the Eighth IEEE International Symposium on Cluster Computing and the Grid.[S.l.] :IEEE Computer Society Press,2008:765-770.
  • 9Oliner A J,Stearley J.Alert detection in system logs[C] //Proceedings of the Eighth IEEE International Conference on Data Mining.[S.l.] :IEEE Computer Society Press,2008:959-964.
  • 10周琪锋.基于网络日志的安全审计系统的研究与设计[J].计算机技术与发展,2009,19(11):139-142. 被引量:15

二级参考文献24

共引文献39

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部