期刊文献+

基于SBWS__GPR预测模型的不确定性多数据流异常检测方法 被引量:9

Outlier detection of uncertainty multiple data stream based on SBWS__GPR prediction model
在线阅读 下载PDF
导出
摘要 针对实际系统中采集的数据流的不确定性,给异常点检测与修正带来了现实挑战。因此,根据滑动基本窗口采样算法(sliding basic windows sampling,SBWB)与高斯过程回归(Gaussian process regression,GPR)模型的特性,提出了基于SBWS_GPR预测模型的不确定性多数据流的异常检测方法。在基于时间序列采集的历史数据集中,引入索引号,对历史数据集进行聚类,分析数据集与索引号的映射关系,将实时获得的输入数据流通过滑动窗口匹配,实现对单数据流的异常点检测与修正。再利用输入、输出数据间的相关性,基于GPR建立预测模型,比较实时观察的输出数据流与预测模型的输出数据流,最终从输入、输出两种不同通道实现多数据流的异常检测与修正。 The uncertainty of collecting data stream in practical system brings a serious challenge for outlier detection and correction. Based on the characteristic of sliding basic windows sampling (SBWS) and Ganssian process regression (GPR), this paper proposed the outlier detection method of uncertainty multiple data stream based on SBWS_GPR prediction model. By collecting historical data set based on time series and introducing index number, cluster and analysis historical data set and got the mapping relation between the data set and index number. The real-time input data stream obtained was to realize outlier detection and correction by the sliding window pattern. And then based on the correlation between the input and output data and the GPR, set up prediction model and compared the real-time output data stream data with the prediction output data stream, to realize outlier detection and correction from two different input and output channels.
出处 《计算机应用研究》 CSCD 北大核心 2018年第2期381-385,共5页 Application Research of Computers
基金 国家自然科学基金资助项目(61702228) 江苏省自然科学基金资助项目(BK20170198) 江苏省博士后科研项目(1601012A) 江苏省"六大人才高峰"计划资助项目(DZXX-026)
关键词 不确定性 数据流 高斯过程回归 索引号 滑动窗口 uncertainty data stream GPR index number sliding window
  • 相关文献

参考文献6

二级参考文献118

  • 1常建龙,曹锋,周傲英+.基于滑动窗口的进化数据流聚类[J].软件学报,2007,18(4):905-918. 被引量:61
  • 2苏国韶,燕柳斌,张小飞,江权.基坑位移时间序列预测的高斯过程方法[J].广西大学学报(自然科学版),2007,32(2):223-226. 被引量:25
  • 3Babcock B,Babu S,Datar M,Motwani R,Widom J.Models and issues data stream systems.In:Proc.of the 21st ACM SIGACT-SIGMOD-SIGART Symp.on Principles of Database Systems.Madison:ACM,2002.1-16.
  • 4Aggarwal CC,Han JW,Yu PS.A framework for clustering evolving data streams.In:Proc.of the 29th Int'l Conf.on Very Large Data Bases.Berlin:Morgan Kaufmann Publishers,2003.81-92.
  • 5Aggarwal CC,Yu PS.A framework for clustering uncertain data streams.In:Proc.of the 24th Int'l Conf.on Data Engineering.Cancún:IEEE,2008.150-159.
  • 6Callaghan LO,Mishra N,Meyerson A,Guha S,Motwani R.Streaming-Data algorithms for high-quality clustering.In:Proc.of the 18th Int'l Conf.on Data Engineering.San Jose:IEEE,2002.685-694.
  • 7Zhu WH,Yin J,Xie YH.Arbitrary shape cluster algorithm for clustering data stream.Journal of Software,2006,17(3):379-387 (in Chinese with English abstract).http://www.jos.org.cn/1000-9825/17/379.htm[doi:10.1360/jos170379].
  • 8Datar M,Gionis A,Indyk P,Motwani R.Maintaining stream statistics over sliding windows.In:Proc.of the 13th Annual ACM-SIAM Symp.on Discrete Algorithms.San Francisco:ACM,2002.635-644.
  • 9Babcock B,Datar M,Motwani R,Callaghan LO.Maintaining variance and k-medians over data stream windows.In:Proc.of the 22nd ACM SIGACT-SIGMOD-SIGART Symp.on Principles of Database Systems.San Diego:ACM,2003.234-243.
  • 10Cao F,Estery M,Qian WN,Zhou AY.Density-Based clustering over an evolving data stream with noise.In:Proc.of the 6th SIAM Int'l Conf.on Data Mining.Bethesda:SIAM,2006.326-337.

共引文献329

同被引文献63

引证文献9

二级引证文献38

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部