摘要
时间序列数据的高维性是影响数据查询代价的主要因素,降维技术是时间序列数据查询优化的有效手段。原有降维技术近似体积的无界性,造成索引阶段不能充分实现点过滤,影响了查询效率。首先,利用近似体积有界的非线性降维技术解决了点过滤问题。然后,引入提前终止技术,减少了原始序列距离计算阶段的冗余计算。在此基础上,提出排序的子序列相似查询算法。实验结果表明,排序子序列相似查询方法具有较高的效率。
High dimensionality of time series data cause high query cost. Dimensionality reduction on the data is an effective way ofqueryoptimization. The approximation volume ofproposed dimensionality reduction is unbounded, so the algorithm can not effectively filter the point in the index. First, a new non-linear dimensionality reduction is used to resolve the problem. Second, Early abandon technique is introduced to further improve efficiency and reduce redundant computation. Then on these basis, ranked subsequence similarity search algorithm is proposed. The experimental results show that ranked algorithm has higher efficiency.
出处
《燕山大学学报》
CAS
2009年第5期417-421,共5页
Journal of Yanshan University
基金
国家自然科学基金资助项目(60773100)
国家"十一五"科技支撑计划资助项目(2006BAK05BO2)
河北省自然科学基金资助项目(F2009000475)
关键词
排序
非线性降维
提前终止技术
紧密下限
rank
non-linear dimensionality reduction
early abandon
tight bounds