时间序列上的变化查询的高效处理算法  

FAST QUERY METHODS OF FINDING SUBSEQUENCESOF SATISFIED CHANGES ON TIME SERIES

在线阅读下载全文

作  者:施文俊 王鹏[1] 汪卫[1] Shi Wenjun;Wang Peng;Wang Wei(School of Computer Science,Fudan University,Shanghai 200438,China)

机构地区:[1]复旦大学计算机科学技术学院,上海200438

出  处:《计算机应用与软件》2024年第7期25-33,共9页Computer Applications and Software

摘  要:时间序列数据的值的变化往往代表着事件的发生。时间序列数据上的变化查询,即查找在一定长度内,满足增长或减少一定阈值的子序列,可以挖掘事件,有重要实际意义。现有方法无法高效解决该问题。为此,一种基于分段并构建分段关系图的方法被提出。实验表明,该方法在百万长度的时间序列下仍可在百毫秒内返回结果,且分段关系图的存储开销也较小。对于波动较少的数据集,存储大小可达到原数据集大小的30%以下。且进一步提出了两种优化手段,可在原有基础上再减少约50%的存储开销,同时不过多影响查询效率。Changes in the value of time series data often represent the occurrence of events.The change query on time series data,that is,to find the subsequences within a certain length that meets a certain threshold of increase or decrease,can mine events and has important practical significance.Existing methods cannot efficiently solve this problem.To this end,a method based on segmentation and constructing a segmentation relationship graph is proposed.Experiments show that this method can still return results within 100 milliseconds under a million-length time series,and the storage overhead of the segmentation relationship graph is also small.For data sets with less fluctuation,the storage size can reach less than 30%of the original data set size.Moreover,two optimization methods are further proposed,which can reduce the storage overhead by about 50%on the original basis,and at the same time do not affect the query efficiency too much.

关 键 词:时间序列 变化查询 子序列查询 分段 

分 类 号:TP301.6[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象