检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]华东师范大学计算机科学技术系,上海200241
出 处:《计算机工程》2016年第1期163-167,共5页Computer Engineering
基 金:上海市国际科技合作基金资助项目(13430710100);上海市科委科技创新行动计划基金资助项目(13511506201)
摘 要:传染病预测是时间序列预测中的一个重要应用领域,针对常用传染病预测算法准确率较低的问题,提出一种基于数据分割的最近邻算法,对相同月份的数据进行相似度计算。将传染病数据按照月份进行分割,得到不同年份、相同月份的时间序列数据,运用K-最近邻(KNN)的方法对时间序列数据进行相似度计算,得出最相似的时间序列的预测序列预测值。利用上海市疾病预防控制中心腹泻数据进行实验,结果表明,该方法能够充分考虑到月份对腹泻人数的影响,与改进前的基于KNN的连续时间序列预测算法相比,平均绝对误差值、平均百分比误差值、均方根误差值分别降低38.52,0.07,47.86,与传统的预测方法 ARIMA相比,平均绝对误差、平均百分比误差值、均方根误差值分别降低23.04,0.07,28.12。Infectious disease prediction is an important field in time series prediction,owing to the problem of lower disease forecast precision,a method that calculates the similarity of the data in the same month by K-nearest Neighbor(KNN) algorithm based on data division is proposed.This method divides diarrhea data by month,and gets the same month time series data in different years.It uses KNN method to calculate similarity of time series,and gets the most similar time series.Forecast sequence is obtained by the most similar time series.An experiment is done based on the data of diarrhea in Shanghai,experimental result shows that the method can fully take an impact on the number of diarrhea of season into account,compared with the continuous time series prediction algorithm based on KNN,MAE is less than38.52,MPE is less than 0.07 and RMSE is less than 47.86,and compared with the traditional forecasting methods of ARIMA,MAE is less than 23.04,MPE is less than 0.07 and RMSE is less than 28.12.
关 键 词:预测 传染病预测 K-最近邻算法 时间序列 相似性计算
分 类 号:TP18[自动化与计算机技术—控制理论与控制工程]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28