检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:毛冬 张辰 陈又咏 刘永清 焦艳斌 MAO Dong;ZHANG Chen;CHEN Youyong;LIU Yongqing;JIAO Yanbin(Information Communication Branch of State Grid Zhejiang Electric Power Co.,Ltd,Hangzhou Zhejiang 310007;State Grid Information&Telecommunication Group Co.,Ltd,Beijing 102211,China;Fujian Yirong Information Technology Co.,Ltd,Fuzhou Fujian 350001,China)
机构地区:[1]国网浙江省电力有限公司信息通信分公司,浙江杭州310007 [2]国网信息通信产业集团有限公司,北京102211 [3]福建亿榕信息技术有限公司,福建福州350001
出 处:《太赫兹科学与电子信息学报》2024年第10期1154-1160,共7页Journal of Terahertz Science and Electronic Information Technology
基 金:国网科技基金资助项目(5700‒202219187A‒1‒1‒ZN)。
摘 要:针对目前数据迁移方法存在数据迁移耗时长、存储空间最大占用率较高、迁移学习错误率高和被访问数据在线概率低的问题,开展基于国产CPU环境的国产数据库历史数据迁移技术的研究。首先在国产CPU环境中集群部署系统软硬件,提高历史数据在国产数据库之间的迁移速率。其次建立孤立森林模型,将历史数据输入孤立森林模型中展开趋势预测,剔除国产数据库中存在的异常数据,减少待迁移的数据量。最后,构建数据迁移模型,并采用交替优化策略求取模型最优解,完成国产数据库历史数据的迁移。实验结果表明,该方法的数据迁移时间为18 min,储存空间最大占用率在10%~25%之间,ALC指标值为0.78~0.95,被访问数据在线概率能够始终保持在97%以上,证明该方法数据迁移耗时较短,存储空间最大占用率较低,迁移学习的错误率低,访问效率高,具有较好的应用效果。In view of the problems of long data migration,high maximum occupancy rate of storage space,high error rate of transfer learning and low online probability of visited data,the historical data migration technology of domestic database based on domestic Central Processing Unit(CPU)environment is studied.Firstly,the system software and hardware are clustered and deployed in the domestic CPU environment to improve the migration rate of historical data between domestic databases.Secondly,an isolation forest model is established,and the historical data is input into the isolation forest model for trend prediction,thereby eliminating the abnormal data in the domestic database,and reducing the amount of data to be migrated.Finally,a data migration model is constructed,and an alternating optimization strategy is adopted to find the optimal solution of the model,thus completing the migration of historical data in domestic databases.The experimental results show that the data migration time of this method is 18 minutes,and the maximum occupancy rate of storage space is between 10%and 25%,the ALC(Area under the Learning Curve)index value is 0.78~0.95,and the online probability of the accessed data can always be maintained at more than 97%,proving that this method has a short data migration time,a low maximum occupancy rate of storage space,a low error rate of migration learning,and high access efficiency,demonstrating good application effects.
关 键 词:国产CPU 国产数据库 孤立森林模型 交替优化策略 数据迁移技术
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:3.144.147.211