检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:许伟 胡婷 XU Wei;HU Ting(Suqian City Tobacco Monopoly Administration(Company),Suqian 223800 China)
机构地区:[1]宿迁市烟草专卖局(公司),江苏宿迁223800
出 处:《自动化技术与应用》2023年第6期87-90,共4页Techniques of Automation and Applications
摘 要:为提高分布式数据库数据的查询效率和准确率,设计一种考虑样本数据重复性的分布式数据库自动化查询方法。提取数据信息的主要特征,确定主要信息方程的权衡函数,整合重复样本信息;分段数据库内的数据,明确数据特征,自适应分解得出自动化查询聚类中心和目标函数;采用ICTCLAS分词系统计算关键词在文本中出现频率,根据灰狼优化算法求得最优函数集获取最优参数集;结合Shingle检测并标记样本信息匹配度,完成最终数据查询。实验结果表明方法查询准确率高于90%,平均耗时低于35 s,可被广泛推广使用。In order to improve the query efficiency and accuracy of distributed database data,an automatic query method of distributed database based on repeatability of sample data is designed.It extracts the main features of data information,determines the trade-off function of the main information equation,and integrates the repeated sample information;It segments the data in the database,clarifies the data characteristics,and adaptively decomposes to obtain the automatic query clustering center and objective function;The ICTCLAS word segmentation system is used to calculate the frequency of keywords in the text,and the optimal function set and the optimal parameter set are obtained according to the gray wolf optimization algorithm;Combined with shingle,detect and mark the matching degree of sample information to complete the final data query.The experimental results show that the query accuracy of the proposed method is higher than 90% and the average time is less than 35 s,which can be widely used.
关 键 词:分布式数据库 模糊聚类分析 F-Measure方法
分 类 号:TP274[自动化与计算机技术—检测技术与自动化装置]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.224.64.24