基于样本数据重复性的分布式数据库自动化查询方法  被引量:2

Automatic Query Method of Distributed Database Based on Repeatability of Sample Data

在线阅读下载全文

作  者:许伟 胡婷 XU Wei;HU Ting(Suqian City Tobacco Monopoly Administration(Company),Suqian 223800 China)

机构地区:[1]宿迁市烟草专卖局(公司),江苏宿迁223800

出  处:《自动化技术与应用》2023年第6期87-90,共4页Techniques of Automation and Applications

摘  要:为提高分布式数据库数据的查询效率和准确率,设计一种考虑样本数据重复性的分布式数据库自动化查询方法。提取数据信息的主要特征,确定主要信息方程的权衡函数,整合重复样本信息;分段数据库内的数据,明确数据特征,自适应分解得出自动化查询聚类中心和目标函数;采用ICTCLAS分词系统计算关键词在文本中出现频率,根据灰狼优化算法求得最优函数集获取最优参数集;结合Shingle检测并标记样本信息匹配度,完成最终数据查询。实验结果表明方法查询准确率高于90%,平均耗时低于35 s,可被广泛推广使用。In order to improve the query efficiency and accuracy of distributed database data,an automatic query method of distributed database based on repeatability of sample data is designed.It extracts the main features of data information,determines the trade-off function of the main information equation,and integrates the repeated sample information;It segments the data in the database,clarifies the data characteristics,and adaptively decomposes to obtain the automatic query clustering center and objective function;The ICTCLAS word segmentation system is used to calculate the frequency of keywords in the text,and the optimal function set and the optimal parameter set are obtained according to the gray wolf optimization algorithm;Combined with shingle,detect and mark the matching degree of sample information to complete the final data query.The experimental results show that the query accuracy of the proposed method is higher than 90% and the average time is less than 35 s,which can be widely used.

关 键 词:分布式数据库 模糊聚类分析 F-Measure方法 

分 类 号:TP274[自动化与计算机技术—检测技术与自动化装置]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象