基于PyQt的全文搜索引擎平台开发  被引量:2

Development of Full-Text Search Engine Platform Based on PyQt

在线阅读下载全文

作  者:张文超 胡玉兰[1] ZHANG Wen-chao;HU Yu-lan(Institute of Information Science and Technology,Shengyang Ligong University,Shengyang 110159,China)

机构地区:[1]沈阳理工大学信息科学与工程学院,辽宁沈阳110159

出  处:《软件导刊》2018年第9期132-135,共4页Software Guide

基  金:国家自然科学基金项目(61373089;61672360)

摘  要:网络信息数量的日益增加,对人们从中获取有效信息的能力提出了更高要求。为了更好地响应用户需求,提高信息处理效率并降低人力成本,基于PyQt进行全文搜索引擎平台开发。采用模块化思想设计网络信息采集功能,然后将获取的信息经数据处理后建立索引库,采用PageRank算法对查询响应结果进行排序,实现检索器功能,并通过用户的点击决策,利用神经网络对排序结果进行二次修正。最后,在界面输入查询字符串后,便可快速得到已排序的链接响应,从而能更好地反映用户对检索结果的感兴趣程度,并提供个性化服务。With the increasing of network information,people also have higher requirements on their ability to obtain effective information.In order to better respond to users'needs,improve the efficiency of information processing and reduce human resources,the function of network information collection is designed with the idea of modularizationfocusing on the hot technology of full-text search engine,and the index database after the data is established and processed,then we use PageRank algorithm to implement the retriever function in the query response,and the ranking results are secondarily corrected by using the neural network through the user's click decision.At last, after the completion of the development of full-text search engine system platform by using of PyQt, the query string is inputted in the interface and the sorted link response can be quickly obtained,which can better reflect the users' interest in the search results and provide personalized service.

关 键 词:全文搜索引擎 网络信息采集 PAGERANK PyQt 

分 类 号:TP319[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象