检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]中南大学信息科学与工程学院,长沙410083 [2]南京理工大学高维信息智能感知与系统教育部重点实验室,南京210094
出 处:《计算机工程与应用》2018年第6期37-43,共7页Computer Engineering and Applications
基 金:国家自然科学基金(No.61672535);高维信息智能感知与系统教育部重点实验室创新基金(No.JYB201502);湖南省普通高校青年教师培养计划;中南大学中央高校基本科研业务费专项(No.2016zzts351);中南大学创新驱动计划(No.2015CXS010);中南大学升华育英计划专项
摘 要:针对个性化站点较少考虑用户检索意图的问题,提出结合交叉信息熵和词语特征信息的关键词提取方法以及结合余弦相似度和加权海明距离的文本排序方法,旨在不需要用户任何反馈的条件下,为用户推荐更满意的检索结果。通过过滤用户请求个性化站点时的访问地址,获取用户浏览的网页文本内容,从中提取能够表示用户检索意图的关键词集进行重新检索后对检索结果排序,最后将排序后的结果作为推荐模块返回给用户。实验表明,利用该方法获得的查询推荐结果能够更加符合用户检索意图,提供更好的用户体验。Personalized website rarely considers user's search intention in retrieval process. To recommend more satisfactory results without any user feedback in personalized website retrieval, this paper proposes a keyword extraction method combining the cross entropy with word feature information, and a text ranking method assembling the cosine similarity with weighted Hamming distance. Firstly, web page text content is obtained from the requested personalized website by filtering the web page address. Secondly, based on the obtained text content, keywords which can reflect user's retrieval intention are extracted. Thirdly, user's intention vector model is constructed and a re-retrieval process is performed by calling the main search engine. Finally, the similarity between the user's intention model and the re-retrieved records is computed, and the results sorted by the similarity values are returned to user. Experimental results show that the proposed method can reflect the user's query intention and provide a notably convenient user experience.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:13.58.242.216