检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]兰州文理学院电子与信息学院,甘肃兰州730000 [2]兰州理工大学计算机与通信学院,甘肃兰州730050
出 处:《计算机应用与软件》2014年第3期7-10,共4页Computer Applications and Software
基 金:兰州文理学院科研能力提升计划项目(2013YBTS03)
摘 要:个性化信息检索系统的实时性关键在于如何动态更新用户兴趣模型。针对原有方法的不足,改进用户兴趣模型的描述与更新方式。首先根据网页文档的特征改进TF-IDF(Term Frequency-Inverse Document Frequency)算法,以此作为用户兴趣特征词的权重,同时通过引入领域本体,将用户兴趣特征项进行语义扩展,并根据用户浏览行为,改进其用户兴趣主题计算方式,并在此基础上提出用户兴趣模型的更新与遗忘机制。实验对比结果表明,该方法能够捕捉用户兴趣的变化,进一步提高个性化信息检索的准确度与用户满意度。It is essential for the real-timeliness of personalized information retrieval system how to dynamically update the user interest model. Aiming at the deficiency of the existing method, the paper improves the user interest model description and updating mode. Firstly, ac- cording to the characteristics of the web document, TF-IDF algorithm is improved, whose results are taken as user interest feature words' weight;meanwhile, by introducing domain ontology, the user interest feature items are semantically extended;then, according to user browsing behavior, the user interest theme calculation method is improved, based on which the updating and forgetting mechanism of user interest mod- el is proposed. Results from comparative experiments indicate that the method can capture changes of the user interest and further improve the precision of personalized information retrieval and user satisfaction.
关 键 词:个性化信息检索 本体 TF—IDF用户兴趣模型
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.198