一种基于标签和协同过滤的并行推荐算法  被引量:2

A parallel recommendation algorithm based on tagging and collaborative filtering

在线阅读下载全文

作  者:祝晓斌[1] 蔡强[1] 白璐[1] 李海生[1] 

机构地区:[1]北京工商大学计算机与信息工程学院,北京100048

出  处:《高技术通讯》2015年第3期307-312,共6页Chinese High Technology Letters

基  金:国家自然科学基金(61402023);北京市自然科学基金(4132025);北京市教师队伍建设青年英才计划(YETP1448)资助项目

摘  要:针对基于用户打分的传统协同过滤推荐算法存在准确率较低以及计算延时的问题,提出了一种基于标签与协同过滤的并行混合推荐算法。该算法通过计算标签的词频-逆文档频率(TF-IDF)值降低流行标签的权重,根据用户的历史行为预测用户对其他资源的偏好值,最后依据预测偏好值排序产生Top-N推荐结果。对该算法的计算效率与复杂度进行了理论分析,并且通过并行编程模型MapReduce使其得到了实现,最后在实验中进行了它与Apache软件基金会项目Mahout的协同过滤算法的对比分析。实验结果表明该算法有较高的准确性,能有效地提高推荐效率。The study focused attention on the problems of lower precision and computing latency of traditional collabora- tive filtering recommendation algorithms, and proposed a parallel hybrid recommendation algorithm based on tagging and collaborative filtering. The algorithm reduces the weight of prevalent tags by calculating the TF-IDF ( time fre- quency-inverse document frequency) value of tags on predicts user preference based on the user historical behav- iors, and finally recommends the Top-N of the predictions. The algorithm' s computation efficiency and complexity were theoretically analyzed, and it was implemented by using the parallel programing model of MapRedce. The ana- lytical comparison of the algorithm with the collaborative filtering algorithm applied to the Mahout, an item of the A- pache Software Foundation, was conducted, and the result showed its higher accuracy, so it can effectively improve the recommendation efficiency.

关 键 词:协同过滤 推荐 标签 TF-IDF MAPREDUCE 

分 类 号:TP391.3[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象