检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:于芳[1] YU Fang(Library,Harbin Institute of Technology in Weihai,Weihai 264209,China)
机构地区:[1]哈尔滨工业大学(威海)图书馆,山东威海264209
出 处:《微型电脑应用》2021年第5期48-51,共4页Microcomputer Applications
基 金:山东省高等学校人文社会科学研究计划项目(j11w151)。
摘 要:针对传统文献推荐过程中易于发生文献查找困难、文献浏览迷失等问题,基于大数据特征,利用内存计算中Spark系统框架高的容错机制和实时运算优势,提出了一种“混合关联”的图书馆推荐算法。利用Spark RDD来支撑“字符串匹配”,利用Spark MLlib支撑“相似度匹配”,通过TF-IDF()算法获得分词的TF/IDF值作为权重值,建立起文献、混合权重的Spark的三元组形式,并利用混合权重值排名建立不同长度推荐列表,以准确率对推荐算法的性能进行了评价,结果表明该算法在庞大图书系统中依然具备了非常高的文献推荐准确率,能够满足用户对感兴趣资料文献的查找需求。The traditional literature recommendation process is easy to occur literature search defects,literature browsing lost and so on.Based on characteristics of big data,a library recommendation algorithm of“mixed association”is proposed by using the high fault tolerance mechanism and real-time operation advantage of Spark system framework in memory computing.We use Spark RDD to support“string matching”,use Spark MLlib to support“similarity matching”,use TF-IDF()algorithm to obtain the TF/IDF value of participle as the weight value and establish the ternary form of Spark with literature and mixed weight.The weight ranking is used to establish the recommendation list with different lengths,and the performance of the recommendation algorithm is evaluated by accuracy.The results show that the algorithm has very high literature recommendation accuracy in the huge book system,which can meet the user's search demand for the information of interest.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.229