基于关联规则的渔业信息推荐系统设计与实现  被引量:5

Design and implementation of fishery information recommendation system based on association rules

在线阅读下载全文

作  者:王立华[1] 肖慧[1,2] 徐硕[1] 刘树[1] 杜卫利[1] 黄其泉[1] 王宇[1] 

机构地区:[1]中国水产科学研究院渔业工程研究所,中国水产科学研究院渔业信息工程研究中心,中国水产科学研究院渔业信息工程功能实验室,北京100141 [2]上海海洋大学信息学院,上海201306

出  处:《农业工程学报》2013年第7期124-130,共7页Transactions of the Chinese Society of Agricultural Engineering

基  金:国家基础条件平台建设项目渔业科学数据平台建设项目(2007DKA31800-03)

摘  要:为了快速便捷的获取渔业科学数据信息,基于Web日志挖掘技术对渔业科学数据共享平台用户频繁访问模式进行分析,用于发现用户访问规则,进行信息推荐服务。对分析挖掘中涉及的算法进行了分析与改进,提出了IASR(IP agent session referrer)用户识别算法和关联规则Apriori算法的改进算法,试验研究表明,IASR算法将用户识别准确性提高了13%,速度是通用算法的2倍。当事务数目大于500时,改进算法的执行效率远远优于Apriori算法,速度提高6倍以上。在此基础上,讨论了系统的关键设计与实现方法,开发了渔业信息推荐系统。系统采用JAVA、AJAX开发,数据库SQL Server2005,操作系统为Windows XP。应用结果表明,系统可使用户方便快捷地获取自己感兴趣的渔业数据信息,从而提高信息服务的质量。In order to obtain fishery scientific data quickly and easily, this article analyzed the user’s interests in visiting fishery scientific data platforms based on data mining, mined rules and gave information recommendations according to the rules. The association rule technique, one of the commonly used algorithms for mining data analysis, attempts to find some relation of transaction items to the mass data. Based on the design of the Fisheries Information Recommendation System, the association rules mining technique was used to access the web log data of the Fishery Scientific Data Platform to find the user access pattern by data processing, pattern discovery and pattern recognition analysis procedures. Researchers analyzed and improved the algorithms involved in the mining analysis, proposed the IASR (IP Agent Session Referrer) algorithm for user identification, and introduced the technology of rewrite URL, IASR used four key informations: IP, Agent, Session and Referrer, and added session identification mechanism to recognized users in order to improve the accuracy of user identification. In the light of algorithm research on association rules, the paper proposed an improved Apriori algorithm for solving the problems of a large number of candidate set data generated in the connected computing in the process, by judging the effectiveness of the joint operation in advance to reduce the item sets joint operations, the number of the candidate item sets and iterative operations for increasing the efficiency of computation. Comparative testing was made including IASR, Apriori algorithm and improved Apriori algorithm by using the web logging data of the Fisheries Science Data Platform as the experimental object. The result of experimental research showed that IASR algorithm can improve the accuracy of user identification by 13%, and that the speed is twice as fast as that of the traditional one. The improved Apriori algorithm can greatly improve the calculation efficiency. When the transaction number is greate

关 键 词:渔业 信息系统 数据挖掘 推荐 关联规则 个性化服务 

分 类 号:TP311[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象