检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:盛逍遥 吴友邦 王翔 李丽[1] SHENG Xiaoyao;WU Youbang;WANG Xiang;LI Li(Tianjin Binhai New Area PLRG Information Center,Tianjin 300450,China)
机构地区:[1]天津市滨海新区规划和国土资源地理信息中心,天津300450
出 处:《天津科技》2018年第9期73-76,共4页Tianjin Science & Technology
摘 要:规国房系统是辅助政府和企业实现审批、办公的高效协同办公软件,多数情况在内网部署,使得用户获取行业外部资讯困难,现存系统也存在资讯更新慢、行业信息聚合性弱、海量资讯筛查困难等问题。本文利用网络爬虫技术有效解决内网用户获取外部信息渠道和时效性问题,同时融合互联网思维,根据用户行为数据建立用户兴趣模型,采用热度值倒排的方式解决用户冷启动和内容库数据量大的问题,利用TF-IDF关键字提取技术和余弦相似度算法实现用户兴趣和内容精准匹配,最终实现个性化资讯推荐。The planning land and housing management system is an efficient and cooperative office software used for assist-ing the government and enterprises to approve and work. In most cases,the system is deployed on the Internet,which makes users have difficulty in accessing to outside information. Existing systems also have problems such as updating infor-mation slowly,polymerizing industrial information weakly,and seeking large amounts of information difficultly. The pa-per proposes a method to solve the problem that users obtaining external information and timeliness by the technology of Web crawler. It solves the problem that users are inactive and the data is large by combining Internet thinking and establish-ing users interest model according to users’ behavior data and using the way of heat value inversion. Moreover,the paper achieves accurate matching of user interest and content with the purpose of personalized recommendation of information by the technology of TF-IDF keyword extraction and the algorithm of cosine similarity.
关 键 词:个性化资讯推荐 规国房系统 网络爬虫 TF-IDF 余弦相似度
分 类 号:TP311.5[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.112