基于主题模型的位置感知订阅发布系统  被引量:3

Location-awareness Publication Subscription System Based on Topic Model

在线阅读下载全文

作  者:鲜学丰[1] 崔志明[1,2] 赵朋朋[2] 刘昭斌[1] 顾才东[1] 

机构地区:[1]江苏省现代企业信息化应用支撑软件工程技术研发中心,江苏苏州215104 [2]苏州大学智能信息处理及应用研究所,江苏苏州215006

出  处:《计算机科学》2018年第3期165-170,共6页Computer Science

基  金:国家自然科学基金资助项目(61672372;61440053;61472268;61472211);江苏省高校"青蓝工程"优秀青年骨干教师培养项目;江苏高等学校优秀科技创新团队资助项目资助

摘  要:随着移动互联网的迅速发展和智能手机的普及,基于位置感知的订阅发布系统在工业界和学术界引起了广泛重视。现有系统主要处理海量空间数据下订阅与事件的查询匹配问题,其匹配模型主要是基于空间关键字之间的相似性,鲜有研究考虑语义相关性。为了探索并实现订阅发布系统在语义上的查询与匹配,提出了一种基于主题模型的位置感知订阅发布系统。首先,该系统利用主题模型对订阅发布系统中的关键字进行主题映射。然后,设计了一种两步分区索引结构RP^(TM)-trees,并使用该索引结构为订阅的主题集合和空间信息建立索引。RP^(TM)-trees根据主题集合的主题个数及关键主题对订阅进行两步分区索引,使其对订阅的分区能力更强,从而显著提升查询匹配的效率。最后,在高流速的事件流、千万级订阅数据集上进行了实验,实验结果表明所提方案是稳定和高效的。Location-awareness publication subscription system has drawn extensive academic and industrial attention with the booming development of mobile Internet and the popularity of smart-phones.The existing systems on locationawareness publication/subscription mainly focus on handling the query and matching problem of events among massive spatial data,whose matching model is mainly based upon the similarities of spatial keywords,while the semantic aspect is ignored.In order to explore how to realize the semantic query and matching in subscription/publication system,this paper proposed a location-awareness publication/subscription system based upon theme model.Firstly,the system makes use of theme model algorithm and realizes the thematic reflection of keywords in location-awareness publication/subscription system.Secondly,it designs a two-step partition index structure RPTM-trees and utilizes RPTM-trees to create an index between thematic aggregation and spatial information.As RPTM-trees conducts a two-step partitioning and indexing of the subscription information based on the topic numbers of thematic aggregation and key topics,a stronger subscription partitioning ability is achieved,and the efficiency of query and matching is significantly improved.Finally,an experiment on high-speed event stream and millions and millions subscription data aggregation was conducted,indicating the effectiveness and the efficiency of the proposed solution.

关 键 词:订阅/发布 概率主题模型 主题映射 索引 

分 类 号:TP391[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象