Location and Trajectory Identification from Microblogs  

在线阅读下载全文

作  者:Na Ta Guo-Liang Li Jun Hu Jian-Hua Feng 

机构地区:[1]School of Journalism and Communication,Renmin University of China,Beijing 100872,China [2]Department of Computer Science and Technology,Tsinghua University,Beijing 100084,China [3]Airbnb Network(Beijing)Inc.,Beijing 100020,China

出  处:《Journal of Computer Science & Technology》2019年第4期727-746,共20页计算机科学技术学报(英文版)

基  金:the National Natural Science Foundation of China under Grant Nos.61802414,61632016,61521002 and 61661166012;the National Basic Research 973 Program of China under Grant No.2015CB358700;the Social Science Foundation of Beijing under Grant No.18XCC011;the Humanities and Social Sciences Base Foundation of Ministry of Education of China under Grant No.16JJD860008,Huawei,and TAL(Tomorrow Advancing Life)education.

摘  要:The rapid development of social networks has resulted in a proliferation of user-generated content(UGC),which can benefit many applications.In this paper,we study the problem of identifying a user's locations from microblogs,to facilitate effective location-based advertisement and recommendation.Since the location information in a microblog is incomplete,we cannot get an accurate location from a local microblog.As such,we propose a global location identification method,Glitter.Glitter combines multiple microblogs of a user and utilizes them to identify the user's locations.Glitter not only improves the quality of identifying a user's location but also supplements the location of a microblog so as to obtain an accurate location of a microblog.To facilitate location identification,Glitter organizes points of interest(POIs)into a tree structure where leaf nodes are POIs and non-leaf nodes are segments of POIs,e.g.,countries,cities,and streets.Using the tree structure,Glitter first extracts candidate locations from each microblog of a user which correspond to some tree nodes.Then Glitter aggregates these candidate locations and identifies top-κlocations of the user.Using the identified top-κuser locations,Glitter refines the candidate locations and computes top-κlocations of each microblog.To achieve high recall,we enable fuzzy matching between locations and microblogs.We propose an incremental algorithm to support dynamic updates of microblogs.We also study how to identify users'trajectories based on the extracted locations.We propose an effective algorithm to extract high-quality trajectories.Experimental results on real-world datasets show that our method achieves high quality and good performance,and scales well.

关 键 词:LOCATION IDENTIFICATION microblog TRAJECTORY IDENTIFICATION 

分 类 号:TP[自动化与计算机技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象