TKES:A Novel System for Extracting Trendy Keywords from Online News Sites  

在线阅读下载全文

作  者:Tham Vo Phuc Do 

机构地区:[1]Lac Hong University,Dong Nai,71000,Vietnam [2]Thu Dau Mot University,Binh Duong,72000,Vietnam [3]University of Information Technology,VNU-HCM,Ho Chi Minh,7000,Vietnam

出  处:《Journal of the Operations Research Society of China》2022年第4期801-816,共16页中国运筹学会会刊(英文)

基  金:The work of Tham Vo is supported by Lac Hong University,and funded by Thu Dau Mot University(No.DT.20-031);The work of Phuc Do is funded by Vietnam National University,Ho Chi Minh City(No.DS2020-26-01).

摘  要:As the Smart city trend especially artificial intelligence,data science,and the internet of things has attracted lots of attention,many researchers have created various smart applications for improving people’s life quality.As it is very essential to automatically collect and exploit information in the era of industry 4.0,a variety of models have been proposed for storage problem solving and efficient data mining.In this paper,we present our proposed system,Trendy Keyword Extraction System(TKES),which is designed for extracting trendy keywords from text streams.The system also supports storing,analyzing,and visualizing documents coming from text streams.The system first automatically collects daily articles,then it ranks the importance of keywords by calculating keywords’frequency of existence in order to find trendy keywords by using the Burst Detection Algorithm which is proposed in this paper based on the idea of Kleinberg.This method is used for detecting bursts.A burst is defined as a period of time when a keyword is continuously and unusually popular over the text stream and the identification of bursts is known as burst detection procedure.The results from user requests could be displayed visually.Furthermore,we create a method in order to find a trendy keyword set which is defined as a set of keywords that belong to the same burst.This work also describes the datasets used for our experiments,processing speed tests of our two proposed algorithms.

关 键 词:Event detection Burst detection Keyword extraction Kleinberg Burst ranking TKES Text stream 

分 类 号:TP393[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象