The BBC News Hunter:A Novel Crawler for BBC News  

在线阅读下载全文

作  者:Mingxin Wang Ning Wang Boran Wang Can Tian Yanchun Liang Guozhong Zhao Xiaosong Han 

机构地区:[1]College of Software,Jilin University,Changchun 130012,China [2]Key Laboratory for Symbol Computation and Knowledge Engineering of National Education Ministry,College of Computer Science and Technology,Jilin University,Changchun 130012,China [3]Zhuhai Laboratory of Key Laboratory for Symbol Computation and Knowledge Engineering of Ministry of Education,Zhuhai College of Jilin University,Zhuhai 519041,China [4]Daqing Oilfield Personnel Development Institute,CNPC,Daqing 163000,China

出  处:《国际计算机前沿大会会议论文集》2016年第2期63-64,共2页International Conference of Pioneering Computer Scientists, Engineers and Educators(ICPCSEE)

摘  要:In order to distinguish and extract the topic information from other interferential information on the BBC news website for the study in social computing,the BBC News Hunter was proposed in this paper.The whole system consists of 6 subsystems,respectively named:UI,Control,Download,Analysis,Storage and Log.Numerical experiments show that satisfactory results can be obtained from the BBC news website,whose average accuracy as well as efficiency are acceptable.

关 键 词:BBC CRAWLER NEWS HTML PARSER Multithread 

分 类 号:C5[社会学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象