基于移动终端的博客搜索引擎系统研究与应用  被引量:2

Research and Application of Blog Search Engine System Based on Mobile Terminal

在线阅读下载全文

作  者:陈建峡[1] 李志鹏[2] 

机构地区:[1]湖北工业大学计算机学院,湖北武汉430068 [2]湖北工业大学电气与电子工程学院,湖北武汉430068

出  处:《湖北工业大学学报》2015年第2期89-94,共6页Journal of Hubei University of Technology

基  金:国家自然科学基金项目编号(41301371)

摘  要:根据RSS/XML文本格式的博客信息特点,进行了文本解析、中文分词和索引建立,以及基于PageRank算法的搜索排序等研究工作,利用Heritrix爬虫、Lucene全文索引检索工具包开发了RSS博客搜索引擎,并将系统实际应用于Android系统的手机终端。实验证明,该系统能实时、高效的在手机终端进行博客搜索,使用户获得优于传统博客检索的体验。As a widely shared network carrier of information,publishing blogs via a variety of mobile terminals has become the mainstream of the online entertainment,especially with the rapid development of the mobile Internet technologies.However,when we retrieve information based on the blog RSS format with traditional searching engines,there are some problems such as inefficiency,slow updates,and restrictions of search terminals and so on.According to the features of RSS/XML blog format,the research was conducted through text analyzing,Chinese word segmentation and indexing.Also,the paper explored the searching arrangement based on the Page Rank.In addition,the paper developed the system of RSS blog searching engine guided by Heritrix reptiles and Lucene full-text indexing search tool.Eventually,the system was used and tested in mobile terminals with the android system.It proves that the system can be applied to retrieve blogs convenient and provides users much better blog retrieving experiences than those with the traditional searching engines.

关 键 词:RSS 网络爬虫 LUCENE PAGERANK Android 

分 类 号:TP311[自动化与计算机技术—计算机软件与理论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象