基于标记树的WEB页面净化技术研究被引量：3

Web Page Distillation Based on the Tag Tree

机构地区：[1]重庆教育学院信息中心,重庆400067 [2]西南大学计算机与信息科学学院,重庆400715

出　　处：《西南师范大学学报（自然科学版）》2006年第5期128-131,共4页Journal of Southwest China Normal University(Natural Science Edition)

摘　　要：根据Web页面标记建立标记树,通过分析,保留有用信息的标记子树,达到获取页面主要内容,净化页面的效果.It＇s the key problem that how to get the information people need of the internet through the computer. An arithmetic is put forward to solve this problem. At first a tag tree of the web page is constructed, then the authors divide the web page into several parts as Main part, Site flag, Navigation bar, Communication part, Copyrights, and the tag tree tells the relationship of these parts. The authors can parse the tag tree, get the child tag tree that only tells the Main part. So the main part is obtained and the web page is distilled.

关键词：标记树标记树模式页面净化

分类号：TP393[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于标记树的WEB页面净化技术研究被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于标记树的WEB页面净化技术研究 被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于标记树的WEB页面净化技术研究被引量：3