基于OWL本体构建的网页图文摘要算法被引量：1

Graphical-text abstract algorithm based on building OWL ontology

出　　处：《计算机工程与设计》2014年第5期1833-1839,共7页Computer Engineering and Design

基　　金：中央高校基本科研业务费专项基金项目(XDJK2013C005)

摘　　要：为了提高网页正文提取的实用性和准确性,在已有的网页正文提取算法基础上,提出了一种网页图片选择算法,并将两者集成为一种新的网页图文摘要方法。构建了网页图文摘要的方法模型,设计出图片选择算法,该算法使用本体建模语言(ontology web language,OWL)进行页面本体的构建,提取图片和网页各种元素的语义特性,并考虑图片的各类align属性,从而能在网页中选取较优的具有代表性图片。实验结果表明,该方法能够有效地丰富和完善网页正文提取。To improve the accuracy extraction of webpage main text, graphical abstract algorithm is proposed to combined with the exist extraction of webpage main text to form a new graphical-text abstract algorithm, the graphical-text abstract algorithm modle and the graphical abstract algorithm are given. Based on the established OWL ontology, the aligh propery of graphic is taken a full account, and combining with the semantic property of graphic and variety of webpage elements, an optimization algorithm is used to optimize these impact factors to extract the most representative graphic with exist extraction of webpage main text. The experiment shows that this new graphical-text abstract algorithm based on building OWL ontology can effectively en riched and perfected the exist extraction of webpage main text.

关键词：图文摘要本体建模语言语义特性图片选择页面本体

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于OWL本体构建的网页图文摘要算法被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于OWL本体构建的网页图文摘要算法 被引量：1

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于OWL本体构建的网页图文摘要算法被引量：1