本体驱动的文本虚拟样本构造方法研究被引量：4

Research on Ontology-driven Text Virtual Sample Constructing

出　　处：《计算机科学》2008年第3期142-145,共4页Computer Science

基　　金：国家自然科学基金资助项目(60675015)

摘　　要：构造虚拟样本能够为机器学习中的训练集融入先验知识,从而改善标注瓶颈问题。提出了一种本体驱动的文本虚拟样本构造方法。在确保类别不变性的前提下,该方法依据领域相关本体所明晰表达的领域知识,基于本体树的点、边、子树,从同义、父子、语义同构的多个词义关系角度实现了文本虚拟样本的构造。初步实验表明,该方法与原分类及类似方法相比具有更好的分类精度和推广能力。Constructing virtual examples can incorporate prior knowledge into training set in machine learning, so as to alleviate the labeling bottleneck. An Ontology-driven scheme to construct text virtual sample is proposed. Under the precondition of label invariability, the proposal constructs virtual samples according to the domain knowledge explicitly formalized by domain-specific Ontology. Based on the different Ontology tree structures, namely nodes, edges, and sub-trees, various lexical-semantic relations, including synonymy, paternity, and semantic isomorphs, are applied into text virtual example constructing. The primary experimental results show the scheme outperforms original text catego- rizations and other similar ones in precision and generalization ability.

关键词：虚拟样本文本分类本体本体树领域知识

分类号：TP391.41[自动化与计算机技术—计算机应用技术] O157[自动化与计算机技术—计算机科学与技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

本体驱动的文本虚拟样本构造方法研究被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

本体驱动的文本虚拟样本构造方法研究 被引量：4

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

本体驱动的文本虚拟样本构造方法研究被引量：4