基于改进SVM和HMM的文本信息抽取算法被引量：6

TEXT INFORMATION EXTRACTION ALGORITHM BASED ON IMPROVED SVM AND HMM

出　　处：《计算机应用与软件》2015年第11期281-284,292,共5页Computer Applications and Software

摘　　要：传统的文本信息抽取算法通常基于词典、规则或其他模型实现,但由于词典建立困难、规则设定模糊或模型结构单一等原因,信息抽取的准确性通常较低。针对传统的文本信息抽取算法存在的多种不足,提出一种基于混合模型的文本信息抽取算法。该算法融合了多种信息抽取方法,引入支持向量机对信息进行分类,利用S型函数拟合调整模型参数,并采用数据平滑技术优化模型概率空间。实验结果表明,与传统的文本信息抽取算法相比,该算法信息抽取的精确度和召回率明显提高,具有较好的可行性。Traditional text information extraction algorithm is usually implemented based on dictionary, rules or other models. However due to the difficulty in dictionary constructing, unclarity in rules setting and single model structure, etc., the precision of information extraction is usually low. In light of the deficiencies existed in traditional text information extraction algorithm, we proposed a hybrid model- based text information extraction algorithm. The algorithm incorporates a variety of information extraction methods, and introduces SVM to classify the information. At the same time, it uses S function to fit adjustment model parameters, and optimises probability space of model by using data smoothing technique. Experimental result indicated that compared with traditional text information extraction algorithm, this algorithm improved obviously the precision and recall rate of the information extraction and had good feasibility.

关键词：信息抽取支持向量机隐马尔可夫模型机器学习

分类号：TP391[自动化与计算机技术—计算机应用技术]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进SVM和HMM的文本信息抽取算法被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于改进SVM和HMM的文本信息抽取算法 被引量：6

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于改进SVM和HMM的文本信息抽取算法被引量：6