检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]北京大学经济学院,北京100871 [2]大连理工大学人文社会科学学院,大连116024 [3]大连理工大学电子与信息工程学院,大连116024
出 处:《情报学报》2008年第3期418-424,共7页Journal of the China Society for Scientific and Technical Information
基 金:本文得到国家自然科学基金项目(编号:60373095,60673039)的资助.
摘 要:已有的经济关系研究大都采用实证的或单纯的计量学的方法来实现的。本文则针对非结构化的文本特点,采用信息抽取和文本挖掘方法挖掘用户感兴趣的区域经济关系是具有十分重大应用价值的研究课题。本文在探讨了基于实体关系的文本挖掘机制的基础上,对31个省、市、自治区的区域经济关系进行了分析。运用文本挖掘技术对经济关系的挖掘包括两种方式:一是基于属性的经济关系挖掘,利用信息抽取获取各个实体属性,采用聚类方法分析经济实体关系;二是基于相互引用的经济关系挖掘,首先构造经济实体关系分类词典,提出了实体关系标注算法,利用信息抽取获得实体之间的引用情况,然后构造关系有向图,从中挖掘区域经济之间的关系。研究表明,运用文本挖掘技术,既可以对各个区域经济发展状况进行分析和评价,也可以发现特定区域经济之间的内在关系。Text mining plays an important role in knowledge acquisition, and it is valuable issue to apply information extraction and text mining to mine relations among entities from non-structure texts in the internet. In this paper, the approach of text mining for relations between named entities is presented, and it includes two mining schemes. One is based on the attributes of entities. It applies the approach of information extraction to collect their attributes, and then adopt the clustering algorithm to analyze the relations between named entities. The other is based on the reference between entities. It constructs the relation dictionary and presents the algorithm of annotating relations. It set up the vector-graph based on the references between entities, and it derives several interesting information patterns from the vector-graph. As a result, it shows a better effect on mining the relationship between named entities from a specific domain.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.104