检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]西安交通大学计算机科学与技术系,西安710049
出 处:《计算机研究与发展》2005年第3期478-485,共8页Journal of Computer Research and Development
基 金:国家自然科学基金项目(60373105)"十五"重大科技攻关基金项目(2001BA101A01)
摘 要:概念语义网络是为了解决信息检索中的词汇不匹配的问题而提出的,是提高检索效果的基本途径之一.以面向自然语言的网络答疑为应用背景,提出了一种基于半结构化语料库的概念语义网络自动生成算法.通过分析语料的组成特点,对不同的概念关系类型,采取不同的模板进行文档抽取,并设定不同的窗口单元计算概念间的相关度;然后经过阈值筛选和角色转换,获得各种类型的概念关系,在此基础上进行语义网络的优化调整.实验结果表明,本算法获得的概念语义网络可以有效地提高问题检索的效果.Recent literature in computational terminology has shown an increasing interest in identifying various semantic relations between concept, which are important for large-scale natural language application systems such as question answering (QA), information retrieval (IR), machine translation (MT), and so on. Taking a natural-language-oriented Web answer system, named NL-WAS, as the application background, a novel approach to generate semantic network of concept based on the semi-structural corpus is proposed. According to the characteristic of the corpus, proper document extraction templates are adopted for 4 kinds of relations between concepts, namely, synonymy, hyponymy, hypernymy and parataxis. Moreover, different window sizes are designed to calculate the relative degree between concepts, and then by choosing the threshold through experimental results and switching the role can obtain all kinds of relationships. Finally, using proper rules, the concept semantic network is optimized. Now the proposed algorithm has already been implemented and applied in the natural language-oriented Web answer system. It is shown that the semantic network of concept can improve the result of the question search of NL-WAS system effectively.
分 类 号:TP391[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.13