检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:唐朝霞[1]
机构地区:[1]淮阴工学院计算机工程学院,江苏淮安223003
出 处:《贵州大学学报(自然科学版)》2011年第5期80-83,共4页Journal of Guizhou University:Natural Sciences
基 金:江苏省高校自然科学基金(06KJD520024)
摘 要:随着互联网的迅速发展和Web2.0概念的提出,问答系统以直接返回给用户精确的答案而逐渐成为一种新的信息检索技术。由于问句都是自然语言的形式,涉及到对问句的语义理解及相似度的判断。本文提出了一种基于问句的表层和语义相似度计算方法,通过聚类去除冗余信息,再通过熵的特征计算权值,最后融合多种特征计算问句相似度,进行答案抽取。实验证明,这种方法能够有效地提高答案抽取的精度和效率。With the rapid development and the appearance of concept of Web2.0, because of the exact answer directly, QA system has become a new information retrieval technology. As the questions are in the form of natural language ,Questions relate to the semantics understanding and similarity judgments. In this paper, based on question's surface and semantic similarity calculation method, removal of redundant information was carried out by clustering,the weight of features was calculated by entropy,finally question's similarity was calculated by integration of multiple features for answer extraction. Experiments show that this method can effectively improve the answer extraction accuracy and efficiency.
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.117.90.244