检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:余伟[1] 王明文[1] 万剑怡[1] 左家莉[2]
机构地区:[1]江西师范大学计算机信息工程学院,南昌330027 [2]江西师范大学初等教育学院,南昌330027
出 处:《北京大学学报(自然科学版)》2013年第2期203-212,共10页Acta Scientiarum Naturalium Universitatis Pekinensis
基 金:国家自然科学基金(60963014;61163006)资助
摘 要:针对位置语言模型没有考虑词与词之间语义关系的问题,提出一种结合语义的位置语言模型。首先采用高斯核函数来度量词与词之间的位置关系;然后提出一种平滑互信息的技术来度量词与词之间的语义关系,证明了平滑互信息能够有效解决大量词对之间无法通过互信息来计算转移概率的问题;还证明了位置语言模型是结合语义位置语言模型的一个特例;最后将结合语义的位置语言模型应用于信息检索,得到一个基于该模型的检索模型。实验结果表明,基于该模型的检索模型在性能方面要优于基于位置语言模型的检索模型。Because positional language models did not consider semantic relationship between the words in different positions, the authors present an effective model named "positional language models with semantic information". Firstly, the authors use Gaussian kernel function to measure the position relationship between words. Secondly, the authors present a technology which is named "smoothed mutual information" to measure semantic relationship between the words, and also prove that smoothed mutual information can effectively solve the problem that a large number of two words could not calculate the transition probability between them only by mutual information. Then the authors prove that positional language models are a special case of positional language models with semantic information. Finally, applying this new model to the area of information retrieval can obtain a retrieval model based on the new model. The experiment show that the retrieval model based on the new model performs better than a retrieval model based on positional language models for using in information retrieval.
分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:18.223.169.109