基于N-Level VSM在Web信息检索中的研究被引量：3

Study of Web Information Retrieval Based on N-Level Vector Space Model

出　　处：《计算机工程与应用》2006年第19期158-160,179,共4页Computer Engineering and Applications

基　　金：国家自然科学基金资助项目(编号:60373095)

摘　　要：分析了传统向量空间检索模型在Web信息检索中的不足,给出了基于N-Level向量空间模型,这种模型是将一篇文档从逻辑上划分为N个相对独立的文本段,然后按照文本段的内容建立文本特征向量以及文本权值向量,在此基础上可以更加精确地定义特征值向量和相似度的计算方法,使之能比较好地适应文档集合的动态扩充。同时进行了两种模型算法时间的复杂度的比较分析。理论分析和实验结果表明,基于此模型实现的信息检索算法具有较快的查找速度和较高的查准率。Based on the analysis of the deficiency of the traditional vector space retrieval model,the N-level vector model is proposed.The N-level vector model partitions a document into N level text paragraphs.The text feature vectors and the text weight vectors are defined according to the text paragraphs＇ context.The calculation method of the feature vectors and the similarity are defined much more precisely such that the algorithm can adapt the dynamic extension of the document set.Meanwhile the time complexity of the algorithm is analyzed between the models.The theoretic analysis and the experimental results show that the new algorithm has higher precision and faster computation speed.

关键词：向量空间模型查全率查准率相似性时间复杂度

分类号：TP311[自动化与计算机技术—计算机软件与理论]

参考文献：

正在载入数据...

二级参考文献：

正在载入数据...

耦合文献：

正在载入数据...

引证文献：

正在载入数据...

二级引证文献：

正在载入数据...

同被引文献：

正在载入数据...

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于N-Level VSM在Web信息检索中的研究被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

高级检索检索式检索

时间限定

期刊范围

学科限定全选

高级检索 检索式检索

时间限定

期刊范围

学科限定全选

基于N-Level VSM在Web信息检索中的研究 被引量：3

我的收藏

参考文献：

二级参考文献：

耦合文献：

引证文献：

二级引证文献：

同被引文献：

相关期刊文献：

相关的主题

相关的作者对象

相关的机构对象

下载全文

用户登录

高级检索检索式检索

基于N-Level VSM在Web信息检索中的研究被引量：3