检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]中国石油大学计算机与通信工程学院,山东东营257061
出 处:《计算机应用与软件》2009年第6期175-177,共3页Computer Applications and Software
摘 要:为了提高基于Lucene中文检索系统的检索精度和效率,通过分析Lucene的结构,在系统中加入了中文分词模块和索引文档预处理模块。给出了具体的实验方法和实验过程,对改进原理和实验数据进行了分析,表明了加入中文分词模块和在索引预处理模块中采用提取特定数量的特征词来替代文档的方法能够有效提高Lucene检索系统的效率和精度,增强Lucene检索系统中文的性能。To improve the efficiency and accuracy of retrieval system based on Lucene in searching Chinese information, we add the Chinese word segmentation module and indexing documents pretreatment module into the system by analyzing the structure of Lucene. The specific way and process of experiment are given in the paper. Both the analysis of improvement principle in theoretic and the experimental results prove that, by substituting documents with specific quantity of characteristic words picked up in index pretreatment module, this method can effectively improve the efficiency and precision of Lucene retrieval system and enhance the proficiency of Lucene in searching Chinese words.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.117