检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
机构地区:[1]广西师范大学计算机科学与信息工程学院,广西桂林541004
出 处:《计算机工程与设计》2014年第6期2218-2223,共6页Computer Engineering and Design
基 金:国家自然科学基金项目(11162002)
摘 要:提出了一种基于中心实体逻辑分组的XML关键字查询算法。以实体对象作为最小的基本语义单元,使用熵值赋权法找出表示文档主题的中心实体,以该实体为中心将XML文档纵向划分成语义信息相对完整的逻辑分组,比较有效地解决了结果类型层次混乱、有意义结果丢失以及返回无意义结果等问题。最后结合逻辑分组的结构信息以及赋权增强区分度的思想,对返回候选查询结果进行排序,能较好地将与用户查询意图最相关的结果优先返回给用户。实验结果表明,该算法与SLCA、MLCEA、XReal相比,具有较好的查询质量和排序效果。A XML keyword search algorithm based on central entity logical grouping was proposed.Taking about the semantic information of each node in the XML document fully,the entity object was regarded as a minimum basic semantic unit,and entropy weighting method was used to determine the central entity of the theme of a XML document,and then the XML document was cut into some relatively complete semantic logical grouping regarding the central entity as a center.The algorithm could solve some problems effectively,such as the confusion of result type hierarchy,the loss of meaningful result and the return of meaningless results.At last the results were sorted by combining the structure information of a logical grouping and the weight,the results which were the most relevant to the user's query intent were returned to the user preferentially.The experiment shows that the proposed algorithm had better query quality and sort results comparing to SLCA,MLCEA and XReal.
关 键 词:可扩展标记语言 关键字查询 中心实体 熵值赋权法 逻辑分组 排序
分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.28