检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:霍欢[1] 王国仁[2] 陈庆奎[1] 彭敦陆[1]
机构地区:[1]上海理工大学光电信息与计算机工程学院,上海200093 [2]东北大学信息科学与工程学院,沈阳110004
出 处:《计算机研究与发展》2010年第5期886-892,共7页Journal of Computer Research and Development
基 金:国家自然科学基金项目(60970012);上海市重点学科建设基金项目(S30501);上海市高校优秀青年教师科研专项基金项目(SLG08012);上海市教委科技创新基金项目(08YZ98)~~
摘 要:与传统数据库对XML数据的处理不同,对XML数据流的处理不仅受实时性的约束,还受存储空间的限制.在XML片段无序传送的广播模型中,考虑在XML数据流上进行高效的关键字查询,进而首次提出近似SLCA算法.SLCA算法利用结构Hash表和LCA表对关键字进行匹配并计算SLCA,从而避免冗余操作.同时,SLCA算法可以对匹配结果立即输出而不必等到数据流传输结束.实验结果表明,基于Hole-Filler模型的XML数据流上的SLCA算法在节省时间和空间开销方面均表现出较好的性能.Unlike in traditional databases,queries on XML streams are bounded not only by memory but also by real time processing.A novel technique for keyword search over streamed XML fragments is presented,which adopts broadcast model and hole-filler model for XML fragments dissemination,addressing the problem of disordered fragment transmission and considering the quality of searching results due to either keyword mismatch or data absence.Two efficient indexes for candidate elements are developed to further improve the performance:Hierarchical hash table and LCA table.The former indexes structure keywords which act as the structure of result,while the latter indexes the condition keywords which refine the keyword search condition.SLCA computing algorithm,which is triggered by condition keywords,only computes the candidate fragments that involve keywords,thus avoiding redundant operations that will not contribute to the final result.The algorithm produces part of the matched answers continuously without having to wait for the end of the stream.The experiments evaluate the performance of the SLCA algorithm with different types of keywords,different document fragmentation and different keyword frequencies,and compare the SLCA algorithm with other XML keyword matching algorithms.The experiment study shows that the SLCA algorithm performs well on saving processing power and memory space.
关 键 词:XML 数据流 查询 最小最近公共祖先(SLCA) Hole-Filler模型
分 类 号:TP311.13[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222