检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:张敏[1,4] 李唯 范青[3] Zhang Min;Li Wei;Fan Qing(Wuhan Library,Chinese Academy of Sciences,Wuhan 430071,China;Wuhan Vocational College of Software and Engineering(Wuhan Open University),Wuhan 430205,China;National Cultural Industry Research Center,Central China Normal University,Wuhan 430079,China;Hubei Key Laboratory of Big Data in Science and Technology,Wuhan 430071,China)
机构地区:[1]中国科学院武汉文献情报中心,湖北武汉430071 [2]武汉软件工程职业学院(武汉开放大学),湖北武汉430205 [3]华中师范大学国家文化产业研究中心,湖北武汉430079 [4]科技大数据湖北省重点实验室,湖北武汉430071
出 处:《现代情报》2024年第2期107-114,129,共9页Journal of Modern Information
基 金:国家社会科学基金艺术学项目“非物质文化遗产智能传播的内在机理与进阶路径研究”(项目编号:22CH188);科技大数据湖北省重点实验室开放基金课题资助项目“科学文化传播领域大数据资源开放平台建设”(项目编号:E3KF291001)。
摘 要:[目的/意义]为满足科研人员对科技文献内部细粒度语义信息进行高效查询的迫切需求,前期研究提出了面向科技文献的多维语义索引体系,然而基于HashMap的常见倒排索引会导致查询效率低下。本文旨在通过面向不同维度语义特征建立混合倒排索引,以改进语义查询性能。[方法/过程]本文以Treap、B+树等多种数据结构探索适合不同语义维度的倒排索引构建方法,并将其组合形成多种适用于科技文献多维语义组织的混合倒排索引构建方法,并通过对比实验,在排序查询和布尔查询条件下分析验证不同类型倒排索引构建方法的查询性能。[结果/结论]实验结果表明,组合形成的8种混合倒排索引构建方法中,表2所示的C3(HHHB)被证明在排序查询条件下具有最高的效率,而C4(TTTB)则在布尔查询条件下被证明最为高效。本文的方法能有效解决单一索引结构导致的查询效率问题。[Purpose/Significance]In order to meet the urgent needs of researchers for efficient querying of finegrained semantic information within scientific and technological literature,previous studies have proposed a multidimensional semantic indexing system for scientific and technological literature,however,the common inverted indexes based on HashMap lead to inefficient querying.This paper aims to improve the semantic query performance by establishing hybrid inverted indexes for different dimensional semantic features.[Method/Process]This paper explored the inverted index construction methods suitable for different semantic dimensions with Treap,B+tree and other data structures,and combined them to form a variety of hybrid inverted index construction methods suitable for multidimensional semantic organization of ientific and technological literature,and analyzed and verified the query performance of the different types of inverted index construction methods under the conditions of Top-k query and Boolean query through comparative experiments.[Result/Conclusion]The experimental results show that among the eight hybrid inverted index construction methods formed by the combination,C3(HHHB)shown in Table 2 is proved to have the highest efficiency under the condition of Top-k query,while C4(TTTB)is proved to be the most efficient under the condition of Boolean query.The method in this paper can effectively solve the query efficiency problem caused by a single index structure.
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.51