检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
作 者:王玲 刘晓清 何震瀛[2] 奚军庆 项焱[4] WANG Ling;LIU Xiaoqing;HE Zhenying;XI Junqing;XIANG Yan(School of Software,Fudan University,Shanghai 200438,China;School of Computer Science and Technology,Fudan University,Shanghai 200438,China;Ministry of Justice Information Center,Beijing 100020,China;School of Low,Wuhan University,Wuhan,Hubei Province 430000China)
机构地区:[1]复旦大学软件学院,上海200438 [2]复旦大学计算机科学技术学院,上海200438 [3]司法部信息中心,北京100020 [4]武汉大学法学院,武汉430000
出 处:《智能计算机与应用》2021年第9期28-34,41,共8页Intelligent Computer and Applications
基 金:国家重点研发计划(2018YFC0830900)。
摘 要:随着大数据技术的发展,加强司法大数据应用成为推进司法现代化建设的重要手段,如何处理司法大数据中的非结构化数据亟待解决。为此,本文提出了面向司法大数据的文本主题OLAP系统。在离线数据处理模块中,设计了Span数据模型,并定义了多种针对该模型的操作符;设计了基于规则的文本行政区划归类方法,并构建了主题立方体。在线上查询模块中,实现了基于倒排索引的关键词搜索方法和最大独特主题范围查询,提供了上卷、下钻、切片等功能。通过在大规模的真实数据集上对系统进行测试,实验结果证明了该系统的合理性和实用性。With the development of big data technology,strengthening the application of judicial big data has become an important means to promote judicial modernization.How to deal with unstructured data in judicial big data needs to be solved urgently.To this end,a text topic OLAP system oriented to judicial big data is proposed,which includes offline data processing and online query parts.In the offline data processing module,a Span data model is designed and a variety of operators for this model are defined.In addition,a rule-based text administrative division classification method is designed,and a topic cube is constructed.In the online query module,the keyword search method based on the inverted index and the largest unique subject range query are realized,and functions such as scrolling,drilling,and slicing are provided.The system is tested on a real large-scale data set,and the experimental results proves the rationality and practicability of the system.
分 类 号:TP311[自动化与计算机技术—计算机软件与理论]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.200