基于场景理论的STAC课程数据库自动检索系统  被引量:1

Design of Stac Course Database Automatic Retrieval System Based on Scene Theory

在线阅读下载全文

作  者:李曙军 张宏杰 王海棠 王秋爽[4] LI Shujun;ZHANG Hongjie;WANG Haitang;WANG Qiushuang(Party School Department,training Center,Hebei Electric Power Co.,Ltd.,Shijiazhuang 050023;Training Center of Hebei Electric Power Co.,Ltd.,Shijiazhuang 050023;Beijing Minxing Pioneering International Management Consulting CoLtd,Beijing 101100,China;College of Computer Science and Technology,Jilin University,Changchun 130012,China)

机构地区:[1]国网河北省电力有限公司培训中心党校工作部,石家庄050023 [2]国网河北省电力有限公司培训中心,石家庄050023 [3]北京敏行创业国际管理咨询有限公司,北京101100 [4]吉林大学计算机科学与技术学院,长春130012

出  处:《吉林大学学报(信息科学版)》2019年第4期457-462,共6页Journal of Jilin University(Information Science Edition)

基  金:吉林大学本科教学改革研究基金资助项目(2017XYB070)

摘  要:由于传统课程数据库检索系统查全效果较差,同时受到噪声影响,导致检索精准度较低,不能满足用户对Stac(Statistical Analysis)课程数据库检索的需求。为此,提出基于场景理论的Stac课程数据库自动检索系统设计。在场景理论下,对数据库自动检索系统进行总体设计,添加分词模块,采用组合型歧义统计方式,区分Stac课程数据库中同义或多义词;使用网络蜘蛛寻找网页链接地址,读取内容,进行全部目标地址检索;当采集量达到一定规模时,调用数个独立的搜索引擎,相互合作,以此建立索引库,根据Stac课程资源数据规范标准进行数据采集,利用索引引擎,将采集结果全部输入到系统中。通过辨认情景特点,建立光盘数据库,设计检索流程,严密监视各个机器行为,避免噪声干扰,经过中心DB Server(Data Base Senver)处理,将地址列表合并,形成新资源列表,供用户检索。由实验结果可知,该系统检索精准度最高可达到98%,为多图像检索提供系统支持。Due to the poor performance of the traditional course database retrieval system and the influence of noise,the retrieval accuracy is low,which can not meet the user’s demand for Stac(Statistical Analysis) course database retrieval. To this end,the design of the automatic retrieval system of Stac course database based on scene theory is proposed. Under the scene theory,the design of the database automatic retrieval system adding the word segmentation module,uses the combined ambiguity statistical method to distinguish synonymous or polysemous words in the Stac course database;and uses the web spider to find the web link address,reading the content,and performing all the goals address retrieval. When the collection volume reaches a certain scale,several independent search engines are called and cooperated with each other to establish an index library,collect data according to the Stac curriculum resource data specification standard,and use the index engine to input all the collection results into the system. By identifying the characteristics of the scene,the CD-ROM database is created,the retrieval process is designed,the behavior of each machine is closely monitored,and noise interference is avoided. After processing by the central DB(Data Base) Server,the address lists are merged to form a new resource list for the user to retrieve. Experimental results show that the retrieval accuracy of the system can reach up to 98%,providing systematic support for multi-image retrieval.

关 键 词:场景理论 Stac课程 数据库 自动检索 索引 引擎 

分 类 号:TP391.1[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象