检索规则说明:AND代表“并且”;OR代表“或者”;NOT代表“不包含”;(注意必须大写,运算符两边需空一格)
检 索 范 例 :范例一: (K=图书馆学 OR K=情报学) AND A=范并思 范例二:J=计算机应用与软件 AND (U=C++ OR U=Basic) NOT M=Visual
出 处:《计算机与应用化学》2005年第8期659-666,共8页Computers and Applied Chemistry
摘 要:Internet上的化字数据库是重要的专业资源,基于超链按分析的搜索引擎还不能索引这类资源。本论文以充分利用In- ternet上的化学数据库数据为目标,将“一个查询发动多个同级检索引擎,并以结构化的方式组织信息”的方案应用于以化合物标识信息为检索入口的Web化学数据库,建立了一个基于多站点集成检索的Web数据库定向查询引擎。该引擎是一个包括用户交互层、中间检索层、数据提供层的三层Web模型。各层在系统内部分别对应于响应用户检索请求的客户端代理模块、集成远程Web信息的服务器端代理模块,以及提供缓存和检索的关系数据库模块。模型采用JSP+Java组件的开发方式, 在HTTP协议标准发送方法的基础上,采用XML技术对检索返回文档进行结构化数据的提取和表示,利用XML-DBMS实现XML数据的存储和检索,建立了一套针对深层Web数据交换的解决方案。依此方案所建立的ChemDB Portal Search实现了四个分布式Web化学数据库的有效加入、同时检索和统一显示。该系统是针对深层Web信息的挖掘和集成检索的一次尝试, 它可为其它领域建立类似的系统提供借鉴。The data in Internet Chemical databases are a class of valuable resources, which couldn't be indexed by search engines based on hyperlink analysis. The major purpose of this paper is to take good advantage of these resources. This is an approach that one query launches several search engines at host sites of distributed chemical databases with compound identifications as entry points in a cascading fashion, and searching results organized in a structural way to form a ChemDB Portal Search, a Chemical Directed Query Engine. A three-tier model is designed for the approach. , including the user interface as a Client Agent responding users'queries, the searching middle-tier ware as a Server Agent integrating data from the target sites, and the Web sites and local database as the data managers providing retrieval of the data. Combining with HTTP to send queries, the model is implemented with JSP + JavaBean fashion using XML technology to wrap structural data from the returned pages and XML-DBMS to store and retrieve XML documents in local databases. Simultaneous searching of five distributed chemical databases by one query is now possible to the ChemDB Portal Search, which can display the hits from different sources in a unified form. In conclusion, the thesis is an attempt to mine and integrate data from Deep Web. It may provide a practicable approach for building similar systems in other fields.
关 键 词:定向查询引擎 深层网 WEB数据挖掘 分布式数据库 集成检索 XML
分 类 号:TP392[自动化与计算机技术—计算机应用技术]
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在载入数据...
正在链接到云南高校图书馆文献保障联盟下载...
云南高校图书馆联盟文献共享服务平台 版权所有©
您的IP:216.73.216.222