一种新型异构数据信息整合与分析系统的构建  被引量:2

Development of a New System for Integration and Data-mining of Heterogeneous Information

在线阅读下载全文

作  者:齐艳红[1] 刘亚利[2] 

机构地区:[1]中山大学图书馆,广东广州510275 [2]中山大学生命科学学院,广东广州510275

出  处:《情报科学》2009年第4期538-543,共6页Information Science

基  金:教育部人文社会科学规划项目成果(06JA870013)

摘  要:基于XML和Java技术,构建了异构数据信息整合与分析系统JavaXML。JavaXML将关系数据库和Native XML数据库相结合作为后台数据库,可存储和管理收集到的异构数据信息;利用JSP技术构建网络数据库,以XML形式的半结构化数据进行信息动态发布。JavaXML包含了一个数据挖掘与分析平台,该平台提供了多种数值与统计计算方法,如显著性标记的聚类分析,显著性标记的相关分析,模式识别算法等等。本平台在动态数据库访问、计算方法、运行环境、即时更新与跨平台等方面有明显优点。JavaXML具有灵活性、可移植性、高效性和轻便性等特点,有利于方便、快捷地整合与分析各种异构数据信息资源,为实现异构数据信息的存储、检索、传输和分析提供了一种新的解决方案。In this study a system for data integration and analysis,JavaXML,was developed based on Java and XML technologies. In JavaXML the relational database and Native XML database were integrated and served as the backstage database which may be used for storage and management of the heterogeneous data collected on internet. The web database was constructed using JSP,and the semistructured data in XML form will be dynamically released. JavaXML included a data-mining and analysis platform,which provides a wide range of numerical and statistical algorithms, such as statisticallymarked cluster analysis ,statistically marked correlation analysis, and pattern recognition algorithms, and so on. This platform demonstrated obvious advantages in dynamic database access,computational methods,operating environments,real-time updates and cross-platform. JavaXML exhibited a higher flexibility and portability,a higher efficiency and lightness,which was conducive to quickly and easily integrate and analyze various heterogeneous data. It will provide a solution for the realization of storage, retrieval, transmission and analysis of heterogeneous internet data.

关 键 词:异构数据信息 整合 系统 数据挖掘 

分 类 号:G350.7[文化科学—情报学]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象