Web Database Sampling Approach Based on Attribute Correlation  

Web Database Sampling Approach Based on Attribute Correlation

在线阅读下载全文

作  者:TIAN Jianwei, LI Shijun, TANG Xiaoyue School of Computer, Wuhan University, Wuhan 430072, Hubei, China 

出  处:《Wuhan University Journal of Natural Sciences》2010年第4期297-302,共6页武汉大学学报(自然科学英文版)

基  金:Supported by the National Natural Science Foundation of China (60970018)

摘  要:In this paper,we present a novel approach utilizing attributes correlation for the sampling task on nonuniform hidden databases. We propose the method of calculating the attributes dependency and construct the sampling template according to the attributes dependency. Then,we use the sampling template to gen-erate initial sampling queries and propose a bottom-up algorithm to search the sampling template. We also conduct extensive ex-periments over real deep Web sites and controlled databases to illustrate that our sampling method has good performance both on the quality and efficiency.In this paper,we present a novel approach utilizing attributes correlation for the sampling task on nonuniform hidden databases. We propose the method of calculating the attributes dependency and construct the sampling template according to the attributes dependency. Then,we use the sampling template to gen-erate initial sampling queries and propose a bottom-up algorithm to search the sampling template. We also conduct extensive ex-periments over real deep Web sites and controlled databases to illustrate that our sampling method has good performance both on the quality and efficiency.

关 键 词:attributes correlation hidden database sampling template mutual information 

分 类 号:N[自然科学总论]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象