Enhancing Storage Efficiency and Performance: A Survey of Data Partitioning Techniques  

在线阅读下载全文

作  者:刘鹏举 李翠平 陈红 Peng-Ju Liu;Cui-Ping Li;Hong Chen(Distinguished Member,CCF 1.School of Information,Renmin University of China,Beijing 100872,China;Key Laboratory of Data Engineering and Knowledge Engineering of the Ministry of Education,Beijing 100872,China)

机构地区:[1]Distinguished Member,CCF 1.School of Information,Renmin University of China,Beijing 100872,China [2]Key Laboratory of Data Engineering and Knowledge Engineering of the Ministry of Education,Beijing 100872,China

出  处:《Journal of Computer Science & Technology》2024年第2期346-368,共23页计算机科学技术学报(英文版)

基  金:supported by the National Key Research and Development Program of China under Grant No.2023YFB4503603;the National Natural Science Foundation of China under Grant Nos.62072460,62076245,and 62172424;the Beijing Natural Science Foundation under Grant No.4212022.

摘  要:Data partitioning techniques are pivotal for optimal data placement across storage devices,thereby enhancing resource utilization and overall system throughput.However,the design of effective partition schemes faces multiple challenges,including considerations of the cluster environment,storage device characteristics,optimization objectives,and the balance between partition quality and computational efficiency.Furthermore,dynamic environments necessitate robust partition detection mechanisms.This paper presents a comprehensive survey structured around partition deployment environments,outlining the distinguishing features and applicability of various partitioning strategies while delving into how these challenges are addressed.We discuss partitioning features pertaining to database schema,table data,workload,and runtime metrics.We then delve into the partition generation process,segmenting it into initialization and optimization stages.A comparative analysis of partition generation and update algorithms is provided,emphasizing their suitability for different scenarios and optimization objectives.Additionally,we illustrate the applications of partitioning in prevalent database products and suggest potential future research directions and solutions.This survey aims to foster the implementation,deployment,and updating of high-quality partitions for specific system scenarios.

关 键 词:data partitioning SURVEY partitioning feature partition generation partition update 

分 类 号:TP333[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象