Data Temperature Informed Streaming for Optimising Large-Scale Multi-Tiered Storage  

在线阅读下载全文

作  者:Dominic Davies-Tagg Ashiq Anjum Ali Zahir Lu Liu Muhammad Usman Yaseen Nick Antonopoulos 

机构地区:[1]Department of Computing,University of Derby,Derby,DE221GB,UK [2]Department of Informatics,University of Leicester,Leicester,LE17RH,UK [3]Department of Computer Science,COMSATS University Islamabad,Islamabad 45550,Pakistan [4]Edinburgh Napier University,Edinburgh,EH114BN,UK

出  处:《Big Data Mining and Analytics》2024年第2期371-398,共28页大数据挖掘与分析(英文)

摘  要:Data temperature is a response to the ever-growing amount of data.These data have to be stored,but they have been observed that only a small portion of the data are accessed more frequently at any one time.This leads to the concept of hot and cold data.Cold data can be migrated away from high-performance nodes to free up performance for higher priority data.Existing studies classify hot and cold data primarily on the basis of data age and usage frequency.We present this as a limitation in the current implementation of data temperature.This is due to the fact that age automatically assumes that all new data have priority and that usage is purely reactive.We propose new variables and conditions that influence smarter decision-making on what are hot or cold data and allow greater user control over data location and their movement.We identify new metadata variables and user-defined variables to extend the current data temperature value.We further establish rules and conditions for limiting unnecessary movement of the data,which helps to prevent wasted input output(I/O)costs.We also propose a hybrid algorithm that combines existing variables and new variables and conditions into a single data temperature.The proposed system provides higher accuracy,increases performance,and gives greater user control for optimal positioning of data within multi-tiered storage solutions.

关 键 词:data temperature hot and cold data multi-tiered storage metadata variable multi-temperature system 

分 类 号:TP18[自动化与计算机技术—控制理论与控制工程] TP311.13[自动化与计算机技术—控制科学与工程]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象