An Optimistic Checkpoint Mechanism Based on Job Characteristics and Resource Availability for Dynamic Grids  

An Optimistic Checkpoint Mechanism Based on Job Characteristics and Resource Availability for Dynamic Grids

在线阅读下载全文

作  者:TAO Yongcai JIN Hai WU Song 

机构地区:[1]School of Information Engineering, Zhengzhou University, Zhengzhou 450000, Henan, China [2]Services Computing Technology and System Lab, Cluster and Grid Computing Lab, Huazhong University of Science and Technology, Wuhan 430074, Hubei, China

出  处:《Wuhan University Journal of Natural Sciences》2011年第3期213-222,共10页武汉大学学报(自然科学英文版)

基  金:Supported by the National Natural Science Foundation of China (90412010,60603058,and 60673174);the Ministry of Education of China and Program for New Century Excellent Talents in University (NCET-07-0334)

摘  要:In the paper, based on the job characteristics and resources availability, an optimistic checkpoint mechanism for dynamic grids(OCM4G) is proposed. It can determine whether to checkpoint a given job running on a given resource node and establish optimal aperiodic checkpoint intervals by applying the knowledge of job characteristics and resource availability. We evaluate OCM4G over a real grid environment (ChitlaGrid) and the results show that OCM4G achieves better performance than the periodic checkpoint and the analytical method of calculating aperiodic checkpoint intervals.In the paper, based on the job characteristics and resources availability, an optimistic checkpoint mechanism for dynamic grids(OCM4G) is proposed. It can determine whether to checkpoint a given job running on a given resource node and establish optimal aperiodic checkpoint intervals by applying the knowledge of job characteristics and resource availability. We evaluate OCM4G over a real grid environment (ChitlaGrid) and the results show that OCM4G achieves better performance than the periodic checkpoint and the analytical method of calculating aperiodic checkpoint intervals.

关 键 词:grid computing fault tolerance CHECKPOINT MARKOV 

分 类 号:TP302.1[自动化与计算机技术—计算机系统结构]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象