Checkpointing and rollback recovery for network of workstations  被引量:1

Checkpointing and rollback recovery for network of workstations

在线阅读下载全文

作  者:汪东升 郑纬民 王鼎兴 沈美明 

机构地区:[1]Department of Computer Science and Technology, Tsinghua University

出  处:《Science China(Technological Sciences)》1999年第2期207-214,共8页中国科学(技术科学英文版)

基  金:Project supported by the National "863" High-tech Program of China.

摘  要:Network of workstations (NOW) now becomes one of the main trends of parallel computing. But for long-running scientific programs, it needs effective fault tolerance for its changing property. Checkpointing and rollback recovery is a solution to this problem. First the main problems upon rollback recovery are discussed, the different checkpointing techniques for NOW are analyzed, and then the design and implementation of ChaRM (checkpoint-based rollback recovery and process migration) system are described. The comparison of three coordinated checkpointing systems is given.Network of workstations (NOW) now becomes one of the main trends of parallel computing. But for long-running scientific programs, it needs effective fault tolerance for its changing property. Checkpointing and rollback recovery is a solution to this problem. First the main problems upon rollback recovery are discussed, the different checkpointing techniques for NOW are analyzed, and then the design and implementation of ChaRM (checkpoint-based rollback recovery and process migration) system are described. The comparison of three coordinated checkpointing systems is given.

关 键 词:CHECKPOINTING ROLLBACK recovery network of WORKSTATIONS (NOW) DOMINO effect COORDINATED check-pointing. 

分 类 号:TP393.03[自动化与计算机技术—计算机应用技术]

 

参考文献:

正在载入数据...

 

二级参考文献:

正在载入数据...

 

耦合文献:

正在载入数据...

 

引证文献:

正在载入数据...

 

二级引证文献:

正在载入数据...

 

同被引文献:

正在载入数据...

 

相关期刊文献:

正在载入数据...

相关的主题
相关的作者对象
相关的机构对象