An Efficient Forward Recovery Checkpointing Scheme in Dissimilar Redundancy Computer System

Roll-forward checkpointing schemes (RFCS) are developed in order to avoid rollback in the presence of independent faults and increase the possibility that a task completes within a tight deadline. But the assumption of RFCS does not exist in most time. Run the same software on the same hardware may...

Full description

Saved in:
Bibliographic Details
Published in:2009 International Conference on Computational Intelligence and Software Engineering pp. 1 - 4
Main Authors: Wang, Guodong, Zhai, Zhengjun, Huang, Tao, Huang, Kaichen
Format: Conference Proceeding
Language:English
Published: IEEE 01-12-2009
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Roll-forward checkpointing schemes (RFCS) are developed in order to avoid rollback in the presence of independent faults and increase the possibility that a task completes within a tight deadline. But the assumption of RFCS does not exist in most time. Run the same software on the same hardware may result in correlated faults. Another question is these RFCS schemes may lose useful build-in self detection information results in performance degradation. In this paper, we propose a twice dissimilar redundancy computer based roll-forward recovery scheme (TDCS) that can avoid the correlated faults and realize fault-tolerance, without extra process. At last we use a novel technique based on a Markov reward model, to reveal our TDCS performance is quite better than the RFCS in average completion time when build-in self detection coverage be high.
DOI:10.1109/CISE.2009.5366252