A Distributed Error Recovery Technique and Its Implementation and Application on UNIX
-
Abstract
This paper presents a checkpoint setting technique to eliminate domino effect in backward recovery in distributed systems,which is very efficient,powerful,widely applicable and easy to be implememted.Besides theoretical analysis,an implementation on UNIX system and a package for software fault-tolerance are in- troduced.Then the problems of checkpoint management and process termination are discussed.
-
-