We use cookies to improve your experience with our site.

Indexed in:

SCIE, EI, Scopus, INSPEC, DBLP, CSCD, etc.

Submission System
(Author / Reviewer / Editor)
Liu Chunsheng, Wen Chuanyuan. Model and Algorithm of Backward Error Recovery of Distributed Software[J]. Journal of Computer Science and Technology, 1989, 4(3): 275-285.
Citation: Liu Chunsheng, Wen Chuanyuan. Model and Algorithm of Backward Error Recovery of Distributed Software[J]. Journal of Computer Science and Technology, 1989, 4(3): 275-285.

Model and Algorithm of Backward Error Recovery of Distributed Software

More Information
  • Received Date: April 22, 1987
  • Revised Date: April 22, 1988
  • Backward error recovery is one of the important techniques of software fault tolerance. Because of error propagation its recovery in distributed software needs cooperation between processes to achieve consistent recovery. However, the techniques of the achievement suffer from either concurrency level decreasing or the domino effect. Based on a formal model of the distributed system, a backward recovery protocol without the two drawbacks is specified in this paper. The algorithm of the protocol is proven strictly and its implementation is proposed.
  • [1]
    B. Randell et al., Reliability issues in computing system design, Computing Surveys,10:2 (1978).
    [2]
    B. Randell, System structure for software fault tolerance, IEEE Trans. SE-1:2 (1975).
    [3]
    P. Jalote and R. H. Campbell, Fault Tolerance Using Communicating Sequential Processes, FTCS-14, 1984.
    [4]
    P. Jalote and R. H. Compbell, Atomic actions for fault tolerances using CSP, IEEE Trans. SE-12:1 (1986).
    [5]
    S. T. Grigory, and J. C. Knight, A New Linguistic Approach To Backward Error Recovey, FTCS-15, 1985.
    [6]
    P. M. Merlin and B. Randell, Consistent State Restoration In Distributed Systems, FTCS-8, 1978.
    [7]
    W. G. Wood, Recovery Control of Communicating Processes in a Distributed System, FTCS-11, 1981.
    [8]
    K. Zielinsky, Model of error propagation in systems of communicating processes, Science of Computing Programming,6 (1986), 191-205.
    [9]
    D. L. Russell, State restoration in systems of communicating processes, IEEE Trans. SE-6:2 (1980).
    [10]
    K. H. Kim, An Implementation of a Programmer Transparent Scheme for Coordination Concurrent Processes in Recovery, COMPSAC 1980.
    [11]
    A. Ciuffoletti, Error Recovery in Systems of Communication Process, 7th Intern, Conf. on Softw. Eng., 1984.
    [12]
    D. Briatico et al., A Distributed Domino-Effect Free Recovery Algorithm, IEEE 1984 Conf. on Reliability in Distributed Software and Database Systems.
    [13]
    B. Randell, Fault Tolerance and System Structuring, from Reliable Computer Systems, Springer-Verlag, 1985.

Catalog

    Article views (16) PDF downloads (1) Cited by()
    Related

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return