Journal article
A diagnosis algorithm for distributed computing systems with dynamic failure and repair
IEEE transactions on computers, Vol.33(3), pp.223-233
1984
DOI: 10.1109/TC.1984.1676419
Abstract
The problem of designing distributed fault-tolerant computing systems is considered. A model in which the network nodes are assumed to possess the ability to "test" certain other network facilities for the presence of failures is employed. Using this model, a distributed algorithm is presented which allows all the network nodes to correctly reach independent diagnoses of the condition (faulty or fault-free) of all the network nodes and internode communication facilities, provided the total number of failures oes not exceed a given bound. The proposed algorithm allows for the reentry of repaired or replaced faulty facilities back into the network, and it also has provisions for adding new nodes to the system. Sufficient conditions are obtained for designing a distributed fault-tolerant system by employing the given algorithm. The algorithm has the interesting property that it lets as many as all of the nodes and internode communication facilities fail, but upon repair or replacement of faulty facilities, the system can converge to normal operation if no more than a certain number of facilities remain faulty.
Details
- Title: Subtitle
- A diagnosis algorithm for distributed computing systems with dynamic failure and repair
- Creators
- S. H Hosseini - Univ. Wisconsin, dep. electrical eng. computer sciJ. G Kuhl - Univ. Wisconsin, dep. electrical eng. computer sciS. M Reddy - Univ. Wisconsin, dep. electrical eng. computer sci
- Resource Type
- Journal article
- Publication Details
- IEEE transactions on computers, Vol.33(3), pp.223-233
- Publisher
- Institute of Electrical and Electronics Engineers
- DOI
- 10.1109/TC.1984.1676419
- ISSN
- 0018-9340
- eISSN
- 1557-9956
- Language
- English
- Date published
- 1984
- Academic Unit
- Electrical and Computer Engineering; Public Policy Center (Archive)
- Record Identifier
- 9984197191202771
Metrics
13 Record Views