Xem mẫu
Chapter 19 Recovery and Fault Tolerance
Copyright © 2008
Introduction
• Faults, Failures, and Recovery
• Byzantine Faults and Agreement Protocols • Recovery
• Fault Tolerance Techniques • Resiliency
Operating Systems, by Dhananjay Dhamdherehamdhere 19.2 2
Copyright © 2008
Faults, Failures, and Recovery
• A fault may damage the state of a system
– Error: a part of the system state that is erroneous • Failure: unexpected behavior or situation
Operating Systems, by Dhananjay Dhamdherehamdhere 19.3 3
Copyright © 2008
Faults, Failures, and Recovery (continued)
• Recovery: for reliable operation, system is restored to a consistent state, and operation resumed
– A recovery is performed when a failure is noticed
Operating Systems, by Dhananjay Dhamdherehamdhere 19.4 4
Copyright © 2008
Classes of Faults
• Fault model: properties that determine the kinds of errors/failures that might result from a fault
• Classes of faults:
– System fault system crash
• Amnesia and partial amnesia faults
• A fail-stop fault brings a system to a halt – Process fault
• Byzantine faults: malicious or arbitrary actions – Storage fault amnesia faults
– Communication fault nonamnesia faults
Operating Systems, by Dhananjay Dhamdherehamdhere 19.5 5
Copyright © 2008
...
- tailieumienphi.vn
nguon tai.lieu . vn