Download presentation
Presentation is loading. Please wait.
1
Fault Tolerance: Basic Mechanisms mMIC-SFT September 2003 Anders P. Ravn Aalborg University
2
Fault Tolerance Means to isolate component faults Prevents system failures May increase system dependability... And mask them
3
Dependability - means Fault prevention Fault tolerance Error Removal Failure Forecasting BW p. 106,...
4
Fault Tolerance
5
FT - levels Full tolerance Graceful Degradation Fail safe BW p. 107
6
FT basis: Redundancy Time Space TryRetry... Try... BW p. 109
7
N-version programming V1 V2 V3 Driver (comporator) Comparison vectors (votes) Comparison status indicators BW p. 109 Comparison points
8
Fault classification (scope of N-VP) Origin Kind Property physical (internal/external) logical (design/interaction) omission value timing byzantine duration (permanent, transient) consistency (determinate, nondeterminate) autonomy (spontaneous, event-dependent) + (+) ++ (+) + / (+) + / +
9
Dynamic Redundancy 1.Error detection 2.Damage confinement and assessment 3.Error recovery 4.Fault treatment and continued service BW p. 114
10
Error Detection f: State x Input State x Output Environment (exception) Application BW p. 115 Assertion: precondition (input) postcondition (input, output) invariant(state, state’) Timing: WCET(f, input) Deadline (f,input) D
11
Damage Confinement Static structure Dynamic structure BW p. 117 object I I
12
Error Recovery Forward Backward BW p. 118 Repair the state – if you can ! define recovery points checkpoint state at r. p. roll back retry Domino effect
13
Recovery blocks ENSURE acceptance_test BY { module_1 } ELSE BY { module_2 }... ELSE BY { module_m } ELSE ERROR BW p. 120
14
The ideal FT-component Exception HandlerNormal mode Request/response Interface exception Interface exception Failure exception Failure exception BW p. 126
Similar presentations
© 2025 SlidePlayer.com. Inc.
All rights reserved.