fault tolerance fundamentals

17
Fault Tolerance Fundamentals ITV Model-based Analysis and Design of Embedded Software Techniques and methods for Critical Software Anders P. Ravn Aalborg University August 2011

Upload: lynna

Post on 22-Feb-2016

69 views

Category:

Documents


1 download

DESCRIPTION

Fault Tolerance Fundamentals . ITV Model-based Analysis and Design of Embedded Software Techniques and methods for Critical Software Anders P. Ravn Aalborg University August 2011. Fault Tolerance. Means to isolate component faults. Prevents system failures. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Fault Tolerance Fundamentals

Fault Tolerance Fundamentals

ITV Model-based Analysis and Design of Embedded SoftwareTechniques and methods for Critical Software

Anders P. RavnAalborg University

August 2011

Page 2: Fault Tolerance Fundamentals

Fault Tolerance

Means to isolate component faults

Prevents system failures

May increase system dependability

Page 3: Fault Tolerance Fundamentals

Dependability - attributes• Availability• Reliability• Safety• Confidentiality• Integrity• Maintainability

Page 4: Fault Tolerance Fundamentals

Dependability - impairments

• Faults • Errors• Failures

Page 5: Fault Tolerance Fundamentals

System and Component

Page 6: Fault Tolerance Fundamentals

Propagation among Components

Page 7: Fault Tolerance Fundamentals

Error Classification

(Fault Error)

• Effect

• Extent

• latent• effective

• local• distributed

Page 8: Fault Tolerance Fundamentals

Failure Classification

(Fault Error Failure)

• Consequence • benign• malign (a mishap)

Page 9: Fault Tolerance Fundamentals

Fault Tolerance

Means to isolate component faults

Prevents system failures

May increase system dependability

... And mask them

Page 10: Fault Tolerance Fundamentals

Fault Tolerance

Page 11: Fault Tolerance Fundamentals

FT - levels

• Full tolerance

• Graceful Degradation

• Fail safeBW p. 107

Page 12: Fault Tolerance Fundamentals

FT basis: Redundancy

• Time

• Space

Try Retry Retry ...

TryTryTry

...

BW p. 109

Page 13: Fault Tolerance Fundamentals

Fault Tolerance

Page 14: Fault Tolerance Fundamentals

Basic Strategies

Page 15: Fault Tolerance Fundamentals

The ideal FT-component

Exception HandlerNormal mode

Request/response

Request/response

Interfaceexception

Interfaceexception

Failureexception

Failureexception

Page 16: Fault Tolerance Fundamentals

Model Design Procedure1. Model the correct component and check that it

has the desired properties.2. Model relevant faults and introduce them as

internal transitions to error states. Check that this fault-affected.

3. Introduce into the model the mechanisms for fault detection, error recovery and masking and check that the desired properties are valid for this design.

Page 17: Fault Tolerance Fundamentals

Exercise

• What is the purpose of a watchdog-timer?• How could it be used in a space based

redundancy scheme?• - in a time based redundancy scheme?