safety and security in distributed systems

32
Safety and security in distributed systems Einar Landre

Upload: einar-landre

Post on 17-Jan-2017

403 views

Category:

Software


0 download

TRANSCRIPT

Page 1: Safety and security in distributed systems

Safety and security in distributed systemsEinar Landre

Page 2: Safety and security in distributed systems

Statoil

Page 3: Safety and security in distributed systems

Context

Page 4: Safety and security in distributed systems

Industries with the potential to injure or kill people or to do serious damage on the environment

hazardous industry

Require high-integrity systems and safety management processes to ensure safety

Page 5: Safety and security in distributed systems

high integrity systemsSystems where failure could lead to an accident and for which high reliability are claimed

- Pressure boundaries- Oil & Gas wells- Boilers

- Instrumentation & Control Systems- Emergency shutdown - Fire and gas leak detection

- Life supporting devices- Pacemakers- Infusion pumps

Page 6: Safety and security in distributed systems

system criticalityNon - Critical

Useful system- Low dependability

- System does not need to be trusted

Business - Critical Mission - Critical Safety - Critical

High Availability- Focus on cost s of

failure caused by system downtime, cost of spares, repair equipment and personnel and warranty claims

High Reliability- Increase the

probability of failure free system operation over a specified time in a given environment for a given purpose

High Safety & Integrity Level- High reliability

- High availability

- High security

- Focus is not on cost, but on preserving life and nature

Page 7: Safety and security in distributed systems

safeguarding integrity

Risk / threat based approach

Page 8: Safety and security in distributed systems

Things

Page 9: Safety and security in distributed systems

Troll A, 472 meters, the largest man made “thing” ever moved

Software was an alien concept

things anno 1995

Page 10: Safety and security in distributed systems

things anno 2015

Asgard subsea compression runs on software

Size = a football field

Page 11: Safety and security in distributed systems

things anno 2025

The subsea factory will be an Internet of Everything

Page 12: Safety and security in distributed systems

Fallacies of distributed computing:1. The network is reliable

2. Latency is zero

3. Bandwidth is infinite

4. The network is secure

5. Topology doesn’t change

6. There is one administrator

7. Transport cost is zero

networked everything's

A distributed system is one in which the failure of a computer youdidn’t even know existed can render your own computer unusable.

Leslie Lamport

Page 13: Safety and security in distributed systems

Software

Page 14: Safety and security in distributed systems

software is ubiquitousDefines the behaviour of1. Mobile devices

2. Medical devices

3. Computer Networks

4. Industrial control systems

5. Supply chains and logistics

6. Robots, cars & aircrafts

7. Human-Machine Interfaces

Institutionalizes our insights and knowledge

Page 15: Safety and security in distributed systems

before softwareTangible control logic

• Design level

• Implementation level

• Verification & test level

No cyber threats

• Intrusion

• Viruses

• Theft

• Identity

Page 16: Safety and security in distributed systems

two unique propertiesInspection & Test • Software can’t be inspected and

tested as analogous components

CPU – the single point of failure• All signals are threaded through the

one single element.

• Execution sequence is un-known

• Same defect is systemized acrossmultiple instances

Impacts how we must manage software for critical systems

Page 17: Safety and security in distributed systems

some specific challengesCommon mode failure

Malware, Viruses and Hacking

Human Factors

Blurred boundaries

Identity management

Page 18: Safety and security in distributed systems

common mode failure“results from an event which because of dependencies causes a coincidence of failure states of components in two or more separate channels of a redundancy system, leading to the defined systems failing to perform its intended function”.

Ariane 5 test launch, 1996

Page 19: Safety and security in distributed systems

malware, viruses and hacking

Motivated by financial, political, criminal or idealistic interests

Software created to cause harm• Change of system behaviour• Steal / destroy data or machines

Exploits weaknesses in• Human character• Technical designs

Horror stories:• Stuxnet and the Iranian centrifuges (Siemens control system)• Saudi Aramco hack of 35000 computers (Windows back office)

Page 20: Safety and security in distributed systems

human factors

How to minimize the probability?

Mistakes occur everywhere• Specification• Design• Implementation• Deployment• Operations

Humans make mistakes• By commission • By omission• By carelessness

Page 21: Safety and security in distributed systems

blurred boundaries

Conflicting interests, divergent situational understanding acrossdisciplines and roles.

Architects thinks and designs in terms of hierarchy and layering

Programmers thinks and designs in terms of threads of execution

Users need systems that works and solves a real world problems

Operations needs to get the job done

Page 22: Safety and security in distributed systems

identity

How to ensure that a thing or person is the one theyclaim to be?

What are the impacts on- Security

- Safety

- Integrity

- Availability

- Reliability

Page 23: Safety and security in distributed systems

Tools

Page 24: Safety and security in distributed systems

systems engineeringArchitecture centric• Design• Implementation• Deployment• Usage

Risk based• Requirements• Design• Implementation• Commissioning• Usage

Holistic and remember higher order effects

Page 25: Safety and security in distributed systems

safety & security architecture

Separation and protection of critical functions

Page 26: Safety and security in distributed systems

Human brain - planets most sophisticatedand vulnerable decision maker

human factors

• Emotions trumps facts (irrationality)

• Limited processing capacity

• Need to rest, easily bored

• Inconsistency across exemplars

• Creative, easily distracted

• Values (ethics and morale)

• Mental illness

Address our inherent weaknesses from day one

Page 27: Safety and security in distributed systems

• I have to make frequent decisions and many of them depend upon readings from sensors that can be correct, noisy, random, unavailable, or in some other state.

• The decisions I have to make often have safety consequences, they certainly have economic consequences, and some are irreversible.

• At any point in time there may be three or four actions I could take based on my sense of what’s happening on the rig

• I would like better support to determine how trustworthy my readings are, what the possible situations are and the consequences of each action.

What is the best actionto take?

enhance human decision making

Page 28: Safety and security in distributed systems

use and adhere to standards

IEC 61508 Functional safety of safety instrumented systems for the process industry sector

IEC 61511 Safety instrumented systems for the process industry sector

DO-178C Software considerations in airborne systems and equipment certification

The good thing about standards is that there are so many to choose fromAndrew S. Tanenbaum

Not sufficient on their own

Represents insights

Must be tailored to be useful

Page 29: Safety and security in distributed systems

build & use safety (security) cases

Thanks to professor Tim Kelly @ University of York

Page 30: Safety and security in distributed systems

Summary

Page 31: Safety and security in distributed systems

summaryHeading toward a world of interconnected every-things

Some of these things support hazardous industries and critical functions

Exposed to the inherent vulnerabilities in computers and software

Hazardous industries need high-integrity systems

Non-critical software practice fails for critical systems Rigorous Systems Engineering, Safety & Security Architecture and Standards

Human factors must be addressed from day oneThrough engineering and operations and use

Page 32: Safety and security in distributed systems

Safety and security in distributed systems

Einar LandreLeaderE-mail [email protected]: +4741470537

www.statoil.com

Thank you