reflections on safety and artificial intelligence braking system brakes held "off"...

101
Eric Horvitz Reflections on Safety and Artificial Intelligence Pittsburgh, PA June 27, 2016 Exploratory Technical Workshop on Safety and Control for AI Carnegie Mellon University

Upload: hoanghuong

Post on 28-Mar-2018

217 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Eric Horvitz

Reflections on Safety

and Artificial Intelligence

Pittsburgh, PAJune 27, 2016

Exploratory Technical Workshop on Safety and Control for AICarnegie Mellon University

Page 2: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

AI & Safety

Constellation of methods referred to as Artificial Intelligence will touch our lives more closely and intimately

AI moving into high-stakes applicationsHealthcareTransportationFinancePublic policyDefense

Much to do on principles, methods, and best practices

Page 3: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Safe AI Systems

Relevance of Multiple Subdisciplines

Page 4: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Safe AI Systems

Relevance of Multiple Subdisciplines

Planning

SensorFusion

ControlTheory

Machine Learning

Page 5: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Safe AI Systems

Relevance of Multiple Subdisciplines

Planning

SensorFusion

ControlTheory

Machine Learning

Metareasoning

Page 6: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Safe AI Systems

Relevance of Multiple Subdisciplines

MultiagentSystems

Mechanism Design

Planning

SensorFusion

ControlTheory

Machine Learning

Metareasoning

Page 7: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Safe AI Systems

Relevance of Multiple Subdisciplines

MultiagentSystems

Mechanism Design

Planning

SensorFusion

ControlTheory

Machine Learning

Metareasoning

RobustOptimization

Verification

Security

Page 8: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Safe AI Systems

Relevance of Multiple Subdisciplines

MultiagentSystems

Mechanism Design

Planning

SensorFusion

ControlTheory

Machine Learning

Metareasoning

RobustOptimization

HCI

Verification

Security

Page 9: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Safe AI Systems

Relevance of Multiple Subdisciplines

MultiagentSystems

Mechanism Design

Planning

SensorFusion

ControlTheory

Machine Learning

Metareasoning

RobustOptimization

HCI

Verification

Security

Safety-Critical Systems

Page 10: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

safetyˈsāftē/

noun1. the condition of being protected from or unlikely to cause danger, risk, or injury

Page 11: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

safety-critical ˈsāftēˌkridək(ə)l/

adjective1. systems whose failure could result in loss of life, significant property damage, or damage to the environment.

2. designed or needing to be fail-safe for safety purposes.

Page 12: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

adjectiveincorporating some feature for automatically counteracting the effect of an anticipated possible source of failure

noundevice or practice that, in the event of a failure, responds or results in a way that will cause no harm, or at least minimizes harm.

fail-safe \ˈfāl-ˌsāf\

Page 13: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Fail-safe

George Westinghouse, 1869Train braking system

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Page 14: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Fail-safe

George Westinghouse, 1869Train braking system

Fail-safe designAir brakes

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Page 15: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Fail-safe

George Westinghouse, 1869Train braking system

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

June 10, 1869 Union Station, Pittsburgh to Steubenville

Page 16: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Fail-safe

George Westinghouse, 1869Train braking system

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Fail-safe practiceFull-power throttle on arrested landing

Page 17: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Fail-safe

George Westinghouse, 1869Train braking system

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Fail-safe planFree return trajectory

Page 18: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Fail-safe

George Westinghouse, 1869Train braking system

Mechanism Practice Plan

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Fail-safe planFree return trajectory

Page 19: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Fail-safe

George Westinghouse, 1869Train braking system

Mechanism Practice Plan

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Page 20: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Fail-safe

George Westinghouse, 1869Train braking system

Mechanism Practice Plan

Monitoring

Brakes held "off" actively by healthy system

Brakes naturally resort to “on” if any failure of braking system

Page 21: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Growing interest in issues & directions with AI in real-world settings

Grappling with uncertainty and more general incompleteness

AAAI President’s address (2008), “Artificial Intelligence in the Open World.”

AAAI President’s address (2016), “Steps Toward Robust Artificial Intelligence.”

AI in the Open World

E. Horvitz. Artificial Intelligence in the Open World, AAAI President’s Address, Chicago, IL, July 2008.

T. Dietterich, Steps Toward Robust Artificial Intelligences, AAAI President's Address, Phoenix, AX. February, 2016.

Page 22: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Open-world complexity incomplete understanding

Uncertainties & poor-characterization of performancePoor operating regimes, unfamiliar situations

Special Considerations with AI

Page 23: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Open-world complexity incomplete understanding

Uncertainties & poor-characterization of performancePoor operating regimes, unfamiliar situations

Special Considerations with AI

Rich ontology of failures

Numerous failure modalities New attack surfaces (e.g., machine learning attack)Self-modification & gaming (e.g., modify reward fcn)Unmodeled influences

Page 24: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Open-world complexity incomplete understanding

Uncertainties & poor-characterization of performancePoor operating regimes, unfamiliar situations

Challenges of transfer across time & space

Special Considerations with AI

Rich ontology of failures

Numerous failure modalities New attack surfaces (e.g., machine learning attack)Self-modification & gaming (e.g., modify reward fcn)Unmodeled influences

Challenge of coordinating human-machine collaborations

Operational opacity

Page 25: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

AI & Open-World Complexity

Frame problem

How to tractably derive consequences of an action?

Qualification problem

Understanding preconditions required for actions to have intended effects

Ramification problem

Understanding all important effects of action

Page 26: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

AI & Open-World Complexity

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

Page 27: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

AI & Open-World Complexity

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

Page 28: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

AI & Open-World Complexity

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

Page 29: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Decision making under uncertainty & incompleteness

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Page 30: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Decision making under uncertainty & incompleteness

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Page 31: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Decision making under uncertainty & incompleteness

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Page 32: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Decision making under uncertainty & incompleteness

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Page 33: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 34: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 35: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 36: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 37: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 38: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 39: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Rise of probabilistic methods: known unknowns

Recent attention to unknown unknowns

AI & Open-World Complexity

Decision making under uncertainty & incompleteness

Page 40: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Fang, et al., 2015

Direction: Learn about abilities & failures

Performance

Successes & failures

p( fail | E, t)

Predictive model of

confidenceImage

H1

H2

H3

W1

W2

W3

W4

Input s

H3

Caption:

a man holding a tennis

racquet on a tennis court

H1

H2

H3

W1

W2

W3

Input t1

H3

W4

Deep learning about deep learning performance

Page 41: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

InferredState

InferenceReliability

Inference Inference

Performance

Successes & failures

p( fail | E, t)

Predictive model of

confidence

Direction: Learn about abilities & failures

Toyama & H. 2000

Page 42: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

InferredState

InferenceReliability

Performance

Successes & failures

p( fail | E, t)

Predictive model of

confidence

Direction: Learn about abilities & failures

Toyama & H. 2000

Page 43: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

InferredState

InferenceReliability

Inference Inference

Performance

Successes & failures

p( fail | E, t)

Predictive model of

confidence

Direction: Learn about abilities & failures

Toyama & H. 2000

Page 44: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Inference modality A

Inference modality B

Inference modality C

Direction: Robustness via analytical portfolios

Toyama & H. 2000

Page 45: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Robustness via analytical portfolios

Toyama & H. 2000

Page 46: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Robustness via analytical portfolios

Toyama & H. 2000

lights out

jolted camera

facing away

periph. distraction

back. subtract color based motion decay

Perceptual modalitiesUnmodeled situations in open world

Joint inference

Page 47: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Vary model structure, parameters, inferences

Direction: Understanding robustness via sensitivity analyses

Page 48: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Vary model structure, parameters, inferences

Direction: Understanding robustness via sensitivity analyses

Page 49: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Vary model structure, parameters, inferences

Direction: Understanding robustness via sensitivity analyses

Page 50: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Robust optimization to minimize downside

Tamar, 2015; Chow, et al., 2014; per Dietterich, AAAI lect. 2016

Robust optimization under uncertain parameters

Risk-sensitive objective

e.g., conditional-value-at-risk budget

Methods trade upside value

for reducing probability of

costly outcomes

Page 51: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Learn about unknown unknowns

Data, experience, rich simulations

Detect anomalies, unexpected variations, distributional shifts

Meta-analysis & transfer

Human engagement

Page 52: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Learn about unknown unknowns

Data, experience, rich simulations

Detect anomalies, unexpected variations, distributional shifts

Meta-analysis & transfer

Human engagement“Beat the Machine” (Attenberg, Ipeirotis, Provost 2015)

Page 53: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Learn about unknown unknowns

“Beat the Machine” (Attenberg, Ipeirotis, Provost 2015)

Data, experience, rich simulations

Detect anomalies, unexpected variations, distributional shifts

Meta-analysis & transfer

Human engagement

Page 54: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Predict new distinctions, combine open- & closed-world models

Direction: Learn about unknown unknowns

Krumm, H., 2006

Page 55: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Destinations, E1..,En, t

Day 1 Day 14

Predict new distinctions, combine open- & closed-world models

Direction: Learn about unknown unknowns

Krumm, H., 2006

Page 56: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Predict previously unseen destination

Destinations, E1..,En, t

Day 1 Day 14

Predict new distinctions, combine open- & closed-world models

Krumm, H., 2006

Direction: Learn about unknown unknowns

Page 57: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Predict previously unseen destination

Destinations, E1..,En, t

Day 1 Day 14

Predict new distinctions, combine open- & closed-world models

Direction: Learn about unknown unknowns

Krumm, H., 2006

Page 58: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Predict previously unseen destination

Destinations, E1..,En, t

Day 1 Day 14

Predict new distinctions, combine open- & closed-world models

Direction: Learn about unknown unknowns

Krumm, H., 2006

Page 59: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

2

1.5

x

1

0.5

00

0.5

y

11.5

20.2

0

-0.2

-0.4

-0.6

-0.8

-1

z

Direction: Joint modeling of key dimensions of error

Example: Learn about errors of perception & control

Probabilistic models of control

Probabilistic models of sensing 𝝋𝐨𝐛𝐬𝐭𝐚𝐜𝐥𝐞

𝝋𝐫𝐨𝐥𝐥

Sadigh & Kapoor, 2016

Page 60: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Proposed trajectory

Sample 1 Sample 2 Sample n

S.D.Mean

S.D.

𝝋𝐫𝐨𝐥𝐥

𝝋𝐨𝐛𝐬𝐭𝐚𝐜𝐥𝐞

Direction: Joint modeling of key dimensions of error

Sadigh & Kapoor, 2016

Page 61: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Proposed trajectory Trajectory safe if:

Σ

Σ Σ+> 1 − 𝜖

S.D.Mean

S.D.

Σ

𝝋𝐫𝐨𝐥𝐥

𝝋𝐨𝐛𝐬𝐭𝐚𝐜𝐥𝐞

Direction: Joint modeling of key dimensions of error

Sadigh & Kapoor, 2016

Page 62: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Proposed trajectory

Sample 1 Sample 2 Sample n

Trajectory safe if:

Σ

Σ Σ+> 1 − 𝜖

S.D.Mean

S.D.

Σ

𝝋𝐫𝐨𝐥𝐥

𝝋𝐨𝐛𝐬𝐭𝐚𝐜𝐥𝐞

𝐩 > 𝟏 − 𝝐

Direction: Joint modeling of key dimensions of error

Sadigh & Kapoor, 2016

Page 63: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Joint modeling of key dimensions of error

Value of refining models & system- Value of additional data- Value of enhancing sensors- Value of better controller

Sadigh & Kapoor, 2016

(video)

Page 64: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Joint modeling of key dimensions of error

𝐩 > 𝟏 − 𝝐

Sadigh & Kapoor, 2016

(video)

Page 65: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Joint modeling of key dimensions of error

𝐩 > 𝟏 − 𝝐

Fail-safe

Sadigh & Kapoor, 2016

(video)

Page 66: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Joint modeling of key dimensions of error

Sadigh & Kapoor, 2016 (video)

Page 67: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Verification, security, cryptography

Verification

Cryptography

Security

Safe AI Systems

Page 68: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Static analysis

Run-time verification

Whitebox fuzzing

Cybersecurity to protect attack surfaces

Appropriate use of physical security, isolation

Encryption for data integrity, protection of interprocess comms.

Direction: Verification, security, cryptography

Page 69: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Difficult to do formal analysis of large-scale system

Analysis & execution considers info. from running system

Direction: Runtime verification

Satisfy or violate desired properties?

Identify problem, future problem

Engage human

Take fail-safe action

Page 70: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

State

Direction: Metalevel analysis, monitoring, assurance

Environment

State

AI system

Page 71: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

State’

Page 72: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’

Page 73: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’

Page 74: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’

Reinforcement

Page 75: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

State

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward Reinforcement

State’ Perception

Page 76: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’ Perception

Reinforcement

Adversary

Page 77: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’ Perception

Reinforcement

Adversary

Self-modification

e.g., see: Amodei, Olah, et al., 2016

Page 78: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’ Perception

Reinforcement

Adversary

Reflective analysis• Operational faithfulness• Ensure isolation, detect mods• Identify external meddling

Page 79: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Run-time verificationStatic analysis

Learning

State

Direction: Metalevel analysis, monitoring, assurance

EnvironmentAction

Environment’

State

AI system

Reward

State’ Perception

Reinforcement

Adversary

Reflective analysis• Operational faithfulness• Ensure isolation, detect mods• Identify external meddling

Page 80: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

Models of human cognition

Transparency of state, explanation

Mastering coordination of initiatives

Page 81: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

Air France 447 (June 2009)

China Airlines 006 (Feb 1985)

747 dives 10,000 in 20 seconds. 5g, supersonic.

Unrecoverable stall.

Page 82: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

Air France 447 (June 2009)

China Airlines 006 (Feb 1985)

747 dives 10,000 in 20 seconds. 5g, supersonic.

Unrecoverable stall.

Page 83: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Human cognition

Machineintelligence

How to best work together for safety?

Rich spectrum of autonomy

Direction: Human-machine collaboration

Kamar, Hacker, H., 2012

Page 84: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Machine learning & inference

Human cognition

Machineintelligence

How to best work together for safety?

Rich spectrum of autonomy

Direction: Human-machine collaboration

Kamar, Hacker, H., 2012

Page 85: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Designs for mix of initiatives

Machine learning & inference

Human cognition

Machineintelligence

How to best work together for safety?

Rich spectrum of autonomy

Direction: Human-machine collaboration

Kamar, Hacker, H., 2012

Page 86: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

Page 87: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

Infer challenges with machine competency

Page 88: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Infer challenges with machine competency

Direction: Human-machine collaboration

Infer human attention

Page 89: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

p(a

tte

nti

on

sta

te|E

)

Time

Infer human attentionInfer challenges with machine competency

Page 90: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

Continual prediction of trajectories

p(a

tte

nti

on

sta

te|E

)

Time

Infer human attention

Page 91: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

Continual prediction of trajectories

p(a

tte

nti

on

sta

te|E

)

Time

Infer human attention

Page 92: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

Continual prediction of trajectories

p(a

tte

nti

on

sta

te|E

)

Time

Infer human attention

Page 93: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Direction: Human-machine collaboration

Safety-assuring mixed-initiative planner

- Driver’s attention over time

- Latency of human input

- Latency tolerance of situation

- Cost & influence of alerting driver

- Custom language, ongoing dialog

Gain driver attention t

Slow to defer need t’

Implement failsafe t’’

Page 94: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

• Phases of study, testing, reporting for rolling out new capabilities in safety-critical domains (akin to FDA clinical trials, post-marketing surveillance)

• Disclosure & control of parameters on failure rates, tradeoffs, preferences

• Transparency & explainability of perception, inference, action

• System self-monitoring & reporting machinery

• Isolation of components in intelligence architectures

• Detecting & addressing feedback of system’s influence on self

Direction: Develop Best Practices for Safe AI

Page 95: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

• Standard protocols for handoffs, attention, awareness, warning, in human-machine collaborations

• Policies for visible disclosure of autonomy to others (e.g., indication to others that a car is currently on automated policy)

• Fail-safe actions & procedures given predicted or sensed failures

• Enhancing robustness via co-design of environment & systems

• Testing for drift of assumptions, distributions in domains

• Special openness & adherence to best practices for data, learning, decision making for applications in governance & public policy

Direction: Develop Best Practices for Safe AI

Page 96: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Addressing concerns of public

Significant differences of opinion, including experts

Direction: Address concerns about “superintelligences”

Page 97: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Addressing concerns of public

Significant differences of opinion, including among experts

Direction: Address concerns about “superintelligences”

Page 98: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Alan Turing Script, BBC broadcast, 1951

Addressing concerns of public

Significant differences of opinion,

Direction: Address concerns about “superintelligences”

Page 99: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

Alan Turing Script, BBC broadcast, 1951

Addressing concerns of public

Significant differences of opinion,

Direction: Address concerns about “superintelligences”

“For it seems possible that once the machine thinking method had started, it would not take long to outstrip our feeble powers.

…they would be able to converse with each other to sharpen their wits.

At some stage therefore, we should have to expect the machines to take control in the way that is mention in Samuel Butler’s Erewhon.”

Alan Turing, 1951

Page 100: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe

• Do we understand possibilities?

• What kind of research should done proactively?

• Can we “backcast” from imagined poor outcomes

• Designs of clear ways to thwart possibilities, ease concerns

Addressing concerns of public

Significant differences of opinion, including experts

Direction: Address concerns about “superintelligences”

Page 101: Reflections on Safety and Artificial Intelligence braking system Brakes held "off" actively by healthy system rakes naturally resort to “on” if any failure of braking system Fail-safe