humans to the rescue: troubleshooting ai systems with human-in-the-loop

20
Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop Ece Kamar Senior Researcher, Microsoft Research AI [email protected]

Upload: crowdsourcing-week

Post on 22-Jan-2018

242 views

Category:

Technology


4 download

TRANSCRIPT

Page 1: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Humans to the Rescue: Troubleshooting

AI Systems with Human-in-the-loop

Ece Kamar

Senior Researcher, Microsoft Research AI

[email protected]

Page 2: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Exciting Times

Page 3: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

AI and the Crowd

training data

accuracy

test data

Page 4: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Power of Data

[Banko&Brill, 2001]

Page 5: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

In the Wild

Page 6: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

In the Wild

Page 7: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Hybrid Intelligence

Human Intelligence

AI Systems

Page 8: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

AI Applied to Critical Domains

Page 9: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Power of the Hybrid

[Courtesy of Murray Campbell]

Page 10: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Troubleshooting of ML Systems

training data

accuracy

test data

querysystem

response

execution

data

In the lab

In the wild

What is the performance in the wild?

How does the system fail?

Why does the system fail?

How the system can be improved?

Page 11: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Biases in ML

[Lakkaraju, K., Caruana, Horvitz; AAAI 2017]

Page 12: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Biases in ML

[Lakkaraju, K., Caruana, Horvitz; AAAI 2017]

Page 13: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Biases in ML

[Lakkaraju, K., Caruana, Horvitz; AAAI 2017]

Page 14: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Where do Blind Spots Come From?

M

cats

dogs

cat

(conf = 0.96)

Unknown unknowns: Data points with confident but incorrect predictions.

Blind-spots: Feature spaces with high concentration of unknown unknowns

Page 15: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Blind-spots Detection

execution data

Beat the Machine [Attenberg, Ipeirotis, Provost, 2011]

Exploration of Unknown Unknowns[Lakkaraju, K., Caruana, Horvitz, 2011]

Step 1:

Descriptive

Space

Partitioning

execution data

Step 2:

Multi-armed

Bandit

based

Exploration

Page 16: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Troubleshooting Complex Systems

Page 17: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Challenge

Possible fixes

for each

component

Limited development time

Where to invest

development time for

biggest impact?

Page 18: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Human-assisted troubleshooting methodology

system

outputComponent

1

Component

2

Component

3

I/OI/O

Evalu

ation

Failures

Fixes

[Nushi, K., Kossmann, Horvitz, 2011]

Page 19: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Complex Issues

Fairness Biases

TransparencyResponsibility

Good vs. Bad

Policy & Law

Page 20: Humans to the Rescue: Troubleshooting AI Systems with Human-in-the-loop

Complex challenges

require collective efforts

No AI is perfect