machine learning for prediction of cancer drug response•enable high productivity for deep learning...

75
Machine Learning for Prediction of Cancer Drug Response Rick Stevens Argonne National Laboratory The University of Chicago Crescat scientia; vita excolatur

Upload: others

Post on 21-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Machine Learning for Prediction of Cancer Drug Response

Rick StevensArgonne National Laboratory

The University of ChicagoCrescat scientia; vita excolatur

Page 2: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

CANcer Distributed Learning Environment (CANDLE)DOE-NCI partnership to advance

exascale development through cancer

research

NCINational Cancer

InstituteDOEDepartment

of EnergyCancer driving

computing advances

Computingdriving cancer

advances

2 2

Rick Stevens and Tom Brettin

Argonne National Laboratory

University of Chicago

“... But the true method of experience on the contrary first lights

the candle, and then by means of the candle shows the way;

commencing as it does with experience duly ordered and digested, not bungling or erratic, and from it educing axioms, and from established axioms again new experiments;”

―Francis Bacon in Novum Organum

August 28, 2018

Presented to:ECP AD KPP Review

Page 3: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

The NCI-DOE Partnership is Extending the Frontiers of Precision Oncology (Three Pilots)

• Cancer Biology – Molecular Scale Modeling of RAS Pathways– Unsupervised Learning and Mechanistic models– Mechanism Understanding and Drug Targets

• Pre-clinical Models – Cellular Scale PDX and Cell Lines– ML, Experimental Design, Hybrid Models– Prediction of Drug Response

• Cancer Surveillance – Population Scale Analysis– Natural Language and Machine Learning– Agent Based Modeling of Cancer Patient Trajectories

3

Page 4: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 5: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

What Is CANCER?

Page 6: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

What is Cancer?

• Large number of complex diseases• Each behave differently depending on cell type from which

originate• Age on onset, invasiveness, response to treatment

• Common General Properties• Abnormal cell growth/division (cell proliferation)• Malignant tumors• Spread to other regions of body (metastasis)

Page 7: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 8: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 9: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Match Normal Pairs (GDC) showing translationIn Gene Expression Feature Space

N

TN T

N

T

N

T

Mutations that Change Cell Behavior

Page 10: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Colon-Rectal

UterusNormal Tissue

Tumor TissueNormal Tissue

Tumor Tissue

Page 11: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 12: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 13: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

IEEE_J_Biomed_Health_Inform_2015_Sheng.pdf

Drug Response is specific to Cancer (tissue) type and specific genetic variance in each tumor

Tum

ors C

lust

ered

by

Resp

onse

Drugs

Green meansSensitive

Red meansResistant

Page 14: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Patient Derived Xenograft ModelsPilot 1

14

Cancer Cell Lines

CL and PD Xenografts

Page 15: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Machine Learning In Cancer Research• Cancer Susceptibility• Cancer Detection and Diagnosis• Cancer Recurrence• Cancer Prognosis and Survival• Cancer Classification and Clustering• Cancer Drug Response Prediction• Cancer Genomics Analysis• Cancer Medical Records Analysis• Cancer Biology

Page 16: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Deep Learning in Cancer ⟹ many Methods• AutoEncoders – learning data representations for

classification and prediction of drug response, molecular trajectories

• VAEs and GANs – generating data to support methods development, data augmentation and feature space algebra, drug candidate generation

• CNNs – type classification, drug response, outcomes prediction, drug resistance

• RNNs – sequence, text and molecular trajectories analysis

Page 17: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Machine Learning In Cancer Research• Cancer Susceptibility• Cancer Detection and Diagnosis• Cancer Recurrence• Cancer Prognosis and Survival• Cancer Classification and Clustering• Cancer Drug Response Prediction• Cancer Genomics Analysis• Cancer Medical Records Analysis• Cancer Biology

Page 18: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Dose Response and Therapeutic WindowsWe want to predict the growth rate for given drug and dose and eventually therapeutic windows

Page 19: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Modeling Cancer Drug Response

19

! = "(#, $)

gene expression levelsSNPsprotein abundancemicroRNAmethylation

IC50GI50% growthZ-score

descriptorsfingerprintsstructuresSMILESdose

Drug (s)

→Tumor

Response

! = "(#, $1 , $2)

Page 20: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Top 29 Single Drug ML Models

Many Methods

(No Free Lunch)

Page 21: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Ensemble ML Model for PredictingNarciclasineResponse

N=741AUC ~0.885-Fold Cross Validation Accuracy ~83.6%

Page 22: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 23: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

S100 Calcium Binding Protein A13Involved in the regulation of a number of cellular processes such as cell cycle progression and differentiation

Page 24: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

S100 Calcium Binding Protein A13 Higher Expression Levels ⟹ Lowered Response to Drug

Response Expression Level

Page 25: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

S100 Calcium Binding Protein A13

Renal Cancer1.97e-4 (Prognostic, unfavourable)

Page 26: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 27: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

We a single model that can train on data from many cancer samples, many drugs and can predict drug response across wide range of tumors and drug combinations

Page 28: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Other learningalgorithms

Page 29: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Model Accuracy as a Function of Training Set Size

MINST (10 digit) Accuracy for Exponentially Increasing Dataset Sizes

MNIST Dataset

10 digits

6,000 examples of each

best models are 99%+

Note: Log Scale

Page 30: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

How much data do we need?• Three heuristics that are sometimes used• X times the number of classes in the data {X ~ 1000, X ~ 10,000}

(Drugs x Cancer Samples x Dose x Response categories) (1000 x 1000 x 10 x ) ~ 100M - ~1B• X times the number of features{ X ~100, X ~1000}

(10,000 x 100) ~ 1M - ~10M• X times the number of model parameters { X ~ 10, X ~100}

(10M x 10 ) ~ 100M to (100M x 100) ~ 10B

• 1M to 10B training examples

• Current training sets are in the low end of this range

Page 31: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 32: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Deep Learning Model for Drug Pair Response

Page 33: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Drug “Synergy”Computational and Structural Biotechnology Journal 13 (2015) 504–513

Page 34: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Deep Learning Model for Drug Pair Response

DNN Model explains 94% of the variance

Page 35: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Do ML models transfer across studies?Do ML models transfer across “bio” model Types?Transfer Learning and Model Transfer

“transfer learning” is using training data from another (possibly

related) area to accelerate training and improve generalizability,

generally requires additional training in the target domain

“model transfer” is using models trained in one area to predict in

another without tuning in the target domain

Until we have sufficient data from PDXs to “tune” models trained

with CL we are in strong “model transfer” regime JNCI_J_Natl_Cancer_Inst_2013_Gillet.pdf

Page 36: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Cross Study Validation Targets (Cell Lines)

Page 37: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Batch Effects Removal

Page 38: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Cross Study Validation – Models are trained on one study and predict on the other studies not used in training (strong model transfer)

Page 39: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Can we create a unified deep learning model to solve tasks across multiple domains?

One Model To Learn Them All

Page 40: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 41: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

1006

Baseline Random Forest Cross-Study RunBest out of study R2 = 0.45

Page 42: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

UnoMT Multitask Deep Learning Cross-StudyBest out of Study R2 = 0.61

19.2

aux tasks

Page 43: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

InterestingBiology

ModelUncertainty

Machine Learning Models with UQ

[High-]Throughput Experiments

AdditionalTraining Data

Training Data

Page 44: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

P1 Challenge Problem Workflow(s) Specification

Data PreparationBatch NormalizationData Augmentation

Outlier RemovalScaling/Quantization

Concordance Processing

Model DiscoveryResidual Networks

Convolution

Multitask Networks

Population Based HPO

Training Inference

Outputs

Ensembles

Domain Adaptation

Cross-validation

UQ

Source – Target Pairs

Drug Combinations

Accuracy / K-rank / R2

Feature importanceFactorial Design

Learning Curves

Confidence Scoring

Performance Analysis

Transfer Learning

Page 45: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

P1 Challenge Problem Workflow(s) Specification

Data PreparationBatch NormalizationData Augmentation

Outlier RemovalScaling/Quantization

Concordance Processing

Model DiscoveryResidual Networks

Convolution

Multitask Networks

Population Based HPO

Training Inference

Outputs

Ensembles

Domain Adaptation

Cross-validation

UQ

Source – Target Pairs

Drug Combinations

Accuracy / K-rank / R2

Feature importanceFactorial Design

Learning Curves

Confidence Scoring

Performance Analysis

Transfer Learning

105 – 106

units of work

105 – 106

units of work

106 – 108

units of work

Page 46: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

CANDLE Challenge Problem Statement

Enable the most challenging deep learning problems in Cancer research to be pursued on the most capable supercomputers in the DOE

Page 47: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

ECP-CANDLE: CANcer Distributed Learning EnvironmentCANDLE Approach

Develop an exscale deep learning environment for cancer

Build on open source deep learning frameworks

Optimize for CORAL and Exascale platforms

Support all three pilot project needs for deep dearning

Collaborate with DOE computing centers, HPC vendors and ECP co-design and software technology projects

47

Page 48: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

CANDLE Components• CANDLE Python Library – make it easy to run on DOE Big Machines,

scale for HPO, UQ, Ensembles, Data Management, Logging, Analysis• CANDLE Benchmarks – exemplar codes/models and data representing

the three primary challenge problems • Runtime Software – Supervisor, Reporters, Data Management, Run

Data Base• Tutorials – Well documented examples for engaging the community• Contributed Codes – Examples outside of Cancer, including Climate

Research, Materials Science, Imaging, Brain Injury• Frameworks – Leverage of Tensorflow, Keras, Horovod, LBANN, etc.• LL Libraries – CuDNN, MKL, etc. (tuned to DOE machines)

Page 49: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

CANDLE Target Open Source Frameworks

Page 50: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Candle Functional Targets• Enable high productivity for deep learning centric workflows

• Support Key DL frameworks on DOE supercomputers (Keras, TF,

Mxnet, CNTK)

• Support multiple paths to concurrency (Ensembles, Data and

Model Parallel)

• Manage training data, model search, scoring, optimization,

production training and inference (End-to-End Workflow)

• CANDLE runtime/supervisor (interface with batch schedulers)

• CANDLE Python library for improving model development (UQ,

HPO, CV, MV)

• Well documented open examples and tutorials on Github

• Leverage as much open source as possible (build only what we

need to add to existing frameworks)

Page 51: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

7 10 CANDLE Benchmarks

Benchmark Type Data ID ODSample

SizeSize of

Network

Additional (activation, layer

types, etc.)

1. P1: B1 Autoencoder MLP RNA-Seq 105 105 15K 5 layers Log2 (x+1) à [0,1] KPRM-UQ

2. P1: B2 Classifier MLP SNP àType

106 40 15K 5 layers Training Set Balance issues

3. P1: B3 Regression MLP+LCN expression; drug descs

105 1 3M 8 layers Drug Response[-100, 100]

4. P2: B1 Autoencoder MLP MD K-RAS 105 102 106-108 5-8 layers State Compression

5. P2: B2 RNN-LSTM RNN-LSTM MD K-RAS 105 3 106 4 layers State to Action

6. P3: B1 RNN-LSTM RNN-LSTM Path reports

103 5 5K 1-2 layers Dictionary 12K +30K

7. P3: B2 Classification CNN Path reports

104 102 105 5 layers Biomarkers

Benchmark Owners:• P1: Fangfang Xia (ANL)• P2: Brian Van Essen (LLNL)• P3: Arvind Ramanathan (ORNL)

51

https://github.com/ECP-CANDLE

Page 52: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

BFP16 Probably as Good as FP32 for Training

Page 53: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

P1B3 Operation Profile

Page 54: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

P1B3 Matrix Sizes and Times for One Pass on x86

Page 55: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

CANDLE System Architecture

CANDLE Supervisor

Workflow Manager(Swift-T EMEWS)

ALCF Theta, Cooley

NERSCCori

OLCFTitan,

SummitDev

Hyperparameter Optimization FrameworksHyperopt, mlrMBO, Spearmint

BenchmarksDatasetsModels

ExperimentsRuns

Metadata Store Model Store

Data API

Model Descriptions

Model Weights

CANDLE Database Integrator Website

Hardware Resources

Benchmark Spec

HyperparameterSpec

Hardware Spec

CANDLE Specifications

ML/DL Benchmarks

Pilot 1 Pilot 2 Pilot 3

Page 56: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

GitHub and FTP

• ECP-CANDLE GitHub Organization:• https://github.com/ECP-CANDLE

• ECP-CANDLE FTP Site:• The FTP site hosts all the public datasets for the benchmarks

http://ftp.mcs.anl.gov/pub/candle/public/

Page 57: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Basic Take Away Points• Cancer changes cell behavior. We can assay gene expression, SNPs, protein abundance,

etc. to characterize these changes. Assays are averaged over the cells in a sample.

• Molecular assay data can be used to predict properties of patient tumors: Cancer Type, Cancer Site, Normal vs Tumor etc. (These predictors are quite accurate when trained on large-scale (GDC) data 98%-99% accurate)• Model systems (Cell Lines, Organoids, Xenografts) resemble patient tumors from gene

expression profiles and are assayed in the same way.

• Drug responses of “Biological” Models are similar to patient response in some cases (how similar is open question)

• Machine Learning Models can predict drug response of “Biological” Models• ML can be used to generate and screen drugs for development

• ML eventually can be used to select drugs and drug combinations for a patients

Page 58: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

AcknowledgementsMany thanks to DOE, NSF, NIH, DOD, ANL, UC, Moore Foundation, Sloan Foundation, Apple, Microsoft, Cray, Intel and IBM for supporting my research group over the years

Page 59: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 60: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Three Approaches to Uncertainty Quantification

• Train on distributions and predict distributions

•Bootstrap with ensembles during training

•Dropout during inference as a Bayesian approximation(Yarin Gal, University of Cambridge)

Page 61: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 62: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Intuition behind UQ (Gaussian Process Models)

Page 63: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Bootstrapping UQ in Deep Neural Networks

Gerhard Paass, Assessing and Improving Neural Network Predictions by the Bootstrap Algorithm, NIPS, 659

Page 64: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Combined Synergy and Uncertainty Map

Page 65: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Hybrid Models in Cancer Cancer Discov; 5(3); 237-8

• Student-Teacher Approach to use Mechanistic Models as Oracles for a subset of the conditions (i.e. where we have very good predictive skill for a sub-problem)

• Integration of pathway Information as constraints/hints to the machine learning models either explicitly or implicitly

• To fill-in gaps in the mechanistic models with machine learned functions

Page 66: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Dropout!

Page 67: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Dropout vs Bootstrap

Page 68: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

0 0.05 0.1 0.15 0.2 0.25std predicted over 100 bootstrap

0.06

0.08

0.1

0.12

0.14

0.16

0.18

0.2

0.22

0.24

95%

CI f

or o

bser

ved-

(mea

n pr

edic

ted)

102 103 104 105

number of predictions ranked by std over 100 bootstrap

0.06

0.08

0.1

0.12

0.14

0.16

0.18

0.2

0.22

0.24

95%

CI f

or o

bser

ved-

(mea

n pr

edic

ted)

Highly confident predictions (small bootstrap std) have high accuracy (with high confidence the predictions are in a small interval around the true value).

Order coherence and calibration

Page 69: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Three Approaches to Uncertainty Quantification• Train on distributions and predict distributions

•Bootstrap with ensembles during training

•Dropout during inference as a Bayesian approximation(Yarin Gal, University of Cambridge)

Page 70: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,
Page 71: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Intuition behind UQ (Gaussian Process Models)

Page 72: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Bootstrapping UQ in Deep Neural Networks

Gerhard Paass, Assessing and Improving Neural Network Predictions by the Bootstrap Algorithm, NIPS, 659

Page 73: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Dropout!

Page 74: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

Dropout vs Bootstrap

Page 75: Machine Learning for Prediction of Cancer Drug Response•Enable high productivity for deep learning centric workflows •Support Key DL frameworks on DOE supercomputers (Keras, TF,

0 0.05 0.1 0.15 0.2 0.25std predicted over 100 bootstrap

0.06

0.08

0.1

0.12

0.14

0.16

0.18

0.2

0.22

0.24

95%

CI f

or o

bser

ved-

(mea

n pr

edic

ted)

102 103 104 105

number of predictions ranked by std over 100 bootstrap

0.06

0.08

0.1

0.12

0.14

0.16

0.18

0.2

0.22

0.24

95%

CI f

or o

bser

ved-

(mea

n pr

edic

ted)

Highly confident predictions (small bootstrap std) have high accuracy (with high confidence the predictions are in a small interval around the true value).

Order coherence and calibration