ai for genomics - scripps research · 2019. 3. 28. · li yin (scripps) alex wells (stanford)...

22
AI for Genomics Amalio Telenti Scripps Research Translational Institute and Department of Integrative Structural and Computational Biology The Scripps Research Institute La Jolla, CA

Upload: others

Post on 23-Mar-2021

6 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

AI for GenomicsAmalio Telenti

Scripps Research Translational Institute and Department of Integrative Structural and Computational Biology

The Scripps Research InstituteLa Jolla, CA

Page 2: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Nvidia

Page 3: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

https://verhaert.com/wp-content/uploads/2018/05/Machine-vs-Deep-Learning-InputOutput.jpg

Page 4: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

•Explain

•Classify

•Predict

•Establish Causality

Page 5: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

2001 2003• HumanGenome

• Encodeproject

~2014

• Nucleosome3Dgenome

~2015

• Deeplearningingenomics• Functionalscreens(CRISPRandparallelreporters)

~2016

• Largescaledeepsequencingofthehumanpopulation

gnomAD|genomeAggregationDatabase

2008• GTEX

Telenti et al, Hum Mol Genet 2018

AI for genomics

Page 6: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Zou et al, Nat Genet 2019

A primer on deep learning in genomics

Page 7: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

A primer on deep learning in genomics

Zou et al, Nat Genet 2019

Page 8: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Two User Cases

Step 1

Step 2

Page 9: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

https://verhaert.com/wp-content/uploads/2018/05/Machine-vs-Deep-Learning-InputOutput.jpg

Page 10: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Machine learning model of non-coding genome essentiality

Wells…..di Iulio, BioRxiv 2018

Page 11: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Machine learning model of non-coding genome essentiality

20% of data HGMD new set ncRNA pathogenic variants

Page 12: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Curated Mendelian non-coding variants

Wells…..di Iulio, BioRxiv 2018

Page 13: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Identification of essential regulatory elements

Page 14: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Two User Cases

Step 1

Step 2

Page 15: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

https://verhaert.com/wp-content/uploads/2018/05/Machine-vs-Deep-Learning-InputOutput.jpg

Page 16: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Transfer learning

Gu, di Iulio

• Transfer learning from human to mouse and other mammals• Goal is to provide alignment-free universal model for eukaryotic genomes

Deep Learning (CNN)Training with

human promoters

Transfer to mouse

promoters

Page 17: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Notable progress

•New biology•Better processivity of data

Page 18: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Predicting Splicing from Primary Sequence withDeep Learning

• A deep neural network models mRNA splicing

• Accurately predicts noncoding cryptic splice mutations

• Estimates that ~10% of pathogenic mutations in patients with rare genetic disorders are caused by this mechanism

Jaganathan et al., 2019, Cell 176, 1–14

Page 19: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Imputation run time: Deep learning approach versus common statistic methodology

Raquel Dias & Ali Torkamani, Scripps

Page 20: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Where to find the new AI genome-wide scores?

Li Yinomni.telentilab.com

Page 21: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

Conclusions and perspectives”Application of deep learning to genomic datasets is primed to revolutionize genome analysis” (Editorial Nat Genet 2019).

• How to design deep learning systems that support medical decisions (for example, genome interpretation)? • How to avoid biases in training sets and how to interpret

predictions?• There is a need for iterative experimentation, in which deep

learning predictions can be validated by functional laboratory tests or by formal clinical assessment.

Page 22: AI for Genomics - Scripps Research · 2019. 3. 28. · Li Yin (Scripps) Alex Wells (Stanford) ZijingGu (Scripps) Shang-Fu Chen (Scripps) Raquel Dias (Scripps) Ali Torkamani (Scripps)

AcknowledgementsJulia di Iulio (Scripps)Li Yin (Scripps)Alex Wells (Stanford)Zijing Gu (Scripps)Shang-Fu Chen (Scripps)Raquel Dias (Scripps)Ali Torkamani (Scripps)Kyle Farh (Illumina)