prediction of disease by pathway-based integrative genomic...

16
Harvard-MIT Division of Health Sciences & Technology Prediction of Disease by Pathway-Based Integrative Genomic and Demographic Analysis Skanda Koppula 14 , Amin Zollanvari 123 , Gil Alterovitz 1234* PRIMES Conference May 18, 2013 1 Center for Biomedical Informatics, Harvard Medical School [Boston, MA 02115]. 2 Children’s Hospital Informatics Program at Harvard-MIT Division of Health Science [Boston, MA 02115]. 3 Partners Healthcare Center for Personalized Genetic Medicine [Boston, MA 02115]. 4 Dept.of Electrical Engineering and Computer Science at MIT [Cambridge, MA 02139]. * Corresponding author. Contact: [email protected]

Upload: others

Post on 09-Jan-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Prediction of Disease by Pathway-Based Integrative

Genomic and Demographic Analysis

Skanda Koppula14, Amin Zollanvari123,

Gil Alterovitz1234*

PRIMES Conference

May 18, 2013

1 Center for Biomedical Informatics, Harvard Medical School [Boston, MA 02115].

2 Children’s Hospital Informatics Program at Harvard-MIT Division of Health Science [Boston, MA 02115].

3 Partners Healthcare Center for Personalized Genetic Medicine [Boston, MA 02115].

4 Dept.of Electrical Engineering and Computer Science at MIT [Cambridge, MA 02139].

* Corresponding author. Contact: [email protected]

Page 2: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Introduction

Why prediction-based analysis of data?

Flexible model types

Gauge effect of feature on phenotype

…effective diagnostic tools!

Page 3: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Introduction

Why prediction-based analysis of data?

Flexible model types

Gauge effect of feature on phenotype

…effective diagnostic tools!

Try analysis on a different level!

SNP 1

SNP 2 Gene A

SNP 3

SNP 4 Gene B

Pathway X

Page 4: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Introduction

Why prediction-based analysis of data?

Flexible model types

Gauge effect of feature on phenotype

…effective diagnostic tools!

Try analysis on a different level?

Use inter-gene relations!

No black-box around disease mechanism

More knowledge about features with no data

Page 5: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Introduction

Why prediction-based analysis of data?

Flexible models [data type, number of features]

Easy to measure effect of feature on phenotype

Effective diagnostic tool

Try analysis on a different level?

Pathway-based predictive models

Page 6: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Predictive Framework :

TAN and Naïve Bayes

Page 7: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Alcoholism

2.5 million 14%

“increasing consumption of alcohol even in face of adverse consequences”

twin adoption

studies environmental

studies

The datasets:

• COGA (1653 patients)

• COGEND (1350 patients)

Page 8: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

- KEGG [Kyoto Encyclopedia of

Genes and Genomes]

- GO [Gene Ontology] …

Page 9: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Page 10: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Genetic-Only Model

Alcoholism

Immune INTERFERON_GAMMA_PRODUCTION INTERFERON_GAMMA_BIOSYNTHETIC_PROCESS REGULATION_OF_INTERFERON_GAMMA_BIOSYNTHETIC_PROCESS POSITIVE_REGULATION_OF_CYTOKINE_BIOSYNTHETIC_PROCESS DEFENSE_RESPONSE_TO_VIRUS IMMUNE_EFFECTOR_PROCESS

Peptide Metabolism BIOGENIC_AMINE_METABOLIC_PROCESS

AMINO_ACID_DERIVATIVE_METABOLIC_PROCESS PEPTIDE_METABOLIC_PROCESS

KEGG_ARGININE_AND_PROLINE_METABOLISM

Cardiovascular KEGG_VIRAL_MYOCARDITIS

KEGG_DILATED_CARDIOMYOPATHY

Absorption and Excretion KEGG_PROXIMAL_TUBULE_BICARBONATE_RECLAMATION INORGANIC_ANION_TRANSPORT ALCOHOL_METABOLIC_PROCESS

Nervous System CENTRAL_NERVOUS_SYSTEM_DEVELOPMENT

BRAIN_DEVELOPMENT

Page 11: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Genetic-Only Model

Alcoholism

Immune INTERFERON_GAMMA_PRODUCTION INTERFERON_GAMMA_BIOSYNTHETIC_PROCESS REGULATION_OF_INTERFERON_GAMMA_BIOSYNTHETIC_PROCESS POSITIVE_REGULATION_OF_CYTOKINE_BIOSYNTHETIC_PROCESS DEFENSE_RESPONSE_TO_VIRUS IMMUNE_EFFECTOR_PROCESS

Peptide Metabolism AMINO_ACID_DERIVATIVE_METABOLIC_PROCESS

PEPTIDE_METABOLIC_PROCESS BIOGENIC_AMINE_METABOLIC_PROCESS

KEGG_ARGININE_AND_PROLINE_METABOLISM

Cardiovascular KEGG_VIRAL_MYOCARDITIS

KEGG_DILATED_CARDIOMYOPATHY

Absorption and Excretion KEGG_PROXIMAL_TUBULE_BICARBONATE_RECLAMATION INORGANIC_ANION_TRANSPORT ALCOHOL_METABOLIC_PROCESS

Nervous System CENTRAL_NERVOUS_SYSTEM_DEVELOPMENT

BRAIN_DEVELOPMENT

Page 12: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Genetic-Demographic Model

Race

Sex Income

Location of Childhood Home

Sexually Abused as Child

Level of Education

Experienced non-physical trauma Neglected as Child Experienced Sexual Trauma

Height Age Frequency with which attends religious services Weight

ROC > 0.55

ROC < 0.55

Page 13: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Genetic-Demographic Model

Increase due to more # features?

No! Replacement increases accuracy by 2.8%

Why?

Genes and demo. factors boost each other

Inorganic Anion Transport contains {CLCNX gene group}

on X-chromosome

Page 14: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Lung Cancer

Pathway AUROC

Estrogen receptor regulation (carm1 and -er) 0.75

Eukaryote Translation Initiation Factor (eif4, eif2) 0.73

rnaPathway 0.73

ST_Tumor_Necrosis_Factor_Pathway 0.72

vegfPathway 0.67

MAP00010_Glycolysis_Gluconeogenesis 0.66

P53_UP 0.66

Page 15: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Next Steps

1. Insight from inter-feature relationships?

2. Application for layman to use predictive framework?

3. In vitro validation of identified pathways

4. Other learning structures?

Page 16: Prediction of Disease by Pathway-Based Integrative Genomic ...math.mit.edu/research/highschool/primes/materials/2013/conf/11-3-Koppula.pdfPrediction of Disease by Pathway-Based Integrative

Harvard-MIT Division of

Health Sciences & Technology

Acknowledgements

PRIMES program for providing me with this opportunity

Dr. Gerovitch, Professor Etingof, and Professor Khovanova

Professor Alterovitz

NIH Grants:

5R21DA025168-02 (G. Alterovitz)

1R01HG004836-01 (G. Alterovitz)

4R00LM009826-03 (G. Alterovitz)

Thank You! Questions?