family-based analysis of genome-wide gene gene interactions… · i dr. andreas beyer i my...

33
Motivation Methods Results Family-based analysis of genome-wide gene × gene interactions Marit Ackermann Biotec TU Dresden July 9, 2009 Marit Ackermann Biotec TU Dresden Family-based analysis of genome-wide gene × gene interactions

Upload: others

Post on 22-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

  • Motivation Methods Results

    Family-based analysis ofgenome-wide gene × gene interactions

    Marit Ackermann

    Biotec TU Dresden

    July 9, 2009

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Motivation

    MethodsFamily-based Association TestExternal Data

    ResultsExampleDiscussion

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Outline

    Motivation

    MethodsFamily-based Association TestExternal Data

    ResultsExampleDiscussion

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Epistasis

    I Epistasis: interaction between two or more genes

    I known to be fundamental for the function of regulatorypathways in mammals

    I implies its importance for the development of complexdiseases such as cancer, Alzheimer‘s disease, diabetes

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Traditional Approaches

    I for yeast and worms large scale double knock-outs andknock-downs exist

    I linkage and association studies in mammals concentrate oneither single locus associations or interactions between fewpreselected loci

    I major reasons: non-availability of large and suitable data foranalysis of interaction effects, low power of the studies

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Genome-Wide Screen in Mammals

    I recent advances in biotechnology allow genome-widegenotyping of thousands of individuals→ can be used to study epistatic effects over wholegenome

    I genotyped individuals possibly related→ take population structure into account; even make use ofknown relationships

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Outline

    Motivation

    MethodsFamily-based Association TestExternal Data

    ResultsExampleDiscussion

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Family-based Association Test

    Outline

    Motivation

    MethodsFamily-based Association TestExternal Data

    ResultsExampleDiscussion

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Family-based Association Test

    Method

    I idea: two markers whosegenotypes are correlated arelikely to interact

    I measure association via χ2-testfor contingency table

    BB Bb bbAA nAABB nAABb nAAbbAa nAaBB nAaBb nAabbaa naaBB naaBb naabb

    I make use of family information to avoidspurious findings:compare observed allele combination withwhat could have been inherited from parents

    I additional correction for allelic drift

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Family-based Association Test

    Method

    I idea: two markers whosegenotypes are correlated arelikely to interact

    I measure association via χ2-testfor contingency table

    BB Bb bbAA nAABB nAABb nAAbbAa nAaBB nAaBb nAabbaa naaBB naaBb naabb

    I make use of family information to avoidspurious findings:compare observed allele combination withwhat could have been inherited from parents

    I additional correction for allelic drift

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Family-based Association Test

    Problem

    I extremely large number of interactions(example: 10,000 markers: ∼ 108 interactions)

    I leads to underpowered analysis, many false positive findings

    I need to complement with additional, external information

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Family-based Association Test

    Problem

    I extremely large number of interactions(example: 10,000 markers: ∼ 108 interactions)

    I leads to underpowered analysis, many false positive findings

    I need to complement with additional, external information

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    External Data

    Outline

    Motivation

    MethodsFamily-based Association TestExternal Data

    ResultsExampleDiscussion

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    External Data

    Databases

    I use public knowledge about gene × gene interactions toconfirm results;e.g. STRING: database of known and predicted physical andfunctional interactions

    I include information from regulatory pathways

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Outline

    Motivation

    MethodsFamily-based Association TestExternal Data

    ResultsExampleDiscussion

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Outline

    Motivation

    MethodsFamily-based Association TestExternal Data

    ResultsExampleDiscussion

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Data

    I Solberg, L.C. et al. (2006). A protocol for high-throughputphenotyping, suitable for quantitative trait analysis in mice.Mammalian Genome, 17, 129-146.

    I genotype data from more than 2000 outbred mice consistingof ∼ 12, 000 markers

    I only consider interactions on two different chromosomes

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Modified χ2-Test

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Modified χ2-Test

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Confirmation with STRING

    I fraction of SNP pairswith a low χ2 p-valuethat lie close tointeracting genes

    I proportion ofconfirmed interactionsshould increase withincreasing χ2 score

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Confirmation with STRING

    I fraction of SNP pairswith a low χ2 p-valuethat lie close tointeracting genes

    I proportion ofconfirmed interactionsshould increase withincreasing χ2 score

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Confirmation with STRING

    I fraction of SNP pairswith a low χ2 p-valuethat lie close tointeracting genes

    I proportion ofconfirmed interactionsshould increase withincreasing χ2 score

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Incorporating Pathway Information

    I interactions in one pathway can becrucial, e.g. when signal weakenedby two consecutive dysfunctionalpathway members

    I interactions between pathwaysindicate common endpoint

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Incorporating Pathway Information

    I interactions in one pathway can becrucial, e.g. when signal weakenedby two consecutive dysfunctionalpathway members

    I interactions between pathwaysindicate common endpoint

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Example: KEGG Pathway

    KEGG: database of signaling andmetabolic pathways

    histogram of KEGG interaction p−values

    −log10 p−value

    Den

    sity

    0 1 2 3 4

    0.0

    0.5

    1.0

    1.5

    2.0

    2.5

    3.0

    indicates importance ofolfactory receptors in em-bryonic development andinterplay with notch

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Example

    Example: KEGG Pathway

    KEGG: database of signaling andmetabolic pathways

    histogram of KEGG interaction p−values

    −log10 p−value

    Den

    sity

    0 1 2 3 4

    0.0

    0.5

    1.0

    1.5

    2.0

    2.5

    3.0

    indicates importance ofolfactory receptors in em-bryonic development andinterplay with notch

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Discussion

    Outline

    Motivation

    MethodsFamily-based Association TestExternal Data

    ResultsExampleDiscussion

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Discussion

    I we propose a new approach to infer epistatic interactions inmammals

    I works on a genome-wide scale

    I population structure explicitly taken into account

    I other counfounding factors readily included

    I data integration from different sources increases power andfacilitates biological interpretation of results

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Discussion

    I we propose a new approach to infer epistatic interactions inmammals

    I works on a genome-wide scale

    I population structure explicitly taken into account

    I other counfounding factors readily included

    I data integration from different sources increases power andfacilitates biological interpretation of results

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Discussion

    I we propose a new approach to infer epistatic interactions inmammals

    I works on a genome-wide scale

    I population structure explicitly taken into account

    I other counfounding factors readily included

    I data integration from different sources increases power andfacilitates biological interpretation of results

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Discussion

    I we propose a new approach to infer epistatic interactions inmammals

    I works on a genome-wide scale

    I population structure explicitly taken into account

    I other counfounding factors readily included

    I data integration from different sources increases power andfacilitates biological interpretation of results

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Discussion

    I we propose a new approach to infer epistatic interactions inmammals

    I works on a genome-wide scale

    I population structure explicitly taken into account

    I other counfounding factors readily included

    I data integration from different sources increases power andfacilitates biological interpretation of results

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

  • Motivation Methods Results

    Discussion

    Acknowledgements

    I Dr. Andreas Beyer

    I my colleagues in theCellular Networks andSystems Biology group

    I Klaus Tschira Foundation for funding

    Marit Ackermann Biotec TU Dresden

    Family-based analysis of genome-wide gene × gene interactions

    MotivationMethodsFamily-based Association TestExternal Data

    ResultsExampleDiscussion