the type 2 diabetes knowledge portal: accelerating …...has produced an open-access resource, the...

1
The Type 2 Diabetes Knowledge Portal: accelerating T2D research through community access to human genetic information and tools MARIA C. COSTANZO, ON BEHALF OF THE ACCELERATING MEDICINES PARTNERSHIP IN TYPE 2 DIABETES Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA Discovery of genetic determinants of type 2 diabetes (T2D) can spark insights into novel therapeutic targets. The Accelerating Medicines Partnership in T2D aims to catalyze such insights by forging collaboration between multiple stakeholders to produce and aggregate data, develop and implement analytical methods and tools, and build infrastructure for data storage and presentation. This collaboration has produced an open-access resource, the T2D Knowledge Portal. Data are stored in a Data Coordinating Center at the Broad Institute and also at a federated node at the European Bioinformatics Institute. Federation allows integrated analysis across data sets, while protecting information that must remain in its site of origin. Patient privacy is protected for all data in the Portal: interfaces present summary results, while some tools allow custom analysis of individual-level data but prevent direct access to the data. The Knowledge Portal includes GWAS and exome and whole-genome sequences covering > 260,000 individuals from large-scale T2D consortia and projects, including health-care systems and biobanks. Genetic data are added regularly, and other data types (e.g., epigenomic annotations, expression) and traits (T2D complications, longitudinal study data) will also be incorporated. The Portal provides a user-friendly interface that enables scientists to identify variants associated with T2D or related traits, or to investigate the effects of perturbing a specific gene or sequence. Versatile tools allow sophisticated queries, such as custom association analysis on individuals within specific phenotypic ranges to identify those who would be of interest for genotype-based callback studies. We are continuously improving the T2D Knowledge Portal to maximize global access to the wealth of genetic information for T2D research. Introduction > 100 loci are now known to be associated with T2D, explaining 10-15% of heritability genetic mapping identifies known T2D drug targets genetic information could help us understand, predict, and treat T2D The challenge: spark insights into T2D etiology and accelerate identification of new therapeutic targets huge amounts of data are needed to draw significant conclusions data are warehoused in disparate locations data are not easily accessible to those outside large consortia data are not easily manipulated by non-experts data from different studies need to be aggregated and harmonized individual-level data must be protected comprehensive T2D associations from the latest studies access to data housed at EBI Federated Node early access and unpublished results public GWAS results for T2D-related traits AMP T2D Data Coordinating Center and its Federated Node aggregate T2D association data Gene Page integrates evidence to suggest hypotheses about the role of a gene in T2D The AMP T2D Consortium is a collaboration among the following organizations, which also provide funding and/or governance: Variant Finder generates sets of variants meeting custom criteria iabete Contact Information Type 2 Diabetes Knowledge Portal www.type2diabetesgenetics.org [email protected] Funding and guidance are also provided by: @T2DKP BUT1.00 1.10 1.20 1.30 1.40 1.50 1.60 1.70 1.80 1.90 2.00 2000 2003 2006 2007 2008 2009 2010 2011 2012 2013 2014 10.3 4.96 T2D odds ratio Genes implicated in T2D Variant Page summarizes associations and the potential regulatory role of the region Interactive Data page filters datasets by data type or phenotype Variant At-a-glance section is a visual display of phenotypic associations and their significance Epigenomic annotations section shows chromatin states in different tissues across the region of the variant LocusZoom interactive visualization displays variant associations and LD relationships over a genomic region, allows custom conditional analysis View minor allele frequencies in different ancestries Top section integrates evidence on the associations, allele frequencies, and predicted impact of variants in a gene to provide an immediate indication of whether a gene is significantly linked to any phenotypes LocusZoom displays chromatin states across the region along with variants and associations Search across datasets to retrieve variants meeting custom criteria: associated phenotypes p-value and effect size thresholds allele frequency and count genomic location consequence for encoded protein presence in specific dataset or ancestry-specific cohort View pre-computed burden test results for high-impact variants, or run a custom burden test after filtering samples by phenotypic range (e.g., compare lean T2D cases to obese controls)

Upload: others

Post on 10-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Type 2 Diabetes Knowledge Portal: accelerating …...has produced an open-access resource, the T2D Knowledge Portal. Data are stored in a Data Coordinating Center at the Broad

The Type 2 Diabetes Knowledge Portal: accelerating T2D research through community access to human genetic information and tools

MARIA C. COSTANZO, ON BEHALF OF THE ACCELERATING MEDICINES PARTNERSHIP IN TYPE 2 DIABETES Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA

Discovery of genetic determinants of type 2 diabetes (T2D) can spark insights into novel therapeutic targets. The Accelerating Medicines Partnership in T2D aims to catalyze such insights by forging collaboration between multiple stakeholders to produce and aggregate data, develop and implement analytical methods and tools, and build infrastructure for data storage and presentation. This collaboration has produced an open-access resource, the T2D Knowledge Portal.

Data are stored in a Data Coordinating Center at the Broad Institute and also at a federated node at the European Bioinformatics Institute. Federation allows integrated analysis across data sets, while protecting information that must remain in its site of origin. Patient privacy is protected for all data in the Portal: interfaces present summary results, while some tools allow custom analysis of individual-level data but prevent direct access to the data.

The Knowledge Portal includes GWAS and exome and whole-genome sequences covering > 260,000 individuals from large-scale T2D consortia and projects, including health-care systems and biobanks. Genetic data are added regularly, and other data types (e.g., epigenomic annotations, expression) and traits (T2D complications, longitudinal study data) will also be incorporated.

The Portal provides a user-friendly interface that enables scientists to identify variants associated with T2D or related traits, or to investigate the effects of perturbing a specific gene or sequence. Versatile tools allow sophisticated queries, such as custom association analysis on individuals within specific phenotypic ranges to identify those who would be of interest for genotype-based callback studies. We are continuously improving the T2D Knowledge Portal to maximize global access to the wealth of genetic information for T2D research.

Introduction

•  > 100 loci are now known to be associated with T2D, explaining 10-15% of heritability

•  genetic mapping identifies known T2D drug

targets •  genetic information could help us understand,

predict, and treat T2D

The challenge: spark insights into T2D etiology and accelerate identification of new therapeutic targets

•  huge amounts of data are needed to draw significant conclusions

•  data are warehoused in disparate locations •  data are not easily accessible to those outside

large consortia •  data are not easily manipulated by non-experts •  data from different studies need to be aggregated

and harmonized •  individual-level data must be protected

•  comprehensive T2D associations from the latest studies

•  access to data housed at EBI Federated Node

•  early access and unpublished results

•  public GWAS results for T2D-related traits

AMP T2D Data Coordinating Center and its Federated Node aggregate T2D association data

Gene Page integrates evidence to suggest hypotheses about the role of a gene in T2D

The AMP T2D Consortium is a collaboration among the following organizations, which also provide funding and/or governance:

Variant Finder generates sets of variants meeting custom criteria

iabete

Contact Information

Type 2 Diabetes Knowledge Portal www.type2diabetesgenetics.org [email protected]

Funding and guidance are also provided by:

@T2DKP

BUT… 1.001.101.201.301.401.501.601.701.801.902.00

2000 2003 2006 2007 2008 2009 2010 2011 2012 2013 2014

10.34.96

T2D

odd

s ra

tio

Genes implicated in T2D

Variant Page summarizes associations and the potential regulatory role of the region

Interactive Data page filters datasets by data type or phenotype

Variant At-a-glance section is a visual display of phenotypic

associations and their significance

Epigenomic annotations section shows chromatin states in different

tissues across the region of the variant

LocusZoom interactive visualization displays variant associations and LD relationships over a genomic region, allows custom

conditional analysis

View minor allele frequencies in different

ancestries

Top section integrates evidence on the associations, allele frequencies, and predicted impact of variants in a gene to provide an immediate indication of whether a gene is significantly linked to any

phenotypes

LocusZoom displays chromatin states across the region along with variants and associations

Search across datasets to retrieve variants meeting custom criteria: •  associated phenotypes •  p-value and effect size thresholds •  allele frequency and count •  genomic location •  consequence for encoded protein •  presence in specific dataset or ancestry-specific

cohort

View pre-computed burden test results for high-impact variants, or run a custom burden test after filtering samples by phenotypic

range (e.g., compare lean T2D cases to obese controls)