one genome is not enough gil mcvean the oxford big data institute

Post on 17-Jan-2016

217 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

One genome is not enoughGil McVean

The Oxford Big Data Institute

The Gene X hypothesis

N = 1

Do 40% of males have mental retardation?

What constitutes evidence?

• Class B – Functional relevance

• Class A – Statistical association– Transmission within pedigree– Association within population

Precision comes from numbers: Multiple sclerosis at 50,000

120 140 160 180

1

2

4

8

16

32

64

128

256

Age at risk:

80-89

70-7960-69

50-59

40-49

5000 people

120 140 160 180

1

2

4

8

16

32

64

128

256 Age at risk:

80-89

70-79

60-69

50-59

40-49

50 000 people

120 140 160 180

1

2

4

8

16

32

64

128

256Age at risk:

80-89

70-79

60-69

50-59

40-49

500 000 people

Haz

ard

Ratio

(95%

CI)

Usual SBP (mmHg) Usual SBP (mmHg)

Courtesy of Prospective Studies Collaboration, unpublished

Usual SBP (mmHg)

The value of large numbers: Ischaemic heart disease and systolic blood pressure

Medical data is big and growing…

Genome sequence

High dimensional profiling

Imaging

Electronic medical records

Mobile health

…at a population scale

500,000

500,000

1,000,000

100,000100,000

Challenges of data sharing

• Volume– How do we cope with the computational and analytical scale?

• Heterogeneity– How do we ensure we are measuring the same thing?

• Privacy– Do we have to share individual level data to achieve power?

• Security– How can we ensure that data are used appropriately?

• Engagement– How do we get people excited about sharing their data?

An international partnership is needed

Challenges of data sharing

• Volume– How do we cope with the computational and analytical scale?

• Heterogeneity– How do we ensure we are measuring the same thing?

• Privacy– Do we have to share individual level data to achieve power?

• Security– How can we ensure that data are used appropriately?

• Engagement– How do we get people excited about sharing their data?

N = 1 | N > 100,000

top related