computers are to biology what mathematics is to physics

40
“Computers are to biology what mathematics is to physics” – Harold Morowitz

Upload: others

Post on 16-May-2022

3 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Computers are to biology what mathematics is to physics

“Computers are to biology what mathematics is to physics”– Harold Morowitz

Page 2: Computers are to biology what mathematics is to physics

Dowell Lab Overview

Transcriptomics Individual Differences

Page 3: Computers are to biology what mathematics is to physics

Techniques of the Trade

Page 4: Computers are to biology what mathematics is to physics

Techniques: Comparative Genomics

Page 5: Computers are to biology what mathematics is to physics

Techniques: Comparative Genomics

Page 6: Computers are to biology what mathematics is to physics

Techniques: Systems Biology

Page 7: Computers are to biology what mathematics is to physics

Techniques: Systems Biology

Page 8: Computers are to biology what mathematics is to physics

Techniques: Machine Learning

Page 9: Computers are to biology what mathematics is to physics

Transcriptomics

• Unbiased detection and characterization of transcripts(Today)

• Transcriptional Regulation (Tues)

• Noncoding Transcripts (Tues)

Page 10: Computers are to biology what mathematics is to physics

Conventional Microarray Design

Page 11: Computers are to biology what mathematics is to physics

Example data from tiled microarray of two yeast strains

Page 12: Computers are to biology what mathematics is to physics

Normalization critical for multiple array comparison

Yij = Yij − bjyij = Q(Yij)

Page 13: Computers are to biology what mathematics is to physics

Normalization critical for multiple array comparison

Yij = Yij − bjyij = Q(Yij)

Page 14: Computers are to biology what mathematics is to physics

Probe Affinity Correction and Gene Averaging

Page 15: Computers are to biology what mathematics is to physics

Probe Affinity Correction and Gene Averaging

Page 16: Computers are to biology what mathematics is to physics

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 17: Computers are to biology what mathematics is to physics

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 18: Computers are to biology what mathematics is to physics

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 19: Computers are to biology what mathematics is to physics

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 20: Computers are to biology what mathematics is to physics

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 21: Computers are to biology what mathematics is to physics

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 22: Computers are to biology what mathematics is to physics

Identify segment boundaries by dynamic programmingalgorithm

S(a, b)→ S(a, k) + L(k + 1, b) | L(a, b)

L(a, b)→ fB(a, b) | fT (a, b)

Page 23: Computers are to biology what mathematics is to physics

Random priming versus poly-dT priming

Page 24: Computers are to biology what mathematics is to physics

What regions are differentially transcribed?

yij = αi + gij + εij

gij ⇒ gsj + β~xi

Page 25: Computers are to biology what mathematics is to physics

Estimate posterior probability that the samples are different

yij = gsj + β~xi + αi + εij

P(Esj > Esj ′ |y,Θ)

Page 26: Computers are to biology what mathematics is to physics

Example region with differential expression

Page 27: Computers are to biology what mathematics is to physics

Differential expression hints at transcript interactions

Page 28: Computers are to biology what mathematics is to physics

In general, antisense transcripts consistent withinterference

Page 29: Computers are to biology what mathematics is to physics

A screen for same strand overlapping transcription

Danford, et. al. (RECOMB 2010)

Page 30: Computers are to biology what mathematics is to physics

SER3: Canonical same strand transcription

Martens et. al. Nature (2004)

Page 31: Computers are to biology what mathematics is to physics

Variety of transcriptional architectures detected

Page 32: Computers are to biology what mathematics is to physics

Verifying same strand transcription

Page 33: Computers are to biology what mathematics is to physics

RNA-Seq: Nucleotide resolution

Wang et. al. Nature Reviews Genetics 10, 57-63 (2009)

Page 34: Computers are to biology what mathematics is to physics

GRO-Seq: Nuclear Run-On Sequencing

Supplementary figures S1-S8, Table S1

Figure S1

Figure S1: Overview of the GRO-seq method. Polymerases are allowed to run-on ~100

bases in isolated nuclei in the presence of sarkosyl and Br-UTP. The RNA is then base

hydrolyzed to ~100 bases and bound to agarose beads that are coated with an !-BrdUTP

antibody. 5’-7meG caps are then removed, and the ends of the RNA are prepared for

adapter ligations. Illumina small RNA adapters are added to the 5’ end follwed by the 3’end,

with an additional round of immuno-enrichment after each adapter ligation. The RNAs are

then reverse transcribed, amplified, and PAGE purified prior to sequencing from the 5’end on

the Illumina 1G genome analyzer. See SOM for detailed description and protocol.

Core, Waterfall, Lis. Science 322:1845-1848 (2008)

Page 35: Computers are to biology what mathematics is to physics

Clustering: Tissue Expression as Bag of “Words”

Gerber et. al. PLoS Comp. Biol (2007)

Page 36: Computers are to biology what mathematics is to physics

Hierarchical Dirichlet Clustering

Page 37: Computers are to biology what mathematics is to physics

Expression largely conserved across species

Page 38: Computers are to biology what mathematics is to physics

Related Projects in the Lab ...

• To what extent are patterns of transcription conserved?

• How does transcription respond to perturbation by a drug(Nutlin)?

• Over a developmental time course, how and when aretranscripts made?

Page 39: Computers are to biology what mathematics is to physics

Acknowledgments

MIT

• David Gifford• Timothy Danford• Alex Rolfe

Whitehead Institute

• Gerry Fink• Stacie Bumgarner• Sudeep Agarwala• An Jansen• Paula Grisafi• Doug Bernstein

ABI Sequencing and Initial Assembly by Broad Institute

Page 40: Computers are to biology what mathematics is to physics

Transcriptomics

• Unbiased detection and characterization of transcripts

• Transcriptional Regulation (Tues)

• Noncoding Transcripts (Tues)