http://cs273a.stanford.edu [bejerano fall11/12] 1 thank you for the midterm feedback! projects will...
TRANSCRIPT
http://cs273a.stanford.edu [Bejerano Fall11/12] 1
Thank you for the midterm feedback!
Projects will be assigned this weekend.
http://cs273a.stanford.edu [Bejerano Fall11/12] 2
Lecture 12
Mid Term Feedback Feedback
Cis-Regulation
Cellular Signaling
MidTerm Feedback Feedback
•You like us!–You like your teachers and love your TAs
•We like you too ;)
•We also like recruiting (at all levels) from this class–Talk to us if you’re interested
http://cs273a.stanford.edu [Bejerano Fall11/12] 3
Big Picture Reminder
We are surveying the functional classes of elements encoded by the human genome.
We have previously discussed:•Repetitive sequences (mobile elements & low complexity)
•Protein coding genes (and pseudogenes)
•Non coding RNAs (structural, micro, long, anti-sense etc.)
Today we’ll cover the last (known :) large class of functional elements, concluding the functional tour.
http://cs273a.stanford.edu [Bejerano Fall11/12] 4
http://cs273a.stanford.edu [Bejerano Fall11/12] 5
Gene Regulation
•gene (how to)•control region(when & where)
DNA
DNA binding
proteins
RNA geneProtein coding
http://cs273a.stanford.edu [Bejerano Fall11/12] 6
unicellular
multicellular
Unicellular vs. Multicellular
http://cs273a.stanford.edu [Bejerano Fall11/12] 7
Broad View of Transcription
Chromatin / Proteins
DNA / Proteins
Extracellular signals
http://cs273a.stanford.edu [Bejerano Fall11/12] 8
Vertebrate Transcription Regulation
http://cs273a.stanford.edu [Bejerano Fall11/12] 9
Pol II Transcription
Key components:• Proteins• DNA sequence• DNA epigenetics
Protein components:• General Transcription factors• Activators• Co-activators
http://cs273a.stanford.edu [Bejerano Fall11/12] 10
Activators & Co-Activators
Protein - DNA
Protein - Protein
http://cs273a.stanford.edu [Bejerano Fall11/12] 11
The Core Promoter
http://cs273a.stanford.edu [Bejerano Fall11/12] 12
Chromatin Remodeling
“off”
“on”
13
Nucleosome tail modifications
•Lysine acetylations.•Histone Acetyl-Transferases (HAT) & Histone Deacetylases (HDAC).
•Lysine and Argenine Metylations.–Modified by histone-
metyl-transferase.
•Phosphorilation.•Ubiquitination.
–H2A ubiquitination affects 10-15% of this histone in most eukaryotic cells
•ADP-ribosylation .
http://cs273a.stanford.edu [Bejerano Fall11/12] 14
http://cs273a.stanford.edu [Bejerano Fall11/12] 15
CpG islands
Functionally:Unmethylated
Evolutionary:CpGs are rareoutside “CpG islands”
http://cs273a.stanford.edu [Bejerano Fall11/12] 16
Transcription Factor (TF) Binding Sites
http://cs273a.stanford.edu [Bejerano Fall11/12] 17
TFs in the Human Genome
http://cs273a.stanford.edu [Bejerano Fall11/12] 18
Synergy / non-linear additivity
Gene
DNA
http://cs273a.stanford.edu [Bejerano Fall11/12] 19
Combinatorial Regulatory Code
Gene
2,000 different proteins can bind specific DNA sequences.
A regulatory region encodes 3-10 such protein binding sites.
When all are bound by proteins the regulatory region turns “on”,and the nearby gene is activated to produce protein.
Proteins
DNA
DNA
Protein binding site
http://cs273a.stanford.edu [Bejerano Fall11/12] 20
Distal Transcription Regulatory Elements
http://cs273a.stanford.edu [Bejerano Fall11/12] 21
Enhancers
http://cs273a.stanford.edu [Bejerano Fall11/12] 22
Basal factors RNAP II
Enhancer with bound protein
promoter
Enhancers: action over very large distances
http://cs273a.stanford.edu [Bejerano Fall11/12] 23
Transient Transgenic Enhancer Assay
Reporter GeneMinimal PromoterConservedElement
Construct is injected into 1 cell embryos
Taken out at embryonic day 10.5-14.5
Assayed for reporter gene activity
in situ
transgenic
http://cs273a.stanford.edu [Bejerano Fall11/12] 24
Vertebrate Enhancer Combinatorics
Sall1limbneuraltube brain
http://cs273a.stanford.edu [Bejerano Fall11/12] 25
Vertebrate Enhancer Combinatorics
http://cs273a.stanford.edu [Bejerano Fall11/12] 26
What are Enhancers?
What do enhancers encode?
Surely a cluster of TF binding sites.
[but TFBS prediction is hard, fraught with false positives]
What else? DNA Structure related properties?
So how do we recognize enhancers?
Sequence conservation across multiple species
[weak but generic]
http://cs273a.stanford.edu [Bejerano Fall11/12] 27
Most Non-Coding Elements likely work in cis…
9Mb
“IRX1 is a member of the Iroquois homeobox gene family.
Members of this family appear to play multiple roles
during pattern formation of vertebrate embryos.”
gene deserts
regulatory jungles
Every orange tick mark is roughly 100-1,000bp long, each evolves under purifying selection, and does not code for protein.
http://cs273a.stanford.edu [Bejerano Fall11/12] 28
Many non-coding elements tested are cis-regulatory
http://cs273a.stanford.edu [Bejerano Fall11/12] 29
Vertebrate Gene Regulation
gene (how to)
control region(when & where)
DNA
proximal: in 103 letters
distal: in 106 letters
DNA binding
proteins
http://cs273a.stanford.edu [Bejerano Fall11/12] 30
Gene Expression Domains: Independent
http://cs273a.stanford.edu [Bejerano Fall11/12] 31
Repressors / Silencers
http://cs273a.stanford.edu [Bejerano Fall11/12] 32
What are Enhancers?
What do enhancers encode?
Surely a cluster of TF binding sites.
[but TFBS prediction is hard, fraught with false positives]
What else? DNA Structure related properties?
So how do we recognize enhancers?
Sequence conservation across multiple species
[weak but generic]
Verifying repressors is trickier [loss vs. gain of function].
How do you predict an enhancer from a repressor? Duh...
repressors
repressorsRepressors
http://cs273a.stanford.edu [Bejerano Fall11/12] 33
Insulators
http://cs273a.stanford.edu [Bejerano Fall11/12] 34
Cis-Regulatory Components
Low level (“atoms”):• Promoter motifs (TATA box, etc)• Transcription factor binding sites (TFBS)
Mid Level:• Promoter• Enhancers• Repressors/silencers• Insulators/boundary elements• Cis-regulatory modules (CRM)• Locus control regions (LCR)
High Level:• Epigenetic domains / signatures• Gene expression domains• Gene regulatory networks (GRN)
http://cs273a.stanford.edu [Bejerano Fall11/12] 35
Disease Implications: Genes
genome
gene
protein
Limb Malformation
Over 300 genes already
implicated in limb malformations.
http://cs273a.stanford.edu [Bejerano Fall11/12] 36
Disease Implications: Cis-Reg
genome
gene
NO protein
made
Limb Malformation
Growing number of cases (limb, deafness, etc).
http://cs273a.stanford.edu [Bejerano Fall11/12] 37
Transcription Regulation & Human Disease
[Wang et al, 2000]
http://cs273a.stanford.edu [Bejerano Fall11/12] 38
Critical regulatory sequences
Lettice et al. HMG 2003 12: 1725-35
Single base changes
Knock out
http://cs273a.stanford.edu [Bejerano Fall11/12] 39
Other Positional Effects
[de Kok et al, 1996]
http://cs273a.stanford.edu [Bejerano Fall11/12] 40
Genomewide Association Studies point to non-coding DNA
http://cs273a.stanford.edu [Bejerano Fall11/12] 41
WGA Disease
9p21 Cis effects
http://cs273a.stanford.edu [Bejerano Fall11/12] 42
Follow up study:
http://cs273a.stanford.edu [Bejerano Fall11/12] 43
Cis-Regulatory Evolution: E.g., obile Elements
[Yass is a small town in New South Wales, Australia.]
Gene
Gene
What settings make these“co-option” events happen?
Gene
Gene
http://cs273a.stanford.edu [Bejerano Fall11/12] 44
Britten & Davidson Hypothesis: Repeat to Rewire!
[Britten & Davidson, 1971]
[Davidson & Erwin, 2006]
http://cs273a.stanford.edu [Bejerano Fall11/12] 45
Modular: Most Likely to Evolve?
Chimp Human
46
Human Accelerated Regions• Human-specific substitutions in conserved
sequences
46[Pollard, K. et al., Nature, 2006] [Prabhakar, S. et al., Science, 2008][Beniaminov, A. et al., RNA, 2008]
Human Chimp
http://cs273a.stanford.edu [Bejerano Fall11/12] 47
Signal Transduction
48
Cell Communication
Lodish, 20-1
49
Wnt and Hedgehog signaling
Jacob & Lum Science 2007
50
Signaling Pathways Important in Developmental Biology
• Wnt/Frizzled through -catenin
• Hedgehog
• TGF- family through Smads
• Growth factors via JAK-STATs
• Notch
• Integrin
• TNF
http://cs273a.stanford.edu [Bejerano Fall11/12] 51
Giant State Machine…
Chromatin / Proteins
DNA / Proteins
Extracellular signals