genome-wide association study between dse polymorphism and poly-a usage in human population hiren...

25
Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Upload: abraham-berry

Post on 18-Jan-2016

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Genome-wide association study between DSE polymorphism and Poly-A

usage in Human population

Hiren KarathiaSridhar Hannenhalli

Page 2: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Transcription & Polyadenylation (Poly-A)

Page 3: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Objectives

• Genome-wide estimation of alternate Poly-A (PA) usage on 3’UTR

• Genome-wide Prediction and investigation of

polymorphisms in DSE (Downstream Sequence Element) motifs

• Population-wide correlation study between the PA usage and DSE polymorphisms

Page 4: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Annotation status of Poly-A sites on 3’UTR of

Human Genome (hg19 – 2009)

1 2 3 4 5 6 7 8 9 10 11

Poly-A usage transcripts 12224 4784 1567 588 189 107 19 19 3 2 1

3.16227766016838

31.6227766016838

316.227766016838

3162.27766016838

31622.7766016838

Frequency of Transcripts Vs

Cleavage Points

log1

0 (T

rans

crip

t fre

quen

cy)

1 2 3 4 5 6 7 8 9 10 11

Poly-A usage transcripts 12224 4784 1567 588 189 107 19 19 3 2 1

3.16227766016838

31.6227766016838

316.227766016838

3162.27766016838

31622.7766016838

Frequency of Transcripts Vs

Cleavage Points

log1

0 (T

rans

crip

t fre

quen

cy)

37% - Multiple Poly-A points

Target of the analysis

Page 5: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

RNA-Seq processing for Human Samples

SampleFastq files

BWA SamtoolsBAM file Merged BAM file

Samtools

Samtools

Sorted BAM fileDe-duplicated file

Picard tool

Indexing the BAM

Samtools

SAM file

Calculate Coverage

Bed tools

Calculate Relative usage of PAs Python script

Symbol Group of Samples Male Female DNA RNABR British in England and Scotland 1 1 FI Finnish in Finland 1 1 UT Utah residents with Northern and Western European ancestry 1 1 YO Yoruba in Ibadan, Nigeria 1 1

Differential Expression of UTR

Cuffdiff tools

Python script

De-novo assembly

Page 6: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Genome-wide estimation of alternate Poly-A (PA) usage on 3’UTR

PA1 Coverage PA2 Coverage

PA1 Junction PA2 Junction

Complete UTR coverage

Coverage (Stop codon – PA1 junction) / DistancePA1 Usage = Complete (complete 3’ UTR) / Distance

Coverage (Stop codon - PA2 junction) / Distance PA2 Usage = Coverage (complete 3’UTR) / Distance

Stop Codon

Cleaved 3’UTR

Page 7: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Prediction of DSE

Coding Strand of DNA

Sample A RNA-Seq

Sample A DNA-Seq

De-novo assembled 3’UTR fragment

Prediction of DSE motif

Template Strand of DNA

Page 8: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Frequency of Poly-A usage in the samples

BR - 1 (F) BR - 2 (M) FN - 1 (F) FN - 2 (M) UT - 1 (F) UT - 1 (M)

Not Expressed 5499 5833 5211 5677 5849 5514

Single PA Usage 9185 8913 9302 9037 8852 9012

Multiple PA Usage 4819 4757 4988 4787 4802 4975

500

1500

2500

3500

4500

5500

6500

7500

8500

9500

Freq

uenc

y of

Tra

nscr

ipt

Page 9: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Correlation of different PA usage in a Human Sample

PA1 – PA2 PA2 – PA3

r = - 0.643; p = 0.0 r = - 0.182; p = 1.06e-33

Page 10: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Correlation of PA usage and corresponding DSE polymorphism

Utah-Fe

male<=

>Briti

sh-M

ale

Utah-Fe

male<=

>Briti

sh-Fe

male

Nigeria

-Male

<=>U

tah-M

ale

Nigeria

-Male

<=>F

innish-M

ale

British

-Male

<=>B

ritish

-Female

Finnish

-Male

<=>B

ritish

-Male

Nigeria

-Male

<=>B

ritish

-Female

British

-Female

<=>U

tah-Fe

male

Finnish

-Male

<=>U

tah-M

ale

Nigeria

-Male

<=>U

tah-Fe

male

Utah-M

ale<=

>Briti

sh-M

ale

Finnish

-Female

<=>F

innish-M

ale

Nigeria

-Male

<=>B

ritish

-Male

0

1

2

3

4

5

6

TT composition in DSE motifs

-LO

G (P

)

Finnish

-Male

<=>B

ritish

-Male

Nigeria

-Male

<=>F

innish-M

ale

Finnish

-Female

<=>B

ritish

-Male

British

-Female

<=>U

tah-Fe

male

Nigeria

-Male

<=>B

ritish

-Male

Finnish

-Male

<=>U

tah-M

ale

Finnish

-Female

<=>B

ritish

-Female

British

-Female

<=>B

ritish

-Male

Utah-Fe

male<=

>Briti

sh-Fe

male

Nigeria

-Male

<=>B

ritish

-Female

Nigeria

-Male

<=>B

ritish

-Female

British

-Female

<=>B

ritish

-Female

Utah-Fe

male<=

>Briti

sh-M

ale

Nigeria

-Male

<=>U

tah-Fe

male0

0.51

1.52

2.53

3.5

GT composition in DSE motifs

-LO

G (P

)

British

-Female

<=>B

ritish

-Female

Finnish

-Female

<=>B

ritish

-Male

Utah-Fe

male<=

>Briti

sh-M

ale

Finnish

-Male

<=>U

tah-M

ale

Nigeria

-Male

<=>B

ritish

-Male

Utah-Fe

male<=

>Briti

sh-Fe

male

Nigeria

-Male

<=>B

ritish

-Female

Nigeria

-Male

<=>B

ritish

-Female

00.5

11.5

22.5

33.5

GG composition in DSE motifs

- LO

G (P

)

Finnish

-Male

<=>U

tah-M

ale

Nigeria

-Male

<=>F

innish-M

ale

Nigeria

-Male

<=>B

ritish

-Male

Nigeria

-Male

<=>U

tah-Fe

male

Utah-Fe

male<=

>Briti

sh-M

ale

Finnish

-Female

<=>U

tah-M

ale

Nigeria

-Male

<=>B

ritish

-Female

Nigeria

-Male

<=>U

tah-M

ale

British

-Female

<=>B

ritish

-Male

Utah-Fe

male<=

>Briti

sh-Fe

male0

1

2

3

4

5

6

Length of DSE motifs

-LO

G (P

)

Page 11: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Correlation of PA usage and corresponding DSE polymorphism

Page 12: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli
Page 13: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Functional enrichment of Genes associated with Differential PA Usage and

Polymorphic for of DSEs in Population

Page 14: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Thank you !!

Page 15: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli
Page 16: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Differential Expression of complete 3’UTR

Page 17: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Inter/Intra group correlation of a PA usage

r = 0.8; p = 0.0 r = 0.8; p = 0.0

r = 0.98; p = 0.0

PA1 usageBR1 – BR2 FN1 – FN2

BR1 – FN1

Page 18: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Statistics of predicted DSE motifs

Sample PA type Mean(Motif Length) Max(Motif Length) Min(Motif Length) Mean(Distance) Max(Distance) Min(Distance)

BR-1Single 12 79 9 30 89 1

Multiple 12 52 9 34 89 1

BR-2 Single 12 62 9 31 89 1

Multiple 12 52 9 34 89 1

FN - 1 Single 12 90 9 35 89 1

Multiple 12 54 9 39 89 1

Find Polymorphism in the DSEs

Find Correlation between the PA-usage and DSE polymorphism

Pending

Page 19: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli
Page 20: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Alternate Poly-A selection mechanism

Page 21: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Complete 3’UTR coverage VS

Alternate 3’UTR coverage

Differential expression of complete 3’UTR usage Differential expression of PA Usage

Page 22: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Poly Adenylation Usage on 3’UTR

PA1 Coverage PA2 Coverage

PA1 Junction PA2 Junction

Complete UTR coverage

PA1 CoverageRelative PA1 Usage = Longest UTR Coverage

PA2 CoverageRelative PA2 Usage = Longest UTR Coverage

Stop Codon

Intron

Cleaved 3’UTR

Page 23: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

DSE statisticSample PA type Mean(Motif Length) Max(Motif Length) Min(Motif Length) Mean(Distance) Max(Distance) Min(Distance)

BR-1

Single 12 79 9 30 89 1

Multiple 12 52 9 34 89 1

BR-2

Single 12 62 9 31 89 1

Multiple 12 52 9 34 89 1

FN - 1

Single 12 90 9 35 89 1

Multiple 12 54 9 39 89 1

Page 24: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

+ strand

- strand

Gene Strand

Template Strand

+ Read

+ Read

+ Read

- Read

- Read

RNA Strand DNA Strand

Page 25: Genome-wide association study between DSE polymorphism and Poly-A usage in Human population Hiren Karathia Sridhar Hannenhalli

Locations of annotated multiple PA locations on 3’UTR

PA1 Junction PA2 JunctionStop CodonCleaved 3’UTR

PA1 Junction PA2 JunctionStop Codon

PAs on same exon

PAs on multiple exonsr = 0.2578p = 8.44e10-111

Poly-A Location

Leng

th o

f 3’

UTR