deep proteome and trancriptome mapping of human cervical cancer cell line

51
Deep proteome and transcriptome mapping of a human cancer cell line BARBA | BIADOMANG |CUA | DAYANAN | LOPEZ AUTHORS: N. Nagaraj, J. Wisniewski, T. Geiger, J. Cox, M. Kircher, J. Kelso, S. Paabo, M. Mann JOURNAL: Molecular Systems Biology DATE PUBLISHED: October 29, 2011

Upload: jamie-barba

Post on 04-Jul-2015

763 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Deep proteome and trancriptome mapping of human cervical cancer cell line

Deep proteome and

transcriptome mapping

of a human cancer

cell line

BARBA | BIADOMANG |CUA | DAYANAN | LOPEZ

AUTHORS: N. Nagaraj, J. Wisniewski, T. Geiger, J. Cox, M. Kircher, J. Kelso, S. Paabo, M. Mann

JOURNAL: Molecular Systems Biology

DATE PUBLISHED: October 29, 2011

Page 2: Deep proteome and trancriptome mapping of human cervical cancer cell line
Page 3: Deep proteome and trancriptome mapping of human cervical cancer cell line

Human genome is comprise of a mere 20,000protein coding genes.

Page 4: Deep proteome and trancriptome mapping of human cervical cancer cell line
Page 5: Deep proteome and trancriptome mapping of human cervical cancer cell line

RNA-Seq Transcriptomics

Transcripts between

8000-16000

of protein coding genes

Page 6: Deep proteome and trancriptome mapping of human cervical cancer cell line

High-res MS-based Proteomics

Essentially complete proteome of model

organism (yeast)

Limited to 4000-6000 protein groups in

mammalian systems

Page 7: Deep proteome and trancriptome mapping of human cervical cancer cell line

“…explore a human proteome in the depth achievable with current technology and to

compare it with the corresponding transcriptome.”

Page 8: Deep proteome and trancriptome mapping of human cervical cancer cell line

METHODOLOGY

Page 9: Deep proteome and trancriptome mapping of human cervical cancer cell line

HeLa Cells Lysate

Flash freezingStore at

-80oC

Cell LysisSonication

CentrifugationProtein content determination

Page 10: Deep proteome and trancriptome mapping of human cervical cancer cell line

Protein Fractionation by Gel

Filtration

0.1mL cell lysate

Load onto GL Column

Elution with buffer

Page 11: Deep proteome and trancriptome mapping of human cervical cancer cell line

Protein Digestion and Peptide

Fractionation

Removal of detergent

Protein Digestion

LysC

Gluc

Trypsin

Page 12: Deep proteome and trancriptome mapping of human cervical cancer cell line

Mass Spectroscopy

PurificationRP C18

ChromatographyMS

Page 13: Deep proteome and trancriptome mapping of human cervical cancer cell line

RNA sequence

Extraction QuantificationRNA library preparation

Enrichment

FragmentationRNA fragments

copied into DNABlunt ends conversion

Addition of deoxyadenosine

Ligation of forked adaptors

Amplify Sequencing

Page 14: Deep proteome and trancriptome mapping of human cervical cancer cell line

Data Availability

Gene and Transcript Quantification

Data analysis

Page 15: Deep proteome and trancriptome mapping of human cervical cancer cell line

RESULTS AND DISCUSSION

Page 16: Deep proteome and trancriptome mapping of human cervical cancer cell line

Sample: The HeLa cells

• HeLa cells– Human cervical carcinoma cell line

• “Immortal cells”: can grow indefinitely, be frozen for decades

• Standardized field of tissue culture

– Named after Henriette Lacks by a scientist at John Hopkins Hospital• A piece of her tumor was taken

• Her cells never died

– Prolific growth maybe due to HPV18• HPV18 viral proteins (E6 and E7) suppresses p53 and pRb

gene products, respectively.

Page 17: Deep proteome and trancriptome mapping of human cervical cancer cell line

Proteome Coverage Study

• Objective: to achieve maximum proteome coverage at a reasonable measurement time

– Procedure: Investigate effects of protein fractionation, proteolytic digestion, peptide fractionation, and reverse phase chromatography

Page 18: Deep proteome and trancriptome mapping of human cervical cancer cell line

Proteome Coverage Study

• Protein fractionation– Gel filtration: separation based on size and shape

• Proteolytic digestion– Trypsin: C-terminal side of lysine and arginine

– Glu-C: C-terminal of glutamic residues

– Lys-C: carboxyl side of lysine residues

– Note: Protein digestion heavily affects effective protein characterization and identification by mass spectrometry• Overlapping fragments = larger sequence coverage

Page 19: Deep proteome and trancriptome mapping of human cervical cancer cell line

Proteome Coverage Study

• Pipette-based prefractionations

– Strong anion exchange resin

• Independent of pH

• Mostly used for deep coverage of the composition of the sample or if specific peptides should be enriched

• Reverse phase chromatography

– Reduced the complexity of the peptide mixture by selecting peptides for tandem mass spectrometry according to their polarity

Page 20: Deep proteome and trancriptome mapping of human cervical cancer cell line

Proteome Coverage Study

• LC-MS/MS analysis– Peptide MS spectra

• Interpretation by comparison with lists from generated from theoretical digestion of protein

– Fragment MS/MS spectra• Interpretation by comparison from theoretical

fragmentation of peptide

– Elution time of peptide is based on its polarity

– Repeated extensively, in order to increase the number of peptides, thereby making the protein less complex, for which tandem mass spectra are acquired

Page 21: Deep proteome and trancriptome mapping of human cervical cancer cell line

Proteome Coverage Study

• Procedure is referred as “shotgun” proteomics

– Most successful strategy to achieve extensive proteome coverafe

– Summary: protein sample is extracted from their biological source, subjected to enzymatic digestion, the resulting peptide mixtures are analysed by LC-MS/MS

• Additional augmented fractionation steps for proteins/peptides can also be conducted

Page 22: Deep proteome and trancriptome mapping of human cervical cancer cell line

MaxQuant Computational Proteomics Environment

• MaxQuant– Quantitative proteomics software package

designed for analyzing large mass spectrometric data sets

– Has an integrated search engine, Andromeda

– Supported instrument: LTQ-Orbitrap• Orbitrap: ions circulate around a central, spindle-

shaped electrode

• Highly accurate: axial frequency oscillation, determined with high precision, is proportional to the square root of m/z.

Page 23: Deep proteome and trancriptome mapping of human cervical cancer cell line

MaxQuant Computational Proteomics Environment

• Number of runs: 2 337 336 high resolution fragmentation spectra and high-accuracy precursor masses

• Search Engine: Andromeda– Algorithm: uses a probability based approach to

match tandem mass (MS/MS) spectra to peptide sequences in databases

– Median peptide score: 121, 6% below has a score of 60• For each score, corresponds to the sum of the highest ions

score for each distinct sequence

Page 24: Deep proteome and trancriptome mapping of human cervical cancer cell line

MaxQuant Computational Proteomics Environment

• Average identification of fragmentation spectra: 43%

• Average absolute mass deviation of the precursors for the matched fragment masses: 1.2 and 4.8 p.p.m

Page 25: Deep proteome and trancriptome mapping of human cervical cancer cell line

MaxQuant Computational Proteomics Environment

• Result of analysis

– Identified and quantified number of peptides

• 163 784 peptides

– FDR (false discovery rate): 1%

– Out of 163 784 peptides

• 84 051 from tryptic digestion

• 52 108 from Lys-C digestion

• 44 704 from Glu-C digestion

Page 26: Deep proteome and trancriptome mapping of human cervical cancer cell line

MaxQuant Computational Proteomics Environment

• Result of analysis– From obtained data, MaxQuant identified 10 255

proteins with 99% confidence• Lower bound of the number of proteins expressed in HeLa

cells

– There were observed overlapping fragments of enzymatic cleavage• Tryptic digestion: yielded highest number of identifications• Lys-C digewstion: 85% overlapped with Trypsin• Glu-C digestion: 5.2% novel identifications• Shows that <5% of all proteins were only identified by one

peptide• Taken all together, >25% median sequence coverage

Page 27: Deep proteome and trancriptome mapping of human cervical cancer cell line

ENSEMBL database and GENSCAN predictions

• ENSEMBL-annotated human protein-coding genes– MS/MS spectra: searched against the ENSEMBL

database with GENSCAN predictions

– 10 255 proteins were mapped to 9207 human protein-coding genes• Most identified number of genes at chromosome 1

• Least number of identified genes at chromosome 21

– GENSCAN preidictions: >1900 peptides not known to ENSEMBL genes

Page 28: Deep proteome and trancriptome mapping of human cervical cancer cell line

ZAB

Page 29: Deep proteome and trancriptome mapping of human cervical cancer cell line

Completeness of Detected Proteome

• Inspecting the macrocomplexes which are functionally necessary

• Proteosome, spliceosome, histone modifying complexes and respiratory chain complexes

• Corum protein complex database

Page 30: Deep proteome and trancriptome mapping of human cervical cancer cell line

Corum protein complex database

• Collection of experimentally verified mammalian protein complexes

– protein complex function

– Localization

– subunit composition

– literature references

Page 31: Deep proteome and trancriptome mapping of human cervical cancer cell line

• Mean proteome coverage of all Corum protein was >95%

• Transcriptome coverage 96.5%

• Among the lower coverage which is due to cell type specificity are (next slide)

Page 32: Deep proteome and trancriptome mapping of human cervical cancer cell line

• Sarcoglycan-sarcospan provides structural integrity in muscle tissues

• SNARE for neurotransmitter release in synapses• ITGA2b-ITGB3 - a fibronectin receptor that plays a

crucial role in coagulation

Complex Normally Expressed % Coverage

Sarcoglycan-sarcospan

Muscle 20

SNARE (Soluble N-

ethylmaleimide sensitive factor Attachment protein

Receptor )

Neuronal tissue 40

ITGA2b-ITGB3 Platelets 50

Page 33: Deep proteome and trancriptome mapping of human cervical cancer cell line

• 5% of the HeLa cell population was in mitosis

– 61/63 proteins in a reference set of cell cycle-specific proteins

– High coverage of the most metabolic pathways pertaining to basic cellular function

• Comprehensiveness of the proteome is hard to determine by comparison with pathway databases because they contain cell type-specific proteins

Page 34: Deep proteome and trancriptome mapping of human cervical cancer cell line

Quantitative Analysis

• Deep-sequencing transcriptomics– Proteomics data - >90% complete

• Transciptome + proteome data – 10,000 - 12, 000 genes expressed in HeLa cells

• iBAQ (intensity based absolute quantification)– incorporating individual peptide signals in MS and

normalized by the number of observable peptides of the protein

– Estimate the absolute amount of each protein

Page 35: Deep proteome and trancriptome mapping of human cervical cancer cell line

Quantitative Analysis

• 40 most abundant protein comprised 25% of the proteome

– Filamin A, pyruvate kinase, enolase, vimentin, Hsp 60

• 600 proteins-> 75% of the HeLa cell proteome mass

Page 36: Deep proteome and trancriptome mapping of human cervical cancer cell line

Quantitative Analysis

• Contribution of each protein to the total mass in combination with the knowledge of number of cells in the initial sample

– roughly estimate the absolute copy number of the proteins in HeLa cells

Page 37: Deep proteome and trancriptome mapping of human cervical cancer cell line

Quantitative Analysis

• Ranked distribution of proteins

– 90% protein is within a range of a factor of 60 above or below the median protein copy number of 18, 000 molecules per cel

– The lower half accounts for <2% of its total mass

Page 38: Deep proteome and trancriptome mapping of human cervical cancer cell line

Quantitative Analysis

• Protein abundance values

– Used to estimate the proportional contribution of any:

• individual protein,

• protein complex and

• protein class

to the total proteome

Page 39: Deep proteome and trancriptome mapping of human cervical cancer cell line

Quantitative Analysis

• Ribosomes (encoded by only 1% human gene)

– 195 proteins contributed 6% to total protein mass

• Actin cytoskeletoncontributes four-fold more to the proteome mass than expected from the number of genes and proteins

Page 40: Deep proteome and trancriptome mapping of human cervical cancer cell line

Quantitative Analysis

• Integral membrane

– 25% of the genome

– 7.6% protein mass

Page 41: Deep proteome and trancriptome mapping of human cervical cancer cell line

Quantitative Analysis

• Protein folding

– 2% of the identified proteome by number

– 8% of proteome mass

Page 42: Deep proteome and trancriptome mapping of human cervical cancer cell line

Quantitative Analysis

Percentage to the Total Mass

“Protein folding” proteins

Integral membrane Proteins

Human genome 25 2

Protein Mass 7.6 8

• Differences are due to cell-type specific functions of these proteins

Page 43: Deep proteome and trancriptome mapping of human cervical cancer cell line

Structural proteins and proteins in basic machineries

Regulatory proteins

>

Page 44: Deep proteome and trancriptome mapping of human cervical cancer cell line
Page 45: Deep proteome and trancriptome mapping of human cervical cancer cell line

Ribosome proteins form tight cluster at the top end

Page 46: Deep proteome and trancriptome mapping of human cervical cancer cell line

Proteosome also abundant but not its regulatory subunits (factor of 100 less)

Page 47: Deep proteome and trancriptome mapping of human cervical cancer cell line

Cytoskeletal and metabolic proteins extend over a broad range

Page 48: Deep proteome and trancriptome mapping of human cervical cancer cell line

Enolase – highest expression value

Glycogen phosphorylase – 100,000-fold less at protein level and 10,000 less at transcript level

Page 49: Deep proteome and trancriptome mapping of human cervical cancer cell line

Regulatory proteins such as protein kinases and transcription factors have, on average, lower expression than the structural proteins

Each category spans a large expression range

Expression levels can provide starting points for systems biologicalmodeling of the cell

Page 50: Deep proteome and trancriptome mapping of human cervical cancer cell line

“Given the rapid technological progress in both fields, we predict that the required

depth of 10,000–12,000 genes will be routinely reachable

soon.”

TRANSCIPTOME RNA-Seq

PROTEOME High-res MS

Page 51: Deep proteome and trancriptome mapping of human cervical cancer cell line

THANK YOU