phospho-itraq: assessing isobaric labels for the · pdf filephospho-itraq: assessing isobaric...

11
Phospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry Pieter Glibert, Paulien Meert, Katleen Van Steendam, Filip Van Nieuwerburgh, Dieter De Coninck, Lennart Martens, §,Maarten Dhaenens, ,and Dieter Deforce* ,,Laboratory of Pharmaceutical Biotechnology and Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium § Department of Medical Protein Research, VIB, B-9000 Ghent, Belgium * S Supporting Information ABSTRACT: The ability to distinguish between phosphopep- tides of high and low stoichiometry is essential to discover the true extent of protein phosphorylation. We here extend the strategy whereby a peptide sample is briey split in two identical parts and di erentially labeled preceding the phosphatase treatment of one part. Our use of isobaric tags for relative and absolute quantitation (iTRAQ) marks the rst time that isobaric tags have been applied for the large-scale analysis of phosphopeptides. Our Phospho-iTRAQ method focuses on the unmodied counterparts of phosphorylated peptides, which thus circumvents the ionization, fragmenta- tion, and phospho-enrichment di culties that hamper quantitation of stoichiometry in most common phosphopro- teomics methods. Since iTRAQ enables multiplexing, simultaneous (phospho)proteome comparison between internal replicates and multiple samples is possible. The technique was validated on multiple instrument platforms by adding internal standards of high stoichiometry to a complex lysate of control and EGF-stimulated HeLa cells. To demonstrate the exibility of Phospho- iTRAQ with regards to the experimental setup, the proteome coverage was extended through gel fractionation, while an internal replicate measurement created more stringent data analysis opportunities. The latest developments in MS instrumentation promise to further increase the resolution of the stoichiometric measurement of Phospho-iTRAQ in the future. The data have been deposited to the ProteomeXchange with identier PXD001574. KEYWORDS: phosphoproteome, stoichomtery, iTRAQ, dephosphotylation, threshold calculation, in-gel Phospho-iTRAQ INTRODUCTION One of the most extensively studied protein modications is phosphorylation, a reversible posttranslational modication (PTM) with a central role in a broad range of cellular processes such as cell metabolism, homeostasis, transcription, and apoptosis. Because of their transient character, phosphor- ylations initiate and propagate signal transduction pathways, which make these fundamental to inter- and intracellular communication. 1,2 Over 2% of the genome encodes for kinases, the family of enzymes that catalyze the transfer of a phosphate group to serine, threonine, or tyrosine residues. It is estimated that 30% of all proteins are phosphorylated, which illustrates the high prevalence of this PTM. 3 Detection of phosphor- ylation is nevertheless challenging considering the dynamic regulation, low stoichiometry, and heterogeneous character of the phosphosites. 4 Numerous dedicated phosphoproteomics workows have therefore been developed to unravel phosphor- ylation networks and the activity of their modulating enzymes (as extensively reviewed in refs 2 and 5). In these workows, phosphopeptides are typically enriched prior to mass spectrometry analysis to remove the strong background interference caused by the bulk of nonphosphory- lated peptides. 4 Apart from requiring thorough optimization and a corresponding high level of expertise of the analyst, enrichment is also responsible for sample loss and limited quantitation accuracy. 5 Subsequent mass spectrometry analysis is furthermore confounded by low ionization eciency and inferior fragmentation of phosphopeptides. Conventional MS/ MS analysis with collision-induced dissociation predominantly results in the neutral loss of the phosphate group, which leads to poor sequence coverage due to reduced backbone fragmentation. An additional stage of fragmentation in MS3 or the use of alternative fragmentation technologies, such as higher energy C-trap dissociation or electron transfer dissociation, can partially facilitate the detection and identi- cation of phosphopeptides. 3,6 While these methodologies are not yet routinely applied in most laboratories, they have allowed multiple specialized studies to report a grand total of over 100 000 phosphorylation sites. 2 Received: August 27, 2014 Article pubs.acs.org/jpr © XXXX American Chemical Society A DOI: 10.1021/pr500889v J. Proteome Res. XXXX, XXX, XXXXXX

Upload: vandieu

Post on 28-Mar-2018

219 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

Phospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale StudyOf Phosphopeptide StoichiometryPieter Glibert,† Paulien Meert,† Katleen Van Steendam,† Filip Van Nieuwerburgh,† Dieter De Coninck,†

Lennart Martens,§,‡ Maarten Dhaenens,†,⊥ and Dieter Deforce*,†,⊥

†Laboratory of Pharmaceutical Biotechnology and ‡Department of Biochemistry, Ghent University, B-9000 Ghent, Belgium§Department of Medical Protein Research, VIB, B-9000 Ghent, Belgium

*S Supporting Information

ABSTRACT: The ability to distinguish between phosphopep-tides of high and low stoichiometry is essential to discover thetrue extent of protein phosphorylation. We here extend thestrategy whereby a peptide sample is briefly split in twoidentical parts and differentially labeled preceding thephosphatase treatment of one part. Our use of isobaric tagsfor relative and absolute quantitation (iTRAQ) marks the firsttime that isobaric tags have been applied for the large-scaleanalysis of phosphopeptides. Our Phospho-iTRAQ methodfocuses on the unmodified counterparts of phosphorylatedpeptides, which thus circumvents the ionization, fragmenta-tion, and phospho-enrichment difficulties that hamperquantitation of stoichiometry in most common phosphopro-teomics methods. Since iTRAQ enables multiplexing, simultaneous (phospho)proteome comparison between internal replicatesand multiple samples is possible. The technique was validated on multiple instrument platforms by adding internal standards ofhigh stoichiometry to a complex lysate of control and EGF-stimulated HeLa cells. To demonstrate the flexibility of Phospho-iTRAQ with regards to the experimental setup, the proteome coverage was extended through gel fractionation, while an internalreplicate measurement created more stringent data analysis opportunities. The latest developments in MS instrumentationpromise to further increase the resolution of the stoichiometric measurement of Phospho-iTRAQ in the future. The data havebeen deposited to the ProteomeXchange with identifier PXD001574.

KEYWORDS: phosphoproteome, stoichomtery, iTRAQ, dephosphotylation, threshold calculation, in-gel Phospho-iTRAQ

■ INTRODUCTION

One of the most extensively studied protein modifications isphosphorylation, a reversible posttranslational modification(PTM) with a central role in a broad range of cellularprocesses such as cell metabolism, homeostasis, transcription,and apoptosis. Because of their transient character, phosphor-ylations initiate and propagate signal transduction pathways,which make these fundamental to inter- and intracellularcommunication.1,2 Over 2% of the genome encodes for kinases,the family of enzymes that catalyze the transfer of a phosphategroup to serine, threonine, or tyrosine residues. It is estimatedthat 30% of all proteins are phosphorylated, which illustratesthe high prevalence of this PTM.3 Detection of phosphor-ylation is nevertheless challenging considering the dynamicregulation, low stoichiometry, and heterogeneous character ofthe phosphosites.4 Numerous dedicated phosphoproteomicsworkflows have therefore been developed to unravel phosphor-ylation networks and the activity of their modulating enzymes(as extensively reviewed in refs 2 and 5).In these workflows, phosphopeptides are typically enriched

prior to mass spectrometry analysis to remove the strong

background interference caused by the bulk of nonphosphory-lated peptides.4 Apart from requiring thorough optimizationand a corresponding high level of expertise of the analyst,enrichment is also responsible for sample loss and limitedquantitation accuracy.5 Subsequent mass spectrometry analysisis furthermore confounded by low ionization efficiency andinferior fragmentation of phosphopeptides. Conventional MS/MS analysis with collision-induced dissociation predominantlyresults in the neutral loss of the phosphate group, which leadsto poor sequence coverage due to reduced backbonefragmentation. An additional stage of fragmentation in MS3or the use of alternative fragmentation technologies, such ashigher energy C-trap dissociation or electron transferdissociation, can partially facilitate the detection and identi-fication of phosphopeptides.3,6 While these methodologies arenot yet routinely applied in most laboratories, they haveallowed multiple specialized studies to report a grand total ofover 100 000 phosphorylation sites.2

Received: August 27, 2014

Article

pubs.acs.org/jpr

© XXXX American Chemical Society A DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

Page 2: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

To draw accurate biological conclusions, it is mandatory toobtain quantitative information on the phosphomodifications.Most quantitative approaches aim to elucidate the relativephosphorylation changes between samples by employing directlabel-free comparisons of phosphoenriched fractions ormetabolic or chemical labeling methods such as stable isotopelabeling by amino acids in cell culture (SILAC), use of isobarictags for relative and absolute quantitation (iTRAQ), or tandemmass tag (TMT).5−7 Such methods, however, forego theinvestigation of phosphorylation stoichiometry, also known asoccupancy, since such analysis requires the simultaneousmonitoring of the unphosphorylated counterpart.8 However,keeping this complete picture is crucial, for example, for in vitrostudies of specific kinases where artificially high enzymeconcentrations or prolonged incubation times often result inthe annotation of low-level spurious phosphosites that arebiologically irrelevant.5,9 The stoichiometry measurement of asingle phosphosite is possible by spiking specific isotopicallylabeled (AQUA) peptides of that particular phosphorylatedpeptide and its unmodified complement, but this strategy isinconvenient for large-scale applications and requires priorinsight of the phosphorylation site. Additionally, for com-parative studies, the bias of the sample preparation steps cannotbe excluded with AQUA, and specific knowledge of the relativeprotein concentrations is not available as with the formermentioned labeling approaches.10

One interesting labeling strategy that is specifically designedto quantify occupancy circumvents the common difficulties ofstandard phosphopeptide detection by focusing on thenonphosphorylated counterparts of the phosphopeptides.11

Herein, a peptide sample is briefly split in two identical parts,which are differentially labeled preceding a phosphatasetreatment of one part and mock-treatment of the other.Immediately afterward, the two parts are recombined for thesubsequent LC−MS/MS analysis. Dephosphorylation of aphosphopeptide will induce a 79.979 mass shift resulting in askewed label ratio in the unphosphorylated peptide’s reporterregion, which now specifies the phosphorylation stoichiom-etry.12 Although the principle has been described previ-ously,12−19 only one phosphatase-based workflow was adaptedspecifically for the large-scale study of a complete lysatewhereby known phosphopeptides present in a database werequantified by isotopic labels.20 Although this report isextensively referenced, the method is not yet routinely appliedfor discovery purposes. As an alternative approach, each half ofa peptide mixture can be labeled with isobaric tags prior to thephosphatase treatment of one-half. By virtue of the completemultiplexing until the fragmentation stage and subsequentquantitation at the MS2 level, isobaric labels have the capacityto analyze up to ten samples simultaneously.21 Previous reportshave already prototyped this strategy with TMT and iTRAQlabels on relatively simple protein samples but haveconcentrated mainly on the precision of the stoichiometrymeasurement.11,12,19,22

Here we extend for the first time the isobaric tag equivalentof this approach for the large-scale quantitative phosphoanalysisof complex mixtures. The technique was validated on threedifferent instrument platforms on epidermal growth factor(EGF)-stimulated HeLa cells.23 We discuss a novel dataanalysis approach for the discovery of high stoichiometryphosphopeptides in complex peptide mixtures and present away to interpret the sizable data set without prior matching ofthe peptide identifications to a phospho-database. Because all

peptides and not only the phosphorylated fraction of theproteome is measured in this approach, instruments with high-speed acquisition capabilities and adequate resolution foraccurate reporter quantitation are indispensable. Still, becauseof the minimal technical variation that is introduced throughoutthe workflow (as for Wu et. al.20), implementation of extensivefractionation and purification steps, such as protein gel-electrophoresis and 2DLC at the peptide level, allows formore in-depth coverage of the (phospho) proteome. Whatmakes this approach even more attractive is that the iTRAQmultiplexing allows for a replicate measurement within oneexperiment, which creates a new data analysis workflow thatmeets the stringent requirements of peptide quantitation. Allthree instrument panels were able to annotate a large set ofknown phosphoproteins. Extending the latter sample fractio-nation and using novel methodologies to avoid dilution ofreporter ratios in MS/MS will allow for the annotation ofincreasing numbers of phosphopeptide with increasingly lowstoichometries in the future.

■ MATERIALS AND METHODS

Chemicals and Materials

Cells were obtained from ATCC. The growth medium, EGF,media supplements, phosphate buffered saline (PBS), and thePeppermintStick internal protein standard were from Invi-trogen (Carlsbad, CA, USA). Readyprep sequential extractionkit, tributylphosphine (TBP), 4x Laemmli sample buffer, 2-Mercaptoethanol (BME), Tris-HCl precast polyacrylamidegels, and the Precision Plus Protein Marker were obtainedfrom Biorad (Hercules, CA, USA). Triethylammoniumbicarbonate (TEABC), sodium dodecyl sulfate (SDS), N-cyclohexyl-3-aminopropanesulfonic acid (CAPS), calciumchloride (CaCl2), and tween-20 were from Millipore (Billerica,MA, USA). Modified trypsin was acquired from Promega(Fitchburg, WI, USA), and the iTRAQ reagents were fromABSciex (Framingham, MA, USA). Acetonitrile and formic acidwere from Biosolve, LCMS-grade (Dieuze, France). Alkalinephosphatase was obtained from different manufactures: calfintestinal phosphatase was from New England Biolabs(Ipswich, MA, USA) and Sigma-Aldrich (St. Louis, MO,USA), thermosensitive alkaline phosphatase was from Promega,and Escherichia coli alkaline phosphatase was from Sigma-Aldrich. All other reagents were purchased from Sigma-Aldrichunless stated otherwise.Cell Culture and Lysis

HeLa cells were cultured at 37 °C in Dulbecco’s Modified EagleMedium (5% CO2) supplemented with 1% (w/v) L-glutamine,10% (w/v) FBS, and 50 IU/ml penicillin/streptomycin. Cellswere starved in FBS-free medium for 17 h before harvesting.For the EGF stimulation, the medium was removed, enrichedwith 150 ng/mL EGF, and immediately readded to the cells for10 min. Next, the EGF-stimulated and control cells wereincubated for 15 min at 37 °C with PBS-based dissociationbuffer (Invitrogen) and detached by cell scraping. Cells werewashed twice by precooled PBS and centrifuged at 4 °C. Allsubsequent steps were also performed at 4 °C unless declaredotherwise. Cell lysis was performed by suspending the cellpellet in the R1-buffer from the Readyprep sequentialextraction kit at a density of 1 mL/106 cells. R1 comprises 40mM Tris-buffer, which we supplemented with 2% (v/v)phosphatase inhibitor cocktail 2&3 (P5726 and P0044,Sigma-Aldrich), 2% (v/v) TBP, 0.3% (v/v) benzonase

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

B

Page 3: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

(E1014, Sigma-Aldrich), and 1 tablet/10 mL R1 complete miniEDTA-free protease inhibitor cocktail (118361700001, Roche,Penzberg, Germany). After sonication in an ice-bath for 10 min(Branson 2510 sonication bath), the protein extracts weremixed with 25:1 volumes of precooled acetone for overnightincubation at −20 °C. The next day, after a centrifugation stepat maximum speed (14 000 rpm) for 15 min, the acetone wasremoved, and the pellet was resuspended in 0.5 M TEABC.After an additional spin, the pellet was removed, and thesupernatant was withheld as the “R1 extract”. A two-step lysismethod was performed to the EGF-treated cells before gelfractionation, which thereby extended the “Phospho-iTRAQ”method to increase the protein coverage. The protein extract ofthe cells in the R1-buffer was centrifuged at a maximum speed.To include more proteins, the pellet was resuspended in theR3-buffer (containing urea, thiourea, detergents, and ampho-lytes) from the Readyprep sequential extraction kit andcombined with the initial R1 lysate, which together formedthe “R1+R3 extract”. The protein content of all the extracts wasdetermined by the 2D Quant Kit (GE Healthcare, LittleChalfont, United Kingdom).

Gel Fractionation and Digest

Twenty microgram samples of the “R1 extract” from both theEGF-stimulated and control cells were digested analogous tothe iTRAQ protocol (ABSciex). A 500 μg sample of the“R1+R3 extract” derived from the EGF-stimulated cells wasdiluted 3:1 in 4x Laemmli sample buffer and reduced with 10%(v/v) BME for 10 min at 95 °C. Next, 1.5% (w/w) of thePeppermintStick internal protein standard (IS1) was added, andthe sample was loaded in the central well of the 10% CriterionTris−HCl Gel, 11 cm IPG + 1 well. Electrophoresis of the gelwas performed for 30 min at 150 V and 60 min at 200 V in aCriterion Cell (Biorad) and monitored by the Precision PlusProtein Marker in the outer well. After a fixation and washingstep, the gel was cut into four equal pieces comprising differentmolecular weight fractions, which were transferred to glasstubes. The proteins were in-gel digested according to optimizedconditions.24 All the obtained peptides were brought to drynessby the SpeedVac and stored at −20 °C until further use.

Labeling

Peptides of the R1 extracts were redissolved in 40 μL of 50 mMTEABC, while peptides from the molecular weight fractions of

Figure 1. The Phospho-iTRAQ approach workflow. (A) Experimental workflow: in the “in-solution” approach (left panel), the soluble proteinextracts (R1) of control and EGF-stimulated HeLa cells were spiked with an internal peptide standard (IS2) of heavy phosphopeptides and comparedby Phospho-iTRAQ. When only one sample is mined (EGF), the flexibility of the protocol allows for gel purification and fractionation and thus forcomplementing the soluble protein extract (R1) with the hydrophobic fraction diluted in strongly denaturing buffer (R3) to increase the number ofannotated phosphopeptides (“in-gel” approach, right panel). The EGF-stimulated cells were spiked with a phosphoprotein internal standard (IS1)before fractionation on a 1D PAGE into four molecular weight fractions followed by Phospho-iTRAQ (Supporting Information). (B) Phospho-iTRAQ protocol: a peptide sample is briefly split in two identical parts and differentially labeled preceding the phosphatase treatment of one part.Afterward, samples are immediately recombined and split into three parts for the LC−MS/MS analysis on three different instruments. (C) DataAnalysis: raw data was processed by the respective vendor’s software, and MGF-files were searched against the SwissProt Human database usingMascot. Exported DAT-files were imported into Rover for ranking of the iTRAQ ratios and were further analyzed in Excel. (D) Data: initiallyphosphorylated peptides have skewed iTRAQ ratios and arise out of the center of the log-normal distribution of the whole precursor population. Themean of the log-normal ratio distribution is located around zero since the vast majority of the peptides in the data have equal 114/115 or 116/117reporters.

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

C

Page 4: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

the R1+R3 extract were redissolved in 80 μL of 50 mMTEABC. As a second internal standard (IS2), ± 7 pmol of eachphosphopeptide from the heavy MS phosphomix 1, 2, and 3(MSP1H, MSP2H, MSP3H: Sigma-Aldrich. Composition, seeSupporting Information) was spiked into each sample.Subsequently, R1 extracts were equally divided in two parts(Figure 1), and the fraction of the R1+R3 extract was divided infour parts of 20 μL. Before the iTRAQ labeling, each reporterwas reconstituted in 250 μL of ethanol and combined with therelated label of a second 4-plex set. For the R1 extracts, the twohalves of the control sample were labeled with 50 μL of the 114and 115 reporter, the two halves of the EGF-stimulated samplewith the 116 and 117 reporter, and each quarter of the fourmolecular weight fractions of the R1+R3 extract with 50 μL of adifferent 4-plex label (see Supporting Information). Thelabeling was performed for 2 h at room temperature bycontinuous shaking before samples were dried out.

Dephosphorylation

The labeled peptides in the 20 vials (both R1 extracts were splitin two, and each gel fraction of the R1+R3 extract was split infour) were redissolved in 12 μL of calf intestinal phosphatase(CIP) buffer (100 mM NaCl, 50 mM Tris-HCl, 10 mM MgCl2,1 mM DTT) and incubated for 45 min at room temperature toremove the residual free labels. Next, the 114−116 and 115−117 labeled peptides were combined,which resulted in 10 vials(114−116 and 115−117 for the ”R1 extracts” as well as the fourmolecular weight fractions of the “R1+R3 extract”). To the115−117 samples, 30 μL of the dephosphorylating enzymecocktail (6.25% 0.25U/μL Escherichia coli alkaline phosphatase(P4069, Sigma-Aldrich)), 31.25% 10U/μL CIP 1 (M0290, NewEngland Biolabs), 31.25% 10U/μL CIP 2 (P4978, Sigma-Aldrich), and 31.25% 1U/μL thermosensitive alkaline phos-phatase (M9910, Promega) was added. For the 114−116control samples, 30 μL of CIP buffer was supplemented beforeall samples were incubated at 37 °C for the overnightdephosphorylation. Phosphatases were inactivated the nextday by heating the samples for 30 min at 85 °C in the presenceof EDTA (50 mM). The 114−116 and 115−117 units weremerged, and the five samples were cleaned by C18 tips (Sigma-Aldrich) to desalt and to remove free labels and excess ofphosphatases. The peptide samples that were eluted with 80%acetonitrile (ACN), 20% water, and 0.1% formic acid (FA)were spilt into three equal parts and were dried outimmediately.

LC−MS/MS Analysis

Peptides were resolved in 0.1% FA, and approximately 0.5−1μg was brought on column each run. The nanoLC−MS/MSanalysis of the three parts was performed on different ESI massspectrometry platforms: the first part on a TripleTof 5600(ABsciex), the second part on a QExactive (Thermo Fisher),and the third part on a SYNAPT G2-Si (Water Cooperation).(1) For the analysis on the TripleTof 5600, the chromatog-raphy was done on an Eksigent ekspert nanoLC 400 System onNanoLC column, 3 μm, ChromXP C18CL, with a 120 mingradient going from 5−90% ACN, 0.1% FA, and a 300 nL/minflow. During acquisition, each survey scan accumulatedprecursors in the range of 400−1250 m/z for 250 ms, fromwhich the top 20 were fragmented for MS/MS every cycle at aratio of 180 ms/precursor. Resolution was >15 k in MSMS;dynamic exclusion 10s. (2) The QExactive was coupled to aDionex Ultimate 3000 RS nanoLC system. After trapping on aprecolumn, peptides were separated on the Acclaim Pep-

Map100, 75 μm × 50 cm by a 90 min gradient, whereby theamount of ACN 0.1% FA increased from 5−70%, also at aconstant flow rate of 300 nL/min. In a scan range of 380−2000m/z, a full MS and data-dependent MS/MS scan wasperformed on the top 10 by a normalized collision energy(NCE) 30 stepped + collision energy 10% in all cases.Resolution was >25 k in MSMS; dynamic exclusion 25 s. (3)For the measurements on the SYNAPT G2-Si, a Nano-ACQUITY system separated the samples with a Waters BEHC18, 75 μm × 150 mm analytical column after a trapping stepof 8 min. Through changing the amount of ACN, 0.1% FAchanged from 1−40% for 90 min at a constant flow rate of 300nL/min. The range of 400−1600 m/z was inspected during the0.1 s survey scan, from which the top ten of the precursors werefragmented by ramping the collision energy in the “iTRAQ”mode for a 0.1 s/MS/MS scan if the threshold of 10 k counts/spectrum was reached. Resolution was >20 k in MSMS;dynamic exclusion 30 s.

Data Analysis

Data acquisition, recording, and preprocessing were carried outwith software of the respective mass spectrometer manufac-turers. Equally, the raw data were processed by specializedsoftware packages of the vendors: ProteinPilot 4.0.8085 for theTripleTof 5600 of ABSciex, Proteome Discoverer 1.4 for theThermo Scientific QExactive, and PLGS 3.0.1 for the SYNAPTG2-Si of Waters Cooperation. The same software packageswere used to export the created peak lists in the MGF format.The database searches of these MGF files were performed withMascot 2.4.0. (Matrix Science) against the SwissProt Humandatabase (59 084 sequences, supplemented with the sequencesof the internal standards). Initially, the prevalence of the mostcommon modifications was explored by an error-tolerantdatabase search.25 Details and search parameters can befound in the Supporting Information. The result files of thesesearches were exported as Mascot-DAT file formats (Peptide e-value <0.05), and each run was then imported separately in theRover software for individual processing (https://code.google.com/p/compomics-rover/).26 After obtaining Rover statisticson the 114/115 and 116/117 ratios, the data were filtered inorder to export the log2 ratios and z-scores of unique or “razor”peptides for consecutive data validation. Data validation andfrequency plot distribution of z-scores (at a bin size of 0.1)were done in Excel, whereby the z-scores of the four molecularweight fractions of the R1+R3 extracts were combined.Additional database searches were performed by changing thevariable modifications in order to annotate the “heavy” internalstandard phosphopeptides (see Supporting Information).Phosphoprotein enrichment in the obtained protein lists was

assessed by a right-sided Fisher enrichment test at a significancelevel of 5%. Known phosphoproteins in humans were obtainedfrom the PhosphoSitePlus database (http://www.phosphosite.org/homeAction.do). The total number of proteins known inhumans was estimated based on the described proteins in thePFAM database. Proteins involved in the EGF pathway wereidentified by querying the BioCarta database.To assess the relative amount of phosphopeptides present in

the lists generated by the Phospho-iTRAQ approach, these listswere compared to the known phosphopeptides present in thePhosphoSitePlus database by means of a protein blast asintegrated in the NCBI blast 2.2.28 package. The procedureallowed for partial mappings between Phospho-iTRAQpeptides and the phosphopeptides. That is, stretches of at

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

D

Page 5: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

least five amino acids in both peptide sequences had to mapunambiguously to each other (irrespective of overhangs thatresult from, for example, tryptic miscleavages), that is, no gapsor mutations were allowed in the procedure.The mass spectrometry proteomics data have been deposited

to the ProteomeXchange Consortium27 via the PRIDE partnerrepository with the data set identifier PXD001574 and10.6019/PXD001574.

■ RESULTS AND DISCUSSIONLarge-scale quantitative analysis of phospho-stoichiometry isessential to explore the true extent of phosphorylation sinceoccupancy is associated with functionality.28 In this report, wetherefore present for the first time the large-scale exploration ofan isobaric tag-based protocol that focuses on the unphos-phorylated counterparts of phosphopeptides for its analysis.Details of the optimized protocol are described in the Matierialsand Methods section, and a schematic overview of the workflowis given in Figure 1. For the comparative Phospho-iTRAQanalysis, HeLa cells were either treated with EGF for 10 min orleft untreated before lysis. The protocol was validated onspiked-in high stoichiometry peptides and was expanded by gelfractionation of the EGF lysate, which coordinately valorizedthe duplication potential of 4-plex iTRAQ and its impact ondata analysis.

Validation of the Phospho-iTRAQ Protocol throughEvaluation of the Internal Phosphopeptide Standards

To validate the effectiveness of the Phospho-iTRAQ method,we added a commercially available internal phosphopeptidestandard (IS2 in Figure 1) composed of 21 different heavypeptide sequences with at least one phosphorylated amino acid

into the sample after digestion. A set of these spiked heavypeptides was identified in all samples (see SupportingInformation). Not all peptides were annotated, however,because of the weak ionization efficiency of some of thesequences and the technical and inter-run variations character-istic of data-dependent acquisition (DDA). As observed on rawmass spectra and the processed data of the nonphosphorylatedcounterparts of the heavy peptides, all of the 114/115 and 116/117 iTRAQ ratios were skewed, whereby the phosphatase-treated 115 and 117 reporters substantially exceeded the 114and 116 labels of the control parts, which hence characterizedthese peptides as phosphopeptides.In Figure 2, the MS/MS spectrum and the range of the

“silent” iTRAQ region of the EVQAEQPSSSSPR peptide (m/z519.2628, 3+) from the R1 sample are given as an example,which illustrate that the approach is able to identifyphosphopeptides in complex peptide mixtures on all threeinstruments. Additional proof is provided if the phosphopep-tides themselves are annotated (EVQAEQPSSpSSPR m/z818.3734, 2+) as these iTRAQ ratios are inverted comparedto their unphosphorylated counterparts (presented in Figure 2,right upper and lower panel). In this manner, thedephosphorylation efficiency can be verified, which furtherallows for the accurate calculation of the stoichiometry asdescribed elaborately in refs 12 and 19. If the nativephosphorylated peptide is not annotated, the method stillprovides a relative simple way to identify high stoichiometryphosphopeptides on a large scale. As an alternative for thephosphatase treatment, chemical dephosphorylation by meansof cerium oxide has also been suggested to ensure thesensitivity of the methodology.22 While the stoichiometry ofthe peptides is well-defined with this approach, complementary

Figure 2. Detailed inspection of high stoichiometry peptides. MS/MS spectra of the EVQAEQPSSSSPR phosphopeptide from the IS2 in thedephosphorylated (left) and phosphorylated (right) form. The range of the iTRAQ reporter region and given ratios of the 114/115 control and 116/117 EFG-treated samples show that the Phospho-iTRAQ method is able to identify phosphopeptides in complex peptide mixtures on all threeinstruments, although variation between the different ratios is observed. The annotated native phosphopeptide has an inverted iTRAQ ratiocompared to its unphosphorylated counterpart.

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

E

Page 6: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

analysis is still required to determine the exact amino acid ofthe identified phosphopeptide that is modified, especially incase of multiphosphorylated peptides.

Addressing the Variation between Annotated Peptideswith Similar Stoichiometry

The overall average log2 Mascot ratio of the confidentlyannotated IS2 peptides (i.e., Mascot peptide score >25) was−3.56 (SD, 2.31; N, 42) for the control and −2.84 (SD, 1.46;N, 42) for the EGF-stimulated sample. The corresponding log2ratios of the IS2, which was spiked after the in-gel digestion, ison average −3.09 (SD, 1.84; N, 138) for 114/115 and −3.12(SD, 1.56; N, 138) for 116/117. Although all spiked IS2

peptides were completely phosphorylated (100% stoichiom-etry), generating strongly skewed “Phospho-iTRAQ ratios”,variation between the ratios of different peptides was observedas reflected by the SD. Equally, as also seen in Figure 2, theratios of peptides that are annotated multiple times in one runor over multiple LC−MS/MS runs are not identical.

This ratio dilution is attributed to (1) underestimation of thefold change, which is a known iTRAQ bias, and (2) the“isobaric tagging elephant”, coeluting peptides with similar m/zvalues that are coselected for fragmentation dilute theratios.29−32 This results in both a scattering of the ratios ofnonphosphopeptides outside the center of the normal ratiodistribution of the whole data set, and more importantly,contributes to the underestimation of the phosphopeptideoccupancy. Most of the spectra will suffer from this dilution, asis illustrated by the fact that nearly none of the IS2 peptides(100% phosphorylated) had zero intensity at the 114 and 116reporters, while these were not subjected to dephosphorylation.Although the isobaric tagging elephant currently still challengesthe stoichiometry measurements, emerging technologies suchas ion-mobility separation (SYNAPT G2-Si) and multinotchselection (Orbitrap Fusion) hold great promise to reduce thisdiluting effect and correspondingly increase the accuracy ofisobaric labeling techniques in the near future, as was recentlyalso suggested for the localization of organelle proteins byisotope tagging (LOPIT) technique.33−35 Furthermore, simply

Figure 3. A pipeline to set thresholds during data analysis. Upper panel: the frequency plot of the z-scores of all the peptides with no serine,threonine, or tyrosine (No STY) in the sequence is used to determine the cutoffs that mark the phosphopeptides. Therefore, z-scores thatcorrespond with the 1%, 2.5%, 5%, 10%, 12.5%, 15%, 20%, and 25% subgroup at the left side of the No STY distribution are transposed (only 1, 2.5,and 10% are shown) to the STY data set. All the STY peptides with a z-score lower than these exported cutoffs are pinpointed as phosphopeptides.The (low) z-scores of the identified IS2 peptides in this data set confirm the effectiveness of Phospho-iTRAQ. Lower panel: depending on the %cutoff, an increasing amount of precursors are classified as phosphopeptides on the different instruments. The multiplexing allows comparison of(phospho-) peptides between the control and EGF-stimulated sample. On average, one-third of the phosphopeptides identified in the lysate of boththe control and EGF-stimulated cells is shared (purple) between the two conditions. Note that the number of unique peptides isolated for each ratiothreshold is the number in the “control/EGF only” category plus the “shared” number.

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

F

Page 7: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

improving separation of the sample prior to analysis has beenshown to dramatically lower such interferences as well.32

However, because the bulk of the peptides in a data setintrinsically have 1:1 ratios, dilution will predominantly causeextreme ratios to move to the middle of the distribution.Supplemented by the notion that the fold change is generallyunderestimated in iTRAQ approaches, ratios are not expectedto move away from the center of the distribution, and thesmallest ratio of each multiply annotated sequence is arguablythe most accurate one.

A Comparative (Phospho) Proteome Analysis ofEGF-Stimulated and Nonstimulated HeLa Cells on ThreeDifferent Mass Spectrometry Instruments

The database searches with identical parameters resulted in theannotation of 1283 proteins out of the 31 562 spectra from theTripleTof 5600 (50% annotation rate), 1567 proteins out of the34 629 spectra from the QExactive (42%), and 905 proteins outof the 21 822 spectra from the SYNAPT G2-Si (32%).Different LC gradients were applied, so no direct comparisonbetween manufacturers can be made. For large-scale datamining, we applied Rover, a software specifically developed forthe visualization, analysis, and validation of quantitativeproteomics data. The software does not predict thephosphomodification but only performs normalization ofselected ratios from mass spectrometry data sets. This allowsthe ranked iTRAQ ratios and corresponding z-scores of theunique and razor peptides of each run to be exported fromRover and to set cutoffs in order to select peptides with skewedlabel ratios (Figure 3).The small z-scores of the IS2 peptides confirm our previous

conclusions on the effectiveness of the approach to discoverhigh stoichiometry peptides in a large data set (Figure 3). Thez-scores of the spiked phosphopeptides also mark the thresholdfor defining phosphorylation in each data set, that is, all theannotated peptides in the data with a z-score below this valueare pinpointed as highly phosphorylated. Alternatively,

predefined mixtures of heavy phosphorylated and theirnonphosphorylated heavy counterparts could similarly set thethresholds of different sections over the full range of occupancystates.8,20 However, even with such an alternative approach,separating the overlapping z-score ratio distribution of thephosphorylated and nonphosphorylated peptides remainsextremely challenging because: (1) only a limited amount ofpeptides in the nonenriched data are phosphorylated in thebulk of nonphosphorylated peptides; (2) most of thephosphorylated peptides are of low stoichiometry (over 50%of the peptides have a stoichiometry <30%);20 and (3) thepreviously mentioned isobaric tagging elephant still leads tooverlap between the normal distribution of the ratios of thenonphosphorylated peptides (1:1 ratios) and that from initiallyphosphorylated peptides (deviating ratios).To assess a threshold for filtering out phosphopeptides, we

first extracted the z-scores of all annotated peptides that cannotbe phosphorylated: peptides with no serine, threonine, ortyrosine (No STY). On the No STY distribution, z-scores atdifferent quantile levels (1−25% on the left site of thedistribution) were chosen as a cutoff value (Figure 3, upperpanels). To select the phosphopeptides in the data, these z-score cutoffs were subsequently transposed to the ranked z-scores of the peptides that have the ability to be phosphorylated(with an STY in the sequence). Different levels, from the leftpercentile to the quantiles of the distribution, were applied toset the threshold on the 114/115 control sample and the 116/117 EGF-treated sample (see Table S2, Supporting Informationfor an example of the data analysis workflow and obtainedphosphopeptide sequences). The graphs in Figure 3 (lowerpanel) illustrate the amount of unique phosphopeptides thatare shared or only identified in one of both conditions at eachselected threshold. A low cutoff only annotates highstoichiometry peptides, while a higher cutoff defines a largerset of phosphopeptides, including those with a lowerstoichiometry that could be useful for discovery studies.Interestingly, on average, 35% of the identified phosphopep-

Figure 4. Situation of the Phospho-iTRAQ (PiTRAQ) approach in the field of quantitative MS. Technical variation introduced during the mostcommonly used proteomics approaches is directly correlated to the number of steps in the protocol that are done separately. In the Phospho-iTRAQapproach, the labeling and dephosphorylation is the only step wherein the samples are separate, which thus allows for indefinite fractionation bothprior to and following this step. P, untreated peptide half; deP:, dephosphorylated peptide half. Adapted from Bantscheff et. al.37

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

G

Page 8: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

tides were shared between the two independently generatedprotein lysates from the control and stimulated cells, with theamount of shared peptides increasing toward lower stoichiom-etry (i.e., higher cutoff).Depending on the considered cutoff level, an increasing

percentage of falsely identified peptides would be expected.However, when the lists of phosphopeptides were matchedwith a database of known HeLa phosphosites obtained bydifferent techniques,36 the amount of phosphopeptides that areequally present in the database changed only slightly, 536/931(58%) at the 5% threshold and 1527/2769 (55%) at the 25%threshold. Similarly, the list of corresponding proteinscontained 77−81% known phosphoproteins at all thresholds(enrichment Fisher p-value of <2.2e−16 (PhosphositePlus36),while no clear declining trend over the different thresholds wasfound (Supporting Information). Importantly, with increasingthreshold, two known EGF-signaling proteins were identified,where none were found in the high stoichiometry list (<5%).Note that a 100% overlap of peptides with known databases isnot to be expected; we circumvent the need for enrichmentsteps and alternative fragmentation approaches that inevitablyintroduce technical bias, and we thus expect to findcomplementary phosphorylation events that escape detectionin other strategies.By looking at the whole sample instead of enriched

phosphopeptide fractions as a trade-off for a degree ofstoichiometric measurement, the Phospho-iTRAQ approachwould greatly benefit from an extended workflow comprising“unlimited” fractionation and replicate measurements toincrease the reliability of the phosphopeptide annotations.Indeed, the approach allows for doing just that.

The Flexibility in Experimental Setup Provided byPhospho-iTRAQ Allows for Deeper (Phospho) ProteomeMining and Increased Reliability of Individual PhosphoEvents

It is invaluable that the approach can be extended “indefinitely”to attain higher proteome coverage without compromisingquantitation. This is possible because contrary to a conven-tional iTRAQ, or any other quantitative, approach, samples areonly briefly split for the differential dephosphorylation of one-half, and no technical variation can accumulate before or afterthis step (Figure 4). Only the approach of Wu et. al. providesthe same degree of freedom in fractionating the samples.20

Importantly, apart from fractionation, 1D PAGE also allowsuse of strongly denaturing buffers to solubilize the morehydrophobic proteins, which normally cannot be done in aconventional iTRAQ approach. Also, fractionation is recom-mended to minimize the effect of coeluted and cofragmentedpeptides and thus the effect of the isobaric taggingelephant.30,32 A second extension of the protocol is the useof four labels to analyze only one sample. This creates theopportunity of duplicate measurements in one experiment andgenerates the possibility of more stringent data analysis.To first validate the potential impact of protein fractionation

on quantitation, the EGF lysate was spiked with protein IS1

followed by gel separation into four molecular weight fractions(see Supporting Information for the validation of the molecularweight separation). The database searches of the merged MGFfiles confirm the expected extension of the proteome coverage:the annotation of 2070 proteins out of 105 572 spectra fromthe TripleTof 5600 (37% annotation rate), 2913 proteins out of155 848 spectra from the QExactive (33%), and 1976 proteinsout of 82 556 spectra from the SYNAPT G2-Si (31%).

Figure 5. Data analysis pipeline for the gel-based Phospho-iTRAQ approach (R1+R3). A replicate setup where both 114/115 and 116/117 measurethe same peptides allows for more stringent data analysis. The graph demonstrates the percentages of identified phosphopeptides that are sharedbetween the 114/115 and 116/117 replicates at different cutoff levels. We suggest to focus on those peptides that were found below the 20% cutoff(black square) in both ratios. The theoretical chance of this event happening randomly is 4% (black dotted line). The table shows the number ofpeptides and corresponding proteins that were isolated at this threshold on each of the platforms. The % of proteins that are known phosphoproteins(PhosphoSitePlus) and the respective Fisher p-value for enrichment are also given along with the proteins in this list that are known responders toEGF signaling. The Venn diagram displays the number of phosphopeptides from the left 20% percent of the distribution of either ratio that areshared between the acquisitions on the different instruments.

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

H

Page 9: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

Additionally, of all 17 annotated peptides, only one peptide ofcasein beta (CASB_BOVIN (IS1)) was located outside theoverall distribution of the 114/115: FQSEEQQQTEDELQDK,a known phosphopeptide (Supporting Information).38 Thisillustrates that extending the protocol by additional fractiona-tion steps at the protein level increases the proteome coveragewithout influencing the technique’s capability to isolatephosphopeptides.In this gel-based Phospho-iTRAQ approach, the samples

were split into four equal parts to coordinately implement aninternal replicate analysis (Supporting Information). Contraryto the comparative in-solution Phospho-iTRAQ analysis (cfr.supra), 114/115 and 116/117 now independently measure thesame phosphorylation event. Again, the ranked 114/115 and116/117 z-scores of the No STY peptide population allow thesetting of phosphocutoffs at different quantile levels on the STYpeptide distribution. However, this time, the peptides that havea skewed ratio in both 114/115 and 116/117 can be consideredto be more reliable phosphopeptides. The graph in Figure 5illustrates how the number of such peptides greatly exceeds theamount that can be theoretically expected. Of interest, manualanalysis surfaced an unexpected phenomenon that is circum-vented this way: apparently an N-terminally acetylated peptidestarting with an alanine creates a b1 ion of m/z 114.05, whichcan be mistaken during data processing to be the 114.11reporter label. Even such unexpected phenomena, however, nolonger influence the final selection of phosphopeptides whenthe 116/117 ratio is equally taken into account.Taken together, we suggest to use the theoretical chance of

both ratios being in the same quantile as a “false discoverycutoff” in this extended gel-based Phospho-iTRAQ strategy.Setting the threshold at 20% and requiring a duplicatemeasurement below this threshold (114/115 as well as 116/117) now becomes a justifiable choice, as the possibility of anypeptide arriving here by chance is only 0.2 × 0.2 = 0.04, or 4%.Although a 25% overlap between the duplicate analyses at thisthreshold might still appear quite narrow, this is an intrinsicconsequence of setting thresholds. For instance, if for onepeptide the 114/115 ratio is 0.5 and the 116/117 ratio is 0.55,one ratio might very well be included at the 20% threshold for114/115, while the other only appears at the 25% threshold for116/117. As demonstrated in the table in Figure 5, allinstruments showed a strong enrichment in phosphoproteinswhen the above-mentioned criteria were applied (Fisher p-value< 2.2e−16). As predicted, more proteins from the EGF pathwaywere identified after protein fractionation, and it can thus beexpected that additional fractionating at the peptide level, aswas done by, for example, Wu et. al.,20 will further completethis list.The fact that we only annotated a few known mediators in

EGF signaling can be attributed to two main reasons. (1) Weonly sample one time point after stimulation (10 min). Asshown in the landmark paper by Olsen et. al.,23 EGF-activatedproteins can be divided into several temporal profiles over a 20min range, and we thus expect to only find a portion of all theknown proteins involved in EGF signaling. Some phosphor-ylations could thus be absent or not of high stoichiometry inour samples (supported by the notion that we start to pickthem up at higher percentile thresholds). (2) Some specificpeptides are not identified because they simply were notisolated for fragmentation during the DDA runs. By extendingthe setup and increasing the fractionation even further, as wasdone by Wu et al., who made a total of 20 fractions (as opposed

to our four) that detected over 80 000 different peptides,should allow the digging out of more phosphorylation eventsfrom the bulk of other peptides. Note that at least on oneplatform, the EGF receptor was identified (phosphopeptidesequence GSTAENAEYLR), which illustrates the additionalbenefit of 1D PAGE as a means of including more hydrophobicproteins by the use of strongly denaturing buffers (R3). TheSupporting Information shows the temporal changes in thisEGF receptor pY1197 phosphorylation as detected by Olsen et.al. (http://www.phosida.com/) and illustrates that this eventwas detected despite the fact that it is at its decline at 10 min,after a peak at 5 min.It is important to emphasize that the technical replicates were

performed completely independently on three different instru-ment platforms using DDA with different LC gradients. It isthus reasonable to expect only a small overlap between theseruns: (1) the above-mentioned consequence of setting athreshold is that it excludes many peptides with a lowerstoichiometry near, but above, the 20% quantile; (2) a DDArun is very irreproducible, even on identical injections, and notmany peptides in our analysis have been selected forfragmentation on every instrument; (3) because the datafrom each instrument is a merged file from four DDA runs ondifferent MW fractions, the said variation in DDA acquisitionaccumulates in this data set; (4) identification of the spectra isvery dependent on data quality of each individual MS/MSspectrum. Still, in the total data set, 2420 of the STY peptidesfrom the left 20% of the distribution were present in at leastone ratio of two of the three runs (see Venn diagram in Figure5). Taken together, we argue that this considerable overlapfurther underlines the capabilities of the Phospho-iTRAQapproach, and we emphasize that this (gel-based) strategy canbe applied on any mass spectrometry platform that hasadequate speed and mass resolution.

■ CONCLUDING REMARKSWe here present and evaluate an isobaric tag based protocolthat relies on phosphatase treatment of differentially labeledpeptides for large-scale quantitative analysis of phosphopep-tides. The Phospho-iTRAQ method focuses only on thenonphosphorylated counterparts of the phosphopeptides andavoids specialized and relatively imprecise workflows such asphosphoenrichment. Since multiple labels are used, informationon relative protein expression can still be obtained in ourmethod.The application of Phospho-iTRAQ on spiked-in phospho-

proteins and phosphopeptides demonstrated its ability todiscover high stoichiometry peptides in a large data set and thison multiple LC−MS/MS platforms. The iTRAQ data at thespectrum level allows estimation of the phosphorylationstoichiometry, while the ratio distribution of all annotatedpeptides allows peptides with high stoichiometry to beidentified by their deviating z-scores. The dilution effect ofthe coeluted and coselected peptides (isobaric taggingelephant) still challenges the accuracy of the methodologybut can be partially addressed by the multiplexing potential ofthe isobaric labels, which allows extensive fractionation and theinclusion of replicates. Furthermore, newly arising technologies,such as ion-mobility separation, hold great promise to diminishthis interference and thus increase the stoichiometric resolutionof the method.Additional confirmation of the identified phosphopeptides

with complementary techniques however remains necessary,

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

I

Page 10: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

because our approach is not able to pinpoint the exactphosphorylation site. Notably, the existence of different “falselocalization scores” and the notion of migrating phosphategroups in collisionally activated peptides puts this shortcominginto some perspective.39,40 As with most quantitative analyses,it is up to the researcher to decide where to put the threshold inthe ranked ratio distribution of large-scale data that marks thelist of the phosphopeptide enriched identifications. Indeed,depending on the goal of the experiment, one can focusexclusively on the more confidently annotated high stoichiom-etry peptides (low cutoff), or one can rather isolate a larger butoverall less reliable list of peptides for an exploratory study(high cutoff), which will thus increase the need for additionalbiological replicates to define the common subsets of differentruns that will again augment the confidence of thephosphopeptide detection.

■ ASSOCIATED CONTENT*S Supporting Information

Composition of the internal standards (IS1 and IS2); scheme ofthe labeling of the R1+R3 extract (in-gel approach); details ofthe database search parameters; %phosphoproteins identified atdifferent thresholds in the in-solution approach; boxplot for thevalidation of the molecular weight separation by 1DPAGE;explanation for why extending sample fractionation does notcompromise quantitative power; temporal changes in EGFRpY1197 phosphorylation as described by Olsen et al.23; andtables showing all the annotated peptides of the IS2, an exampleof the data analysis workflow, and a list of the phosphopeptidesof all samples. This material is available free of charge via theInternet at http://pubs.acs.org.

■ AUTHOR INFORMATIONCorresponding Author

*E-mail: [email protected]. Phone: +32 (0)9 264 8067, +32 (0)498 818 606.Author Contributions⊥These authors contributed equally.Notes

The authors declare no competing financial interest.

■ ACKNOWLEDGMENTSWe thank Sofie Vande Casteele for the assistance with themaintenance of our Waters Premier LC−MS, the platformapplied for the development, and optimization of the protocol.We thank Niklaas Colaert for the support with the Roversoftware and Kristof De Beuf from the FIRE StatisticalConsulting of the Ghent University for the help with thestatistics. For the LC−MS acquisitions of the final samples, weacknowledge Ernst Bouvin and Christian Baumann fromABSciex; Hans Vissers, Chris Hughes, and Jan Claereboudtfrom the Waters Corporation; and Wilfried Voorhorst andHeike Schaefer from Thermo Fisher Scientific. We also thankthe PRIDE team for creating a platform to share our datapublically. This research is funded by a grant from the Fund ofScientific Research Flanders (FWO: 3G073112) and a Ph.D.grant from the Institute for the Promotion of Innovationthrough Science and Technology in Flanders (IWT-Vlaande-ren) awarded to P.M. D.D. and L.M. acknowledge the supportof Ghent University (Multidisciplinary Research Partnership“Bioinformatics: from nucleotides to networks”). L.M. also

acknowledges support from the PRIME-XS project funded bythe European Union seventh Framework Program under GrantAgreement No. 262067.

■ REFERENCES(1) Junger, M. A.; Aebersold, R. Mass spectrometry-drivenphosphoproteomics: Patterning the systems biology mosaic. Dev.Biol. 2014, 3, 83−112.(2) Thingholm, T. E.; Jensen, O. N.; Larsen, M. R. Analyticalstrategies for phosphoproteomics. Proteomics 2009, 9 (6), 1451−68.(3) Tichy, A.; Salovska, B.; Rehulka, P.; Klimentova, J.; Vavrova, J.;Stulik, J.; Hernychova, L. Phosphoproteomics: Searching for a needlein a haystack. J. Proteomics 2011, 74 (12), 2786−97.(4) Roux, P. P.; Thibault, P. The coming of age of phosphoproteo-micsFrom large data sets to inference of protein functions. Mol. Cell.Proteomics 2013, 12 (12), 3453−64.(5) Dephoure, N.; Gould, K. L.; Gygi, S. P.; Kellogg, D. R. Mappingand analysis of phosphorylation sites: A quick guide for cell biologists.Mol. Biol. Cell 2013, 24 (5), 535−42.(6) Nilsson, C. L. Advances in quantitative phosphoproteomics. Anal.Chem. 2012, 84 (2), 735−46.(7) Linke, D.; Hung, C. W.; Cassidy, L.; Tholey, A. Optimizedfragmentation conditions for iTRAQ-labeled phosphopeptides. J.Proteome Res. 2013, 12 (6), 2755−63.(8) Olsen, J. V.; Vermeulen, M.; Santamaria, A.; Kumar, C.; Miller,M. L.; Jensen, L. J.; Gnad, F.; Cox, J.; Jensen, T. S.; Nigg, E. A.;Brunak, S.; Mann, M. Quantitative phosphoproteomics revealswidespread full phosphorylation site occupancy during mitosis. Sci.Signaling 2010, 3 (104), ra3.(9) Harvey, S. L.; Enciso, G.; Dephoure, N.; Gygi, S. P.;Gunawardena, J.; Kellogg, D. R. A phosphatase threshold sets thelevel of Cdk1 activity in early mitosis in budding yeast. Mol. Biol. Cell2011, 22 (19), 3595−608.(10) Vaudel, M.; Sickmann, A.; Martens, L. Peptide and proteinquantification: A map of the minefield. Proteomics 2010, 10 (4), 650−70.(11) Zhang, X.; Jin, Q. K.; Carr, S. A.; Annan, R. S. N-Terminalpeptide labeling strategy for incorporation of isotopic tags: A methodfor the determination of site-specific absolute phosphorylationstoichiometry. Rapid Commun. Mass Spectrom. 2002, 16 (24), 2325−32.(12) Pflieger, D.; Junger, M. A.; Muller, M.; Rinner, O.; Lee, H.;Gehrig, P. M.; Gstaiger, M.; Aebersold, R. Quantitative proteomicanalysis of protein complexes: Concurrent identification of interactorsand their state of phosphorylation. Mol. Cell. Proteomics 2008, 7 (2),326−46.(13) Bonenfant, D.; Schmelzle, T.; Jacinto, E.; Crespo, J. L.; Mini, T.;Hall, M. N.; Jenoe, P. Quantitation of changes in proteinphosphorylation: A simple method based on stable isotope labelingand mass spectrometry. Proc. Natl. Acad. Sci. U. S. A. 2003, 100 (3),880−5.(14) Domanski, D.; Murphy, L. C.; Borchers, C. H. Assaydevelopment for the determination of phosphorylation stoichiometryusing multiple reaction monitoring methods with and withoutphosphatase treatment: Application to breast cancer signalingpathways. Anal. Chem. 2010, 82 (13), 5610−20.(15) Hegeman, A. D.; Harms, A. C.; Sussman, M. R.; Bunner, A. E.;Harper, J. F. An isotope labeling strategy for quantifying the degree ofphosphorylation at multiple sites in proteins. J. Am. Soc. Mass Spectrom.2004, 15 (5), 647−53.(16) Johnson, H.; Eyers, C. E.; Eyers, P. A.; Beynon, R. J.; Gaskell, S.J. Rigorous determination of the stoichiometry of protein phosphor-ylation using mass spectrometry. J. Am. Soc. Mass Spectrom. 2009, 20(12), 2211−20.(17) Kanshin, E.; Wang, S.; Ashmarina, L.; Fedjaev, M.; Nifant’ev, I.;Mitchell, G. A.; Pshezhetsky, A. V. The stoichiometry of proteinphosphorylation in adipocyte lipid droplets: Analysis by N-terminal

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

J

Page 11: Phospho-iTRAQ: Assessing Isobaric Labels for the · PDF filePhospho-iTRAQ: Assessing Isobaric Labels for the Large-Scale Study Of Phosphopeptide Stoichiometry ... (P5726 and P0044,

isotope tagging and enzymatic dephosphorylation. Proteomics 2009, 9(22), 5067−77.(18) Previs, M. J.; VanBuren, P.; Begin, K. J.; Vigoreaux, J. O.;LeWinter, M. M.; Matthews, D. E. Quantification of proteinphosphorylation by liquid chromatography−mass spectrometry. Anal.Chem. 2008, 80 (15), 5864−72.(19) Przybylski, C.; Junger, M. A.; Aubertin, J.; Radvanyi, F.;Aebersold, R.; Pflieger, D. Quantitative analysis of protein complexconstituents and their phosphorylation states on a LTQ-Orbitrapinstrument. J. Proteome Res. 2010, 9 (10), 5118−32.(20) Wu, R.; Haas, W.; Dephoure, N.; Huttlin, E. L.; Zhai, B.; Sowa,M. E.; Gygi, S. P. A large-scale method to measure absolute proteinphosphorylation stoichiometries. Nat. Methods 2011, 8 (8), 677−83.(21) Hebert, A. S.; Merrill, A. E.; Stefely, J. A.; Bailey, D. J.; Wenger,C. D.; Westphall, M. S.; Pagliarini, D. J.; Coon, J. J. Amine-reactiveneutron-encoded labels for highly plexed proteomic quantitation. Mol.Cell. Proteomics 2013, 12 (11), 3360−9.(22) Jia, W.; Andaya, A.; Leary, J. A. Novel mass spectrometricmethod for phosphorylation quantification using cerium oxidenanoparticles and tandem mass tags. Anal. Chem. 2012, 84 (5),2466−73.(23) Olsen, J. V.; Blagoev, B.; Gnad, F.; Macek, B.; Kumar, C.;Mortensen, P.; Mann, M. Global, in vivo, and site-specificphosphorylation dynamics in signaling networks. Cell 2006, 127 (3),635−48.(24) Glibert, P.; Van Steendam, K.; Dhaenens, M.; Deforce, D.iTRAQ as a method for optimization: Enhancing peptide recoveryafter gel fractionation. Proteomics 2014, 14 (6), 680−4.(25) Creasy, D. M.; Cottrell, J. S. Error-tolerant searching ofuninterpreted tandem mass spectrometry data. Proteomics 2002, 2(10), 1426−34.(26) Colaert, N.; Helsens, K.; Impens, F.; Vandekerckhove, J.;Gevaert, K. Rover: A tool to visualize and validate quantitativeproteomics data from different sources. Proteomics 2010, 10 (6),1226−9.(27) Vizcaino, J. A.; Cote, R. G.; Csordas, A.; Dianes, J. A.; Fabregat,A.; Foster, J. M.; Griss, J.; Alpi, E.; Birim, M.; Contell, J.; O’Kelly, G.;Schoenegger, A.; Ovelleiro, D.; Perez-Riverol, Y.; Reisinger, F.; Rios,D.; Wang, R.; Hermjakob, H. The proteomics identifications (PRIDE)database and associated tools: Status in 2013. Nucleic Acids Res. 2013,41 (Database issue), D1063−9.(28) Olsen, J. V.; Mann, M. Status of large-scale analysis of post-translational modifications by mass spectrometry. Mol. Cell. Proteomics2013, 12 (12), 3444−52.(29) Ow, S. Y.; Salim, M.; Noirel, J.; Evans, C.; Rehman, I.; Wright,P. C. iTRAQ underestimation in simple and complex mixtures: “Thegood, the bad, and the ugly”. J. Proteome Res. 2009, 8 (11), 5347−55.(30) Evans, C.; Noirel, J.; Ow, S. Y.; Salim, M.; Pereira-Medrano, A.G.; Couto, N.; Pandhal, J.; Smith, D.; Pham, T. K.; Karunakaran, E.;Zou, X.; Biggs, C. A.; Wright, P. C. An insight into iTRAQ: Where dowe stand now? Anal. Bioanal. Chem. 2012, 404 (4), 1011−27.(31) Christoforou, A.; Lilley, K. S. Taming the isobaric taggingelephant in the room in quantitative proteomics. Nat. Methods 2011, 8(11), 911−3.(32) Vaudel, M.; Burkhart, J. M.; Radau, S.; Zahedi, R. P.; Martens,L.; Sickmann, A. Integral quantification accuracy estimation forreporter ion-based quantitative proteomics (iQuARI). J. ProteomeRes. 2012, 11 (10), 5072−80.(33) Ting, L.; Rad, R.; Gygi, S. P.; Haas, W. MS3 eliminates ratiodistortion in isobaric multiplexed quantitative proteomics. Nat.Methods 2011, 8 (11), 937−40.(34) Sturm, R. M.; Lietz, C. B.; Li, L. Improved isobaric tandem masstag quantification by ion mobility mass spectrometry. Rapid Commun.Mass Spectrom. 2014, 28 (9), 1051−60.(35) Christoforou, A.; Arias, A. M.; Lilley, K. S. Determining proteinsubcellular localization in mammalian cell culture with biochemicalfractionation and iTRAQ 8-plex quantification. Methods Mol. Biol.2014, 1156, 157−74.

(36) Hornbeck, P. V.; Kornhauser, J. M.; Tkachev, S.; Zhang, B.;Skrzypek, E.; Murray, B.; Latham, V.; Sullivan, M. PhosphoSitePlus: Acomprehensive resource for investigating the structure and function ofexperimentally determined post-translational modifications in man andmouse. Nucleic Acids Res. 2012, 40 (Database issue), D261−70.(37) Bantscheff, M.; Lemeer, S.; Savitski, M. M.; Kuster, B.Quantitative mass spectrometry in proteomics: Critical review updatefrom 2007 to the present. Anal. Bioanal. Chem. 2012, 404 (4), 939−65.(38) Magrane, M.; Consortium, U. UniProt knowledgebase: A hub ofintegrated protein data. Database (Oxford) 2011, 2011, bar009.(39) Vaudel, M.; Breiter, D.; Beck, F.; Rahnenfuhrer, J.; Martens, L.;Zahedi, R. P. D-score: A search engine independent MD-score.Proteomics 2013, 13 (6), 1036−41.(40) Mischerikow, N.; Altelaar, A. F.; Navarro, J. D.; Mohammed, S.;Heck, A. J. Comparative assessment of site assignments in CID andelectron transfer dissociation spectra of phosphopeptides discloseslimited relocation of phosphate groups. Mol. Cell. Proteomics 2010, 9(10), 2140−8.

Journal of Proteome Research Article

DOI: 10.1021/pr500889vJ. Proteome Res. XXXX, XXX, XXX−XXX

K