organizing ms/ms proteomic data for publication with scaffold

Post on 07-Feb-2016

34 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Organizing MS/MS Proteomic Data for Publication with Scaffold. Brian C. Searle Proteome Software Inc. Portland, Oregon ABRF2009, Memphis TN February 10 th , 2009. Creative Commons Attribution. Publication Standards. - PowerPoint PPT Presentation

TRANSCRIPT

Organizing MS/MS Proteomic Data for Publication with Scaffold

Brian C. SearleProteome Software Inc.

Portland, Oregon

ABRF2009, Memphis TNFebruary 10th, 2009

Creative Commons Attribution

Publication Standards

• In 2006 MCP published guidelines for reporting peptide and protein identifications

• Other proteomics journals have adopted similar standards

What do the Guidelines Do?

The guidelines ensure enough information to:– Understand and critically assess the results– Enforce a low level of reliability– Provide enough data concerning potentially

questionable results to allow for reassessment

What do the Guidelines Do?

The guidelines ensure enough information to:– Understand and critically assess the results– Enforce a low level of reliability– Provide enough data concerning potentially

questionable results to allow for reassessment

Publication guidelines have played a critical role in the acceptance of proteomic analysis

Publication Standards are Hard!

Publication Standards are Hard!

They're hard for: • the authors who have to comply

Publication Standards are Hard!

They're hard for: • the authors who have to comply• the reviewers who must police compliance

Publication Standards are Hard!

They're hard for: • the authors who have to comply• the reviewers who must police compliance• the journals because there’s a huge amount of

supplemental data in non-standard formats– PowerPoint guarantee readability, but difficult to use– RAW file formats can expire, but are much more useful

Publication Standards are Hard!

They're hard for: • the authors who have to comply• the reviewers who must police compliance• the journals because there’s a huge amount of

supplemental data in non-standard formats

But they don’t have to be!

Scaffold Goals

• Make it easier to organize data• Make tables useful for publication• Make figures that could be dropped directly

into manuscripts

• Clear fit when the guidelines were announced

Scaffold Makes theGuidelines Easier for Authors

• Collate relevant data from search engine files– Most search engines supported: Mascot,

SEQUEST, X! Tandem, Phenyx, SpectrumMill, OMSSA, IdentityE

Scaffold Makes theGuidelines Easier for Authors

• Collate relevant data from search engine files– Most search engines supported: Mascot,

SEQUEST, X! Tandem, Phenyx, SpectrumMill, OMSSA, IdentityE

– Collapse related parameters across all files– Clearly point out missing metadata – Produce comparable search engine and

instrument independent probabilities

Collate Relevant Data • Peak picking software, version,

altered parameters

• Database Selection– Database name and version

– Species restriction

– Number of proteins searched

• Database search parameters– Search engine name and version

– Enzyme specificity

– # missed cleavages

– Fixed/variable modifications

– Mass tolerances

• Peptide selection criteria

Collate Relevant Data • Peak picking software, version,

altered parameters

• Database Selection– Database name and version

– Species restriction

– Number of proteins searched

• Database search parameters– Search engine name and version

– Enzyme specificity

– # missed cleavages

– Fixed/variable modifications

– Mass tolerances

• Peptide selection criteria

Collate Relevant Data

Collate Relevant Data

This Button Gets:•Protein accession numbers•Number of unique peptides•% sequence coverage

Collate Relevant Data

While This Button Gets:•Peptide sequences identified•Precursor m/z and charge•Score and peptide probability

• One hit wonders and modifications require validation

• Distribute the Scaffold file to reviewers

Collate Relevant Data

Collate Relevant Data

• Similar table and reports for iTRAQ and TMT quantitative data in Scaffold Q+

That’s it!

Making the Guidelines Easier for Reviewers to Police

• All metadata is guaranteed to be present

Making the Guidelines Easier for Reviewers to Police

• All metadata is guaranteed to be present

• Quick to review key proteins and spectra

Making the Guidelines Easier for Reviewers to Police

• All metadata is guaranteed to be present

• Quick to review key proteins and spectra

• Easy to validate statistical assumptions

Probability Assumptions Usually Work

But Not Always!

Distributing Guideline Compliant Data

• Journals can distribute Scaffold files containing the entire data set

Distributing Guideline Compliant Data

• Journals can distribute Scaffold files containing the entire data set

• Authors can share their raw data– Peak list exports for a variety of formats

(MGF, DTA, PKL, etc)

Distributing Guideline Compliant Data

• Journals can distribute Scaffold files containing the entire data set

• Authors can share their raw data– Peak list exports for a variety of formats

(MGF, DTA, PKL, etc)

• Journals are allowed to distribute the original version of the free viewer so the files can ALWAYS be opened!

Conclusions

Scaffold makes it:• Easier to pass publishing data guidelines• Easier to review data and to police standards• Easier to safely distribute supplemental

material in a useful format

top related