organizing ms/ms proteomic data for publication with scaffold

28
Organizing MS/MS Proteomic Data for Publication with Scaffold Brian C. Searle Proteome Software Inc. Portland, Oregon ABRF2009, Memphis TN February 10 th , 2009 Creative Commons Attribution

Upload: varden

Post on 07-Feb-2016

34 views

Category:

Documents


0 download

DESCRIPTION

Organizing MS/MS Proteomic Data for Publication with Scaffold. Brian C. Searle Proteome Software Inc. Portland, Oregon ABRF2009, Memphis TN February 10 th , 2009. Creative Commons Attribution. Publication Standards. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Organizing MS/MS Proteomic Data for Publication with Scaffold

Organizing MS/MS Proteomic Data for Publication with Scaffold

Brian C. SearleProteome Software Inc.

Portland, Oregon

ABRF2009, Memphis TNFebruary 10th, 2009

Creative Commons Attribution

Page 2: Organizing MS/MS Proteomic Data for Publication with Scaffold

Publication Standards

• In 2006 MCP published guidelines for reporting peptide and protein identifications

• Other proteomics journals have adopted similar standards

Page 3: Organizing MS/MS Proteomic Data for Publication with Scaffold

What do the Guidelines Do?

The guidelines ensure enough information to:– Understand and critically assess the results– Enforce a low level of reliability– Provide enough data concerning potentially

questionable results to allow for reassessment

Page 4: Organizing MS/MS Proteomic Data for Publication with Scaffold

What do the Guidelines Do?

The guidelines ensure enough information to:– Understand and critically assess the results– Enforce a low level of reliability– Provide enough data concerning potentially

questionable results to allow for reassessment

Publication guidelines have played a critical role in the acceptance of proteomic analysis

Page 5: Organizing MS/MS Proteomic Data for Publication with Scaffold

Publication Standards are Hard!

Page 6: Organizing MS/MS Proteomic Data for Publication with Scaffold

Publication Standards are Hard!

They're hard for: • the authors who have to comply

Page 7: Organizing MS/MS Proteomic Data for Publication with Scaffold

Publication Standards are Hard!

They're hard for: • the authors who have to comply• the reviewers who must police compliance

Page 8: Organizing MS/MS Proteomic Data for Publication with Scaffold

Publication Standards are Hard!

They're hard for: • the authors who have to comply• the reviewers who must police compliance• the journals because there’s a huge amount of

supplemental data in non-standard formats– PowerPoint guarantee readability, but difficult to use– RAW file formats can expire, but are much more useful

Page 9: Organizing MS/MS Proteomic Data for Publication with Scaffold

Publication Standards are Hard!

They're hard for: • the authors who have to comply• the reviewers who must police compliance• the journals because there’s a huge amount of

supplemental data in non-standard formats

But they don’t have to be!

Page 10: Organizing MS/MS Proteomic Data for Publication with Scaffold

Scaffold Goals

• Make it easier to organize data• Make tables useful for publication• Make figures that could be dropped directly

into manuscripts

• Clear fit when the guidelines were announced

Page 11: Organizing MS/MS Proteomic Data for Publication with Scaffold

Scaffold Makes theGuidelines Easier for Authors

• Collate relevant data from search engine files– Most search engines supported: Mascot,

SEQUEST, X! Tandem, Phenyx, SpectrumMill, OMSSA, IdentityE

Page 12: Organizing MS/MS Proteomic Data for Publication with Scaffold

Scaffold Makes theGuidelines Easier for Authors

• Collate relevant data from search engine files– Most search engines supported: Mascot,

SEQUEST, X! Tandem, Phenyx, SpectrumMill, OMSSA, IdentityE

– Collapse related parameters across all files– Clearly point out missing metadata – Produce comparable search engine and

instrument independent probabilities

Page 13: Organizing MS/MS Proteomic Data for Publication with Scaffold

Collate Relevant Data • Peak picking software, version,

altered parameters

• Database Selection– Database name and version

– Species restriction

– Number of proteins searched

• Database search parameters– Search engine name and version

– Enzyme specificity

– # missed cleavages

– Fixed/variable modifications

– Mass tolerances

• Peptide selection criteria

Page 14: Organizing MS/MS Proteomic Data for Publication with Scaffold

Collate Relevant Data • Peak picking software, version,

altered parameters

• Database Selection– Database name and version

– Species restriction

– Number of proteins searched

• Database search parameters– Search engine name and version

– Enzyme specificity

– # missed cleavages

– Fixed/variable modifications

– Mass tolerances

• Peptide selection criteria

Page 15: Organizing MS/MS Proteomic Data for Publication with Scaffold

Collate Relevant Data

Page 16: Organizing MS/MS Proteomic Data for Publication with Scaffold

Collate Relevant Data

This Button Gets:•Protein accession numbers•Number of unique peptides•% sequence coverage

Page 17: Organizing MS/MS Proteomic Data for Publication with Scaffold

Collate Relevant Data

While This Button Gets:•Peptide sequences identified•Precursor m/z and charge•Score and peptide probability

Page 18: Organizing MS/MS Proteomic Data for Publication with Scaffold

• One hit wonders and modifications require validation

• Distribute the Scaffold file to reviewers

Collate Relevant Data

Page 19: Organizing MS/MS Proteomic Data for Publication with Scaffold

Collate Relevant Data

• Similar table and reports for iTRAQ and TMT quantitative data in Scaffold Q+

That’s it!

Page 20: Organizing MS/MS Proteomic Data for Publication with Scaffold

Making the Guidelines Easier for Reviewers to Police

• All metadata is guaranteed to be present

Page 21: Organizing MS/MS Proteomic Data for Publication with Scaffold

Making the Guidelines Easier for Reviewers to Police

• All metadata is guaranteed to be present

• Quick to review key proteins and spectra

Page 22: Organizing MS/MS Proteomic Data for Publication with Scaffold

Making the Guidelines Easier for Reviewers to Police

• All metadata is guaranteed to be present

• Quick to review key proteins and spectra

• Easy to validate statistical assumptions

Page 23: Organizing MS/MS Proteomic Data for Publication with Scaffold

Probability Assumptions Usually Work

Page 24: Organizing MS/MS Proteomic Data for Publication with Scaffold

But Not Always!

Page 25: Organizing MS/MS Proteomic Data for Publication with Scaffold

Distributing Guideline Compliant Data

• Journals can distribute Scaffold files containing the entire data set

Page 26: Organizing MS/MS Proteomic Data for Publication with Scaffold

Distributing Guideline Compliant Data

• Journals can distribute Scaffold files containing the entire data set

• Authors can share their raw data– Peak list exports for a variety of formats

(MGF, DTA, PKL, etc)

Page 27: Organizing MS/MS Proteomic Data for Publication with Scaffold

Distributing Guideline Compliant Data

• Journals can distribute Scaffold files containing the entire data set

• Authors can share their raw data– Peak list exports for a variety of formats

(MGF, DTA, PKL, etc)

• Journals are allowed to distribute the original version of the free viewer so the files can ALWAYS be opened!

Page 28: Organizing MS/MS Proteomic Data for Publication with Scaffold

Conclusions

Scaffold makes it:• Easier to pass publishing data guidelines• Easier to review data and to police standards• Easier to safely distribute supplemental

material in a useful format