an exploratory method to reconstruct pathways

22
An Exploratory Method to Reconstruct Pathways Cory Tobin

Upload: dyanne

Post on 21-Jan-2016

28 views

Category:

Documents


0 download

DESCRIPTION

Cory Tobin. An Exploratory Method to Reconstruct Pathways. Collaborators. Dr. Matteo Pellegrini Shawn Cokus @ UCLA. Outline. Purpose Methods Sample Data Possible Uses Final Remarks. Purpose. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: An Exploratory Method to Reconstruct Pathways

An Exploratory Method to Reconstruct Pathways

Cory Tobin

Page 2: An Exploratory Method to Reconstruct Pathways

Collaborators

Dr. Matteo Pellegrini

Shawn Cokus

@ UCLA

Page 3: An Exploratory Method to Reconstruct Pathways

Outline

• Purpose

• Methods

• Sample Data

• Possible Uses

• Final Remarks

Page 4: An Exploratory Method to Reconstruct Pathways

Purpose

Reconstruct signal transduction pathways & protein complexes using protein-protein interactions reported

on the web

Page 5: An Exploratory Method to Reconstruct Pathways

Materials

• Python

• Yahoo! Search API

• ProstgreSQL

• Django Web Framework

Page 6: An Exploratory Method to Reconstruct Pathways

Methods

• Construct high likelihood / low noise queries• Ex: “Jak2 phophorylates Stat5”

• Query Yahoo! for every permutation of 2 proteins in a given species

• Use high likelihood joining words…

Page 7: An Exploratory Method to Reconstruct Pathways

Joining Words

• Phosphorylates• Methylates• Acetylates• Activates• Deactivates• Binds to

• Inhibits• Dephosphorylates• Glycosylates• Ubiquitinates• Interacts with

Page 8: An Exploratory Method to Reconstruct Pathways

Full Query

“Jak2 acetylates OR phosphorylates OR

methylates OR binds to OR interacts

with Stat5”

Page 9: An Exploratory Method to Reconstruct Pathways

Hindrance

• Doing pair-wise queries for all N proteins in an organism requires N*N queries

• E. coli has >4000 genes

(16,000,000 queries)

• Yahoo! allows 5k / day / computer

Page 10: An Exploratory Method to Reconstruct Pathways

Possible Solutions

Recruit 4k computers and finish in a day

Find a better method

OR

Page 11: An Exploratory Method to Reconstruct Pathways

Better Method

• Only specify the first symbol

• Iterate through the results and only

take results whose word following the

joining symbol corresponds to a valid

symbol

Page 12: An Exploratory Method to Reconstruct Pathways

Full Query

“Jak2 acetylates OR phosphorylates OR

methylates OR binds to OR interacts

with”

Page 13: An Exploratory Method to Reconstruct Pathways

Another Hindrance

• The symbol “thE”(and others like it)

• Searches need to be case insensitive to account for “p53” and “P53”

• Recognizes the word “the” as the protein “thE”

Page 14: An Exploratory Method to Reconstruct Pathways

Solution

• Use a list of stop words

• Very common, non-interesting

words

• If the name appears in that list of stop

words, just forget about that protein

all together

http://www.dcs.gla.ac.uk/idom/ir_resources/linguistic_utils/stop_words

Page 15: An Exploratory Method to Reconstruct Pathways

Methods (cont.)

• After we have this data in a database...

• Create a web interface to the data so others can search for protein interactions (Shwe)

Page 16: An Exploratory Method to Reconstruct Pathways

Data

KEGG - Yeast MAPK

Our Datahttp://www.genome.jp/dbget-bin/show_pathway?sce04010+YGR040W

Page 17: An Exploratory Method to Reconstruct Pathways

Data (cont.)

KEGG - Yeast Cell Cycle

http://www.genome.jp/dbget-bin/get_pathway?org_name=sce&mapno=04110

Our Data

Page 18: An Exploratory Method to Reconstruct Pathways

Data (cont.)

KEGG - Yeast 26S Proteasome

Our Data

http://www.genome.jp/dbget-bin/show_pathway?sce03050+YER012W

Page 19: An Exploratory Method to Reconstruct Pathways

Possible Uses

• General reference for protein

interactions

• Curate other databases

Page 20: An Exploratory Method to Reconstruct Pathways

Final Remarks

• Only works well detecting signal pathways and protein complexes

• Not metabolic pathways

• It is possible to get high quality, interesting data without much noise or complex text analysis algorithms

Page 21: An Exploratory Method to Reconstruct Pathways

References

• Kyoto Encyclopedia of Genes and Genomes http://www.genome.jp/kegg/

• Cytoscape Network Visualization http://www.cytoscape.org/

• Yahoo! Developer Network http://developer.yahoo.com/

Page 22: An Exploratory Method to Reconstruct Pathways

Acknowledgements

• Dr. Matteo Pellegrini

• Everyone in the lab

• SoCalBSI

• NIH / NSF