intro spring 2009 bioinformatiatics proteomics. workflow spring 2009 bioinformatiatics proteomics...

48
Intro Spring 2009 Bioinformatiatics Proteomics

Upload: silvester-lee

Post on 12-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

IntroSpring 2009 BioinformatiaticsBioinformatiatics

Proteomics

Page 2: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

workflowSpring 2009 BioinformatiaticsBioinformatiatics

Proteomics Workflow

• Sample Prep• Sequencing• Database Search• Protein ID• Protein Interactions

Page 3: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Identification

Quantification

General workflow of proteomics analysis

External data sourcestaxonomy, ontologies, bibliography…

Applications Systems biology (pathways, interactions..) biomarker-discovery, drug targets

Proteins/peptides

2D gel image aquisition and storage

MALDI, MS/MS

Store peak lists and all meta data

Digestion and/or

separation

PMF

MS/MS

DIGE

LC-MS & Tags

Page 4: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Sequence data bases:EMBL Nucleotide Sequence Database GenBank UniProtKB/Swiss-Prot & TrEMBL Ensemble EST database PIR

Identification

Quantification

General workflow of proteomics analysis

Proteins/peptides

Digestion and/or

separation

MALDI, MS/MS

2D Page data basesSwiss 2D PAGE, Gelbank, Cornelia, WordPAGE

Make 2D

Imaging tools:Melanie, PDQuest ProgenesisDelta 2D

Storing/ organising:ProteincsapeMSight

KEGG PDB DIPOMIMReactomePROSITPfamSPINBONDSTRINGAmiGODavidPubMedMEDLINE

MascotSequestAldentePopitamPhenyxFindModProfoundPepFragMS-FitOMSSASearch XLinksTagIdent

Page 5: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

General workflow of proteomics analysis

Proteins/peptides

Digestion and/or

separation

2D Page data bases

Make 2D

Imaging Softwares:The ability to compare two gels (images) and then identify differently expressed spots

•Melanie•PDQuest•Progenesis•Delta 2DProteinscape –platform for storing, organizing

data

MSight -representation of mass spectra along with data from the separation

2D gel databases:Data integration on the webImage data and textual information

•Swiss 2D PAGE •Gelbank •Cornelia•WordPAGE

Page 6: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Laser capture

Spring 2009 BioinformatiaticsBioinformatiatics

Laser-Capture Micro dissection, LMC

Technique for selectively sampling certain cells within a tissue

Biopsy

Transfer film

Glass slide

Genomic/proteomic analysis

Tissue sample

Laser beam activates film

Selected cells are transferred

Tumor

Cells

Modified from “National Cancer Institute”, US National Institutes of Health:

http://www.cancer.gov/cancertopics/understandingcancer/moleculardiagnostics/Slide29

Page 7: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

TemplateSpring 2009 BioinformatiaticsBioinformatiatics

Page 8: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

FractionationSpring 2009 BioinformatiaticsBioinformatiatics

Affinity Purification

Page 9: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

2D gels at SwissprotSpring 2009 BioinformatiaticsBioinformatiatics

Swissprot ExPaSy Database

2D Electrophoresis

Page 10: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

TemplateSpring 2009 BioinformatiaticsBioinformatiatics

Protein Digestion•Primary sequence must be accessible•Denature – urea in solution or SDS in gel•Reduce & alkylate disulfide bonds between cysteines

•dithiothreitol (DTT) & Iodoacetamide (IAA)

•Digest with enymes•Purify peptide fragments

Page 11: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

TemplateSpring 2009 BioinformatiaticsBioinformatiatics

Page 12: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

codon UsageSpring 2009 BioinformatiaticsBioinformatiatics

Standard Genetic Code (transl_table=1)

AAs = FFLLSSSSYY**CC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGGStarts = ---M---------------M---------------M----------------------------Base1 = TTTTTTTTTTTTTTTTCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAGGGGGGGGGGGGGGGGBase2 = TTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGBase3 = TCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAG

AAs = FFLLSSSSYY**CC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGGStarts = ---M---------------M------------MMMM---------------M------------Base1 = TTTTTTTTTTTTTTTTCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAGGGGGGGGGGGGGGGGBase2 = TTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGBase3 = TCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAG

The Bacterial and Plant Plastid Code (transl_table=11)

AAs = FFLLSSSSYYQQCC*WLLLLPPPPHHQQRRRRIIIMTTTTNNKKSSRRVVVVAAAADDEEGGGGStarts = -----------------------------------M----------------------------Base1 = TTTTTTTTTTTTTTTTCCCCCCCCCCCCCCCCAAAAAAAAAAAAAAAAGGGGGGGGGGGGGGGGBase2 = TTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGTTTTCCCCAAAAGGGGBase3 = TCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAGTCAG

The CiliateHexamita Nuclear Code (transl_table=6)

Page 13: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Unusual amino acidsSpring 2009 BioinformatiaticsBioinformatiatics

Unusual Amino Acids

Page 14: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

phosphorylationSpring 2009 BioinformatiaticsBioinformatiatics

Phosphorylation - signal transduction

mRNA

mRNA

Page 15: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

TemplateSpring 2009 BioinformatiaticsBioinformatiatics

Page 16: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

TemplateSpring 2009 BioinformatiaticsBioinformatiatics

Page 17: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

TemplateSpring 2009 BioinformatiaticsBioinformatiatics

Page 18: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

TemplateSpring 2009 BioinformatiaticsBioinformatiatics

Page 19: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

TemplateSpring 2009 BioinformatiaticsBioinformatiatics

Page 20: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

TemplateSpring 2009 BioinformatiaticsBioinformatiatics

Page 21: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

MS analysis

Page 22: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Antibody arrays

Good for low-abundance proteinsProblem is antibody specificity

Page 23: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Array-based protein interaction detection

Page 24: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Protein microarrays

Page 25: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Yeast Two-Hybrid System

Page 26: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

How to organize information?• Gene Ontology

– Biological process• Frequently from biochemical analyses• In silico analysis

– Molecular function• Biochemical analysis

– Cellular component• Biochemical analysis• GFP or other tagging

Page 27: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Interaction maps - Grid

Page 28: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

challenges

• Complexity – some proteins have >1000 variants

• Need for a general technology for targeted manipulation of gene expression

• Limited throughput of todays proteomic platforms

• Lack of general technique for absolute quantitation of proteins

Page 29: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Protein Profiling

• 2D gel electrophoresis

• Difference gel electrophoresis (DIGE)

• LC-MS/MS using coded affinity tagging(ICAT, iTrac, SILAC..)

• ProteinChip Array (SELDI analysis)

• Antibody arrays

Measure the expression of a set of proteins in two samples and compare them - Comparative proteomics

Page 30: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

IntroSpring 2007 BioinformatiaticsBioinformatiatics

RNA and Protein Structure Prediction

Page 31: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein
Page 32: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

SSU Secondary StructureSSU Secondary StructureSSU Secondary StructureSSU Secondary Structure

Page 33: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Ribosome

Page 34: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Ribosome

Page 35: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

-Beaudry & Joyce, Science, 1992

Frq. Of mutation

(%; n=25) after

9 generations.

Page 36: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein
Page 37: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

M13 vector sequenceM13 vector sequence

TATAGGGCGAATTGAATTTAGCGG

or

ATTAACCCTCACTAAAGGGACTAG

to

CCCTT

Page 38: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

PseudoknotsPseudoknots

Page 39: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Hammerhead RibozymeHammerhead Ribozyme

Page 40: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

tRNA Secondary StructuretRNA Secondary Structure

Page 41: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

RNA Tertiary StructureRNA Tertiary Structure

Page 42: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein
Page 43: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein
Page 44: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein
Page 45: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Pyruvate KinasePyruvate Kinase

Page 46: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Human DNA clamp PCNA

Page 47: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein

Chou-Fasman ParametersChou-Fasman Parameters

Page 48: Intro Spring 2009 Bioinformatiatics Proteomics. workflow Spring 2009 Bioinformatiatics Proteomics Workflow Sample Prep Sequencing Database Search Protein