24th feb 2006 jane lomax gene ontology tutorial talk:using the gene ontology (go) for expression...

47
24th Feb 2006 Jane Lomax Gene Ontology tutorial • Talk: Using the Gene Ontology (GO) for Expression Analysis • Practical: Onto-Express analysis tool • Talk: GO further • Practical: AmiGO browser • Practical: Annotation exercise

Upload: ciera-douthitt

Post on 01-Apr-2015

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Gene Ontology tutorial

• Talk: Using the Gene Ontology (GO) for Expression Analysis

• Practical: Onto-Express analysis tool

• Talk: GO further• Practical: AmiGO browser• Practical: Annotation exercise

Page 2: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Using the Gene Ontology (GO) for Expression

Analysis

Page 3: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO for Expression Analysis

• What is GO?• Structure of GO• GO tools for microarray analysis• Onto-Express tutorial

Page 4: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

What is the Gene Ontology?

• Set of biological phrases (terms) which are applied to genes:– protein kinase– apoptosis– membrane

Page 5: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

What is the Gene Ontology?

• Genes are linked, or associated, with GO terms by trained curators at genome databases– known as ‘gene associations’ or GO

annotations

• Some GO annotations created automatically

Page 6: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

gene -> GO term

associated genes

GO annotations

GO database

genome and protein databases

Page 7: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

What is the Gene Ontology?

• Allows biologists to make inferences across large numbers of genes without researching each one individually

Page 8: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

Copyright ©1998 by the National Academy of Sciences

Eisen, Michael B. et al. (1998) Proc. Natl. Acad. Sci. USA 95, 14863-14868

Page 9: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO structure

• GO isn’t just a flat list of biological terms

• terms are related within a hierarchy

Page 10: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO structure

gene A

Page 11: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO structure

• This means genes can be grouped according to user-defined levels

• Allows broad overview of gene set or genome

Page 12: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

How does GO work?

• GO is species independent– some terms, especially lower-level,

detailed terms may be specific to a certain group• e.g. photosynthesis

– But when collapsed up to the higher levels, terms are not dependent on species

Page 13: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

How does GO work?

• What does the gene product do?• Where and when does it act?• Why does it perform these activities?

What information might we want to capture about a gene product?

Page 14: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO structure

• GO terms divided into three parts:– cellular component– molecular function– biological process

Page 15: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Cellular Component

• where a gene product acts

Page 16: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Cellular Component

Page 17: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Cellular Component

Page 18: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Cellular Component

• Enzyme complexes in the component ontology refer to places, not activities.

Page 19: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Molecular Function

• activities or “jobs” of a gene product

glucose-6-phosphate isomerase activity

Page 20: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Molecular Function

insulin bindinginsulin receptor activity

Page 21: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Molecular Function

drug transporter activity

Page 22: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Molecular Function

• A gene product may have several functions; a function term refers to a single reaction or activity, not a gene product.

• Sets of functions make up a biological process.

Page 23: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Biological Process

a commonly recognized series of events

cell division

Page 24: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Biological Process

transcription

Page 25: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Biological Process

regulation of gluconeogenesis

Page 26: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Biological Process

limb development

Page 27: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Biological Process

courtship behavior

Page 28: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Ontology Structure

• Terms are linked by two relationships– is-a – part-of

Page 29: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Ontology Structure

cell

membrane chloroplast

mitochondrial chloroplastmembrane membrane

is-apart-of

Page 30: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Ontology Structure

• Ontologies are structured as a hierarchical directed acyclic graph (DAG)

• Terms can have more than one parent and zero, one or more children

Page 31: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Ontology Structure

cell

membrane chloroplast

mitochondrial chloroplastmembrane membrane

Directed Acyclic Graph (DAG) - multiple

parentage allowed

Page 32: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Anatomy of a GO term

id: GO:0006094name: gluconeogenesisnamespace: processdef: The formation of glucose fromnoncarbohydrate precursors, such aspyruvate, amino acids and glycerol.[http://cancerweb.ncl.ac.uk/omd/index.html]exact_synonym: glucose biosynthesisxref_analog: MetaCyc:GLUCONEO-PWYis_a: GO:0006006is_a: GO:0006092

unique GO IDterm name

definition

synonymdatabase ref

parentage

ontology

Page 33: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO tools

• GO resources are freely available to anyone to use without restriction– Includes the ontologies, gene

associations and tools developed by GO

• Other groups have used GO to create tools for many purposes:

http://www.geneontology.org/GO.tools

Page 34: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO tools

• Affymetrix also provide a Gene Ontology Mining Tool as part of their NetAffx™ Analysis Center which returns GO terms for probe sets

Page 35: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO tools

• Many tools exist that use GO to find common biological functions from a list of genes:

http://www.geneontology.org/GO.tools.microarray.shtml

Page 36: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO tools

• Most of these tools work in a similar way:– input a gene list and a subset of

‘interesting’ genes– tool shows which GO categories have most

interesting genes associated with them i.e. which categories are ‘enriched’ for interesting genes

– tool provides a statistical measure to determine whether enrichment is significant

Page 37: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Microarray process

• Treat samples• Collect mRNA• Label• Hybridize• Scan• Normalize• Select differentially regulated genes • Understand the biological phenomena

involved

Page 38: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Traditional analysis

Gene 1ApoptosisCell-cell signalingProtein phosphorylationMitosis…

Gene 2Growth controlMitosisOncogenesisProtein phosphorylation…

Gene 3Growth controlMitosisOncogenesisProtein phosphorylation…

Gene 4Nervous systemPregnancyOncogenesisMitosis…

Gene 100Positive ctrl. of cell prolifMitosisOncogenesisGlucose transport…

Page 39: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Traditional analysis

• gene by gene basis

• requires literature searching

• time-consuming

Page 40: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Using GO annotations

• But by using GO annotations, this work has already been done for you!

GO:0006915 : apoptosis

Page 41: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Grouping by process

ApoptosisGene 1Gene 53

MitosisGene 2Gene 5Gene45Gene 7Gene 35…

Positive ctrl. of cell prolif.Gene 7Gene 3Gene 12…

GrowthGene 5Gene 2Gene 6…

Glucose transportGene 7Gene 3Gene 6…

Page 42: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

GO for microarray analysis

• Annotations give ‘function’ label to genes

• Ask meaningful questions of microarray data e.g.– genes involved in the same process,

same/different expression patterns?

Page 43: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Using GO in practice

• statistical measure – how likely your differentially regulated

genes fall into that category by chance

microarray

1000 genesexperiment

100 genes differentially regualted

mitosis – 80/100apoptosis – 40/100p. ctrl. cell prol. – 30/100glucose transp. – 20/100

0

10

20

30

40

50

60

70

80

mitosis apoptosis positive control ofcell proliferation

glucose transport

Page 44: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Using GO in practice

• However, when you look at the distribution of all genes on the microarray:Process Genes on array # genes expected in occurred

100 random genesmitosis 800/1000 80 80apoptosis 400/1000 40 40p. ctrl. cell prol. 100/1000 10 30 glucose transp. 50/1000 5 20

Page 45: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

The tutorial

• Analysing microarray data using GO with Onto-Express

http://vortex.cs.wayne.edu/projects.htm#Onto-Express

Page 46: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

The tutorial - Onto-Express

http://vortex.cs.wayne.edu/projects.htm#Onto-Express

Page 47: 24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO

24th Feb 2006 Jane Lomax

Onto-Express walkthrough

http://vortex.cs.wayne.edu/projects.htm#Onto-Express