introduction to functional analysis -...
TRANSCRIPT
![Page 1: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/1.jpg)
Daniel Rico, PhD. [email protected]
::: Introduction to Functional Analysis
Course on Functional Analysis
Madrid, November 6th, 2012.
![Page 2: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/2.jpg)
::: Schedule.
1. Biological (Functional) Databases 2. Threshold-based and threshold free methods 3. Threshold-based example 1: FatiGO. 4. Threshold-based example 2: DAVID.
![Page 3: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/3.jpg)
Arabidopsis thaliana
Homo sapiens
Mus musculus
Rattus norvegicus
Drosophila melanogaster
Caenorhabditis elegans
Saccharmoyces cerevisae
Gallus gallus
Danio rerio
HGNC symbol
EMBL acc
RefSeq
PDB
Protein Id
IPI….
Genes IDs
Gene Ontology Biological Process Molecular Function Cellular Component
UniProt/Swiss-Prot
UniProtKB/TrEMBL
Ensembl IDs
EntrezGene
Affymetrix
Agilent
KEGG pathways
Regulatory elements miRNA
CisRed
Transcription Factor Binding Sites
REACTOME pathways
InterPro Motifs
Bioentities from literature:
Diseases terms Chemical terms
Gene Expression in tissues
Keywords Swissprot
Biological databases
![Page 4: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/4.jpg)
Gene Ontology CONSORTIUM h"p://www.geneontology.org
• The objective of GO is to provide controlled vocabularies for the description of the molecular function, biological process and cellular component of gene products.
• These terms are to be used as attributes of gene products by
collaborating databases, facilitating uniform queries across them.
• The controlled vocabularies of terms are structured
![Page 5: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/5.jpg)
GO structure The three categories of GO Molecular Function
the tasks performed by individual gene products; examples are transcription factor and DNA helicase
Biological Process
broad biological goals, such as mitosis or purine metabolism, that are accomplished by ordered assemblies of molecular functions
Cellular Component
subcellular structures, locations, and macromolecular complexes; examples include nucleus, telomere, and origin recognition complex
GO tree structure
IS_A relation
PART_OF relation
![Page 6: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/6.jpg)
Pathways
![Page 7: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/7.jpg)
What is a Biological pathway?
ª an ordered series of molecular events that leads to the crea:on new molecular product, or a change in a cellular state or process.
Boehringer Mannheim/Roche "Biochemical Pathways" wall chart (1968)
![Page 8: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/8.jpg)
h6p://www.genome.ad.jp/kegg/pathway.html
![Page 9: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/9.jpg)
h6p://www.biocarta.com/genes/index.asp
![Page 10: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/10.jpg)
h6p://www.reactome.org/
![Page 11: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/11.jpg)
h6p://www.pathwaycommons.org
![Page 12: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/12.jpg)
h6p://www.whichgenes.org/
![Page 13: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/13.jpg)
::: Schedule.
1. Biological (Functional) Databases 2. Threshold-based and threshold free methods 3. Threshold-based example 1: FatiGO. 4. Threshold-based example 2: DAVID.
![Page 14: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/14.jpg)
The two-steps approach
• Genes of interest are selected using the experimental value.
• Selected genes are compared to the background.
Threshold-based functional analysis
Study the enrichment in functional
terms in groups of genes defined by the experimental value.
FatiGO
GOminer
DAVID
Marmite
Threshold-free functional analysis
Select genes taking into account
their functional properties.
FatiScan
GSEA
MarmiteScan
• Under a systems biology
perspective. • Detect blocks of functionally
related genes.
![Page 15: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/15.jpg)
Class1 Class2
6est cut-‐off
FDR<0.05
FDR<0.05
Biological meaning?
Threshold-based functional analysis
![Page 16: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/16.jpg)
ES/NES staOsOc
-‐
+
Class1 Class2
Gene Set 1
6est cut-‐off
Gene Set 2
Gene Set 3
Gene set 3 enriched in Class 2
Gene set 2 enriched in Class 1
Threshold-free functional analysis
![Page 17: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/17.jpg)
Threshold-‐based funcOonal analysis
Comparing a gene list of interest with
a background gene list
![Page 18: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/18.jpg)
6est cut-‐off
FDR<0.05
FDR<0.05
List 1
List 2 (background)
Class1 Class2
List 1b / List 2b
![Page 19: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/19.jpg)
Testing the distribution of GO terms among two lists of genes
Biosynthesis 60% Biosynthesis 20%
Sporulation 20% Sporulation 20%
List 1
List 2
(background)
Genes in List 1 are significantly enriched in biosynthesis, but not in sporulation.
Are this two groups of genes
carrying out different
biological roles?
8 4 No biosynthesis 2 6 Biosynthesis B A
![Page 20: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/20.jpg)
§ List1: genes of interest (they are significantly over- or under-expressed when two classes of experiments are compared, co-located in the chromosomes, etc.)
§ List2:the background (typically the rest of genes). § Select suitable database, Run...
List2
Remove genes repeated in list1
Remove genes repeated between
both lists
Remove genes repeated in list2
Extract functional
terms
Comparing groups of genes
List1 “clean” List1
“clean” List2
DAVID/BABELOMICS
GO
KEGG Interpro
KW Bioentities
Gene Expression
TF ...
011000101010101001 ...... 11001010 ........... 010001010 ........... 0110001010 ........... 1111001111...............
Matrix of functional
terms Fisher´s test/
EASE test
Adjust p-value by FDR
Significant functional
terms
![Page 21: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/21.jpg)
DAVID
![Page 22: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/22.jpg)
http://david.abcc.ncifcrf.gov/
![Page 23: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/23.jpg)
![Page 24: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/24.jpg)
Click in here
![Page 25: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/25.jpg)
Explore the different annotations you wish to use
Try Clustering, Chart and Table... what are the differences?
![Page 26: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/26.jpg)
::: How the functional profiling should never be done
It is not uncommon to find the following asserOon in papers and talks: “then we examined our set of genes selected in this way (whatever) and we discover that 65% of them were related to metabolism, so we can conclude that our experiment acOvates metabolism genes”.
AnnotaOon is not a funcOonal enrichment analysis result!!!
![Page 27: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/27.jpg)
::: BioGPS: A nice tool to explore gene annotation !!
http://biogps.org/
![Page 28: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/28.jpg)
FaOGO
![Page 29: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/29.jpg)
http://babelomics.bioinfo.cipf.es/
![Page 30: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/30.jpg)
![Page 31: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/31.jpg)
http://bioinfo.cipf.es/babelomicstutorial/enrichment_analysis
More examples in:
![Page 32: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/32.jpg)
http://www.reactome.org/ReactomeGWT/entrypoint.html#PathwayAnalysisDataUploadPage http://www.reactome.org/userguide/Usersguide.html#Pathway_Analysis
New tool integrated in REACTOME
![Page 33: Introduction to Functional Analysis - bioinfo.cnio.esbioinfo.cnio.es/files/training/Sixth_Functional_Analysis_of_Gene... · Daniel Rico, PhD. drico@cnio.es::: Introduction to Functional](https://reader031.vdocuments.net/reader031/viewer/2022022119/5e1010ae7da82923b94dedea/html5/thumbnails/33.jpg)
Many of these slides have been taken and adapted from originals by: Gonzalo Gómez Babelomics, DAVID and Reactome teams.
ACKNOWLEDGEMENTS