netbiosig2012 chaozhang-mosaic
DESCRIPTION
With the improvement of high-throughput technology, the dramatic increase of large-scale data in both biomolecular concentration and biomolecular interactions has resulted in many biological networks, such as protein interaction networks, gene regulatory networks, and metabolic networks. Although functional analysis is the fundamental step of better understanding biological networks, utilizing vast wealth of data and huge amount of knowledge to annotate and analyze the function of biological networks is still challenging in nowadays bioinformatics. Many software tools are available to visualize and analyze function-derived biological networks, but most of them are isolated with simple functions. One challenge faced by these visualization tools is how to make sense of such networks often represented as massive “hairballs.” Many network analysis algorithms filter or partition networks based on topological features, or mathematically model networks rely on their statistical properties, sidestepping the issue of making sense of the network itself altogether. On other hand, traditional functional enrichment analysis methods regard a network as a list of genes, and annotate networks with gene set enrichment methods. However, it does not consider the topological dynamics of network which might lead to the different functions under different conditions. Therefore, it is necessary to consider molecular interactions to correctly and specifically annotate biological networks. As one of the most successful open source frameworks in bioinformatics, Cytoscape is a powerful network visualization platform that actively supports independent plugin development. By integrating model-view-controller design pattern and Cytoscape techniques, it makes possible an integrated ontology-annotated biological network visualization and analysis platform. In the first stage of the project, we successfully developed two interactive plugins -- Mosaic (http://nrnb.org/tools/mosaic) and NOA (http://nrnb.org/tools/noa) -- to address both visualization and analysis respectively. Mosaic supports interactive network annotation and visualization that includes partitioning, layout and coloring based on biologically-relevant ontologies. It shows slices of a given network in the visual language of biological pathways, which are familiar to any biologist and are ideal frameworks for integrating knowledge, and also provides researchers with an interactive tool to evaluate biological interactions within the context of well-defined processes, functions and cellular localization while retaining all original network information. NOA first introduced link ontology that assigns functions to interactions based on the known annotations of joint genes via optimizing two novel indexes ‘Coverage’ and ‘Diversity’. Then, NOA generates two alternative reference sets to statistically rank the enriched functional terms for a given biological network. It has been proved to be more efficient not only in...TRANSCRIPT
1
Network-Ontology Visualization and Analysis
Chao ZhangComputer Science Department
2
GO annotation
Background
Up to October 26, 2010, there have been more than 2,753,338 annotations covering 48 species in GO database
annotation
3
Enrichment analysis
Background
4
Importance and common drawbacks
• Function enrichment analysis– DAVID (2003) Genome biol 3,364
(3,293)– DAVID (2008) Nature protcols 1,908
(1,802)– Bingo (2005) Bioinformatics 657 (609)
• Limitations– Lack of good visualization tool– All for gene set, not network
Background
5
Workflow
Background
Visualization+
Analysis
•Including as more as possible functions of other software
•Relatively independent modules with interactions
•Very flexible, 4 individual plugins or 1 bundle
6
Visualization (Mosaic)
• What’s the difference?
Mosaic
7
Visualization (Mosaic)
• Mosaic shows slices of a given network in the visual language of biological pathways, which are familiar to any biologist and are ideal frameworks for integrating knowledge
Mosaic
Biological Process
Molecular functionCellular
components
8
Visualization (Mosaic)
• Partitioning with BP• Layout with CC• Coloring with MF• Automatic annotation• ID mapping• Overview network
.
Mosaic
9
1st Partitioning/decomposition
• Decompose a huge biological network to sub-networks (Biological process)
Mosaic
10
2nd Layout/Pathway
• Currently only support PathVisio and WikiPathway formats (Cellular components)
Mosaic
11
3rd Coloring/Highlighting
• Molecular function
Other projects>AVALON
12
3rd Coloring/Highlighting
• Molecular function
Other projects>AVALON
13
Overview network
Mosaic
14
Analysis (NOA)
NOA
Molecular systems biology, 2009
Normal DiseaseDisease
The same gene list
Different phenotype
15
Edge-based algorithm
NOA: a novel Network Ontology Analysis method
Nucleic Acids Research, 2009
NOA
Blast2GO, GoAnnotator, Goanna, etc
FatiGO, DAVID, g:profiler,BiNGO, etc
NOA
Gene annotation
Gene list annotation
Network annotation
N e t w o r k
Network
Single genes
Single genes
G ene set
Gene set
16
Analysis (NOA)
• Edge-base algorithm• Batch mode• Heatmap• Overview network• Interaction with Mosaic
NOA
17
Node/edge-based algorithms
Other projects>AVALON
18
Node/edge-based algorithms
Other projects>AVALON
19
Batch mode
Other projects>AVALON
20
Batch mode
Other projects>AVALON
21
Acknowledgement
Project advisor:Alexander R. Pico (UCSF)Dong Xu (Uni. Of Missouri)
Mosaic:Kristina Hanspers (UCSF)Allan Kuchinsky (Agilent)Nathan Salomonis (UCSF)
NOA:Jiguang Wang (Columbia Uni.)Luonan Chen (CAS)Xiangsun Zhang (CAS)
Google’ Summer of Code Program
Acknowledgement
22
Question?
Mosaic:http://nrnb.org/tools/mosaic/
NOA:http://nrnb.org/tools/noa/
Chao Zhang:[email protected]
Acknowledgement