j goecks - the galaxy visual analysis framework

20
The Galaxy Visual Analysis Framework Jeremy Goecks, The Galaxy Team, Anton Nekrutenko, and James Taylor 1

Upload: jan-aerts

Post on 10-May-2015

1.296 views

Category:

Education


1 download

DESCRIPTION

Presentation at BOSC2012 by J Goecks - The Galaxy Visual Analysis Framework

TRANSCRIPT

Page 1: J Goecks - The Galaxy Visual Analysis Framework

The  Galaxy  Visual  Analysis  Framework

Jeremy  Goecks,  The  Galaxy  Team,  Anton  Nekrutenko,  and  James  Taylor

1

Page 2: J Goecks - The Galaxy Visual Analysis Framework

What  is  Galaxy?

Web-based GUI for genomics✦ for  complete  analyses:  obtain  and  integrate  data,  analyze,  

visualize,  share,  publish

A tool integration framework that makes it simple to chain tool usage together step-by-step or create complex workflows

Open source software  that  makes  it  simple  to  ✦ integrate  your  own  tools  and  data✦ customize  and  run  on  your  own  resources

2

http://usegalaxy.org http://galaxyproject.org

Page 3: J Goecks - The Galaxy Visual Analysis Framework

Goal

An  open,  Web-­‐based  approach  for  

making  highly  interactive  visual  analysis  

tools  for  NGS  datasets

3

Page 4: J Goecks - The Galaxy Visual Analysis Framework

Goal

An  open,  Web-­‐based  approach  for  

making  highly  interactive  visual  analysis  

tools  for  NGS  datasets

4

distributed,  extendable,  sharable,  fast

Page 5: J Goecks - The Galaxy Visual Analysis Framework

Goal

An  open,  Web-­‐based  approach  for  

making  highly  interactive  visual  analysis  

tools  for  NGS  datasets

5

distributed,  extendable,  sharable,  fast

flexible,  customizable

Page 6: J Goecks - The Galaxy Visual Analysis Framework

Goal

An  open,  Web-­‐based  approach  for  

making  highly  interactive  visual  analysis  

tools  for  NGS  datasets

6

distributed,  extendable,  sharable,  fast

flexible,  customizable visualization  +  tools

Page 7: J Goecks - The Galaxy Visual Analysis Framework

Goal

An  open,  Web-­‐based  approach  for  

making  highly  interactive  visual  analysis  

tools  for  NGS  datasets

7

distributed,  extendable,  sharable,  fast

flexible,  customizable visualization  +  tools

needs  to  scale  to  huge  datasets  

Page 8: J Goecks - The Galaxy Visual Analysis Framework

Demo

8

Page 9: J Goecks - The Galaxy Visual Analysis Framework

Trackster

9

Page 10: J Goecks - The Galaxy Visual Analysis Framework

Paramamonster

10

Page 11: J Goecks - The Galaxy Visual Analysis Framework

Circster

11

Page 12: J Goecks - The Galaxy Visual Analysis Framework

Trackster

Completely  Web-­‐based✦ no  downloads,  no  add-­‐ons,  no  Flash

Supports  arbitrarily  large  NGS  datasets✦ SAM/BAM,  BED,  GFF/GTF,  VCF,  WIG

Highly  flexible✦ e.g.  custom  rainbow  tracks

Integrated  with  Galaxy  tool  framework✦ dynamic  filtering✦ re-­‐running  tools

12

Page 13: J Goecks - The Galaxy Visual Analysis Framework

Paramamonster

Visualization  for✦ tool  parameter  space✦ outputs  from  different  settings

Can  easily  find  good  settings  by  visual  inspection✦ for  many  settings,  across  multiple  regions

Can  explore  parameter  space  systematically  or  ad-­‐hoc

13

Page 14: J Goecks - The Galaxy Visual Analysis Framework

Circster

Circos-­‐like  visualization  that  provides  genome-­‐wide  views

Complements  Trackster

Very  much  a  work  in  progress

14

Page 15: J Goecks - The Galaxy Visual Analysis Framework

Architecture

15

Datasets

...

Tools

...

Galaxy

Web browser

Galaxy HTML UI

Page 16: J Goecks - The Galaxy Visual Analysis Framework

Architecture

16

Datasets

...

Tools

... Dat

a Pr

ovid

ers

Galaxy

Web browser

Galaxy HTML UI

Page 17: J Goecks - The Galaxy Visual Analysis Framework

Architecture

17

Datasets

...

Tools

... Dat

a Pr

ovid

ers

Galaxy

...

Web browser

Data Managers

Visualizationsd3.js (SVG)HTML5: Canvas, CSS

Galaxy HTML UI

Page 18: J Goecks - The Galaxy Visual Analysis Framework

Future  Directions

Non-­‐genomic  visualizations✦ phylogenetic  trees✦ scatterplots

Integration  of  multiple  visualizations✦ multiple  views  in  same  visualization✦ views  in  different  visualizations

18

Page 19: J Goecks - The Galaxy Visual Analysis Framework

Supported by the NHGRI (HG005542, HG004909, HG005133, HG006620), NSF (DBI-0850103), Penn State University, Emory University, and the Pennsylvania Department of Public Health

Dan Blankenberg Nate Coraor

Greg von Kuster

Enis Afgan Dannon Baker

Jeremy Goecks

Anton NekrutenkoJames Taylor

Dave Clements Jennifer Jackson

19

Page 20: J Goecks - The Galaxy Visual Analysis Framework

http://galaxyproject.org

Galaxy  publications:  http://galaxyproject.org/wiki/Citing

Tech  Track  Talk  (TT08):  Sunday,  2:30p

[email protected]

Thanks!  Questions?