visual compression of workflow visualizations with automated detection of macro motifs

60
Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs Eamonn Maguire, Philippe Rocca-Serra, Susanna-Assunta Sansone, Jim Davies and Min Chen University of Oxford e-Research Centre University of Oxford Department of Computer Science VIS 2013, 13th-18th October 2013

Upload: eamonn-maguire

Post on 14-Dec-2014

419 views

Category:

Technology


3 download

DESCRIPTION

VIS 2013 Presentation Paper is available here: http://www.oerc.ox.ac.uk/personal-pages/emaguire/AutoMacron.pdf Code is available here: http://github.com/isa-tools/automacron

TRANSCRIPT

Page 1: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Eamonn Maguire, Philippe Rocca-Serra, Susanna-Assunta Sansone, Jim Davies and Min Chen

University of Oxford e-Research CentreUniversity of Oxford Department of Computer Science

VIS 2013, 13th-18th October 2013

Page 2: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Some terminology

Motif

Macro

Commonly observed subgraphs

A single instruction that expands automatically in to a more complex set of instructions.

Workflow Literally a flow of work showing the processes enacted from start to finish in say business processes, software execution, analysis procedures, or in our case, biological experiments.

They are used to enable reproducibility.

e.g VisTrails in our VIS community - 40,000 downloads

Q

Q

D

E

Q

QE

D

VIS 2013, 13th-18th October 2013

Very commonly seen used in: biology - protein-protein interaction, transcription/regulation networks; chemistry; and even visualization (e.g. VisComplete)

Page 3: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Roadmap

VIS 2013, 13th-18th October 2013

Page 4: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Roadmap

WorkflowSubstitute motifs with

‘macros’

VIS 2013, 13th-18th October 2013

AutomaticallyDetect Motifs

Page 5: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

VIS 2013, 13th-18th October 2013

Blockades

Page 6: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

VIS 2013, 13th-18th October 2013

Blockades

Current Motif Detection Algorithm Limitations

No semantics

Limited motif sizes (Max 10)

Page 7: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

VIS 2013, 13th-18th October 2013

Blockades

Current Motif Detection Algorithm Limitations

No semantics

Limited motif sizes (Max 10) Deciding what should

be a Macro

Macros in electronic circuit diagrams are the product of years of refinement.

Macros in biological workflows for instance is new...how do we determine what should be a macro?

Page 8: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Example case

Biology

VIS 2013, 13th-18th October 2013

Page 9: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Taxonomy-based Glyph Design

Maguire et al, 2012IEEE TVCG

Visualizing (ISA based) workflows of biological experiments

Extension on Previous Work

VIS 2013, 13th-18th October 2013

Page 10: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

A Typical Biological Experiment

Hypothesis Experiment Results

&

Paper

Analysis

VIS 2013, 13th-18th October 2013

Page 11: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

material protocol chemical dataKEY

Source name

Sampling Protocol

Sample name

Chemical Label

Labeling Protocol

Labeled Extract

Hybridisation Protocol

Assay Name

Scanning Protocol

Raw Data File

Feature Extraction Protocol

Processed Data File

Describe the flow of work from a

biological sample to the data file.

Workflow varies between technologies,

but there is a large commonality in steps.

For example, the labeling step is very

common in DNA microarray experiments.

Representing an Experiment - Workflows!

Reproducibility!

VIS 2013, 13th-18th October 2013

Page 12: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

Our Process

VIS 2013, 13th-18th October 2013

Page 13: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

Our Process

VIS 2013, 13th-18th October 2013

Page 14: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Workflow Repository

9,670 Biological Experiment Workflows

Why such a large number?We can statistically make suggestions to users about what motifs can be macros based on a number of metrics (detailed later)

+ we can robustly test our algorithm performance across a huge cross section of experiments...

VIS 2013, 13th-18th October 2013

Page 15: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

VIS 2013, 13th-18th October 2013

Page 16: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Motif Extraction Algorithm

VIS 2013, 13th-18th October 2013

Page 17: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

The Current Weaknesses

No semantics (edge or node)Small node limit normally <10

VIS 2013, 13th-18th October 2013

Imagine n-grams with no information other than topology

e.g. bi-grams of DNA ‘motifs’ where instead of A-T, T-C, T-G > x-x, x-x, x-x

FANMOD, mFinder etc.

Page 18: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

The Problem...Current Motif Extraction Algorithms

Ah, and you can’t have macros without function...

Exactly!

We can’t infer function from these results

What’s up?

Unable to infer function Unable to produce a macro

VIS 2013, 13th-18th October 2013

Page 19: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

SolutionA a normal state, with

s1

s0

A

B DB

EE

G

H

a holding state, with

a pseudo-ŵŽƟĨ

a ‘’legal’’ ŵŽƟĨ

s2

s3C

C

s4

F

E

H

a starƟŶŐ state

a trĂŶƐŝƟŽŶ that

generates a ŵŽƟĨC

E

generates a ŵŽƟĨ

a trĂŶƐŝƟŽŶ does not

generate a ŵŽƟĨ

A a normal state, with

s1

s0

A

B DB

EE

G

H

a holding state, with

a pseudo-ŵŽƟĨ

a ‘’legal’’ ŵŽƟĨ

s2

s3C

C

s4

F

E

H

a starƟŶŐ state

a trĂŶƐŝƟŽŶ that

generates a ŵŽƟĨC

E

generates a ŵŽƟĨ

a trĂŶƐŝƟŽŶ does not

generate a ŵŽƟĨ

VIS 2013, 13th-18th October 2013

More detail about each individual case, A-H available in paper.

Page 20: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

SolutionA a normal state, with

s1

s0

A

B DB

EE

G

H

a holding state, with

a pseudo-ŵŽƟĨ

a ‘’legal’’ ŵŽƟĨ

s2

s3C

C

s4

F

E

H

a starƟŶŐ state

a trĂŶƐŝƟŽŶ that

generates a ŵŽƟĨC

E

generates a ŵŽƟĨ

a trĂŶƐŝƟŽŶ does not

generate a ŵŽƟĨ

A a normal state, with

s1

s0

A

B DB

EE

G

H

a holding state, with

a pseudo-ŵŽƟĨ

a ‘’legal’’ ŵŽƟĨ

s2

s3C

C

s4

F

E

H

a starƟŶŐ state

a trĂŶƐŝƟŽŶ that

generates a ŵŽƟĨC

E

generates a ŵŽƟĨ

a trĂŶƐŝƟŽŶ does not

generate a ŵŽƟĨ

3

VIS 2013, 13th-18th October 2013

More detail about each individual case, A-H available in paper.

Page 21: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Resulting In... From our algorithm, running over 9,670 workflows, we retrieved ~12,000 motifs up to depth 12

VIS 2013, 13th-18th October 2013

Page 22: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Resulting In...

Semantically awareLimited by depth, not node count - we have motifs with > 80 nodes

From our algorithm, running over 9,670 workflows, we retrieved ~12,000 motifs up to depth 12

VIS 2013, 13th-18th October 2013

Essentially, more complicated topologically sensitive n-grams

Page 23: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

VIS 2013, 13th-18th October 2013

Page 24: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

Ranking Algorithm...because 12,000 is just too much.

VIS 2013, 13th-18th October 2013

Page 25: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

M1 - Occurrences in data

repository

1,043

M2 -Workflow Presence

640...

M3 -Compression Potention

VIS 2013, 13th-18th October 2013

Page 26: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

M1 - Occurrences in data

repository

1,043

M2 -Workflow Presence

640...

M3 -Compression Potention

VIS 2013, 13th-18th October 2013

Page 27: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

M1 - Occurrences in data

repository

1,043

M2 -Workflow Presence

640...

M3 -Compression Potention

VIS 2013, 13th-18th October 2013

Page 28: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

M1 - Occurrences in data

repository

1,043

M2 -Workflow Presence

640...

M3 -Compression Potention

For At, Aw and Ac, we map it to a fixed range [−1, 1] using a linear mapping based on the min-max range of each indicator, yielding three normalized metrics M1 , M2 and M3

VIS 2013, 13th-18th October 2013

No algorithm would be complete without a weighting element. So each metric can be weighted. We use a default weight of 1.

Page 29: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

3 Normalized metrics Motif subgraph 3 Glyph representations

Filter by pattern presenceLinear, branching and merging

Filter by min/max depth

Motifs arranged by depth

Depth 6 motifs with magnified view in B and detailed popup of selected motif in D

VIS 2013, 13th-18th October 2013

Page 30: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

3 Normalized metrics Motif subgraph 3 Glyph representations

Filter by pattern presenceLinear, branching and merging

Filter by min/max depth

Motifs arranged by depth

Depth 6 motifs with magnified view in B and detailed popup of selected motif in D

Occurrences Workflow

presence

Score Compression

Potential

VIS 2013, 13th-18th October 2013

Page 31: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

3 Normalized metrics Motif subgraph 3 Glyph representations

Filter by pattern presenceLinear, branching and merging

Filter by min/max depth

Motifs arranged by depth

Depth 6 motifs with magnified view in B and detailed popup of selected motif in D

Occurrences Workflow

presence

Score Compression

Potential

Downgrade Icon

Adjusted Score

VIS 2013, 13th-18th October 2013

Page 32: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

3 Normalized metrics Motif subgraph 3 Glyph representations

Filter by pattern presenceLinear, branching and merging

Filter by min/max depth

Motifs arranged by depth

Depth 6 motifs with magnified view in B and detailed popup of selected motif in D

Occurrences Workflow

presence

Score Compression

Potential

Downgrade Icon

Adjusted Score

VIS 2013, 13th-18th October 2013

1000

Page 33: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

3 Normalized metrics Motif subgraph 3 Glyph representations

Filter by pattern presenceLinear, branching and merging

Filter by min/max depth

Motifs arranged by depth

Depth 6 motifs with magnified view in B and detailed popup of selected motif in D

Occurrences Workflow

presence

Score Compression

Potential

Downgrade Icon

Adjusted Score

VIS 2013, 13th-18th October 2013

Subset of

1000

1200

Page 34: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Ranking Algorithm

3 Normalized metrics Motif subgraph 3 Glyph representations

Filter by pattern presenceLinear, branching and merging

Filter by min/max depth

Motifs arranged by depth

Depth 6 motifs with magnified view in B and detailed popup of selected motif in D

Occurrences Workflow

presence

Score Compression

Potential

Downgrade Icon

Adjusted Score

VIS 2013, 13th-18th October 2013

Subset of

1000

1200 200

Page 35: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

VIS 2013, 13th-18th October 2013

Page 36: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

Glyph Design

VIS 2013, 13th-18th October 2013

Page 37: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Glyph Design

Topology/structure within a macro

Node type

Density

Annotation

Things we’d like to see...

VIS 2013, 13th-18th October 2013

Page 38: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Glyph Design

annotation

annotation

Node typecolour/shape

Node typecolour/shape

Length

Topologyarrangement

Breadth

Topologyarrangement

Breadth

Topologyoverall

Node typecolour

Length

Breadth

Length

annotation

VIS 2013, 13th-18th October 2013

Page 39: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

annotation

annotation

STATE-TRANSITION MODEL EXAMPLES

Node typecolour/shape

Node typecolour/shape

Length

Topologyarrangement

Breadth

Topologyarrangement

Breadth

Topologyoverall

Node typecolour

Length

Breadth

Length

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4

F

E

H

C

E

s1

s0

AA s

1ss0 s

3

Bs1

s3C A s

1

Es4

s1

s4

F s1

s4

G

annotation

Page 40: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

annotation

annotation

STATE-TRANSITION MODEL EXAMPLES

Node typecolour/shape

Node typecolour/shape

Length

Topologyarrangement

Breadth

Topologyarrangement

Breadth

Topologyoverall

Node typecolour

Length

Breadth

Length

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4

F

E

H

C

E

s1

s0

AA s

1ss0 s

3

Bs1

s3C A s

1

Es4

s1

s4

F s1

s4

G

annotation

Page 41: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

VIS 2013, 13th-18th October 2013

Page 42: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

MOTIF EXTRACTION ALGORITHM

RANKING ALGORITHM

MACRO SELECTION VIA UI

BIOLOGICAL WORKFLOW REPOSITORY

MACROSELECTION

GLYPH DESIGN MACRO ANNOTATION

2.87

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

2.4

OCCURRENCE

600

WORKFLOWS

240

COMPRESSION

2400

SELECTED MACROS

DOMAIN EXPERT

Branch & Merge

Branch & Merge

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Branch & Merge

Branch & Merge

MACRO INSERTION IN GRAPH

MOTIFS

...

2.871

OCCURRENCE

1092

WORKFLOWS

476

COMPRESSION

3276

-2.43n

OCCURRENCE

20

WORKFLOWS

10

COMPRESSION

200

...

DOMAIN EXPERT

Branch & Merge

Branch & Merge

Macro Insertion for Workflow Compression

VIS 2013, 13th-18th October 2013

Page 43: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Macro Insertion for Workflow Compression

VIS 2013, 13th-18th October 2013

Page 44: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Macro Insertion for Workflow Compression

A

VIS 2013, 13th-18th October 2013

Page 45: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Macro Insertion for Workflow Compression

A

B

VIS 2013, 13th-18th October 2013

Page 46: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Macro Insertion for Workflow Compression

A

B

C

VIS 2013, 13th-18th October 2013

Page 47: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Macro Insertion for Workflow Compression

A

B

C D

VIS 2013, 13th-18th October 2013

Page 48: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Evaluation

VIS 2013, 13th-18th October 2013

User Testing Performance

Page 49: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Evaluation

VIS 2013, 13th-18th October 2013

Page 50: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Evaluation

VIS 2013, 13th-18th October 2013

Page 51: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Evaluation

VIS 2013, 13th-18th October 2013

Page 52: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Evaluation

VIS 2013, 13th-18th October 2013

Page 53: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Community Dissemination

VIS 2013, 13th-18th October 2013

Page 54: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

A B

Dissemination of macros to community

Automacron API available as an OSGi plugin for ISAcreator

VIS 2013, 13th-18th October 2013

Page 55: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

Roadmap

WorkflowSubstitute motifs with

‘macros’AutomaticallyDetect Motifs

VIS 2013, 13th-18th October 2013

Page 56: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

VIS 2013, 13th-18th October 2013

Overcoming the blockades

Current Motif Detection Algorithm Limitations

No semantics

Limited motif sizes (Max 10) Deciding what should

be a Macro

Macros in electronic circuit diagrams are the product of years of refinement.

Macros in biological workflows for instance is new...how do we determine what should be a macro?

Page 57: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

VIS 2013, 13th-18th October 2013

Overcoming the blockades

Current Motif Detection Algorithm Limitations

No semantics

Limited motif sizes (Max 10) Deciding what should

be a Macro

Macros in electronic circuit diagrams are the product of years of refinement.

Macros in biological workflows for instance is new...how do we determine what should be a macro?

New semantically enabled algorithm

Page 58: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

VIS 2013, 13th-18th October 2013

Overcoming the blockades

Current Motif Detection Algorithm Limitations

No semantics

Limited motif sizes (Max 10) Deciding what should

be a Macro

Macros in electronic circuit diagrams are the product of years of refinement.

Macros in biological workflows for instance is new...how do we determine what should be a macro?

New semantically enabled algorithm

Statistically informed selection fro

m

analysis of a large corpus of w

orkflows

Page 59: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

VIS 2013, 13th-18th October 2013

Summary

New semantically enabled motif discovery algorithm

Statistically informed selection of macro candidates for use in biological workflow visualizations

Automated macro image generation from inferred from algorithm states

Integration of final selections and utility to compress in ISAcreator tool for curators and biologists alike

Open source - we want you to extend!

F

A

s1

s0

A

B DB

EE

G

H

s2

s3C

C

s4E

H

C

E

Page 60: Visual Compression of Workflow Visualizations with Automated Detection of Macro Motifs

github.com/isa-tools/automacron

Philippe Rocca-SerraSusanna-Assunta SansoneJim DaviesMin Chen

Co-authors

AlsoAlejandra Gonzalez Beltran for many useful discussions

Bye.

You can download this software now!

And yes.

It is open source!

VIS 2013, 13th-18th October 2013