progress report presentation

29
Progress Report Presentation Recovery.gov Visualization Using Dual Treemap for Bi-Hierarchical Exploration Rachel Schwartz Puneet Sharma Miguel Rios Tak Yeon Lee

Upload: kira

Post on 23-Feb-2016

31 views

Category:

Documents


0 download

DESCRIPTION

Progress Report Presentation. Recovery.gov Visualization. Using Dual Treemap for Bi-Hierarchical Exploration. Rachel Schwartz Puneet Sharma Miguel Rios Tak Yeon Lee. Introduction Recovery Act Existing Visualization Task Analysis Newspaper Headlines Spotfire Tryout - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Progress Report Presentation

Progress Report Presentation

Recovery.gov VisualizationUsing Dual Treemap for Bi-Hierarchical Exploration

Rachel SchwartzPuneet Sharma Miguel Rios Tak Yeon Lee

Page 2: Progress Report Presentation

Introduction

Recovery Act

Existing Visualization

Task Analysis

Newspaper Headlines

Spotfire Tryout

Concept

Demo

Schedule

Page 3: Progress Report Presentation

About Recovery Act

•Signed into law on Feb. 17, 2009 by President Barack Obama

•Total $787 billion

•28 Agencies

•Contract, Grant and Loans

•130,362 recipient reports:

- 13,080 reports on contracts,

- 116,675 on grants,

- 607 on loans

Page 4: Progress Report Presentation

……

VendorsVendorsVendors

What is Recovery Act Report

Agency Report

Agency

Prime Recipients

Sub Recipients

Vendors

Contract / Grant / Loan

Recipient Report

•Plan•How the money is DISTRIBUTED

•Who Received the money•How the money is USED•How many jobs are created

Recovery.gov

Page 5: Progress Report Presentation

Existing Visualizations

Recovery.gov

Eye On the Stimulus

Wall Street Journal

msnbc.com

Page 6: Progress Report Presentation

Existing Visualizations

Recovery.gov

Eye On the Stimulus

Wall Street Journal

msnbc.com

Page 7: Progress Report Presentation

Existing Visualizations

Recovery.gov

Eye On the Stimulus

Wall Street Journal

msnbc.com

Page 8: Progress Report Presentation

Existing Visualizations

Recovery.gov

Eye On the Stimulus

Wall Street Journal

msnbc.com

Page 9: Progress Report Presentation

Existing Visualizations

Summary.

- Geographical map and Table are most common

- Browsing is the main activity being supported

- Comparison is often considered but not fully supported

Not Suitable for Analytic Task

Page 10: Progress Report Presentation

Examples of Analytic Tasks on Recovery Act

Investigate Journalists do find headlines out of Recovery Act Report

Page 11: Progress Report Presentation

Examples of Analytic Tasks on Recovery Act

How do Journalists find headlines?

-State / County-wise comparison is common

-Census Data is useful for- Finding states/counties in similar context- Validating fairness of funding

(Is the money given to the place where needs it?)

-Two hierarchies- Agency hierarchy- Spatial hierarchy

are equally important

Page 12: Progress Report Presentation

Bi-Hierarchical Structure

DISTRIBUTIO

N

AGGREGATION

* Suitable for tracking how the money is distributed

* Associated with Industry, Recipient information

* Intuitive

Agency

Projects

Prime / Sub Recipients

COUNTY

STATE

1. Agency Tree 2. Spatial Tree

How to support EXPLORATION within/across both hierarchies is the key point

* Associated with Census Data

CENSUS data

Page 13: Progress Report Presentation

Spotfire

Agency Report

US Census Bureau Data

Contract

Recipient Report

Grant Loan

Population Population over 65 Female Percentage White people percentage Black people percentage Hispanic percentage Infant deaths High school graduate percentage Bachelor degree percentage Housing unit Housing unit percent change Median household income People in poverty labor force unemployment rate number of firms women-owned firms percentage

Purpose.• Understanding the dataset

• Exploring capability of current visualization techniques

• Modeling Task Flow

• Finding our contribution

Why Spotfire? Spotfire has most features for multivariate comparison tasks

Page 14: Progress Report Presentation

Spotfire / findings.

$20.94 per job$417.61 per job

$98.84 per job

The most effective Job Creators are questionable.

Agency – Recipient State - County

Size by Award AmountColored by money Per Job (how much money they spent for creating each job)Dataset Filtered by money Per Job : $10~$2000

Page 15: Progress Report Presentation

Spotfire / findings. Florida ,the biggest senior town in US, Gets Most Money From Military

Page 16: Progress Report Presentation

Critics on Spotfire•Brushing is great, but highlighted cells are not showing how much is the related portion

select

highlight

Actual Portion of the selection

LOS ANGELES

Page 17: Progress Report Presentation

Critics on Spotfire•Color Scheme and Filter cannot disclose data inconsistency

It can only filter out empty values

Color Scheme provides basic linear spectrum only

How can we highlights elements having missing/invalid values?

Page 18: Progress Report Presentation

Critics on Spotfire•Basic Color Scheme works poorly with an exponential distribution

Only two extremely highest values are distinguished Basic Color Scheme supports

linear spectrum

Page 19: Progress Report Presentation

•Using Filter takes much time and efforts

Critics on Spotfire

•Trellis needs more flexibility• Comparison between a state and a county is

not easy

• (Future work) Sort-by-Attribute, Cluster-by-

Feature and more possibilities

Page 20: Progress Report Presentation

Concept.

Synchronized Dual Treemap for Exploring Bi-Hierarchical Data

Contribution to Recovery Act Accountability and Transparency.- Providing Analytic Tool for Citizen Watchdogs

-Supporting Sense-making Process of Dataset

Contribution to Treemap Visualization and Spotfire.- Improving Brushing with Proportional Highlight- Improving Filter Interaction- New Features of Trellis

Page 21: Progress Report Presentation

Task flow

OVERVIEW

ZOOM IN & DYNAMIC FILTER

TAKE SNAPSHOTS

DETAIL COMPARISON

Exploring Bi-Hierarchical dataUnderstanding General Trends on Dual Treemap

Narrowing Down by Zoom-In FilterDynamic Filter by Project/Regional Attributes

Keep Treemap Snapshots for Comparison

Find Patterns and Outliers of the Treemaps in the Shoebox

Page 22: Progress Report Presentation

Washington

Tennessee New York

South Carolina

California

Texas Colorado

Idaho

[All Agency] | Project [All State] | County

DEPT. OF ENERGY

General Services Administration

Corps of Engineers

Department of the Army

Department of Health and Human

Services

Filter / Color Scheme

Details

Shoebox

D. ENERGY Washington Tennessee D. Health … Exec. President NASA D. ENERGY Washington Tennessee D. Health …

Basic UI Layout

Page 23: Progress Report Presentation

-When Zooming –Into a Treemap, other Treemap is redrawn with filtered data-Zooming-Out removes the filter

AGENCY > PROJECT >

PRIME/SUB RECIPIENT

Department of Energy

PROJECT > PRIME/SUB RECIPIENT

STATE >COUNTY >

Entire data

Filtered by[Department of

Energy]

STATE >COUNTY >

ZOOM INREDRAWN

Filtered by[Department of

Energy] &[Maryland]

Maryland

COUNTY >ZOOM IN

PROJECT > PRIME/SUB RECIPIENT

REDRAWN

All Agency

Filtered by[Maryland]

COUNTY >ZOOM OUT

AGENCY >PROJECT >

PRIME/SUB RECIPIENT

REDRAWN

Entire data

All State

STATE >COUNTY >

REDRAWNAGENCY >PROJECT >

PRIME/SUB RECIPIENT

ZOOM OUT

Narrowing Down by Zoom-In filter

Agency Tree Spatial TreeShared Data

Page 24: Progress Report Presentation

Portion of related elements are highlighted

Brushing and Proportional Highlight

select

Page 25: Progress Report Presentation

Keep Treemap Snapshots for comparison

Washington

Tennessee New York

South Carolina

California

Texas Colorado

IdahoDEPT. OF ENERGY

General Services Administration

Corps of Engineers

Department of the Army

Department of Health and Human

Services

Shoebox

D. ENERGY Washington Tennessee D. Health … Exec. President NASA D. ENERGY Washington Tennessee D. Health …

Page 26: Progress Report Presentation

Shoebox

Find Patterns and Outliers of Treemap-More advanced features are planned but implementation is not guaranteed

- Cluster by Treemap Features (size-color correlation, uniformity, diversity, …)- Sort by Attributes (award amount, population, Job Creation, …) - Snapshot as a bookmark of setting

Page 27: Progress Report Presentation

Extended Color Scheme-Based on a set of predefined rules, Treemap elements having empty / invalid values are highlighted. (it overrides standard color scheme based on other attributes)-Users are assumed to have no idea of any inconsistency pattern. Otherwise just normal filter it is.

Rule 1. Zip codes not found in standard zip code tableRule 2. Congressional District code not found in the state’s CD code table Rule 3. Agency code not found in standard agency code table

Highlight Invalid Data

zip code

Congressional District code

Agency code

Other filters

Page 28: Progress Report Presentation

Extended Color Scheme-A linear color scheme is not suitable for exponential distribution. Extended Color Scheme utilizes statistical percentile to separate outliers from main distribution.

^ 50%

^ 90%

^Max

^10%

^ 50%

^ 90%

^Max

^10%

Page 29: Progress Report Presentation

Thank you!