march 3, 2015 john brosz - tfdl - dataforgood.pdf · phd & msc in computer science 3d computer...
TRANSCRIPT
![Page 1: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/1.jpg)
March 3, 2015
John Brosz
![Page 2: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/2.jpg)
PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering,
Multiresolution
Post-Doc at the InnoVis Lab (HCI & InfoVis) Focus+Context viewing, touch-based interfaces/NUI, Tree Vis
Visualization Research Coordinator Visualization Studio
Programming – workshops, help with visualizations
Other Projects
![Page 3: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/3.jpg)
Tour Time!
![Page 4: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/4.jpg)
At the University we see a wide variety
Focusing on free
1. Preparing Data
2. Visualization
![Page 5: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/5.jpg)
Interactive tool for cleaning & rearranging
Suggests changes
Web tool – sends data to external site
Import: text, CSV, JSON
Export: CSV, JSON
No longer being developed (since 2011)
http://vis.stanford.edu/wrangler/
![Page 6: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/6.jpg)
Consolidate spelling
Auto-detect outliers
Sorting & filtering
Auto-suggests changes
Import: Excel, XML, JSON, RDF, CSV
Export: Excel, CSV, ODF, HTML
http://openrefine.org/
![Page 7: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/7.jpg)
Import: csv, tsv, copy & paste from Excel
Export: JSON, ASP, ActionScript, MySQL, PHP, Python, Ruby, HTML, XML
http://shancarter.github.io/mr-data-converter/
![Page 8: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/8.jpg)
Extract data from PDFs
Stand-alone app for Windows/Mac
Interactively select table
Output: CSV, Excel
http://tabula.nerdpower.org/
![Page 9: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/9.jpg)
General Purpose
Maps & GIS
Text Analysis
Programming/APIs
![Page 10: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/10.jpg)
Strengths:
Many chart types
Interactive web output
Access to underlying data
Many data sources (live)
Drag & drop – easy to experiment
Maps
Great defaults
Link visualizations
Academic Program - Free for students
http://www.tableau.com/
![Page 11: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/11.jpg)
Tableau
Explicitly addresses visual variables
Interact with chart legend
Mapping is easier
Story points – sequential series of data views
Bottom-up
Focused on interface
Stacked Area, Gantt, Bubble
Spotfire
Interesting “first guess” visualizations
API
Stronger statistical tools
Better TreeMap
Top-Down
Focused on performance
Parallel Coordinate Plot, 3D Scatterplot
![Page 12: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/12.jpg)
As the alphabet is to reading, visual variables are to visualization.
Jacques Bertin’s Sémiologie Graphique
Jock Mackinlay significantly furthered this research in the ‘80s(hint: he’s now a VP at Tableau)
![Page 13: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/13.jpg)
![Page 14: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/14.jpg)
Merge others’ data into your own table
Collaborative data gathering
Great for maps
Resolves geocoding
data API
Limited data capacity& customization
https://www.google.com/fusiontables
![Page 15: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/15.jpg)
Web tool – Java/Flash
Many Vis Types
Scatterplot, network diagram, treemap, word tree, tag/word cloud, phrase net
Charts: matrix, bar, bubble, pie
Graphs: line, stack
Maps & cartograms
Data & charts are public
Limited data size (5MB)
http://www-958.ibm.com/software/analytics/manyeyes/
![Page 16: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/16.jpg)
Open-source software for statistics & graphics
All sorts of advanced stats Regression, linear/nonlinear models, time series analysis, clustering, nonparametric tests
Data wrangling
Charts & Plots
Command line*
Many add-ons (> 4400)
60+ Resources for R
http://www.computerworld.com/article/2497464/business-intelligence/60-r-resources-to-improve-your-data-skills.html
http://www.r-project.org/
* Several UIs: e.g., Shiny R, Quick-R, Rstudio, RExcel, etc
![Page 17: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/17.jpg)
Create SVG graphics
Data not uploaded (so remains private)
16 charts
API to create new charts
http://app.raw.densitydesign.org
![Page 18: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/18.jpg)
Import: txt, HTML, XML, PDF, RTF, & Word
lexical analysis
frequency and distribution
Export: XML, tsv, html widgets
http://voyant-tools.org/
![Page 19: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/19.jpg)
Open Source alternative to ArcGIS
Linux, Windows, Mac OSX, Unix, Android
http://www.qgis.org/
![Page 20: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/20.jpg)
Uses OpenStreetMap tiles
Generates choropleths or heat maps that can be embedded into web pages
http://www.openheatmap.com/
![Page 21: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/21.jpg)
Windows/Linux/OS X
Can handle 50K nodes & 1000K edges
Interactive
Filter
Dynamic layout
Clustering/hierarchies
http://gephi.github.io
![Page 22: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/22.jpg)
Plugin for Excel ‘07, ‘10, ‘13
Allows exploration of network graphs
Additional plugins to import from social networks:
YouTube
http://nodexl.codeplex.com/
![Page 23: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/23.jpg)
Variety of free charts
Web tool that provides URL or full API
24 different charts
Limited customization
https://developers.google.com/chart/
![Page 24: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/24.jpg)
More customizable, interactive, & dynamic than Google Charts
Less work than D3
Free for noncommerical
http://www.highcharts.com/
![Page 25: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/25.jpg)
Very flexible
Embeds data into the webpage
Animations
Many examples to build from
Need to know html, css, javascript, & SVG
Used extensively by New York Times
Many libraries that build on D3
Poor performance when you reach 10K elements
http://d3js.org/
![Page 26: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/26.jpg)
Programming language & IDE designed for non-programmers
Used by Artists, Designers, Educators, Researchers
Can be converted to Android & Java
https://processing.org/
![Page 27: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/27.jpg)
J. Brosz, M. A. Nacenta, R. Pusch, S. Carpendale and C. Hurter
Transmogrifiers.org
![Page 28: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/28.jpg)
J. Brosz, M. A. Nacenta, R. Pusch, S. Carpendale and C. Hurter
Transmogrifiers.org
Videos taken out to reduce file size, if you would like to see
them again, visit: http://www.transmogrifiers.org/description/
![Page 29: March 3, 2015 John Brosz - TFDL - DataForGood.pdf · PhD & MSc in Computer Science 3D Computer Graphics: 3D modeling, Non-photorealistic Rendering, Multiresolution Post-Doc at the](https://reader034.vdocuments.net/reader034/viewer/2022042220/5ec67fde6478104f4255f236/html5/thumbnails/29.jpg)