statmine (new technologies and techniques for statistics)

19
StatMine – prototype StatMine 0.2 Edwin de Jonge, Jan van der Laan & Jessica Solcer Statistics Netherlands (CBS) NTTS 2013, March 6 2013

Upload: edwin-de-jonge

Post on 03-Dec-2014

227 views

Category:

Technology


0 download

DESCRIPTION

 

TRANSCRIPT

  • 1. StatMine prototype 0.2 Edwin de Jonge, Jan van der Laan & Jessica Solcer Statistics Netherlands (CBS) NTTS 2013, March 6 2013

2. StatMine Goal: Improve use figures Statistics Netherlands How: Add Analysis layer to OutputDB (StatLine) Working approach: Formulate improvement Develop software prototype Test prototype on (real) users EvaluateBut why? StatMine2 3. Mission SNThe mission of Statistics Netherlands is to publish reliable and coherent statistical information that meets the needs of society (source: www.cbs.nl)StatMine 0.23 4. Mission SNThe mission of Statistics Netherlands is to publish reliable and coherent statistical information that meets the needs of society (source: www.cbs.nl)StatMine 0.24 5. Evidence-based policy5 6. What is the state of the Netherlands?StatLine contains over 1.000.000.000 figures!StatMine6 7. Problem 1 Figures InformationStatMine7 8. 1. Figures Information We know (from user study): Some important user dont get the most out of StatLine: Data journalists Policy makers They dont find and see interesting information, because of tabular presention (data = table)StatMine 0.28 9. Solution 1 Visualize data!StatMine9 10. Problem 2. Fragmented informationStatMine10 11. 2. Fragmented information For policy makers and journalist most information in OutputDB is fragmented: Users need to combine fragments from different statistics Diabetes (insuline usage, hospital admissions, mortality, visits to doctor, obesity) Energy consumption vs economic growth Income vs economic growth (Perceived) public safety vs registered crimes StatMine 0.211 12. 2. Solution: Let users combine tables(even if we wouldnt )StatMine12 13. Prototype StatMine 0.2 Implements: Visual interactive data browsing Combining fragments of different tablesTested on: 40 SN employees (++) 40 policy makers (++)StatMine 0.213 14. Line chartBar chart- Show development- CompareBubble/scatter chartMosaic chart- Show correlation- Show structureStatMine 0.214 15. Small multiplesStatMine 0.215 16. StatMine16 17. Technical HTML5 JSONRJavaScriptCSS SVG Runs on desktop makkelijk over te zetten naar webserverStatMine 0.217 18. Currently (2013) All Official Statistics have confidence interval. StatMine 0.3 will test if showing uncertainty improves/changes understanding of (quality of) figures. May lead to publishing interval estimates (in stead of point estimates).StatMine18 19. Conclusion Visual data browsing is promising for Our own statisticians (quality control) External policy makers and journalists Using real end users for testing is very helpful: Lots of suggestions for improvement from users Users feel involved in innovation process of NSIStatMine19