open data in data journalists' workflow
DESCRIPTION
a talk about tools for supporting data journalism workflows and making work with open data easy. Open Data on the Web workshop: http://www.w3.org/2013/04/odw/agendaTRANSCRIPT
Open Data in Data Journalists' Workflow
Institute of Mathematics and Computer Science, University of Latvia
National Library of Latvia
Uldis Bojārs (@CaptSolo)
ODW-2013 – 24-Apr-2013
National Library of Latvia (NLL)
• Digital Library “Lettonica”– http://www.lnb.lv/en/digital-library
• Linked Open Data [Publishing]– being added into NLL’s systems
• Examples:– authority data– digital object management system– digital text corpus + named entity database
IMCS, University of Latvia
• Institute of Mathematics and Computer Science (IMCS)– http://www.lumii.lv/resource/show/170
• Open Data– making it easier for people to work with data
(discover, transform, visualize, ...)– interested in collaboration on open data projects
Make it simpler
• working with data must be as easy as possible– *frictionless* (as Rufus says)
• need a data eco-system– [work with] data more useful data
= motivation for the Data Journalism / Data Processing Tool [proposal]
Marko Lorenz, 2010 – CC BY 2.0 licensehttp://en.wikipedia.org/wiki/File:Data_driven_journalism_process.jpg
Data Visualization Pipeline (Ben Fry)
via “Speculative Maps & Open Data“ talk @ ODW-2013 by Benedikt Groß
Data Processing Tool
• The Idea:– a tool (or set of tools) covering the whole workflow
• repeatability, provenance, data publishing
– make it easy for people to use open data• graphical modeling, visualization, natural language
• Data Journalism (one of the use cases)– discovery– transformation (clean, filter, integrate, ...)– interpretation (visualization, ...)– developing a story– publishing
Research @ IMCS
• semantic web– data modeling, mapping RDBMS data to RDF, ...
• network analysis and visualization [tools]– http://www.slideshare.net/CaptSolo/exploring-th
e-networks-in-open-public-data-13391338
• computational linguistics– named entity and relationship extraction– natural language interfaces
in the context of Data Web
• important [for the web]:– data discovery– data publishing
• publish the data along with the story– make it easy to publish data as a part of the data
journalism workflow– make data discoverable for re-use– [automatically] maintain provenance info
More info• Uldis Bojārs - @CaptSolo
• National Library of Latviahttp://www.lnb.lv/en/digital-library
• IMCS: Exploring the Networks on Open Public Datahttp://www.slideshare.net/CaptSolo/exploring-the-networks-in-open-public-data-13391338
Data Journalism Tool proposal in progress,get in touch for more info