digital archiving 3.0
DESCRIPTION
Presentation given at ODW2013 (http://www.w3.org/2013/04/odw/). Goes over the need for institutions doing digital archiving to publish their meta-data as LOD and ensure formats round-tripping for the dataTRANSCRIPT
![Page 1: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/1.jpg)
Digital Archiving 3.0
“My data open on the Web, ok but how ?”
Christophe Guéret (@cgueret)
Open Data on the Web, 23 - 24 April 2013
Data Archiving and Networked Services
DANS is een instituut van KNAW en NWO
![Page 2: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/2.jpg)
A bit of context
http://cedar-project.nl
http://easy.dans.knaw.nl
![Page 3: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/3.jpg)
Put your data open on the Web!
“Sharing knowledge: EC-funded projects on scientific information in the digital age”
“E-Data & Research”, October 2011
![Page 4: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/4.jpg)
Where is your research data ?
It is available as an RDF/XML dump on my test server
Just get it from the web site of the research project
I think I have have it somewhere on a stick, let me check...
![Page 5: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/5.jpg)
All bad answers, really.
● We need research data to be
– Accessible/readable/usable by anyone– Available in many (>1) years from now– With traceable provenance and usages
● Dumping the data on a web site somewhere is not enough
![Page 6: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/6.jpg)
Solution: use a repository
● Data repositories will take over serving the data and have a page for it!
● Repository hold two type of data– The data stored
– The meta-data about this data
“Sharing knowledge: EC-funded projects on scientific information in the digital age”
![Page 7: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/7.jpg)
Which format for meta-data ?
● LOD is a perfect fit for describing data
– Use to refer to and link data items– Facilitates discovery, easy to crawl/index– One description per data item stored– Redirects to actual location of the data
● Remaining question: how much meta-data is needed?
![Page 8: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/8.jpg)
Which format for the data?
● Many formats around : PDF, SDF, DSPL, XLS, RDF, CSV, SHP, JSON-LD, ...
● Translation will imply some extra work for the data owner and not please everyone
![Page 9: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/9.jpg)
Which format for the data?
● Many formats around : PDF, SDF, DSPL, XLS, RDF, CSV, SHP, JSON-LD, ...
● Translation will imply some extra work for the data owner and not please everyone
Select vocabularies to describe your resources
Buy a DN, decide on a URI scheme for your data
Express your data as described resources
![Page 10: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/10.jpg)
Solution: use a repository
● Data repositories will take over serving your data
● Just get the data in the repository
● Repositories will take care of everything
● PS: forget about HTTP URIs for data
![Page 11: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/11.jpg)
Format evolution
● Use Content-negotiation to translate and serve different data formats
● Ensure everyone gets the format he wants
![Page 12: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/12.jpg)
Format evolution
● Use Content-negotiation to translate and serve different data formats
● Ensure everyone gets the format he wants
??
![Page 13: Digital archiving 3.0](https://reader034.vdocuments.net/reader034/viewer/2022042714/54bd4d164a795953058b45ae/html5/thumbnails/13.jpg)
Next generation archives
● Provide long term access to data in several formats
● Publish Linked Open Meta-Data about the data stored (DCAT, ...)
● Facilitate moving data around archives (LDP, ...)