semantic annotation of digital libraries. a model for science communication

32
Semantic annotation of Digital Libraries a model of science communication Francesca Di Donato SNS - ERC [email protected] This presentation is released under a Creative Commons Attribution 3.0 Unported Knowledge Triangle New formulas for science communication Pisa, May 23-24 2013

Upload: francesca-di-donato

Post on 12-Apr-2017

87 views

Category:

Internet


0 download

TRANSCRIPT

Page 1: Semantic annotation of digital libraries. A model for science communication

Semantic annotation of Digital Libraries

a model of science communicationFrancesca Di Donato

SNS - ERC [email protected]

This presentation is released under a Creative Commons Attribution 3.0 Unported

Knowledge TriangleNew formulas for science communication

Pisa, May 23-24 2013

Page 2: Semantic annotation of digital libraries. A model for science communication

A new way of carrying out research

How we produce scientific results

How we store data and results

How we disseminate them

What we can do with

How they are evaluated

Page 3: Semantic annotation of digital libraries. A model for science communication

An example as a modelEUROCORR Project (ERC -AdG , 2010-2014)

Critical edition (on paper and digital) of the unpublished correspondence to Jacob Burckhardt (1842-97)

Coordinated by Maurizio Ghelardi (SNS, Principal Investigator), the research group involves 5 people (3 humanists, 2 developers, 1 "digital humanist")

Page 4: Semantic annotation of digital libraries. A model for science communication

EUROCORR goals

1. publish new materials and tools for the study of European culture of the second half of 1800

2. enrich science of new knowledge that will help to answer interesting research questions

Page 5: Semantic annotation of digital libraries. A model for science communication

A full OA model

Access to data (physical layer)

Access to data (legal layer)

Access to data (technological layer)

Page 6: Semantic annotation of digital libraries. A model for science communication

Physical layer: manuscripts, transcriptions, metadata, documentation

1. Manuscripts (http://www.burckhardtsource.org/box_view_url_shortener?u=Z)

Page 7: Semantic annotation of digital libraries. A model for science communication

2. Transcriptions (XML-TEI)

Page 8: Semantic annotation of digital libraries. A model for science communication

The transcription workflow

Page 9: Semantic annotation of digital libraries. A model for science communication

3. Metadata

Page 10: Semantic annotation of digital libraries. A model for science communication

4. Documentation (soon on a wiki)Transparency of the research process is essential for replicate it

1. Transcription compendium

2. Transcription workflow

3. XML - TEI Schema

4. XML - TEI Schema documentation

Page 11: Semantic annotation of digital libraries. A model for science communication

Rights on data (sources) and results

1a. Manuscripts of the correspondence to Burckhardt

1b. Burckhardt’s letters (printed edition) (?)

2. Project results

Page 12: Semantic annotation of digital libraries. A model for science communication

Interoperability: Towards open science

1. Open source platform (muruca.org)

2. XML-TEI (standard)

3. Linked Open Data (standard)

Page 13: Semantic annotation of digital libraries. A model for science communication
Page 14: Semantic annotation of digital libraries. A model for science communication
Page 15: Semantic annotation of digital libraries. A model for science communication

open science (in pratice)

1. Variations, frequency of relevant terms for author

3. Salutations, dates, other (?) tag clouds

2. B.’s correspondents on a map and a timelinean example: http://goo.gl/5h6Vpa second one: http://goo.gl/w5amR

Page 16: Semantic annotation of digital libraries. A model for science communication

Semantic annotation of the DL

(thepund.it)

Page 17: Semantic annotation of digital libraries. A model for science communication
Page 18: Semantic annotation of digital libraries. A model for science communication

Annotations are linked to the Web of Data

Web of Data (RDF)mash-updbpediaeuropeanaiconclassbibserver

Ad hoc vocabsex. persons, works of art, biblio-graphies

Page 19: Semantic annotation of digital libraries. A model for science communication

What can we do with annotations?

Some example:

1. Network of people discussing about a certain topic

2. Display on a historical map the works of art mentioned in the letters

see:http://goo.gl/6xfUa

Page 20: Semantic annotation of digital libraries. A model for science communication

We can do more

What about the debate on the same topics in the Republic of Letters?

Page 21: Semantic annotation of digital libraries. A model for science communication

Other examples1. Polymath Project (2009)A collaborative space for mathematical research

3. HapMap (2002) http://hapmap.ncbi.nlm.nih.gov/

The goal of the International HapMap Project is to develop a haplotype map of the human genome which will describe the common patterns of human DNA sequence variation

2. GenBank (1996)http://www.ncbi.nlm.nih.gov/genbank/

Genetic data are immediately shared online

Page 22: Semantic annotation of digital libraries. A model for science communication

4. Galaxy Zoo http://www.galaxyzoo.org/

200.000 volontari collaborano con esperti alla classificazione delle galassie

5. Wikipedia (2000)A collaborative Encyclopedia

6. MAPPA Project (2011-14)Study predictive computational tools applicable to the archaeological potential of an urban areaCreate the first italian open digital archaeological archive

7. Transcribe Bentham http://blogs.ucl.ac.uk/transcribe-bentham/

A collaborative transcription project

Page 23: Semantic annotation of digital libraries. A model for science communication

The open scienceimperative

[M. Nielsen, Le nuove vie della scoperta scientifica, Einaudi, pp. 222-229]

Page 24: Semantic annotation of digital libraries. A model for science communication

A community of knowledge in a republic of science

Page 25: Semantic annotation of digital libraries. A model for science communication

changethe "traffic direction"

of science

Page 26: Semantic annotation of digital libraries. A model for science communication

how groups can work together to manage the commons (intangible)?

Page 27: Semantic annotation of digital libraries. A model for science communication

1. Demanding open science

Page 28: Semantic annotation of digital libraries. A model for science communication

a) Funding agencies mandate to deposit in OA archives

(ex. ERC, FP7, Horizon2020, National Science Foundation)

b) Funding agencies mandate to document researchers activities on the Web

Create agreement and consensus in the scientific community

and make a long-term strategy

Page 29: Semantic annotation of digital libraries. A model for science communication

2. Encourage open science

Page 30: Semantic annotation of digital libraries. A model for science communication

How adopting new ways of sharing can become an imperative for scientists as it is today publishing

an article?

Science economy is based on reputation

Publications: articles/books through journals/series

The citation leads to the assessment, which leads to the reward, which leads to people motivated to

contribute to the science in a new way

Page 31: Semantic annotation of digital libraries. A model for science communication

Include in the evaluation mechanisms citations of data, wikis,

comments, etc.

Altmetricshttp://altmetrics.org/manifesto/

Page 32: Semantic annotation of digital libraries. A model for science communication

Thank you

CreditsMaurizio Ghelardi (SNS)Susanne Müller (SNS-ERC)Costanza Giannaccini (SNS- ERC)Alexander auf der Heyde (SNS- ERC)Alessio Piccioli (Net7 www.netseven.it)Michele Barbera (Net7 www.netseven.it)Simone Fonda (Net7 www.netseven.it)Romeo Zitarosa (Net7 www.netseven.it)www.burckhardtsource.orgwww.muruca.orgthepund.it

See also a video on Pundit and semantic annotation at the LODLAM Challenge 2013 http://summit2013.lodlam.net/2013/04/03/pundit/