ariadne: data sharing
TRANSCRIPT
ARIADNE is funded by the European Commission's Seventh Framework Programme
Data sharing
Kate Fernie
Overview• Archaeology data: rights and licences• Open access • Open data• Open licences• Barriers and benefits of data sharing• Plan ahead: considerations• Group Discussion Exercise
Archaeology data and rightsThe processes and activities involved in archaeological result can result in the generation of Intellectual Property Rights at different stages
• The actors may include:– Owners/managers of the monument, site or artefact, e.g. national heritage
organisation, museum, private persons.– Funding bodies, who may own the IPR in the content and assign licences for its use– Organisations involved in data capture and post-processing of the content– Researchers
• Agreements may cover physical access to the monument, the IPR in the content and licences for its use
• Content includes text documents, images, 3D models, videos and original data created by the archaeological research.
• Metadata is provided for discovery and to promote re-use of the content is generally openly licenced
Copyright and research data
• Copyright protects the expression of an idea– not the idea itself.
• Data is not covered by copyright– but the arrangement of data in a spreadsheet or database is
• Copyright is assigned when a creative work is produced– Funding bodies may request copyright is assigned to themselves – Employers may claim copyright of works produced by their staff.
• How long copyright lasts varies according to the type of work and the country
• Copyright law varies from country to country. • Different institutions have different copyright clauses in their employment
contracts.
“Intellectual property rights, very broadly, are rights granted to creators and owners of works that are the result of human intellectual creativity”
Licences• Copyright protects your work• Licences are your way of saying how people may use it and
cover:– Attribution (of you as the author of the work)– Permitted uses (e.g. education, commercial uses, open access)
• Limitations on use e.g. publication of an image in a journal article– Derivatives – whether people can make copies, remix or use the
content to create new works– Share alike - a license condition that specifies that new works must
be licensed under the same terms
Some context: open access to scientific data
• The European Union promotes open access to publications and data with the aim of:– Enabling researchers to build on previous research– Fostering collaboration between researchers– Accelerating innovation– Involving citizens and society
What is open access?
Open access can be defined as providing on-line access to scientific information that is free
of charge to the end-user and is re-usable
Main routes to open access:• Open access scientific journals• Open access data repository
Archaeology and open access
Atkinson, M. and Preston S. (2015). Heybridge: A late Iron Age and Roman settlement. Excavations at Elms Farm 1993-5. Volume 2, Internet Archaeology 40. http://dx.doi.org/10.11141/ia.40.1
Open access publication Open access data archive
Related digital archive: Essex County Council (2015) Elms Farm Portfolio Project [data-set]: http://dx.doi.org/10.5284/1021668
Open data: accessible online
• Accessible online– The dataset is available online via a service– Users may need to register to access the data
http://dans.knaw.nl/en/search
Open data: re-usable
• Open data is re-usable– Available in an open format that allows for re-analysis, e.g. the data is in a
spreadsheet and not locked in a PDF document– Is more than the summarized data in publications (i.e. figures, charts, etc.) – It may be original raw data or have been cleaned, or normalized when
deposited
Open data: open licences
• Open licences permit re‐use of data for free• Includes any royalty‐free copyright licence• Example licences:
– CC0 - Creative Commons Zero (Public Domain dedication without attribution)
– CC-BY – Creative Commons Attribution– CC-BY-SA – Creative Commons Share-alike– ODC PDDL - Open Data Commons Public Domain Dedication and
Licence– ODbL – Open Data Commons Open Database licence
• http://creativecommons.org/licenses/• http://opendatacommons.org/
Open licenses: example of re-use
Barriers to data sharing• Priority of published papers / little academic reward for
development and sharing of datasets
• Existing copyrights, confidential and sensitive data
• Concerns of researchers that data could be scooped, misused or misinterpreted
• Potential reputational risk (e.g. data quality, errors,…)
• Required effort to share re-usable data, (incl. formatting, metadata creation, licensing etc.)
• Perceived lack of appropriate data archives (trusted, sustainable, ...)
Sensitive dataArchaeological datasets may sometimes include sensitive or confidential information
relating to individuals but which provides valuable historical or contextual information.
• Personal Data is data relating to living individuals which identifies them: name, age, sex, address, photographs, etc.
• Sensitive Personal Data is data that may incriminate a person such as:– Race, ethnic origin, political opinion, religious beliefs, physical/mental health,
sexual orientation, criminal proceedings or convictions.• Confidential data includes:
– Data given in confidence, or agreed to be kept confidential (i.e. not released into public domain).
– Data covered by ethical guidelines, legal requirements, or research consent forms.
• Sharing of such data can often be achieved using a combination of obtaining consent, anonymising data and regulating data access.
Benefits of data sharing• Access to data is a scholarly
communication• Citation increases the visibility • Motivates/inputs new research• Verification of research/research
integrity• Stimulates new collaborations• Re-use/-purposing of well curated
data increases research efficiency
Charles Beagrie: Keeping Research Data Safe (KRDS) benefits framework
How to reap the benefits?• Deposit data in a recognised repository which
– provides unique persistent identifiers (e.g. DOIs)– requires users to follow citation standards (e.g. DataCite)
• Provide good metadata – “no pain, no gain”– Key for data re-use without direct contact with creator– Include costs of preparing data and metadata for
publication in requests for project funding • Apply an open license that allows reuse • Cite your own data!
Attribution of Research Data• Example licences: CC-BY and ODC-BY • Attribution for a dataset citation:
– For example: Evans, T.N.L. and R.H. Moore (2014) 'The Use of PDF/A in Digital Archives: A Case Study from Archaeology' International Journal of Digital Curation . Vol. 9, No. 2, pp. 123-138. DOI: 10.2218/ijdc.v9i2.267
• Ask for a persistent identifer– resolves to an Internet location, e.g. Handles, Archive Resource Keys
(ARKs) Persistent URLs (PURLs) and Digital Object Identifiers (DOI) • Attribution helps:
– The data to be tracked and its impact– Re-use and verification of data
What we mean by open data
• Accessible online• Free at the point of use • Reusable• Openly licensed (e.g. CC-BY, CC-BY-SA)
– Licence allows derivatives to be created
Group discussion
For your own project data:
• What data will be produced/could be archived?
• Are there any barriers to you sharing the data?
• What steps will you need to carry out to do this?
• How might open access benefit your research?
AcknowledgementsARIADNE is a project funded by the European Commission under the Community’s
Seventh Framework Programme, contract no. FP7-INFRASTRUCTURES-2012-1-313193.
Teaching Materials for Research Data Management in Archaeology created by Lindsay Lloyd-Smith (2011) as part of the JISC funded DataTrain project based at the
Cambridge University Library
ARIADNE, 2014, D3.3 Report on data sharing policies: http://www.ariadne-infrastructure.eu/Resources/D3.3-Report-on-data-sharing-
policies
3D ICONS, 2014, Guidelines: http://www.3dicons-project.eu/eng/Guidelines-Case-Studies/Guidelines2