data citations: who cares?

48
Data citation... Who cares? Heather Piwowar DataONE postdoc with Dryad and NESCent DataONE summer internship meeting July 7, 2010

Upload: heather-piwowar

Post on 01-Nov-2014

1.184 views

Category:

Education


0 download

DESCRIPTION

Who cares how research data is attributed and cited? Lots of people. Presented by Heather Piwowar to DataONE summer internship 2010 group on data citatio

TRANSCRIPT

Page 1: Data citations:  who cares?

Data citation...Who cares?

Heather Piwowar

DataONE postdoc with Dryad and NESCentDataONE summer internship meeting 

July 7, 2010

Page 2: Data citations:  who cares?

http://www.metmuseum.org/toah/ho/09/euwf/ho_24.45.1.htm

Page 3: Data citations:  who cares?

http://www.flickr.com/photos/jsmjr/62443357/

Page 4: Data citations:  who cares?

http://www.flickr.com/photos/camilleharrington/3587294608/

Page 5: Data citations:  who cares?

http://www.flickr.com/photos/rkuhnau/3318245976/

Page 6: Data citations:  who cares?

http://www.flickr.com/photos/conformpdx/1796399674/

Page 7: Data citations:  who cares?

http://www.flickr.com/photos/rkuhnau/3317418699/

Page 8: Data citations:  who cares?

http://www.flickr.com/photos/zemlinki/261617721/

Page 9: Data citations:  who cares?

http://www.flickr.com/photos/tracenmatt/3020786491/

Page 10: Data citations:  who cares?

http://www.flickr.com/photos/the-o/2078239333/

Page 11: Data citations:  who cares?

Probably.

Page 12: Data citations:  who cares?

In theory.

Page 13: Data citations:  who cares?

?

Page 14: Data citations:  who cares?

• Genbank

• PDB

Page 15: Data citations:  who cares?

http://www.oxfordjournals.org/nar/database/cap/

Page 16: Data citations:  who cares?

http://www.flickr.com/photos/archeon/2941655917/

Page 17: Data citations:  who cares?

Data citation...

Page 18: Data citations:  who cares?

datasetpaper

paper

paper

paper

paper

paper

dataset

dataset

dataset

dataset

dataset

Page 19: Data citations:  who cares?

• Alas, no unique standard identifier• URL• accession number• DOI• citation to paper• citation to database• reference to supplementary material• search strategy

Page 20: Data citations:  who cares?

Example: full-text phrases containing “... accessed”

Page 21: Data citations:  who cares?

“submitted”

Page 22: Data citations:  who cares?

“downloaded”

Page 23: Data citations:  who cares?

• Citations are indexed and machine-extractable

Page 24: Data citations:  who cares?

datasetpaper

paper

paper

paper

paper

paper

dataset

dataset

dataset

dataset

dataset

Page 25: Data citations:  who cares?

• understand current practice• articulate the best best-practices

Page 26: Data citations:  who cares?

datasetpaper

paper

paper

paper

paper

paper

dataset

dataset

dataset

dataset

dataset

Page 27: Data citations:  who cares?

Who cares?

Page 28: Data citations:  who cares?

1.  Data creators

• personal reward• motivation:

• “if it really helped”• even esoteric datasets are useful

• how prevalent is scooping?• alert to possible misuses• grounded requirements

Page 29: Data citations:  who cares?

2.  Data reusers

• clear guidelines are helpful• what has been reused, for what?• what hasnʼt?

Page 30: Data citations:  who cares?

3.  Repository creators, maintainers

• funding• how much metadata• how to format• what additional tools are useful• lifecycle of data

Page 31: Data citations:  who cares?

4.  Funders

• most, best science for their money• cost/benefit of mandate• inform funding decisions:

• what has been extra useful?• what hasnʼt?

• what support is needed

Page 32: Data citations:  who cares?

5.  Journals

• increasingly called upon to mandate or fund:

• how to decide• how to rationalize

• another avenue to compete

Page 33: Data citations:  who cares?

6.  Information scientists

• extension of citation analysis for studying information behaviour

Page 34: Data citations:  who cares?

6.  Me

Page 35: Data citations:  who cares?
Page 36: Data citations:  who cares?

Articles published in journals

with a strong data-sharing

policy are more likely to have

publicly available datasets

Page 37: Data citations:  who cares?

Reuse estimate

• 2703 submissions in 2007 • GSE* in PubMed Central• Exclude author overlap• Exclude data creation

• automatically, manually

• 139

• 520

Page 38: Data citations:  who cares?
Page 39: Data citations:  who cares?
Page 40: Data citations:  who cares?
Page 41: Data citations:  who cares?

7.  You

Page 42: Data citations:  who cares?

8.  Your mom

Page 43: Data citations:  who cares?

9.  These mice

http://www.flickr.com/photos/ryanr/142455033/

Page 44: Data citations:  who cares?

10.  Scientific progress

• trace errors, fraud• increase transparency• more efficient and effective

Page 45: Data citations:  who cares?

you can not manage what you do not measure

quote: Lord Kelvinhttp://www.flickr.com/photos/archeon/2941655917/

Page 46: Data citations:  who cares?

science about our science

Page 47: Data citations:  who cares?

http://www.flickr.com/photos/druclimb/293046352/

Page 48: Data citations:  who cares?

questions?

Thanks to:

NSF, DataONE, NESCent, Dryad

UBC Dept of Zoology

NLM, U of Pittsburgh Dept of Biomedical Informatics

Open science online community and those who release their articles, datasets and photos openly