"the reality of digital science"
TRANSCRIPT
kaitlin thaneySciPy, 13 july 2011
austin, texas
the reality of ‘digital science’
Wednesday, 13 July 2011
xi. background
Wednesday, 13 July 2011
about me
Wednesday, 13 July 2011
Digital Science(the company)
Wednesday, 13 July 2011
investment armincubator rolein-house dev
Wednesday, 13 July 2011
tiered approachbuild to scale
researcher-focused
Wednesday, 13 July 2011
1. science, tech, and moving online
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
research
idea
experiment
lit review discovery
materials
publish
share results
retestanalyze
collect data
Wednesday, 13 July 2011
blocking points
idea
experiment
lit review discovery
materials
publish
share results
retestanalyze
collect data
(to name a few ... )
Wednesday, 13 July 2011
access
analysis
disseminationWednesday, 13 July 2011
text texttext
Wednesday, 13 July 2011
discovery & delivery
Wednesday, 13 July 2011
changes at the workbench
Wednesday, 13 July 2011
annotation & curation
Wednesday, 13 July 2011
social & administrative
Wednesday, 13 July 2011
gaps still exist
Wednesday, 13 July 2011
2. key constituencies
Wednesday, 13 July 2011
(3)
Wednesday, 13 July 2011
machines
researchers
decision makers
Wednesday, 13 July 2011
machines
researchers
decision makers
Wednesday, 13 July 2011
...annotation
markupsearch
discovery“behind the
scenes”...
Wednesday, 13 July 2011
[brief interlude]
Wednesday, 13 July 2011
digitisation of the scholarly canon
(content is still king)
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
not nearly there yet ...
Wednesday, 13 July 2011
Wednesday, 13 July 2011
barriers to “access”
Wednesday, 13 July 2011
still the starting point
Wednesday, 13 July 2011
patents are no better(in many cases, worse)
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
can streamline
Wednesday, 13 July 2011
name disambiguation
Wednesday, 13 July 2011
10,11-dihydro-5-methyl-5H-dibenzo[b,e][1,4]diazepin-11-one
(still strains the minds of the best)
Wednesday, 13 July 2011
machine readability is key.
agreement is hard.
Wednesday, 13 July 2011
10,11-dihydro-5-methyl-5H-dibenzo[b,e][1,4]diazepin-11-one
Wednesday, 13 July 2011
“everything is metadata ...
everything can be a label.”
- david weinberger
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
machines
researchers
decision makers
Wednesday, 13 July 2011
a few edge cases (though no field is perfect)
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
CC-BY-2.0 - Plaxco Lab - http://www.flickr.com/photos/34857812@N04/
Wednesday, 13 July 2011
Wednesday, 13 July 2011
trackingexpiration calibration
Wednesday, 13 July 2011
the non-digital
+ordering
processing
Wednesday, 13 July 2011
protocols parameterscalibration
misc. lit
Wednesday, 13 July 2011
managing information
different types of “data”
Wednesday, 13 July 2011
often gigabytes, not terabytes
Wednesday, 13 July 2011
“i invented a folder based system ...”
Wednesday, 13 July 2011
“i invented a folder based system ...”
“yeah, we had a LIMS. it only ever got used to store photos
from lab nights out.”
Wednesday, 13 July 2011
Wednesday, 13 July 2011
why?
experimentation reliance
data moves, grows legs
funder/instit’n pressure
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
machines
researchers
decision makers
Wednesday, 13 July 2011
rewards, incentives the “ why ”
Wednesday, 13 July 2011
data capture(of a different sort)
Wednesday, 13 July 2011
the “social issue”best practices
behaviour roadblocksdiscipline / researcher specific
Wednesday, 13 July 2011
paper’s still the currency
Wednesday, 13 July 2011
imperfect system
Wednesday, 13 July 2011
“Right now we're going through a Cambrian explosion of metrics.”
- Johan Bollen
Nature 465, 864-866 (2010) | doi:10.1038/465864a
Wednesday, 13 July 2011
there’s been a drastic spike in terms of sheer volume and type
Wednesday, 13 July 2011
citation / impact factorh - index
weighted citations (eigenfactor, sjr)“betweenness centrality”
alt-metrics, etc.
Wednesday, 13 July 2011
difficult to ... harmonise
track /maintain / mapunderstand
(even still measure)
Wednesday, 13 July 2011
administrators / funders =
part of the research cycle
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
Wednesday, 13 July 2011
3. the reality
Wednesday, 13 July 2011
“the future is here ... just not evenly
distributed yet.”- William Gibson
Wednesday, 13 July 2011
changing understandings,
paradigms
Wednesday, 13 July 2011
technology can helpdesign decisions are key
plan for the irrational
Wednesday, 13 July 2011
more efficient researchincrease productivityenable reproducibility
Wednesday, 13 July 2011
thank you.
@kaythaney
Wednesday, 13 July 2011