heartbeat: measuring installed base by analyzing downloads and scientific software network map

14
Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map James Howison University of Texas at Austin

Upload: james-howison

Post on 13-Feb-2017

222 views

Category:

Software


0 download

TRANSCRIPT

Page 1: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

Heartbeat: measuring installed base by analyzing downloadsand

Scientific Software Network Map

James Howison University of Texas at Austin

Page 2: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

Downloads

• Great desire to measure something similar to sales and/or market share

• Early focus on downloads but …– A download is not a sale– No direct reward– Might be experimentation– Strongly correlated with number of releases

Page 3: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

Installed Base

• How many regular users does a piece of software have?

Page 4: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map
Page 5: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

Bibdesk daily downloads

Page 6: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

Bibdesk installed base

Page 7: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

What’s needed?

• High frequency data• Some notification of new releases, or• Some driver for frequent updates

Page 8: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

Current work

• Focus on software work in science– No convenient central repositories!

• Focusing on understanding what software is used with what– Complements, not dependencies

• Linking metrics from publications to runtimes, and dependencies.

Page 9: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map
Page 10: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

Mentions in publications?

Page 11: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

@jameshowison DOI: 10.6084/m9.figshare.1146366

Types of mentions in publicationsMention Type Example

Cite to Publication … was calculated using biosys (Swofford & Selander 1981).

Cite to Project Name or Website

… using the program Autodecay version 4.0.29 PPC (Eriksson 1998).Reference List has: ERIKSSON, T. 1998. Autodecay, vers. 4.0.29 Stockholm: Department of Botany.

Like Instrument … calculated by t-test using the Prism 3.0 software (GraphPad Software, San Diego, CA, USA).

URL in text … freely available from http://www.cibiv.at/software/pda/ .

In-text name mention only

… were analyzed using MapQTL (4.0) software.

Not even name mentioned

… was carried out using software implemented in the Java programming language.

Page 12: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

@jameshowison DOI: 10.6084/m9.figshare.1146366

Types of Mentions

Page 13: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

Detecting complements

• http://scisoft-net-map.isri.cmu.edu/• http://scisoft-net-map.isri.cmu.edu:7777/• http://depsy.org/

Page 14: Heartbeat: measuring installed base by analyzing downloads and Scientific Software Network Map

Questions

• Ideas for discovering complements– Software used with other software

• Anyone interested in mining publications (or perhaps blogs etc) for software mentions– Gold standard dataset at:

github.com/jameshowison/softcite