collaboration and sharing

31
Collaboration and sharing computational research methods Carole Goble University of Manchester, UK [email protected]

Upload: jisc

Post on 10-May-2015

827 views

Category:

Education


0 download

DESCRIPTION

Presented by Carole Goble at the JISC Future of Research Conference, 19th October 2010

TRANSCRIPT

Page 1: Collaboration and Sharing

Collaboration and sharing computational research methods

Carole GobleUniversity of Manchester, [email protected]

Page 2: Collaboration and Sharing

Scientific research is generally held to be of good provenance when it is documented in detail sufficient to allow reproducibility.

http://en.wikipedia.org/wiki/Provenance#Science

Materials +

Methods+

Results=

Publication

Page 3: Collaboration and Sharing

In silico Experimental Standard Operating, Procedures, Protocols, Plans

http://usefulchem.wikispaces.com/page/code/EXPLAN001

Page 4: Collaboration and Sharing

E. Science laboris: in silico experimentsAutomated, reusable scripted analysis pipelines - workflows

Data processing , data chainingData and tool integrationAnnotation pipelinesAnalyticsSimulation steering and parameter sweepsPublication miningModel and hypothesis buildingResult Validation and comparisonData cleaning, curation and preservation

Shield from clouds and clusters details.Record provenance: steps, methods, results

http://www.mygrid.org.uk/tools/taverna/

Page 5: Collaboration and Sharing

Genetic variation in cattle species. Food security, biodiversity.

Resistance to African trypanosomiasis infection (sleeping sickness)

Liverpool (Kemp), Manchester (Brass), Nairobi

Comparing new data with reference genomes, prior results and the literature to identifying interesting differences

22 million SNPs

Page 6: Collaboration and Sharing

Little Science + Big Science

http://www.genomics.liv.ac.uk/tryps/trypsindex.html

Page 7: Collaboration and Sharing

Bottom up Effectiveness in Research

• Automated, repeatable, tracked plumbing– Using institutional and community computing

infrastructures, tools and datasets• Easier access to best of breed and “surfing” results

– non-developers access to sophisticated codes and applications, shielded from nasty computing details.

• Leverage applications, services, datasets and codes shielded from computing details. – Honors original codes and applications. Heterogeneous

coding styles and tools sets. The best applications.• Extensibility, adaptability & innovation.

– My stuff. Variant design.

Page 8: Collaboration and Sharing

Reuse, Recycle, Repurpose, Mash, Trade, Publish

Dr Paul Fisher

Dr Jo Pennock

Identify biological pathways implicated in resistance to Trypanosomiasis in cattle using mouse as a model organism.

Identify the biological pathways involved in sex dependence in the mouse model, believed to be involved in the ability of mice to expel the whipworm parasite.

Levison S.E., et al Inflammatory Bowel Diseases (2010)

Fisher P, et al Nucleic Acids Research, 2007, 35(16) 5625-5633

Page 9: Collaboration and Sharing

Global Long Tail

Page 10: Collaboration and Sharing

How do I find and share methods across the

institution, communities, the web ?

How do I connect with other authors and users?

How do I know if its any good or right for me?

Who else is using it?

Where do I comment on my experiences?

Page 11: Collaboration and Sharing

http://www.myexperiment.org

• Socially share, discover, review and reuse workflows and other scientific methods.

• Cooperative market place.• A scientific gateway.

• Commons-based Production + Social networking

• Primary contribution, reviewing and curating.

Page 12: Collaboration and Sharing

Find experts and peers, advice, workflows, packs

Contribute, review and curate workflows

Train and educateLaunch workflows Cloud Methods Commons

http://www.myexperiment.org

Page 13: Collaboration and Sharing

Facts and Figures: Boutique but Beautiful

• Public Service: 1325 workflows, 349 files, 138 packs, 4129 registered members, 235 groups, 56 different countries, ~ 3000 unique hits per month. Workflows viewed/downloaded many 1000s of times.

• Adopted by 19 workflow systems and integrated into workflow workbenches: Galaxy, Taverna– Biology, chemistry, image analysis, social science, astronomy, engineering,

music…– Specialised clones in Music & NeuroScience. – Focus of research on workflow patterns and analytics.

• JISC funding since 2007• (Other funding: Microsoft, EU, EPSRC, BBSRC)

Page 14: Collaboration and Sharing

Effectiveness and Open CollaborationOpen platform, off the shelf components, open development, open linked data, Web 3.0 funky

Google gadgets

Application plugins

Linked Data Cloud

Page 15: Collaboration and Sharing

Friends, colleagues, resources

Literature

Images

LogBook

SoftwarePresentations

Data (files, spreadsheets)

Compute resources

Backup and Archive

WorkbenchGadgets

Publishing

[Duncan Hull]

Page 16: Collaboration and Sharing

Social collaboration environments (“e-Laboratories”) Collaboration, acceleration and transparency through Human Computation

Workflow management system Collaboration, acceleration and transparency through Automated Computation.

http://www.mygrid.org.uk

Page 17: Collaboration and Sharing

Institutional challenges

Adoption of Reproducible MethodsNew Publishing and Learning Objects

Pre-and Post Publication Metadata DifferentialsCitation, Credit and Reputation

Curation Costs

Page 18: Collaboration and Sharing

Methods Matter

Reproducible (or at least

defendable) Research

many eyes

Science 2010

Page 19: Collaboration and Sharing

scientists

LocalWeb

Repositories

Graduate Students

Undergraduate Students

Virtual Learning Environment

Technical Reports

Reprints

Peer-Reviewed Journal &

Conference Papers

Preprints &

Metadata

Certified Experimental

Results & Analyses

experimentation

Data, Metadata Provenance WorkflowsOntologies

Digital Libraries

[De Roure]

Actionable scholarly publishing & learning

Page 20: Collaboration and Sharing

Actionable Compound Research Objects

Page 21: Collaboration and Sharing

Data and Method burialSupplementary informationText miningThe rise of the Wiki

Page 22: Collaboration and Sharing

Competitive advantage.Adoption.

Credit.Help.Fame.

Reputation.

Being scooped.Scrutiny.

Misinterpretation.Cost.

Blame. Reputation.

Rew

ards

Risk

s

Nature 461, 145 (10 September 2009)

Trust

Page 23: Collaboration and Sharing

“Its not ready yet”

“I need to get (another) publication first”

“We don’t have the resources or skills to prepare it for others, esp. now we finished that project”

“Its faster/easier to do it myself, and will get the credit/control too”

“Its not described enough to be usable”

“I don’t trust the quality. Its not reliable enough. Its too noisy.

“Others won’t use it properly.” “Its not worth my while”

Sharing Governance ….

Page 24: Collaboration and Sharing

Credit & Reputation Quality & ReassuranceProvenance

Page 25: Collaboration and Sharing

Crowd Contribution

CreditRewardCareerUse ProfilesCitation

Method Building too.

Nature 2008

Page 26: Collaboration and Sharing

Credit and Reputation

T Shirts are not enough

Software

Community building

Automation

Social & Cultural

Coordination

GovernanceQUALITY for REUSE

Sustainability

Page 27: Collaboration and Sharing

Public Service

Community generated and curated Content

Open Software

Teaching aid

Methodology

Dig Library/Repository

Social network

Computer science researchSoftware engineeringComputational researchersResearchersSocial Science Social experiment

Collaboration platform

Page 28: Collaboration and Sharing

Free like puppies

Page 29: Collaboration and Sharing

Take Home: Methods Matter.

• Workflows are a transformative mechanism of connecting tools and encoding know-how– Scientists stand on the shoulders of resource experts

• myExperiment is a example of a collaborative environment for connecting workflow authors and users– Authors stand on the shoulders of each other

• The Power of Collectivism.• Rewards and risks of researchers in competitive

research.• Cultural shift in reward, adoption and support for

building, sharing and curating computational methods.

Page 30: Collaboration and Sharing

AcknowledgementsmyExperiment Director: David De Roure

Developers• Jiten Bhagat• Don Cruickshank• Danius Michaelides • David Newman• Sergejs Aleksejevs• Mark Borkum• Matt Lee• Tom Foster

Allied, Contributing Projects• Thomas Laurent• Eric Nzuobontane • Ian Dunlop• Stuart Owen• Shoaib Sufi• Sean Bechhofer• Rodrigo Lopez• Steve Pettifer• Mannie Tags• Finn Bacall• Sarah Thew• Matt Gamble• Tim Clark

Users• Katy Wolstencroft• Paul Fisher• Duncan Hull• Franck Tanoh• Andrea Wiggins• Marco Roos• Jerzy Orlowski• Olga Krebs• Wolfgang Mueller• Tony Linde

Social Scientists• Yuiwei Lin• Rob Proctor• Meik Poschen• Jonathan Foster

Sponsors• Savas Parastatidis • Roger Barga• Derick Campbell• Tony Hey

Page 31: Collaboration and Sharing

Contact

David De [email protected]

Carole [email protected]

Visit myexperiment.org