science and the web simon cockell (@sjcockell) bioinformatics support unit (@nclbsu) ...

47
Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu) http://bsu.ncl.ac.uk/ [email protected]

Upload: anis-mcbride

Post on 01-Jan-2016

225 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Science and the WebSimon Cockell (@sjcockell)

Bioinformatics Support Unit (@nclbsu)

http://bsu.ncl.ac.uk/

[email protected]

Page 2: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

In the bad old days• There was the internet• But there was no web • No Facebook• No iPlayer• No Spotify• No Google• But we still managed to use it for awesome science

Page 3: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

• Yes, this really was the view of the world’s sequence databases that we used to have!

Page 4: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk
Page 5: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

But we had social resources• Email newsgroups• MUDs (kind of like a text only Second Life)• Usenet Newsgroups

• A kind of forerunner of today's message boards• Often un-moderated• Full of spam• All the science stuff of interest was under bio.* or sci.*

Page 6: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

How the web was won• The internet has been designed by scientists for scientists

pretty much since inception• The internet ‘as we know it today’ arrived in the 1990s• OK that’s just not true, but it’s what most people associate

with the internet• Aka ‘The Web’

Page 7: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

And this WAS about science• (Sir) Tim Berners-Lee is a scientist• The World-Wide Web (W3) was developed to be a pool of

human knowledge, which would allow collaborators in remote sites to share their ideas and all aspects of a common project

• Data, and pages could be linked together for the first time with images, sounds, text

• As the data grew, it needed to be searched• Search engines collated the data

Page 8: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

And so Web 1.0 began• We got used to:

• Online shopping• Having a homepage• Getting our news online• Guestbooks on websites• Animated GIFs• Wonderful colourschemes:

Page 9: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

So wtf is Web 2.0?• 20% marketing nonsense to attract investors• 20% buzzwords designed to annoy old people

• ‘mashup’ etc

• 10% missing vowels & capitalisation • flickr, tumblr etc

• 50% useful• Specifically refers to ‘user generated content’ • High availability and access to data• Networks of people and network effects• Openness• Tapping into the ‘wisdom of the crowds’

Page 10: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Concepts

• One of the social concepts most people are familiar with now is tagging:

• And the ubiquitous upvote:

Page 11: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

How can we take this out of Facebook and into science?

Page 12: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Tagging is very useful• Search and retrieval tool• Can be applied to just about anything

• Browser bookmarks• Uploaded YouTube videos• Academic Papers• Uploaded Powerpoint presentations• And most importantly they can be SHARED with other users

• We call these tags ‘folksonomies’• “a system of classification derived from the practice and method of

collaboratively creating and managing tags to annotate and categorize content”

Page 13: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Social bookmarking• If you’re going to share your folksonomies, why not share

your bookmarks too?• Maybe you don’t want to share everything you have

bookmarked in your browser…• But what about the stuff that’s related to work?• Many services exist for this, but the one that most people

know about is http://delicious.com/

• See also:• diigo.com• pintrest.com

Page 14: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

delicious.com

Page 15: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Scientific Literature• Scientific papers are as amenable to bookmarking as

websites• Needs specialized service

http://pubmed.org/22522561 http://dx.doi.org/10.1038/nm.2692

http://www.nature.com/nm/journal/vaop/ncurrent/full/nm.2692.html

Page 16: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

CiteULike• A social bookmarking service for scientific literature• Exactly like Delicious, but understands that different URLs

can point to the same resource• Allows you to exploit “network effects”

• Someone who reads lots of the same papers you do is probably interested in the same things

• They may have papers in their library you don’t know about• You may be interested in those papers too

Page 17: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

CiteULike

Page 18: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Mendeley• Similar service to CiteULike

• Bookmark papers• Define folksonomy• Discover related literature

• Also – • Manages PDFs• Manages citations (plugins for Word etc)

See also:Zotero (http://zotero.org)Papers (http://mekentosj.com/papers)

Page 19: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Mendeley - App

Page 20: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Mendeley - Web

Page 21: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Mendeley - Citations

Page 22: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

We have a data overload

• How can you define what is important?• And how can you deal with it subsequently?

Page 23: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Knowledge Discovery• No, not via Google

• One of the issues of information overload is having to go to multiple sites in order to collate the day to day information you might want to read

• This is already a solved problem

• RSS

• ‘Really Simple Syndication’

Page 24: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

RSS

Page 25: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Who has RSS?• You can have RSS feeds from

• Journal publishers• Search results (PubMed)• Blogs• CiteULike, etc.• Actually just about any dynamically updated web resource.

• RSS is everywhere• Pervasive, Useful, Centralised, Shareable

Page 26: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

FOAF and Network Effects• Already mentioned re: CiteULike• FOAF = Friend Of A Friend• Requires a network (obviously)• Can be of things (ie papers) or people• Have been loads of attempts to build ‘Facebook for

Scientists’• Begs the question – how about Facebook?

Page 27: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

On the (awesome) power of Twitter

Page 28: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk
Page 29: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Q&A• Concept dates back to internet dark ages

• ‘Usenet’ groups, revolved around specific subject areas

• With the birth of the web, ‘forums’ proliferated• Usability nightmare, hard to navigate, harder to search

• Then came expertsexchange.com• That’s Experts Exchange dot com, not Expert Sex Change dot com• Pay people to answer specific queries• Results in *bad* content (people care about the money, not the

content)

• Then came stackoverflow.com...

Page 30: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Stack Overflow

Page 31: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Stack Exchange

Page 32: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

biology.stackexchange.com

Page 33: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

academia.stackexchange.com

Page 34: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

An example - BioStar

Page 35: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Half an hour later...

Page 36: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Online Collaboration (crowd sourcing)

Page 37: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Sharing your work• Everything so far has been about sharing knowledge

• Things you read• Things you’ve seen• Things you know

• Some people go further and share all their work• So-called ‘Open Science’

• While this is an extreme, it can have benefits• Some aspects of open science are becoming mainstream

• Open Access publishing• Data sharing (now required by many funding bodies)

Page 38: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

#arseniclife

Page 39: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

#arseniclife

If this data was presented by a PhD student at their committee meeting, I'd send them back to the bench to do

more cleanup and controls.“

Page 40: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

#arseniclife

Page 41: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Open Access publishing• Your work is not behind a 'paywall'

• It can be very frustrating trying to get hold of papers from journals or publishers that the University does not have a site licence for

• Your work can be more widely read • Hard to argue that this is not desirable!

• The full text of your article is preserved and can even be analysed computationally to derive even more knowledge

Page 42: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

“Increasing pressure to share data

BBSRC expects research data generated as a result of BBSRC support to be made available with as few

restrictions as possible in a timely and responsible manner to the scientific community for subsequent research.

Applicants should make use of existing standards for data collection and management and make data available

through existing community resources or databases where possible.

http://www.bbsrc.ac.uk/organisation/policies/position/policy/data-sharing-policy.aspx

Page 43: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Data Sharing• Many services for specific data types• GEO/ArrayExpress for microarrays• PRIDE for proteomics• SRA for high-throughput sequencing• Also now facilities to share more generic research data:

• FigShare

Page 44: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

FigShare

Page 45: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Why share?• Well you might as well ask

• Why go to a conference?• “At a conference the most important things happen in the coffee break”

– Hans Ulrich Obrist

• Why talk to colleagues?

• The internet doesn’t have to be a distraction, it can be an extension of your peer group. A place to find and exchange relevant information, build your profile and do your job more efficiently

Page 46: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Your online “brand”• In this day and age a future employer is going to Google

you• Own what Google “says” about you• What you say and do in public forums is a matter of public

record - it’s ok to engage and discuss, but be wary of getting mad

• Be consistent and professional in your online ‘persona’• You might want to have a look at what your Facebook

profile exposes to the world - it may be more than you think

• LinkedIn is a great place for an online CV and networking

Page 47: Science and the Web Simon Cockell (@sjcockell) Bioinformatics Support Unit (@nclbsu)  simon.cockell@newcastle.ac.uk

Things we haven’t had time for...• Loads more useful sites and services:• Nature Network• OpenWetWare• LinkedIn• SlideShare• Prezi• BioScreenCast• Google Plus/Groups/Docs• DropBox• ResearchBlogging