five selfish reasons to work reproducibly
TRANSCRIPT
Florian Markowetz CRUK Cambridge Institute
www.markowetzlab.org
5 sel!sh reasons to work reproducibly
More publications, more grants, more awesome!
Systems Genetics of Cancer
Genetic variation • In people • In tumours • In clones
Phenotypic variation • Tumour subtypes • Aggressiveness • Survival
Cancer genome Evolution
Cancer tissue Context
Cancer genome Function
Ines
Wei Edith
Geoff
Ke Anne Joe
Leon
Andy
Amanda
Reproducible Research
• It’s the right thing to do! • The world would be a better place if everyone did it!
• It’s the foundation of Science!
• It’s the honourable thing to do!
Weak Strong Phenotype
Step 1
Step 2
Hits
Knock-down Known pathway members New RNAi Hits
Compare expression phenotypes by NEMs
NFκB
?
Anatomy of the NFκB pathway
Why is well-documented and easily accessible code+data useful?
• Easy to look up numbers and put them in manuscript
• Be con!dent your !gures and tables are up-to-date • Numbers and result automatically update when
data change. • It is engaging and more eyes can look over the
analysis. • Easier to spot mistakes.
Why is well-documented and easily accessible code+data useful?
• Easy to look up numbers and put them in manuscript
• Be con!dent your !gures and tables are up-to-date • Numbers and result automatically update when
data change. • It is engaging and more eyes can look over the
analysis. • Easier to spot mistakes.
A very engaged reviewer
• Reviewer: “I downloaded the authors’ data and tried out a variation of their analysis which gave an insigni!cant result”
• We: “Thank you, the reason is XXX and
if you do YYY everything is !ne.”
“My PI said I should continue the project of a previous
postdoc.
But that postdoc is long gone and hasn’t saved any scripts
or data.”
“Sounds alright, but my code and data are spread over so
many hard drives and directories that it would just be too much work to collect
them all in one place”
5 sel!sh reasons to work reproducibly
1. Avoid disaster
2. Easier to write papers
3. Easier to talk to reviewers
4. Continuity of your work/in the lab
5. Reputation
When do you need to worry about reproducibility?
• Before you start the project • While you do the analysis • When you write the paper • When you co-author a paper • When you review a paper
When do you need to worry about reproducibility?
• Before you start the project • While you do the analysis • When you write the paper • When you co-author a paper • When you review a paper
Scienti!c SOFT SKILLS
• Organization of project
• Tidy data
• Tidy code
• Control over tools
• Documentation
• Reproducibility
\project \data \code \analysis \paper
Less clicking and pasting, more scripting and coding
Reproducibility is important for
• Phd students
• Postdocs
• PIs
Learn tools and apply in daily work!
Create a ‘culture of reproducibility’ in your lab!