genomic futures v_pitt_kent_osu

56
Ben Busby, Ph.D. Lead, Bioinformatics Training, NCBI Chair, Department of Bioinformatics and Data Science, FAES [email protected] Exploring the Many Possible Futures of Genomics iciently Leveraging Commercial and Open Source Bioinformatics Tools inical Interventions and Research Discoveries from Very Large Datase

Upload: ben-busby

Post on 12-Apr-2017

106 views

Category:

Software


1 download

TRANSCRIPT

Page 1: Genomic futures v_pitt_kent_osu

Ben Busby, Ph.D.Lead, Bioinformatics Training, NCBI

Chair, Department of Bioinformatics and Data Science, [email protected]

Exploring the Many Possible Futures of Genomics

Efficiently Leveraging Commercial and Open Source Bioinformatics Tools for Clinical Interventions and Research Discoveries from Very Large Datasets

Page 2: Genomic futures v_pitt_kent_osu

Please note that all views are my own and not necessarily those of any Federal agency. No mention of any commercial or non-

profit entity should be considered an endorsement.

Slides available at: https://www.slideshare.net/benbusby/genomic-futures-v2

Exploring the Many Possible Futures of Genomics

• Human Genomic Variants – Chronic Disease and Cancer• Viruses – Zika• Bacteria – Food Borne Pathogens• Data Transfer and Storage

Page 3: Genomic futures v_pitt_kent_osu

NCBI

Page 4: Genomic futures v_pitt_kent_osu

Review of terminology and conceptsNext Generation Sequencing

Graphic Credit: Spencer Martin, UBC

Page 5: Genomic futures v_pitt_kent_osu

Review of terminology and conceptsHow Genomes are Mapped and Assembled

© Martine Zilversmit 2013

Page 6: Genomic futures v_pitt_kent_osu

http://1.usa.gov/1J1xmYs

NCBI NGS Online Workshop – Available on the NCBI YouTube Channel!

Review of terminology and conceptsHow Genomes are Mapped and Assembled

Page 7: Genomic futures v_pitt_kent_osu
Page 8: Genomic futures v_pitt_kent_osu

dbGaP

Page 9: Genomic futures v_pitt_kent_osu

dbGaP

2007 2008 2009 2010 2011 2012 2013 2014 2015

14,20153,216

139,311

374,464

485,727

566,181

660,665

876,849

1,002,935

Subjects

Page 10: Genomic futures v_pitt_kent_osu
Page 11: Genomic futures v_pitt_kent_osu

dbGaP – GWAS and PheGenI

Page 12: Genomic futures v_pitt_kent_osu

dbGaP – GWAS and PheGenI

Page 13: Genomic futures v_pitt_kent_osu

dbGaP – ClinVar

Page 14: Genomic futures v_pitt_kent_osu

ClinVar

Page 15: Genomic futures v_pitt_kent_osu

ClinVar

Page 16: Genomic futures v_pitt_kent_osu

ClinVar – Why Should we Care?

Page 17: Genomic futures v_pitt_kent_osu

ClinVar – Why Should we Care?

Page 18: Genomic futures v_pitt_kent_osu

ClinVar – Why Should we Care?

Page 19: Genomic futures v_pitt_kent_osu

ClinVar – Why Should we Care?

Page 20: Genomic futures v_pitt_kent_osu

ClinVar – Why Should we Care?

Page 21: Genomic futures v_pitt_kent_osu

Translation to the Clinic

Page 22: Genomic futures v_pitt_kent_osu

Combined score is the average of SVs, mappability, GC..

NCBI region list

Encode blacklist

DangerTrack!

Page 23: Genomic futures v_pitt_kent_osu

Genome in a Bottle

Page 24: Genomic futures v_pitt_kent_osu

Combined score is the average of SVs, mappability, GC..

NCBI region list

Encode blacklist

DangerTrack!

Page 25: Genomic futures v_pitt_kent_osu

We’ve run 9 hackathons

over the past two years.

We will run

7 or 8 this year

Page 26: Genomic futures v_pitt_kent_osu

Matching Expressed Variants in-memory; Analyzing with Graphs

https://f1000research.com/articles/5-674/v1

Page 27: Genomic futures v_pitt_kent_osu

phenvar.colorado.edu

Page 28: Genomic futures v_pitt_kent_osu

Variants are Often Pleiotropic

Page 29: Genomic futures v_pitt_kent_osu

Translation to the Clinic

Page 30: Genomic futures v_pitt_kent_osu

Data Science Training!

Page 31: Genomic futures v_pitt_kent_osu

Carpentries, MOOCs, Semi-Traditional Coursework and Mentoring

Page 32: Genomic futures v_pitt_kent_osu

NCBI Webinars

Page 33: Genomic futures v_pitt_kent_osu

Viral Genomes

Page 34: Genomic futures v_pitt_kent_osu

Virus Variation

Page 35: Genomic futures v_pitt_kent_osu

Virus Variation

Page 36: Genomic futures v_pitt_kent_osu

Virus Variation

Subscribe!

Page 37: Genomic futures v_pitt_kent_osu

Virus Variation

Page 38: Genomic futures v_pitt_kent_osu

EMRs and NLP

Page 39: Genomic futures v_pitt_kent_osu

Food Borne Pathogens

Page 40: Genomic futures v_pitt_kent_osu

Food Borne Pathogens

Page 41: Genomic futures v_pitt_kent_osu

Food Borne Pathogens

• Escherichia and Shigella• Campylobacter• Acinetobacter• Salmonella• Klebsiella• Listeria

Page 42: Genomic futures v_pitt_kent_osu

Investigation of NGS:MagicBLAST!

Page 43: Genomic futures v_pitt_kent_osu

Extracting Pathogenic Information from Metagenomes

Page 44: Genomic futures v_pitt_kent_osu

44

• Qiime• Mothur

• Nepthele• MetAMOS• MetaViz

• mash

Popular Metagenomics Tools!

Page 45: Genomic futures v_pitt_kent_osu

Investigation of NGS (esp metagenomes):SRA BLAST!

Another example – Cas9

Page 46: Genomic futures v_pitt_kent_osu

Immunogenic Peptides

Page 47: Genomic futures v_pitt_kent_osu

Where to Get More Information!

Page 48: Genomic futures v_pitt_kent_osu

Upcoming Hackathons

• March 20-22, NIH Campus

• May 22-24, UC BioFrontiers Institute Boulder CO

• June 19-21, NYGC

• August 17-19, NIH Campus

• September 25-27, Pittsburgh, PA

• October 11-13, Microbial and Metagenomics, NCBI (with ASM and CDC)

Page 49: Genomic futures v_pitt_kent_osu

Intensive Internships for Grad Students, Postdocs and Clinicians at NCBI

Come work at NCBI for 4-6 weeks!

Email [email protected]

for more information!

Page 50: Genomic futures v_pitt_kent_osu

My View of Data Transfer Principles• Metadata Search

• Rapid NoSQL (for now)• Integration• Non-ambiguous identifiers

• Transferring Small amounts of Data• Data still gets transferred in the cloud• Underlying structure• Finding specific data from validated formats

• Democratization of Data• Rapid comparison by domain experts

• Reporting• Metrics to report data upload and [unique IP] download of datasets• Post-publication User Review

Page 51: Genomic futures v_pitt_kent_osu

Websearch!

Page 52: Genomic futures v_pitt_kent_osu

52

EDirect (Search API) Cookbook

Page 53: Genomic futures v_pitt_kent_osu

53

New APIs!

Page 54: Genomic futures v_pitt_kent_osu

Variable Storage (and collaboration)!

Page 55: Genomic futures v_pitt_kent_osu

Federated Datasets

Page 56: Genomic futures v_pitt_kent_osu