genomic futures v2
TRANSCRIPT
![Page 1: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/1.jpg)
Ben Busby, Ph.D.Lead, Bioinformatics Training, NCBI
Chair, Department of Bioinformatics and Data Science, [email protected]
Exploring the Many Possible Futures of Genomics
Efficiently Leveraging Commercial and Open Source Bioinformatics Tools for Clinical Interventions and Research Discoveries from Very Large Datasets
![Page 2: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/2.jpg)
Please note that all views are my own and not necessarily those of any Federal agency. No mention of any commercial or non-
profit entity should be considered an endorsement.
Exploring the Many Possible Futures of Genomics
• Human Genomic Variants – Chronic Disease and Cancer• Viruses – Zika• Bacteria – Food Borne Pathogens• Data Transfer and Storage
![Page 3: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/3.jpg)
NCBI
![Page 4: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/4.jpg)
Review of terminology and conceptsNext Generation Sequencing
Graphic Credit: Spencer Martin, UBC
![Page 5: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/5.jpg)
Review of terminology and conceptsHow Genomes are Mapped and Assembled
© Martine Zilversmit 2013
![Page 6: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/6.jpg)
http://1.usa.gov/1J1xmYs
NCBI NGS Online Workshop – Available on the NCBI YouTube Channel!
Review of terminology and conceptsHow Genomes are Mapped and Assembled
![Page 7: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/7.jpg)
![Page 8: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/8.jpg)
dbGaP
![Page 9: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/9.jpg)
dbGaP
2007 2008 2009 2010 2011 2012 2013 2014 2015
14,20153,216
139,311
374,464
485,727
566,181
660,665
876,849
1,002,935
Subjects
![Page 10: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/10.jpg)
![Page 11: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/11.jpg)
dbGaP – GWAS and PheGenI
![Page 12: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/12.jpg)
dbGaP – GWAS and PheGenI
![Page 13: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/13.jpg)
dbGaP – ClinVar
![Page 14: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/14.jpg)
ClinVar
![Page 15: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/15.jpg)
ClinVar
![Page 16: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/16.jpg)
ClinVar – Why Should we Care?
![Page 17: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/17.jpg)
ClinVar – Why Should we Care?
![Page 18: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/18.jpg)
ClinVar – Why Should we Care?
![Page 19: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/19.jpg)
ClinVar – Why Should we Care?
![Page 20: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/20.jpg)
ClinVar – Why Should we Care?
![Page 21: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/21.jpg)
Translation to the Clinic
![Page 22: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/22.jpg)
Combined score is the average of SVs, mappability, GC..
NCBI region list
Encode blacklist
DangerTrack!
![Page 23: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/23.jpg)
Genome in a Bottle
![Page 24: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/24.jpg)
Combined score is the average of SVs, mappability, GC..
NCBI region list
Encode blacklist
DangerTrack!
![Page 25: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/25.jpg)
We’ve run 9 hackathons
over the past two years.
We will run
7 or 8 this year
![Page 26: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/26.jpg)
Matching Expressed Variants in-memory; Analyzing with Graphs
https://f1000research.com/articles/5-674/v1
![Page 27: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/27.jpg)
phenvar.colorado.edu
![Page 28: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/28.jpg)
Variants are Often Pleiotropic
![Page 29: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/29.jpg)
Translation to the Clinic
![Page 30: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/30.jpg)
Data Science Training!
![Page 31: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/31.jpg)
Carpentries, MOOCs, Semi-Traditional Coursework and Mentoring
![Page 32: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/32.jpg)
NCBI Webinars
![Page 33: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/33.jpg)
Viral Genomes
![Page 34: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/34.jpg)
Virus Variation
![Page 35: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/35.jpg)
Virus Variation
![Page 36: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/36.jpg)
Virus Variation
Subscribe!
![Page 37: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/37.jpg)
Virus Variation
![Page 38: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/38.jpg)
EMRs and NLP
![Page 39: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/39.jpg)
Food Borne Pathogens
![Page 40: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/40.jpg)
Food Borne Pathogens
![Page 41: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/41.jpg)
Food Borne Pathogens
• Escherichia and Shigella• Campylobacter• Acinetobacter• Salmonella• Klebsiella• Listeria
![Page 42: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/42.jpg)
Investigation of NGS:MagicBLAST!
![Page 43: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/43.jpg)
Extracting Pathogenic Information from Metagenomes
![Page 44: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/44.jpg)
44
• Qiime• Mothur
• Nepthele• MetAMOS• MetaViz
• mash
Popular Metagenomics Tools!
![Page 45: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/45.jpg)
Investigation of NGS (esp metagenomes):SRA BLAST!
Another example – Cas9
![Page 46: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/46.jpg)
Immunogenic Peptides
![Page 47: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/47.jpg)
Where to Get More Information!
![Page 48: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/48.jpg)
Upcoming Hackathons
• March 20-22, NIH Campus
• May 22-24, UC BioFrontiers Institute Boulder CO
• June 19-21, NYGC
• August 17-19, NIH Campus
• September 25-27, Pittsburgh, PA
• October 11-13, Microbial and Metagenomics, NCBI (with ASM and CDC)
![Page 49: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/49.jpg)
Intensive Internships for Grad Students, Postdocs and Clinicians at NCBI
Come work at NCBI for 4-6 weeks!
Email [email protected]
for more information!
![Page 50: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/50.jpg)
My View of Data Transfer Principles• Metadata Search
• Rapid NoSQL (for now)• Integration• Non-ambiguous identifiers
• Transferring Small amounts of Data• Data still gets transferred in the cloud• Underlying structure• Finding specific data from validated formats
• Democratization of Data• Rapid comparison by domain experts
• Reporting• Metrics to report data upload and [unique IP] download of datasets• Post-publication User Review
![Page 51: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/51.jpg)
Websearch!
![Page 52: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/52.jpg)
52
EDirect (Search API) Cookbook
![Page 53: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/53.jpg)
53
New APIs!
![Page 54: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/54.jpg)
Variable Storage (and collaboration)!
![Page 55: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/55.jpg)
Federated Datasets
![Page 56: Genomic Futures v2](https://reader035.vdocuments.net/reader035/viewer/2022062822/58b894b41a28ab3e3a8b6617/html5/thumbnails/56.jpg)