clinical grade next-generation sequencing of um

19
Clinical grade next- generation sequencing of UM Elisha Roberson, Ph.D. Depts. of Internal Medicine and Genetics Washington University in St. Louis

Upload: phyre

Post on 23-Feb-2016

30 views

Category:

Documents


0 download

DESCRIPTION

Clinical grade next-generation sequencing of UM. Elisha Roberson, Ph.D. Depts. of Internal Medicine and Genetics Washington University in St. Louis. All talk content can be tweeted and blogged. @thatdnaguy. Sanger ≠ next-gen sequencing. Sanger Consensus of a population of molecules - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Clinical grade next-generation sequencing of UM

Clinical grade next-generation sequencing of UM

Elisha Roberson, Ph.D.Depts. of Internal Medicine and Genetics

Washington University in St. Louis

Page 2: Clinical grade next-generation sequencing of UM

All talk content can be tweeted and blogged

@thatdnaguy

Page 3: Clinical grade next-generation sequencing of UM

Sanger ≠ next-gen sequencingSanger

• Consensus of a population of molecules• ~600-800bp sequence• Very low error rate• Low-throughput• Targeted• Expensive* (256 bp/$)

NGS• Single-molecules• 35 bp – 3+ kb•High error rates (1-15+%)• High-throughput• Can shotgun• Cheap* (11.5 Mbp/$)

*Our lab’s current Sanger & HiSeq2500 costs

Page 4: Clinical grade next-generation sequencing of UM

UM Clinical sequencing• CLIA/CAP– Detect actionable somatic variants• Insurance pre-approval• Submit DNA• Wait a long time• Pay $7000+ / sample

• Research (discovery, epidemiology, etc)– IRB approved DNA collection– You sequence and interpret

Page 5: Clinical grade next-generation sequencing of UM

My sequencing wishlist• Tissue

– Fresh tissue– Large amount of high-quality DNA

• No PCR amplification

• Sequencing (whole genome)– Low-error, long reads– Paired-end– 30X or greater germline coverage– 60X or greater tumor coverage

• Bioinformatics– De novo assembly of both germline and tumor, compare to

reference & each other– Genotyping algorithm that is pair-aware

Page 6: Clinical grade next-generation sequencing of UM

NGS technologies

• ABI – SOLiD– Emulsion PCR by di-base ligation

• Illumina – Solexa– Single-molecule fluor sequencing

• Life Tech - Ion Torrent– Single-molecule semiconductor sequencing

• PacBio – SMRT– Zero-mode wave guide with single polymerase in-well

Page 7: Clinical grade next-generation sequencing of UM

Where should I get DNA???

• Germline–NO blood• Sequencing single molecules!!!

– Spit kits, washed skin biopsies

• Tumor tissue– Fresh primary tumor– Fresh metastatic tumor– FFPE?

Page 8: Clinical grade next-generation sequencing of UM

Calculating sequencing depth

• NGS technology dependent• How deeply do you want to see mosaicism?

• General guidelines– 30X germline– 60X or higher tumor

• BUT sequencing follows Poisson distribution– i.e. 30X average coverage != all targets 30X

Page 9: Clinical grade next-generation sequencing of UM

% Target covered ≥30X at different average coverage

Page 10: Clinical grade next-generation sequencing of UM

Coverage also varies by tissue*

*Plot available on Figshare

Page 11: Clinical grade next-generation sequencing of UM

FASTQ Preprocessing• Demultiplex samples

– Discard no index

• Convert to PHRED quality scale– -10 x log10( probability of base error )

• Remove adapter contamination– cutadapt

• Trim low-quality trailing bases and 3’ Ns– No 5’ trimming!!!

• Run FASTQC!

Page 12: Clinical grade next-generation sequencing of UM

Alignment to reference

• Current human is GRCh37

• Repeat masking– Hard mask repeats are N– Soft mask repeats are lowercase– Prefer soft masking

• Ref aligners have generally low memory use– Mostly use Burrows-Wheeler transform– bowtie 1 & 2– bwa-aln & bwa-mem– Novoalign (high memory)

Page 13: Clinical grade next-generation sequencing of UM

De novo assembly• Most De Bruijn graphs with kmers of

sequence

• Mostly very high memory usage– Depends on depth and number of kmers– Try running diginorm first (C. Titus Brown)

• Aligners– ABySS– MIRA– SOAPdenovo– Velvet

Page 14: Clinical grade next-generation sequencing of UM

Post-processing• Picard tools– Convert to BAM format– Add read-group tags–Mark duplicates (Picard tools)

• Genome Analysis ToolKit (GATK)– Local realignment of indels– Base quality score recalibration

Page 15: Clinical grade next-generation sequencing of UM

Genotyping• General– Genome analysis toolkit• Unified Genotyper

– Samtools• Mpileup

• Somatic specific–MuTect– Somatic Sniper– VirMiD

Page 16: Clinical grade next-generation sequencing of UM

Variant filtering strategies –sequential evolution

Initiating event

Mosaic primary

Mutation with metastatic advantageleaves the eye

Metastasis has primary mutations &Metastatic mutation (maybe in primary?) &New mutations

Page 17: Clinical grade next-generation sequencing of UM

Variant filtering strategies –parallel evolution

Initiating event

Mosaic primary

Metastasis has few primary mutations &Metastatic mutation (not in primary) &New mutations

Very little overlapin mutations between

primary and met!

Page 18: Clinical grade next-generation sequencing of UM

Variant confirmation

• Sanger sequencing

• Fluidigm

• TaqMan– castPCR

• Sequenom

• Illumina Golden Gate

Discover in a focused set with sequencingType with these technologies in everything

Page 19: Clinical grade next-generation sequencing of UM

Many thanks to