metagenomes of native and electrode-enriched … · metagenomes of native and electrode-enriched...

1
Metagenomes of native and electrode-enriched microbial communities from the Soudan Iron Mine Jonathan P. Badalamenti and Daniel R. Bond Department of Microbiology and BioTechnology Institute, University of Minnesota - Twin Cities, Saint Paul, Minnesota, USA Twitter: @JonBadalamenti @wanderingbond Approach - compare metagenomes from native and electrode-enriched deep subsurface microbial communities Summary Results - long reads recover a closed genome from mine enrichements on electrodes Acknowledgments Conclusions Results - native Soudan metagenomes Introduction Soudan Underground Iron Mine Genome Comparison of Metal-Reducing Deltaproteobacteria collect Soudan brine inoculate electrode biodreactors enriched unenriched assembled metagenomes enrich +0.24 V 20° C de novo assembly IDBA_UD de novo assembly IDBA_UD hybrid assembly long read assembly HGAP N4 binning harvest cells from electrodes extract DNA extract DNA read trimming and filtering reconstruct complete genome(s) PacBio RS II long reads Illumina HiSeq short reads time (d) current (μA/cm 2 ) 0 0 10 10 20 20 30 30 40 N4 binning TFF filtrate retentate return 0.1 μm 3 μmprefilter borehole brine collection bottle Despite apparent carbon limitation, anoxic deep subsurface brines at the Soudan Underground Iron Mine harbor active microbial communities . To characterize these assemblages, we performed shotgun metagenomics of native and enriched samples. Follwing enrichment on poised electrodes and long read sequencing, we recovered from the metagenome the closed, circular genome of a novel Desulfuromonas sp. with remarkable genomic features that were not fully resolved by short read assem- bly alone. This organism was essentially absent in unenriched Soudan communities, indicating that electrodes are highly selective for putative metal reducers. Native community metagenomes suggest that carbon cycling is driven by methyl-C 1 me- tabolism, in particular methylotrophic methanogenesis. Our results highlight the promising potential for long reads in metagenomic surveys of low-diversity environ- ments. 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Mb 1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 2 Mb 2.1 2.2 2.3 2.4 2.5 2.6 2.7 2.8 2.9 3 Mb 3.1 3.2 3.3 3.4 3.5 3.6 3.7 3.8 3.9 DBIWA_3000 DBIWA_3008 DBIWA_3009 DBIWA_3010 rRNA 2 rRNA 1 CRISPR2 CRISPR1 hgcAB imcH cbcL Ca. Desulfuromonas biwabikus DDH964’ complete genome 3,924,648 bp circular 62.23% G+C 3,633 CDS 54 tRNA genome features glyoxlyate shunt nitrate respiration degradation of aromatics phosphonate transport evidence for conjugative mobule element transfer (tra genes) and chromosomal integration heavy metal resistance; mercury resistance and methylation c-type cytochromes genes on + strand phage DNA/transposons rRNA repeat sequences >500 bp repeat sequences >2,000 bp short read assembled contigs mapped reads > 6.5 kbp mapped long read coverage genes on strand 3536800 3538400 3540000 malate synthase aceB isoctrate lyase icl DBIWA_3278 DBIWA_3279 read length (kbp) Mbp > read length # of subreads 10 500 1000 1500 2000 2500 250 200 150 100 50 20 30 datasets Alpha- proteobacteria Firmicutes Alphaproteobacteria Gammaproteobacteria Gamma- proteobacteria other Unclassified Bacteria Euryarchaeota other Delta- proteobacteria IDBA_UD SPAdes SPAdes hybrid PBcR hybrid HGAP contigs 2820 3816 3338 581 132 total length 16,849,449 15,149,004 15,417,681 5,660,852 4,451,391 N 50 38,339 16,604 25,567 58,773 3,932,815 L 50 107 187 80 27 1 assembly comparison - electrode enrichment adding PacBio long reads improves metagenomic assembly PBcR SPAdes with Illumina only SPAdes hybrid IDBA_UD 800 18 15 12 9 6 3 0 1600 4000 3200 2400 contigs cumulative length (Mbp) short reads only long reads only short and long reads enriched on electrodes unenriched PacBio subread filtering pangenome all genes core genome all genes genomes gene clusters 4000 8000 12000 1 3 5 7 9 11 13 15 17 19 21 23 25 genomes gene clusters 1000 2000 3000 4000 1 3 5 7 9 11 13 15 17 19 21 23 25 568 genes 0.08 Geobacter sp. M21 Pelobacter carbinolicus Geobacter argillaceus Geopsychrobacter electrodophilus Geobacter sp. GSS01 Geobacter sp. M18 Geobacter daltonii FRC-32 Geobacter pickeringii G13 Geoalkalibacter ferrihydriticus Z-0531 Desulfuromonas acetoxidans Pelobacter propionicus Geoalkalibacter subterraneus Red1 Geobacter uraniireducens Rf4 Desulfuromonas sp. TF Geobacter metallireducens Geobacter bremensis Geobacter sp. OR-1 Desulfuvibrio vulgaris Hildenborough Pelobacter seleniigenes Desulfuromusa kysingii Ca. Geobacter sulfurreducens PCA Desulfuromonas subbituminosa Geobacter bemidjiensis Desulfuromonas thiophila Geobacter lovleyi Ca. 86 22 49 28 34 51 41 68 42 71 59 58 58 76 70 58 65 61 55 2 6 31 38 24 outgroup-rooted tree based on alignment of concatenated set of 40 conserved single-copy genes generated using Phylosift (v. 1.0.1) pan- and core genome analysis of avail- able Geobacter/Desulfuromonas clade genomes, with cytochrome plots re- ported using default clustering param- eters in get_homologues (3-6-2015 re- lease) 86 multiheme* c-type cytochrome count 86 multiheme* c-type cytochrome count *3 or more heme- binding motifs into carbon cycling the deep terrestrial biosphere low-complexity microbial communities which drive fundamental biogeochemical cycles, including redox transformations of metals - dance of putative metal reducers reducers on insoluble metal oxides as electron acceptors, such as variability in crystallinity, redox potential, and adsorption of other compounds microbes in natural communities, but at the expense of assembly contiguity There is tremendous potential for long reads in improving metagenomic assem- blies and downstream phylogenetic, bioinformatic, and biochemical predic- tions Family-level lineages, particularly among Firmicutes Methanolobus was the only Archaeon observed in unenriched metagenomes, suggesting active methyl-C 1 metabolism in situ - ronments where they exist at extremely low relative abundance However, for unenriched metagenomic datasets, additional long read cov- erage is required to address low abundance (<5%) community members Duluth Int’l Falls Bemidji ONTARIO LAKE SUPERI OR M I N N E S O T A 40 km CANADA UNITED STATES mine transect massive veins of hematite embedded in an Archaean (2.7 Gya) banded iron formation dry mine reduced metals, suggesting active microbial metabolism Soudan is actively maintained as a Minne- sota state park, allowing year-round access to the deep terrestrial biosphere We thank the Minnesota Supercomputing Institute (MSI) for developing, implementing, and main- taining the PacBio SMRT Analysis suite, and the Marine Biological Laboratory (MBL) for Illumina se- quencing under a seed grant from the Deep Carbon Observatory Census of Deep Life. Long read data was generously provided by Pacific Biosciences and we thank Karl Oles (Mayo Clinic Bioinformatics Core) for sample preparation. We also thank Chris O’Brien (Pall Life Sciences) for guidance with TFF DEEP CARBON OBSERVATORY BIOINFORMATICS CORE 0 30 60 multiheme* c-type cytochromes 20 60 100 multiheme* c-type cytochromes DDH 932 DDH 944 DDH 951 datasets natural, unenriched Soudan Mine communities unclassified Clostridiales unclassified Clostridiales unclassified Clostridiales Desulfitibacter Halothiobacillus Halothiobacillus Marinobacter Marinobacter Marinobacter Desulfopila Dehalobacter Halanaerobium Halocella Halocella Halocella Desulfosporosinus Dehalobacter Dehalobacter unclassified Halanaerobiaceae unclassified Rhodobacteraceae unclassified Proteobacteria unclassified Rhodobacteraceae unclassified Rhodobacterales unclassified Bacteroidales unclassified Bacteroidales unclassified Alteromonadales unclassified Peptococcaceae unclassified Peptococcaceae unclassified Peptococcaceae Methanolobus Methanolobus Methanolobus Geobacter Sphaerochaeta Rhodovulum Rhodovulum Demequina Demequina Roseovarius inner rings novel lineages Firmicutes Firmicutes Bacteroidetes Alphaproteobacteria Gammaproteobacteria Deltaproteobacteria Euryarchaeota unclassified Proteobacteria Order level Class level Family level deeper/ more reduced shallower/ more oxidized samples collected along redox gradient

Upload: lamkhanh

Post on 25-Aug-2018

232 views

Category:

Documents


0 download

TRANSCRIPT

Metagenomes of native and electrode-enriched microbial communities from the Soudan Iron MineJonathan P. Badalamenti and Daniel R. BondDepartment of Microbiology and BioTechnology Institute, University of Minnesota - Twin Cities, Saint Paul, Minnesota, USATwitter: @JonBadalamenti @wanderingbond

Approach - compare metagenomes from native and electrode-enriched deep subsurface microbial communitiesSummary

Results - long reads recover a closed genome from mine enrichements on electrodes

AcknowledgmentsConclusions

Results - native Soudan metagenomes

Introduction

Soudan Underground Iron Mine

Genome Comparison of Metal-Reducing Deltaproteobacteria

collect Soudan brine

inoculate electrode

biodreactors

enriched

unenrichedassembled metagenomes

enrich+0.24 V

20° C

de novo assembly

IDBA_UD

de novo assembly

IDBA_UD

hybridassembly

long readassembly

HGAP

N4 binning

harvest cellsfrom

electrodes

extract DNA

extractDNA

read trimming and filtering

reconstruct complete genome(s)

PacBio RS IIlong reads

Illumina HiSeqshort readstime (d)

curr

ent (µA

/cm

2 )

00

10

10

20

20

30

30

40

N4 binning

TFF

filtrate

retentatereturn

0.1 µm

3 µm prefilter

boreholebrine

collection bottle

Despite apparent carbon limitation, anoxic deep subsurface brines at the Soudan Underground Iron Mine harbor active microbial communities . To characterize these assemblages, we performed shotgun metagenomics of native and enriched samples. Follwing enrichment on poised electrodes and long read sequencing, we recovered from the metagenome the closed, circular genome of a novel Desulfuromonas sp. with remarkable genomic features that were not fully resolved by short read assem-bly alone. This organism was essentially absent in unenriched Soudan communities, indicating that electrodes are highly selective for putative metal reducers. Native community metagenomes suggest that carbon cycling is driven by methyl-C1 me-tabolism, in particular methylotrophic methanogenesis. Our results highlight the promising potential for long reads in metagenomic surveys of low-diversity environ-ments.

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1 Mb

1.1

1.2

1.3

1.4

1.5

1.6

1.7

1.8

1.9

2 Mb

2.12.22.

3

2.4

2.5

2.6

2.7

2.8

2.9

3 Mb

3.1

3.2

3.3

3.4

3.5

3.6

3.7

3.8

3.9

DBIWA_3000

DBIWA_3008

DBIWA_3009

DBIWA_3010

rRNA 2

rRNA 1

CRISPR2

CRISPR1

hgcAB

imcH

cbcL

‘Ca. Desulfuromonas biwabikus DDH964’complete genome 3,924,648 bp circular62.23% G+C3,633 CDS54 tRNA

genome featuresglyoxlyate shunt nitrate respirationdegradation of aromaticsphosphonate transportevidence for conjugative mobule element transfer (tra genes) and chromosomal integrationheavy metal resistance; mercury resistance and methylation

c-type cytochromes

genes on + strand

phage DNA/transposonsrRNArepeat sequences >500 bprepeat sequences >2,000 bp

short read assembled contigsmapped reads > 6.5 kbpmapped long read coverage

genes on − strand

3536800 3538400 3540000

malate synthase aceB isoctrate lyase iclDBIWA_3278 DBIWA_3279

read length (kbp)M

bp >

read

leng

th

# of

subr

eads

10

500

1000

1500

2000

2500 250

200

150

100

50

20 30

datasets

Alpha-proteobacteria

Firmicutes

AlphaproteobacteriaGammaproteobacteria

Gamma-proteobacteria

other

UnclassifiedBacteriaEuryarchaeotaother

Delta-proteobacteria

IDBA_UD SPAdes SPAdes hybrid PBcR hybrid HGAPcontigs 2820 3816 3338 581 132total length 16,849,449 15,149,004 15,417,681 5,660,852 4,451,391N50 38,339 16,604 25,567 58,773 3,932,815L50 107 187 80 27 1

assembly comparison - electrode enrichment

adding PacBio long reads improves metagenomic assembly

PBcR

SPAdes withIllumina only

SPAdes hybridIDBA_UD

800

18

15

12

9

6

3

0 1600 400032002400contigs

cum

ulat

ive

leng

th (M

bp)

short reads onlylong reads

onlyshort and long reads

enriched on electrodes

unenriched

PacBio subread filtering

pangenomeall genes

core genomeall genes

genomes

gene

clus

ters

●●

●●

●●

●● ●

● ●●

●●

●●

●●

●●

●●

●●

●●

●●

● ●

●●

●●

●●

●●

● ●●

●●

●●

●● ●

●●

●●

●●

●●

●●

● ●●

● ●●

●●

●●

●●

●●

●●

●●

●●

●● ●

●●

●●

●●

●●

●●

●● ● ●

● ●

●●

●●

●●

●●

●● ●

●●

●● ●

●●

●●

● ●

●●

●●

●●

●●

● ● ●● ● ●

●●

●●

●●

● ●●

●●

●●

●●

●● ●

●●

●●

●●

●●

●●

●●

●● ●

●●

4000

8000

1200

0

1 3 5 7 9 11 13 15 17 19 21 23 25genomes

gene

clus

ters

● ●

●●

● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

● ●● ● ● ●

● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

●●

● ● ●●

●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

●●

●● ● ● ● ● ● ● ● ● ●

● ● ● ● ● ● ● ● ● ● ●

●●

● ●● ● ● ● ●

●● ● ● ● ● ● ● ● ● ● ● ● ● ●

●● ● ●

● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

●●

● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

●● ● ● ● ● ● ● ● ●

● ● ● ● ● ● ● ● ● ● ● ● ●

●● ●

● ● ● ●● ● ●

● ● ● ● ● ● ● ● ● ● ● ● ● ●

●●

● ●

●●

● ● ●● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

1000

2000

3000

4000

1 3 5 7 9 11 13 15 17 19 21 23 25

568genes

0.08

Geoba

cter s

p. M

21

Pelob

acte

r car

binoli

cus

Geobacter argillaceus

Geopsychrobacter electrodophilus

Geobacter sp. GSS01Geob

acte

r sp.

M18

Geobacter daltonii FRC-32

Geob

acte

r pick

erin

gii G

13

Geoalkalibacter ferrihydriticus Z-0531

Desulfuromonas acetoxidans

Pelobacter propionicus

Geoalkalibacter subterraneus Red1

Geobacter uraniireducens Rf4

Desulfuromonas sp. TF

Geobacter metallireducens

Geobacter bremensisGeobacter sp. OR-1

Desulfuvibrio vulgaris

Hildenborough

Pelo

bact

er se

leniig

enes

Desulfuromusa kysingii

‘Ca.

Geobacter sulfurreducens PCA

Desulfu

romon

as su

bbitu

minosa

Geobacter bemidjiensis

Desulfuromonas thiophila

Geobacter lovleyi

‘Ca.

86

22

49

28

34

51

41

68

42

71

5958

58

7670

58

65

61

55

2

631

3824 outgroup-rooted tree based on

alignment of concatenated set of 40 conserved single-copy genes generated using Phylosift (v. 1.0.1)

pan- and core genome analysis of avail-able Geobacter/Desulfuromonas clade genomes, with cytochrome plots re-ported using default clustering param-eters in get_homologues (3-6-2015 re-lease)

86multiheme* c-typecytochrome count

86multiheme* c-typecytochrome count

*3 or more heme-binding motifs

into carbon cycling the deep terrestrial biospherelow-complexity microbial communities which drive

fundamental biogeochemical cycles, including redox transformations of metals-

dance of putative metal reducers

reducers on insoluble metal oxides as electron acceptors, such as variability in crystallinity, redox potential, and adsorption of other compounds

microbes in natural communities, but at the expense of assembly contiguityThere is tremendous potential for long reads in improving metagenomic assem-blies and downstream phylogenetic, bioinformatic, and biochemical predic-tions

Family-level lineages, particularly among FirmicutesMethanolobus was the only Archaeon observed in

unenriched metagenomes, suggesting active methyl-C1 metabolism in situ-

ronments where they exist at extremely low relative abundance

However, for unenriched metagenomic datasets, additional long read cov-erage is required to address low abundance (<5%) community members

Duluth

Int’l Falls

Bemidji

O N T A R I O

L A K E S U P E R I O R

MI N

N E S O T A

40 km

C A N A D A

U N I T E D S T A T E S

mine transect massive veins of hematite embedded in an Archaean (2.7 Gya) banded iron formation

dry mine

reduced metals, suggesting active microbial metabolismSoudan is actively maintained as a Minne-sota state park, allowing year-round access to the deep terrestrial biosphere

We thank the Minnesota Supercomputing Institute (MSI) for developing, implementing, and main-taining the PacBio SMRT Analysis suite, and the Marine Biological Laboratory (MBL) for Illumina se-quencing under a seed grant from the Deep Carbon Observatory Census of Deep Life. Long read data was generously provided by Pacific Biosciences and we thank Karl Oles (Mayo Clinic Bioinformatics Core) for sample preparation. We also thank Chris O’Brien (Pall Life Sciences) for guidance with TFF

DEEP CARBONOBSERVATORY BIOINFORMATICS CORE

●●

● ● ● ● ● ● ● ●● ● ● ● ● ● ● ● ● ● ●

● ●●

● ●●

●●

●●

●● ● ● ● ●

●● ● ● ● ●

● ● ● ●● ● ● ● ● ● ● ● ● ●

● ● ● ● ● ● ●

● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

●●

● ●

● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

●●

●●

● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●

● ● ●

● ●●

● ● ● ● ● ● ● ● ● ● ● ● ● ●

●●

● ● ●● ●

●●

● ● ● ● ● ● ● ● ● ● ● ● ●

● ●● ● ● ●

● ● ● ● ● ●● ● ● ● ● ● ● ● ● ●

●●

● ●●

● ●● ● ● ● ● ●

●● ● ● ● ● ● ● ●0

3060

multiheme* c-typecytochromes

●●

● ●

●●

● ●● ●

● ● ●● ● ● ● ● ● ● ● ●

●● ● ● ● ● ● ● ● ●

● ● ● ● ● ● ● ● ●● ●

●● ●

● ● ● ●● ● ●

●● ● ●

● ● ● ● ● ● ● ●

●●

● ●

● ● ●● ●

● ● ● ●● ● ●

● ● ● ● ● ●

●●

● ● ● ● ●●

● ● ●● ●

● ● ● ● ● ● ● ●● ●

● ● ●

● ● ● ●● ●

●● ● ● ● ● ● ● ● ● ● ●

●● ● ● ● ● ● ● ● ●

● ●● ● ● ● ● ●

● ● ● ●

●● ●

●● ●

● ● ●● ●

● ● ● ● ● ● ● ● ● ● ●

● ● ● ● ●●

●● ●

● ●● ● ● ● ●

● ● ● ●

●● ●

●●

●● ●

●● ●

● ● ● ● ● ● ● ● ● ● ●

2060

100

multiheme* c-typecytochromes

DDH932

DDH944

DDH951

datasetsnatural, unenriched Soudan Mine communities

unclassifiedClostridiales

unclassifiedClostridiales

unclassifiedClostridiales

Desulfitibacter

Halothiobacillus Halothiobacillus

Marinobacter

Marinobacter

Marinobacter

DesulfopilaDehalobacterHalanaerobium

Halocella

Halocella HalocellaDesulfosporosinusDehalobacter Dehalobacter

unclassifiedHalanaerobiaceae

unclassifiedRhodobacteraceae

unclassifiedProteobacteria

unclassifiedRhodobacteraceae

unclassifiedRhodobacterales

unclassifiedBacteroidales

unclassifiedBacteroidales

unclassifiedAlteromonadales

unclassifiedPeptococcaceae

unclassifiedPeptococcaceae

unclassifiedPeptococcaceae

Methanolobus

Methanolobus

Methanolobus

GeobacterSphaerochaeta

Rhodovulum Rhodovulum

DemequinaDemequina

Roseovarius

inner rings novel lineagesFirmicutesFirmicutesBacteroidetes

AlphaproteobacteriaGammaproteobacteria

DeltaproteobacteriaEuryarchaeota

unclassifiedProteobacteria

Order levelClass level

Family level

deeper/more reduced

shallower/more oxidized samples collected along redox gradient