chemistry & biology perspective - core · phosphopantetheinyl transferases involved in...

12
Chemistry & Biology Perspective Metagenomic Approaches for Exploiting Uncultivated Bacteria as a Resource for Novel Biosynthetic Enzymology Micheal C. Wilson 1 and Jo ¨ rn Piel 1, * 1 Institute for Microbiology, Eidgeno ¨ ssische Technische Hochschule (ETH) Zurich, Wolfgang-Pauli-Strasse 10, 8093 Zurich, Switzerland *Correspondence: [email protected] http://dx.doi.org/10.1016/j.chembiol.2013.04.011 Most biologically active microbial natural products are known from strains that can be isolated and cultivated in the laboratory. However, the genomics era has revealed that cultured bacteria represent a mere fraction of total estimated bacterial biodiversity. With the development of community genomics, termed metagenomics, the uncultivated majority became accessible for functional analysis. Through metagenomic studies, novel biocatalysts and biosynthetic pathways are being discovered at a pace previously not possible using tradi- tional molecular biology techniques. Additionally, the study of uncultivated bacteria has provided valuable insights into previously overlooked biocatalysts from cultured strains. This perspective highlights recent dis- coveries from metagenomics of uncultivated bacteria and discusses the impact of those findings on the field of natural products. Introduction With the advent of cultivation-independent methods for study- ing microbial diversity (e.g., 16S ribosomal RNA phylogeny), we now know that bacteria inhabit nearly every ecological niche imaginable: from the deepest oceans to arid deserts to the gut of nearly every multicellular organism. They can exist as free- living organisms or form symbiotic or pathogenic relationships with eukaryotic hosts or other prokaryotes. Yet, despite the ubiquity of bacteria and our extensive effort to study them, the vast majority of microbial biodiversity has not been functionally characterized (Wu et al., 2009). This is largely due to current lim- itations in cultivation technologies but also to the sheer number of estimated bacterial taxa. For instance, approximately 70% of all known bacterial phyla do not have a single cultured represen- tative (Achtman and Wagner, 2008). This reality is both exciting and somewhat daunting considering the extremely prosperous relationship between cultivated bacteria and industrial applica- tions. Specialized bacterial strains, their produced proteins, and small molecules are used in industries involved in food pro- cessing, biofuel production, cosmetics, materials chemistry, and pharmaceutical drug discovery and production. In fact, bacteria are likely the best hope for discovery of novel antibi- otics to combat the ever growing prevalence of antibiotic-resis- tant pathogens (Wright, 2012). Moreover, greater than 10,000 biologically active metabolites with therapeutic properties against cancer, HIV, inflammation, and many more human dis- eases are known from cultured actinomycetes alone, most of which are produced by a single genus (Streptomyces)(Be ´ rdy, 2005). Full genome sequencing of cultured bacteria also brings light to the fact that, in most cases, the isolated secondary me- tabolites of even well-studied producers barely scratch the sur- face of the biosynthetic potential encoded within their DNA (Bentley et al., 2002; Udwary et al., 2007). Extrapolating what we know from these prolific cultured strains, we can hypothe- size that the potential for discovery of novel biocatalysts and biosynthetic gene clusters from the uncultured majority is massive. In the last decade or so, the field of metagenomics has proven to be a fruitful means for accessing the biosynthetic machinery of uncultured bacteria, thus circumventing traditional molecular methods that rely on cultivation. Metagenomics is the analysis of DNA from a mixed population of organisms and initially involved the cloning of either total or enriched DNA directly from the environment (eDNA) into a host that can be easily culti- vated (Handelsman, 2004; Miao and Davies, 2009). More recently, advances in next generation sequencing (NGS) tech- nologies allow isolated eDNA to be sequenced and analyzed directly from environmental samples (Shokralla et al., 2012). Us- ing a variety of functional or bioinformatic screening methods, novel biocatalysts and small molecule biosynthetic genes can be identified at an increasing pace. Metagenomics in industrial applications has yielded numerous enzyme biocatalysts used in the production of important commodities, such as biofuels (Xing et al., 2012) and detergents (Lorenz and Eck, 2005). In the field of natural product drug discovery, metabolites or puta- tive pathways for most known and some previously unreported structural families have been isolated from soils (Chang and Brady, 2011; Feng et al., 2011), sponges (Freeman et al., 2012; Piel et al., 2004a), tunicates (Rath et al., 2011; Schmidt et al., 2012), insects (Piel, 2002), and many more ecological habitats using metagenomic methods. In many cases, the challenge still remains to link biosynthetic genes to products; however, several examples of successful heterologous expression or in vitro char- acterization of biosynthetic genes have been reported (Brady et al., 2001; Brady and Clardy, 2005a; Long et al., 2005). For detailed reviews, see Piel (2011), Nikolouli and Mossialos (2012), and Brady et al. (2009). Excitingly, research on unculti- vated bacteria has shown that unprecedented biosynthetic en- zymes and transformations can be found in these organisms at high frequency. In this perspective, we discuss strategies for 636 Chemistry & Biology 20, May 23, 2013 ª2013 Elsevier Ltd All rights reserved

Upload: others

Post on 20-Jul-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Chemistry & Biology

Perspective

Metagenomic Approachesfor Exploiting Uncultivated Bacteriaas a Resource for Novel Biosynthetic Enzymology

Micheal C. Wilson1 and Jorn Piel1,*1Institute for Microbiology, Eidgenossische Technische Hochschule (ETH) Zurich, Wolfgang-Pauli-Strasse 10, 8093 Zurich, Switzerland*Correspondence: [email protected]://dx.doi.org/10.1016/j.chembiol.2013.04.011

Most biologically activemicrobial natural products are known from strains that can be isolated and cultivatedin the laboratory. However, the genomics era has revealed that cultured bacteria represent a mere fraction oftotal estimated bacterial biodiversity. With the development of community genomics, termedmetagenomics,the uncultivated majority became accessible for functional analysis. Through metagenomic studies, novelbiocatalysts and biosynthetic pathways are being discovered at a pace previously not possible using tradi-tional molecular biology techniques. Additionally, the study of uncultivated bacteria has provided valuableinsights into previously overlooked biocatalysts from cultured strains. This perspective highlights recent dis-coveries frommetagenomics of uncultivated bacteria and discusses the impact of those findings on the fieldof natural products.

IntroductionWith the advent of cultivation-independent methods for study-

ing microbial diversity (e.g., 16S ribosomal RNA phylogeny),

we now know that bacteria inhabit nearly every ecological niche

imaginable: from the deepest oceans to arid deserts to the gut

of nearly every multicellular organism. They can exist as free-

living organisms or form symbiotic or pathogenic relationships

with eukaryotic hosts or other prokaryotes. Yet, despite the

ubiquity of bacteria and our extensive effort to study them, the

vast majority of microbial biodiversity has not been functionally

characterized (Wu et al., 2009). This is largely due to current lim-

itations in cultivation technologies but also to the sheer number

of estimated bacterial taxa. For instance, approximately 70% of

all known bacterial phyla do not have a single cultured represen-

tative (Achtman and Wagner, 2008). This reality is both exciting

and somewhat daunting considering the extremely prosperous

relationship between cultivated bacteria and industrial applica-

tions. Specialized bacterial strains, their produced proteins,

and small molecules are used in industries involved in food pro-

cessing, biofuel production, cosmetics, materials chemistry,

and pharmaceutical drug discovery and production. In fact,

bacteria are likely the best hope for discovery of novel antibi-

otics to combat the ever growing prevalence of antibiotic-resis-

tant pathogens (Wright, 2012). Moreover, greater than 10,000

biologically active metabolites with therapeutic properties

against cancer, HIV, inflammation, and many more human dis-

eases are known from cultured actinomycetes alone, most of

which are produced by a single genus (Streptomyces) (Berdy,

2005). Full genome sequencing of cultured bacteria also brings

light to the fact that, in most cases, the isolated secondary me-

tabolites of even well-studied producers barely scratch the sur-

face of the biosynthetic potential encoded within their DNA

(Bentley et al., 2002; Udwary et al., 2007). Extrapolating what

we know from these prolific cultured strains, we can hypothe-

size that the potential for discovery of novel biocatalysts and

636 Chemistry & Biology 20, May 23, 2013 ª2013 Elsevier Ltd All righ

biosynthetic gene clusters from the uncultured majority is

massive.

In the last decade or so, the field of metagenomics has proven

to be a fruitful means for accessing the biosynthetic machinery of

uncultured bacteria, thus circumventing traditional molecular

methods that rely on cultivation. Metagenomics is the analysis

of DNA from a mixed population of organisms and initially

involved the cloning of either total or enriched DNA directly

from the environment (eDNA) into a host that can be easily culti-

vated (Handelsman, 2004; Miao and Davies, 2009). More

recently, advances in next generation sequencing (NGS) tech-

nologies allow isolated eDNA to be sequenced and analyzed

directly from environmental samples (Shokralla et al., 2012). Us-

ing a variety of functional or bioinformatic screening methods,

novel biocatalysts and small molecule biosynthetic genes can

be identified at an increasing pace. Metagenomics in industrial

applications has yielded numerous enzyme biocatalysts used

in the production of important commodities, such as biofuels

(Xing et al., 2012) and detergents (Lorenz and Eck, 2005). In

the field of natural product drug discovery, metabolites or puta-

tive pathways for most known and some previously unreported

structural families have been isolated from soils (Chang and

Brady, 2011; Feng et al., 2011), sponges (Freeman et al., 2012;

Piel et al., 2004a), tunicates (Rath et al., 2011; Schmidt et al.,

2012), insects (Piel, 2002), and many more ecological habitats

using metagenomic methods. In many cases, the challenge still

remains to link biosynthetic genes to products; however, several

examples of successful heterologous expression or in vitro char-

acterization of biosynthetic genes have been reported (Brady

et al., 2001; Brady and Clardy, 2005a; Long et al., 2005). For

detailed reviews, see Piel (2011), Nikolouli and Mossialos

(2012), and Brady et al. (2009). Excitingly, research on unculti-

vated bacteria has shown that unprecedented biosynthetic en-

zymes and transformations can be found in these organisms at

high frequency. In this perspective, we discuss strategies for

ts reserved

Page 2: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Figure 1. General Metagenomic WorkflowGraphical schematic summarizing the stepsinvolved in a typical metagenomic workflow.

Chemistry & Biology

Perspective

accessing the biosynthetic potential of uncultivated bacteria and

highlight case studies of novel biochemistry discovered by these

methods.

Metagenomics: A Powerful Tool for AccessingUncultured BacteriaA powerful tool for investigating the biosynthetic potential of an

uncultivated bacterial population involves the construction of

an eDNA library in a suitable host (Figure 1). This begins with

the isolation of high molecular weight eDNA, for which several

protocols have been developed from a variety of different envi-

ronments (Gurgui and Piel, 2010; Liles et al., 2008; Ouyang

et al., 2010). Enrichment of specific cell populations is common

prior to eDNA isolation (Hildebrand et al., 2004; Jiao et al., 2006;

Lenk et al., 2012). This strategy is often used for samples that

contain mixtures of eu- and prokaryotic cells, such as marine

sponges and other invertebrates containing symbiotic bacteria

(Bewley et al., 1996; Piel et al., 2004a). Each population can

Chemistry & Biology 20, May 23, 2013

then be targeted for analysis separately.

As with genomic library construction of

single organisms, Escherichia coli is the

most common host; however, it is some-

times necessary to use an alternate DNA

recipient for properties such as codon

bias and precursor supply capability

when considering function-based

screening protocols (Brady et al., 2009;

Craig et al., 2009, 2010; Piel, 2011;

Wang et al., 2000). The choice of vector

can also determine the success or failure

of a particular project. Cosmid and fosmid

vectors are the most commonly used for

library construction, because they can

accept fragments from 35–45 kb, can

easily be transfected into E. coli to pro-

duce millions of clones, and are relatively

stable (single copy fosmids are more sta-

ble than cosmids). Larger insert libraries

using bacterial artificial chromosomes

(BACs) have also been successful (Gilles-

pie et al., 2002; Long et al., 2005) but are

less common, due to the challenge of

isolating DNA fragments of sufficient

sizes from eDNA. Hundreds to millions

of clones can be maintained in a 96 or

384 well format, as liquid pools (Rondon

et al., 2000), or in a low-melting-point

agarose three-dimensional (3D) matrix

(Gurgui and Piel, 2010), depending on

the number of clones required to obtain

ample coverage for screening purposes.

Once the library is constructed, it can be

subjected to functional or homology-

based screening protocols.

A common method for identifying discrete biocatalysts or me-

tabolites synthesized by small to medium-sized gene clusters

(i.e., smaller than the vector insert size) is to screenmetagenomic

library clones for modified phenotypes or specific enzymatic

function. This method is prevalent for the identification of indus-

trially relevant enzymes, such as cellulases, lypases, esterases,

and many more (Reyes-Duarte et al., 2012). Phenotypic screens

based on color, zones of inhibition on bacterial (Rondon et al.,

2000) or fungal (Chung et al., 2008) lawns, hemolytic activity on

blood agar plates (Gillespie et al., 2002), or the use of specific re-

porter genes (Guan et al., 2007; Owen et al., 2012) are employed

for identifying functional biocatalysts. A number of natural prod-

ucts and their biosynthetic machinery have been identified

through functional screening of soil metagenomes, including tur-

bomycin A (1) and B (2) (Gillespie et al., 2002), terragines (3)

(Wang et al., 2000), fatty acid derivatives (Brady et al., 2002;

Brady and Clardy, 2003), and several aromatic polyketides

(Feng et al., 2011; Figure 2). Intracellular reporters based on

ª2013 Elsevier Ltd All rights reserved 637

Page 3: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Figure 2. Natural Products WhoseBiosynthetic Pathways Were Identifiedthrough Metagenomics or Single-CellIsolation from a Metagenomic SampleThe turbomycins (1 and 2) and terragine A (3)pathways were discovered from soil meta-genomes, the patellazole A (4) and bryostatin 1 (5)pathways are from symbiotic bacteria associatedwith marine invertebrates, and the apratoxin A (6)pathway was identified by single-cell analysis of acyanobacterium.

Chemistry & Biology

Perspective

fluorescence or pigment production have been used to identify

novel quorum-sensing molecules (Williamson et al., 2005) and

phosphopantetheinyl transferases involved in polyketide and

nonribosomal peptide biosynthesis (Owen et al., 2012). In these

screens, the metagenomic library host carries a reporter gene

(e.g., GFP), usually located on a plasmid, that is activated

when a specific target from the eDNA is expressed. Functional

screens are advantageous because they do not require any pre-

vious knowledge of gene sequences and are perhaps the best

way to identify novel biochemical mechanisms.

Alternatively, gene homology-based screens rely on using

sequence similarity to previously known biosynthetic genes or

biocatalysts. Though limited in their utility for discovery of

completely novel enzymatic reactions, homology-based

methods do not require expression of biosynthetic genes for

detection. PCR or hybridization techniques are the most com-

mon strategies for identifying, quantifying, and cataloging genes

of interest frommetagenomic libraries, especially in the early ge-

nomics era. Typically, degenerate primers designed from highly

638 Chemistry & Biology 20, May 23, 2013 ª2013 Elsevier Ltd All rights reserved

conserved regions of biosynthetic targets

are used to PCR amplify gene fragments

from total or enriched eDNA. From these

fragments, specific PCR primers can be

generated for screening a metagenomic

clone library. This is a common strategy

to identify modular polyketide or nonribo-

somal peptide biosynthesis in a metage-

nomic sample, since these pathways

are more difficult to detect in functional

screens compared to individual bio-

catalysts or smaller pathways (e.g., type

II polyketide synthase [PKS] systems).

This is due to the fact that these pharma-

ceutically relevant natural products are

synthesized by gene clusters that can

easily exceed the insert size limitations

of fosmid or even BAC vectors, making

it often necessary to isolate multiple

clones or perform genome walking in

order to obtain the complete gene clus-

ter. Thus, heterologous expression of

modular PKS or nonribosomal peptide

synthetase (NRPS) pathways can usually

only be attempted after initial identifica-

tion using homology-based screens.

As high-throughput sequencing and

assembly technologies improve and pri-

ces decrease, large-scale sequencing of enriched and metage-

nomic DNA is becoming more common for homology-based

gene discovery. These NGS platforms, including 454 Life Sci-

ences pyrosequencing, Illumina, single molecule sequencing,

and others have revolutionized the way metagenomic (and tradi-

tional genomic) data are obtained. Enormous amounts of

sequencing reads can be generated within hours using the latest

platforms. An unfortunate disadvantage with NGS is that assem-

bly becomes very challenging, due to short read lengths associ-

ated with most platforms. PKS and NRPS pathways from

metagenomes can be particularly challenging to assemble

because of the highly conserved and repetitive organization of

these genes. Additionally, complex systems (e.g., animal-micro-

bial associations and soil) can contain several hundred to thou-

sand different species of bacteria and with numerous PKS and

NRPS genes. Nonetheless, this challenge can be addressed to

some degree by using paired-end libraries, deeper sequencing,

or enrichment of the producing bacteria, and gap closing using

PCR. These strategies have been used with NGS to identify

Page 4: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Chemistry & Biology

Perspective

PKS and NRPS clusters from a number of cultured strains (John-

ston et al., 2013; Liu et al., 2010) as well as the patellazole (4)

(Donia et al., 2011a) and bryostatin (5) (Sudek et al., 2007)

biosynthetic gene clusters from uncultured bacteria associated

with marine invertebrates.

An important limitation to the use of metagenomics for com-

plex communities is that biosynthetic genes can rarely be

ascribed to a specific producer, as clones carrying both biosyn-

thetic genes and phylogenetic markers are almost nonexistent in

clone libraries. Single-cell genomics has emerged as a tool to

address this dilemma and reducemetagenomic complexity. Sin-

gle cells can be isolated from complex microbial mixtures via mi-

crofluidic encapsulation, micromanipulation, fluorescence-acti-

vated cell sorting (FACS), or microdissection (Blainey, 2013).

The genome of one cell is then amplified by multiple displace-

ment amplification (MDA) to generate enough DNA for

sequencing or PCR screening (Kvist et al., 2007). For example,

single-cell isolation and sequencing was used to identify the

elusive apratoxin (6; Figure 2) pathway from the filamentous cy-

anobacteria, Lyngbya bouillonii, by separating the cyanobacteria

from its associated heterotrophic bacteria (Grindberg et al.,

2011). FACS and MDA were used to sequence the partial

genome of a sponge-associated bacterium from the unusual

candidate phylum ‘‘Poribacteria’’ and linked the bacterium to a

new class of PKS genes (Siegl et al., 2011). Differentiating the ge-

netic make-up of a single organism from a complex assemblage

may give hints to the nutritional requirements necessary for culti-

vation experiments. Many obligate symbionts, however, are

likely to remain unculturable, as they often require host-specific

nutrients or cell-cell interactions in order to grow.

Ideally, the identification of a natural product biosynthetic

gene cluster will subsequently lead to heterologous expression

of the entire cluster and production of its compound. However,

expression of whole pathways from uncultured or slow-growing

organisms remains a major bottleneck in the functional charac-

terization natural product pathways and production of their prod-

ucts. This is especially true for large PKS and NRPS clusters that

can span several cosmid or fosmid clones and can incorporate

precursors or cofactors from external metabolic pathways not

present in the host. Additional challenges to successful heterol-

ogous expression, including codon bias, promoter recognition,

host toxicity, product yield, and host versatility, must be resolved

when considering any experiment. In this case, genomic infor-

mation can assist in choosing an appropriate host organism for

heterologous expression of desired genes. Additional tech-

niques to address these challenges are steadily advancing. For

example, transformation-assisted recombination (TAR) in yeast

has been used to assemble type II PKS clusters fromoverlapping

clone inserts derived from soil eDNA (Kim et al., 2010). Genetic

engineering of broad range host strains has also led to the suc-

cessful expression of cryptic pathways (Busch and Hertweck,

2009; Komatsu et al., 2013).

By applying these methods to various habitats, uncultivated

bacteria are increasingly being recognized as a rich source for

unprecedented biosynthetic enzymology. In the following sec-

tions, wewill provide several examples that illustrate the diversity

of novel pathways and biochemistry encountered in these or-

ganisms. It should be pointed out that many of these features

were later also found in cultured bacteria and are thus not entirely

Chemistry & B

restricted to uncultivated organisms. However, the high fre-

quency at which biosynthetic novelty is encountered in the still

limited number of published metagenomic studies suggests a

significant skew toward unusual pathways in environmental pro-

karyotes and thus an exciting discovery potential.

N-to-O Exchange in EnamidesA series of saturated andmonosaturated long-chainN-acyl-tyro-

sines (7 to 8; Figure 3A) were the first novel biologically active

natural products discovered through functional screening of

metagenomic libraries (Brady andClardy, 2000). Later, a broader

range ofN-acyl-amino acids were identified along with enol ester

derivatives (9; Figure 3A; Brady et al., 2002; Brady and Clardy,

2005b). Sequencing and transposon mutagenesis of clones pro-

ducing these compounds led to the identification of a novel N-

acyl-amino acid synthase and, subsequently, to the fatty acid

enol ester biosynthetic gene cluster (Brady et al., 2002; Brady

and Clardy, 2000). Of particular note in this pathway is the un-

precedented N,O-acyltransferase, FeeH, which, through a yet

unknown biochemical mechanism, facilitates the enamine-to-

enol ester conversion (Figure 3A).

Isocyanide BiosynthesisBy functionally screening a soil-derived metagenomic E. coli li-

brary for antibiotic activity against Bacillus subtilis, an isocya-

nide-functionalized indole (10; Figure 3B) was discovered (Brady

and Clardy, 2005a). The genes isnA and isnB located on the

cloned eDNA region were shown to be responsible for biosyn-

thesis. Through stable isotope feeding studies, the indole was

shown to derive from tryptophan. IsnA is an unprecedented iso-

nitrile synthase that performs the C–N condensation, while IsnB

is a decarboxylase (Figure 3B). Extensive feeding studies re-

vealed that the carbon atom of the isonitrile group originates

from a single excised carbon (C2) of ribulose-5-phosphate

(Brady and Clardy, 2005c). To explore this remarkable transfor-

mation further, 12 additional isnA-related sequences were

recovered by homology-based screening of 400,000 soil-

derived cosmid clones, and the resulting clusters were heterolo-

gously expressed in E. coli (Brady et al., 2007). Re-evaluation

of the pvc biosynthetic gene cluster from Pseudomonas

aeruginosa, which contains the isnA homolog pvcA, showed

that themajor product of the pathway was the isocyanide natural

product paerucumarin (11; Figure 3C) and not pyoverdine, as

previously proposed (Clarke-Pearson and Brady, 2008).

Aromatic Polyketide SkeletonsPKSs are responsible for the biosynthesis of some of the most

structurally and functionally diverse bioactive natural products,

many of which are employed pharmaceutical drugs. Evolution-

arily related to fatty acid synthases, PKSs similarly catalyze the

successive Claisen condensation and modification of small

acyl units. There are three main classes of PKS enzymes or

enzyme complexes (type I, II, and III), which can be further

divided into subtypes based on their gene architecture and

whether they function iteratively or modularly. Type II PKSs

generate most of the aromatic polyketides found in bacteria

(Brady and Clardy, 2000), of which tetracyclines and daunoru-

bicin are well-known examples. While a myriad of functionaliza-

tion patterns exist among these substances, the diversity of their

iology 20, May 23, 2013 ª2013 Elsevier Ltd All rights reserved 639

Page 5: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Figure 3. Small Molecules and Proposed Biosynthesis Determined through Functional Screening and Heterologous Expression ofMetagenomic eDNA(A) Proposed biosynthesis of long-chain N-acyltyrosines ([7] and [8]) and their enol ester derivatives (9).(B) Proposed biosynthesis of isocyanide-functionalized indoles (10).(C) Structure of paerucumarin (11).(D) Representative aromatic polyketides isolated from heterologous expression of eDNA from soil metagenomes.

Chemistry & Biology

Perspective

carbon skeletons is restricted to just a few basic types. Thus, the

discovery of novel skeletons is relatively rare for this substance

class. To explore uncultivated bacteria as a resource of new ar-

omatic polyketides, clones from soil metagenomic libraries

carrying characteristic ketosynthase b (KSb) genes of type II

PKSs were identified using degenerate primers. Positive over-

lapping clones were reassembled by TAR and expressed in

Streptomyces (Feng et al., 2010, 2011; King et al., 2009). From

these studies, two new pentacyclic type II PKS core skeletons

were discovered (12 and 13; Figure 3D) along with derivatives

of two compound families whose core skeletons are extremely

rare among cultured strains (14 and 15; Figure 3D). The novel

pentacyclic product, erdacin (12) is predicted to arise from the

coupling of two smaller but similarly derived bicyclic octaketide

intermediates and has potent antioxidant activity (King et al.,

2009). The other novel pentacyclic polyketide (13) is proposed

to derive from the rearrangement of a common tetracyclic inter-

mediate. Among the rare skeletons isolated were the antibiotic

fasamycins (14), which are active against methicillin-resistant

Staphylococcus aureus and vancomycin-resistant Enterococci

by inhibiting FabF during type II fatty acid elongation (Feng

et al., 2012). The moderately antibacterial fluostatins (15) repre-

sent another rare structural family derived from the TAR-based

assembly and heterologous expression of two clones from a

640 Chemistry & Biology 20, May 23, 2013 ª2013 Elsevier Ltd All righ

multimillion member metagenomic library constructed from a

soil sample collected in the Anza Borrego desert of Southern

California (Feng et al., 2010). The five-member ring in the core

of 15 is derived from the oxidative excision of a single carbon

and rearrangement of a rabelomycin-like precursor (Feng et al.,

2010). Phylogenetic analysis of KSb domains can assist in struc-

ture predictions for aromatic polyketides from both cultured and

uncultured bacteria, and the novel structural families from meta-

genomic studies are honing the predictive capability of such

methods (Feng et al., 2011).

Expansion of Type I PKS DiversityA recently recognized subclass of modular type I PKSs is the

trans-acyltransferase (trans-AT) type (Piel, 2010). Trans-AT

PKSs differ from textbook (cis-AT) type I PKSs, in that a free-

standing AT domain protein acts iteratively to provide the acyl

extender units rather than each PKS module containing a dedi-

cated AT domain. Though trans-AT PKS genes or gene frag-

ments were initially identified in cultured bacteria (Huang et al.,

2001; Paitan et al., 1999; Scotti et al., 1993), the first extended

trans-AT PKS biosynthetic gene cluster attributed to a known

product was the pathway for the cytotoxin pederin (16) (Figure 4),

which is produced by an uncultured Pseudomonas sp. symbiont

of Paederus spp. rove beetles (Piel, 2002). To isolate the genes,

ts reserved

Page 6: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Figure 4. Polyketides Derived from Trans-AT PKS PathwaysWith the exception of leinamcyin (17), these trans-AT PKS products were derived from pathways discovered through homology-based metagenomic libraryscreening/sequencing.

Chemistry & Biology

Perspective

the ketosynthase (KS) domains of the pederin cluster were first

amplified using degenerate primers from the total DNA of ped-

erin-producing beetles. Primers from the amplified KS fragments

were then used to identify the pederin cluster from an 80,000

cosmid clone metagenomic library. The first functional evidence

for the hypothesized catalytic function of trans-AT enzymes was

provided for the leinamycin (17) PKS, showing that a single AT

loads all the expressed acyl carrier protein domains (Cheng

et al., 2003; Figure 4). From studies on trans-AT systems, it

became rapidly clear that these PKSs exhibit an exceptional

architectural and catalytic diversity. Examples of components

that are absent or rare in cis-AT PKSs include various domains

for the formation of five- and six-membered heterocyclic rings

(Gulder et al., 2011; Matilla et al., 2012; Moldenhauer et al.,

2010; Tang et al., 2004), dehydratase domains that introduce

b,g-double bonds (Kusebauch et al., 2010; Moldenhauer et al.,

2010), and enzymes and domains for diverse carbon branches

(Moldenhauer et al., 2010).

For as-yet unknown reasons, trans-AT PKSs are by far the

dominant known PKS type in uncultivated bacterial symbionts,

while they are much less frequent in free-living cultivated

bacteria. Examples include biosynthetic genes for onnamide

(18) and psymberin (irciniastatin A) (19) from marine sponges

(Figure 4), bryostatin (5; Figure 2) from a bryozoan, patellazole

(4; Figure 2) from a tunicate, and rhizoxin (20) from a fungus

(Figure 4), which all belong to diverse endosymbionts. Many

complex polyketides with exquisite bioactivities are isolated in

extremely low quantities from invertebrates but are most likely

produced by trans-AT PKSs of symbiotic bacteria. On this ratio-

nale, strategies targeting trans-AT PKS genes have been devel-

Chemistry & B

oped to provide direct access to biosynthetic pathways for such

compounds (Fisch et al., 2009; Piel et al., 2004b), which can then

be used for the development of heterologous expression sys-

tems. For example, it is evident that trans- and cis-AT PKS sys-

tems evolved separately and form distinct clades when

compared phylogenetically, allowing for targeted genome min-

ing. Structure prediction from trans-AT PKSs are challenging,

since they do not follow the same colinearity rules as textbook

cis-AT systems. However, phylogeny-based rules have been

developed to predict polyketide structures from trans-AT PKS

sequences and vice versa (Nguyen et al., 2008). These phylog-

eny-based approaches facilitated investigation of trans-AT path-

ways from both cultured and uncultured bacteria (Irschik et al.,

2010; Nguyen et al., 2008; Teta et al., 2010).

Peptide ModificationsPeptide natural products from postribosomal peptide synthesis

pathways, recently designated ribosomally synthesized and

posttranslationally modified peptides (RiPPs), are produced

by all three domains of life, and many possess potent biological

activities (Arnison et al., 2013). The biological activities of RiPPs

include cytotoxicity, antimicrobial activity, anticancer, antiviral,

and quorum sensing. Though known throughout the realm

of cultivated bacteria, functional identification of at least two

new families of RiPPs originated from metagenomic studies

(Figure 5).

Cyanobactins constitute a large class (�200 known products)

of cyclic RiPPs produced by both free-living and symbiotic cya-

nobacteria (Donia et al., 2008, 2011b). Many of these com-

pounds are cytotoxic. Common enzymatic modifications of

iology 20, May 23, 2013 ª2013 Elsevier Ltd All rights reserved 641

Page 7: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Figure 5. RiPPs from Pathways Elucidated through Metagenomic StudiesPosttranslational modifications are indicated by color: heterocyclization (pink); prenylation (green); C-methylation (orange); N-methylation (violet); epimerization(red); and hydroxylation (blue). Polytheonamides differ in the stereochemistry of the sulfoxide moiety.

Chemistry & Biology

Perspective

cyanobactins include N-to-C macrocyclization (Koehnke et al.,

2012; McIntosh et al., 2010a; Figure 6A) and heterocyclizations

(McIntosh et al., 2010b; McIntosh and Schmidt, 2010;

Figure 6B). Sometimes structures exhibit unprecedented preny-

lations of serine, tyrosine, or threonine residues facilitated by a

new family of O-prenyltransferases (O-PTs) (McIntosh et al.,

2011; Figure 6C). Several recent reviews discuss cyanobactins

in detail (Arnison et al., 2013; Schmidt and Donia, 2009). Though

the first cyanobactins were reported in 1980 from didemnid tuni-

cates (Aziz et al., 2008), it was only in 2005 when two groups

working independently identified and heterologously expressed

the patellamide (21; Figure 5) pathway (pat) from a cyanobacte-

rial symbiont, Prochloron didemni, of the tunicate Lissoclinum

patella (Long et al., 2005; Schmidt et al., 2005). While Long, Jas-

pars, and coworkers discovered the genes by expression

screening of an enriched P. didemni DNA library in E. coli,

642 Chemistry & Biology 20, May 23, 2013 ª2013 Elsevier Ltd All righ

Schmidt et al. analyzed the symbiont DNA first for NRPS and

then for RiPP genes. Once the putative pat pathway was identi-

fied as a small RiPP cluster, it was expressed in E. coli yielding

20 mg l�1. Knowledge of the patellamide pathway revealed

related enzymes encoded in numerous other symbiotic and

free-living cultured and uncultured cyanobacteria, with homo-

logs being present in about 30% of evolutionarily distant cyano-

bacterial taxa (Leikoski et al., 2009). In addition to the O-PTs,

mechanism of heterocyclization was elucidated from enzymes

in the pat and trunkamide (22; Figure 5) pathways (tru). Macro-

cyclization of cyanobactin RiPPs involves an Asp-His-Ser cata-

lytic triad analogous to macrocycle formation by thioesterase

domains of NRPSs. Upon recognition of a three-residue signa-

ture, a serine protease-like macrocyclase, PatG, cleaves a

C-terminal peptide sequence and facilitates the N-to-C macro-

cyclization (Figure 6A; Koehnke et al., 2012; McIntosh et al.,

ts reserved

Page 8: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Figure 6. Modifications of Cyanobactin Ribosomal PeptidesEnzymatic reactions for (A) heterocyclization, (B) macrocyclization, and (C) prenylation as determined through in vitro experiments.

Chemistry & Biology

Perspective

2010a). Unlike heterocyclization in nonribosomal peptide

systems, which require three distinct protein domains, the heter-

ocyclases in the pat and tru pathways involve only a single ATP-

dependent enzyme that is akin to molecular machines

(Figure 6B; McIntosh and Schmidt, 2010). Additional experi-

ments by the Schmidt group showed that combinatorial libraries

could be generated by engineering modifications in the cyano-

bactin precursor peptide (Donia et al., 2006, 2008).

The polytheonamides (23; Figure 5) are another example of

peptides originally thought to be products of an NRPS biosyn-

thetic route. Reported in 1994 from the marine sponge Theonella

swinhoei, these compounds exhibit a highly complex structure

and belong to the largest secondary metabolites known. The

polytheonamides are highly cytotoxic linear peptides comprised

Chemistry & B

of numerous nonproteinogenic amino acids, including eight

t-leucine residues, eight g-N-methylasparagines, and at least

four additional unusual amino acids (Freeman et al., 2012; Ham-

ada et al., 2005). Despite sharing structural features with nonri-

bosomal peptides, including the presence of 18 D-amino acids,

their enormous size (48 residues) merited investigation whether a

ribosomal route might be involved (Freeman et al., 2012). Using

degenerate PCR primers based on the hypothetical unmodified

precursor peptide carrying only proteinogenic amino acids, a

fragment of the precursor gene was isolated from the sponge

metagenomic DNA. The sequence information allowed isolation

of the complete gene cluster from a 980,000 clone metagenomic

library constructed from the total sponge tissue. The precursor

protein (PoyA) has an unusually long leader sequence that is

iology 20, May 23, 2013 ª2013 Elsevier Ltd All rights reserved 643

Page 9: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Chemistry & Biology

Perspective

homologous to nitrile hydratase-like enzymes and does not

share similarity to RiPP precursors from characterized peptide

families. However, genes for homologous precursors were found

in various sequenced genomes of cultivated bacteria, suggest-

ing the existence of a new natural product family (Haft et al.,

2010). Polytheonamides are the, so far, only attributed members

of this family, which is termed proteusins.

Proteusin pathways exhibit some highly unusual enzymology.

For biosynthesis of 23, up to 48 posttranslational modifications

are necessary, making these compounds themost heavily modi-

fied RiPPs known to date (Figure 5). For four hydroxylations, at

least 21 methylations and 18 epimerizations, only six modifying

enzyme candidates were identified. Functions for several of the

modifying enzyme candidates were assessed by in vitro coex-

pression with PoyA. Most and perhaps all epimerizations are

catalyzed by a single epimerase, PoyD. This unprecedented

enzyme is a member of the radical S-adenosylmethionine

(rSAM) superfamily and performs the epimerization in a unidirec-

tional fashion rather than generating an equilibrium. PoyF shows

homology to the dehydratase domain of bidomain lanthionine

synthetases and catalyzes dehydration of a threonine residue

that is eventually converted to the unusual N-acyl residue. One

or two rSAM methyltranferases, PoyB and C, are proposed to

perform a total of four methylations of the modified Thr, resulting

in a net tert-butylation. This intermediate is then hydrolytically

cleaved to release the leader peptide and yield the final N-acyl

moiety. A single SAM-dependent N-methyltransferase, PoyE,

iteratively methylates eight asparagine residues. The function

of the rSAM methyltransferases, PoyB and C, along with the

Fe(II)/a-ketoglutarate-dependent oxygenase, PoyI, remains to

be fully elucidated but are hypothesized to be involved in the

23C-methylations and four hydoxylations of the polytheonamide

precursor, respectively. Homologs of the polytheonamide en-

zymes, aswell as enzymeswith as-yet unknown function, are en-

coded in many other proteusin gene clusters in a wide range of

bacteria. Thus, similar to cyanobactins and other examples dis-

cussed in this perspective, this is another case in which studies

on uncultivated bacteria have set the stage for the discovery of

new structural and enzymatic diversity on a broader scope.

The Way ForwardAs described here, metagenomic approaches to natural product

drug discovery and biosynthetic understanding have not only

facilitated the functional characterization of uncultivated bacte-

ria but have also provided valuable insights into previously un-

known biochemistry in cultivatable organisms. Yet there are hur-

dles that must be overcome in order to fully maximize the

potential of metagenomics for natural product drug discovery

and development. As NGS technologies improve and become

cheaper and more routine, the amount of data an individual lab-

oratory can generate can become overwhelming fairly rapidly.

Automated genome mining tools, such as the recently devel-

oped antiSMASH (Medema et al., 2011) or ClusterMine360 (Con-

way and Boddy, 2013), will become critical when working with

multiple large data sets. However, since these methods rely on

homology-based queries, the identification of truly novel chem-

istry will likely rely on new strategies for bioinformatic genome

mining. These may include developing pattern recognition-

based algorithms designed to find conserved yet unexplored

644 Chemistry & Biology 20, May 23, 2013 ª2013 Elsevier Ltd All righ

open reading frames associated with natural product biosyn-

thesis. Additionally, genomics-guided functional screening and

heterologous expression is a logical next step in natural product

drug discovery and development as NGS sequencing technolo-

gies improve and prices drop. Host selection and engineering

based on phylogenetic and comparative genomic approaches

may assist in selecting the most appropriate universal host

for a particular environmental sample. A (meta)genome-first

approach may also provide a better understanding of compat-

ible gene regulation and resistance in order to identify and engi-

neer universal hosts that can express genes from awide range of

sources, as well as be used directly for library construction. This

will also require improved vectors that can carry and express

entire gene clusters stably and reliably.

We should not, however, turn away from cultivation. Recent

advances in cocultivation and environmental simulation have

shown that uncultured bacteria are not necessarily unculturable

(Stewart, 2012). Using semiporous culture chambers, two or

more organisms can be grown separately, yet a membrane al-

lows for the exchange of small molecules, such as nutrients

and growth factors. Other techniques, such as using natural

seawater as the culture medium (Kaeberlein et al., 2002) or the

use of microfluidics, have also proven successful for isolation

of rare bacterial taxa (Liu et al., 2009). Integration of ‘‘traditional’’

metagenomics with the latest tools in single-cell analysis (Blai-

ney, 2013; Kamke et al., 2012), metatranscriptomics (Yu and

Zhang, 2012), and metaproteomics (Muth et al., 2013) will

certainly aid in unveiling not only the biosynthetic potential of

uncultivated microbes but also how to express their genes het-

erologously and how to grow them effectively under laboratory

conditions.

ACKNOWLEDGMENTS

M.C.W. is funded by the Alexander von Humboldt Foundation.

REFERENCES

Achtman, M., and Wagner, M. (2008). Microbial diversity and the genetic na-ture of microbial species. Nat. Rev. Microbiol. 6, 431–440.

Arnison, P.G., Bibb, M.J., Bierbaum, G., Bowers, A.A., Bugni, T.S., Bulaj, G.,Camarero, J.A., Campopiano, D.J., Challis, G.L., Clardy, J., et al. (2013). Ribo-somally synthesized and post-translationally modified peptide natural prod-ucts: overview and recommendations for a universal nomenclature. Nat.Prod. Rep. 30, 108–160.

Aziz, R.K., Bartels, D., Best, A.A., DeJongh, M., Disz, T., Edwards, R.A., For-msma, K., Gerdes, S., Glass, E.M., Kubal, M., et al. (2008). The RAST Server:rapid annotations using subsystems technology. BMC Genomics 9, 75.

Bentley, S.D., Chater, K.F., Cerdeno-Tarraga, A.M., Challis, G.L., Thomson,N.R., James, K.D., Harris, D.E., Quail, M.A., Kieser, H., Harper, D., et al.(2002). Complete genome sequence of themodel actinomycete Streptomycescoelicolor A3(2). Nature 417, 141–147.

Berdy, J. (2005). Bioactive microbial metabolites. J. Antibiot. (Tokyo) 58, 1–26.

Bewley, C.A., Holland, N.D., and Faulkner, D.J. (1996). Two classes of metab-olites from Theonella swinhoei are localized in distinct populations of bacterialsymbionts. Experientia 52, 716–722.

Blainey, P.C. (2013). The future is now: single-cell genomics of bacteria andarchaea. FEMS Microbiol. Rev. 37, 407–427. http://dx.doi.org/10.1111/1574-6976.12015.

ts reserved

Page 10: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Chemistry & Biology

Perspective

Brady, S.F., and Clardy, J. (2000). Long-chain N-acyl amino acid antibioticsisolated from heterologously expressed environmental DNA. J. Am. Chem.Soc. 122, 12903–12904.

Brady, S.F., and Clardy, J. (2003). Synthesis of long-chain fatty acid enol estersisolated from an environmental DNA clone. Org. Lett. 5, 121–124.

Brady, S.F., and Clardy, J. (2005a). Cloning and heterologous expression ofisocyanide biosynthetic genes from environmental DNA. Angew. Chem. Int.Ed. Engl. 44, 7063–7065.

Brady, S.F., and Clardy, J. (2005b). N-acyl derivatives of arginine and trypto-phan isolated from environmental DNA expressed in Escherichia coli. Org.Lett. 7, 3613–3616.

Brady, S.F., and Clardy, J. (2005c). Systematic investigation of the Escherichiacolimetabolome for the biosynthetic origin of an isocyanide carbon atom. An-gew. Chem. Int. Ed. Engl. 44, 7045–7048.

Brady, S.F., Chao, C.J., Handelsman, J., and Clardy, J. (2001). Cloning andheterologous expression of a natural product biosynthetic gene cluster fromeDNA. Org. Lett. 3, 1981–1984.

Brady, S.F., Chao, C.J., and Clardy, J. (2002). New natural product familiesfrom an environmental DNA (eDNA) gene cluster. J. Am. Chem. Soc. 124,9968–9969.

Brady, S.F., Bauer, J.D., Clarke-Pearson, M.F., and Daniels, R. (2007). Naturalproducts from isnA-containing biosynthetic gene clusters recovered from thegenomes of cultured and uncultured bacteria. J. Am. Chem. Soc. 129, 12102–12103.

Brady, S.F., Simmons, L., Kim, J.H., and Schmidt, E.W. (2009). Metagenomicapproaches to natural products from free-living and symbiotic organisms. Nat.Prod. Rep. 26, 1488–1503.

Busch, B., and Hertweck, C. (2009). Evolution of metabolic diversity in polyke-tide-derived pyrones: using the non-colinear aureothin assembly line as amodel system. Phytochemistry 70, 1833–1840.

Chang, F.Y., and Brady, S.F. (2011). Cloning and characterization of an envi-ronmental DNA-derived gene cluster that encodes the biosynthesis of the anti-tumor substance BE-54017. J. Am. Chem. Soc. 133, 9996–9999.

Cheng, Y.-Q., Tang, G.-L., and Shen, B. (2003). Type I polyketide synthaserequiring a discrete acyltransferase for polyketide biosynthesis. Proc. Natl.Acad. Sci. USA 100, 3149–3154.

Chung, E.J., Lim, H.K., Kim, J.C., Choi, G.J., Park, E.J., Lee, M.H., Chung,Y.R., and Lee, S.W. (2008). Forest soil metagenome gene cluster involved inantifungal activity expression in Escherichia coli. Appl. Environ. Microbiol.74, 723–730.

Clarke-Pearson,M.F., and Brady, S.F. (2008). Paerucumarin, a newmetaboliteproduced by the pvc gene cluster from Pseudomonas aeruginosa. J. Bacteriol.190, 6927–6930.

Conway, K.R., and Boddy, C.N. (2013). ClusterMine360: a database of micro-bial PKS/NRPS biosynthesis. Nucleic Acids Res. 41, D402–D407.

Craig, J.W., Chang, F.Y., and Brady, S.F. (2009). Natural products from envi-ronmental DNA hosted in Ralstonia metallidurans. ACS Chem. Biol. 4, 23–28.

Craig, J.W., Chang, F.Y., Kim, J.H., Obiajulu, S.C., and Brady, S.F. (2010). Ex-panding small-molecule functional metagenomics through parallel screeningof broad-host-range cosmid environmental DNA libraries in diverse proteo-bacteria. Appl. Environ. Microbiol. 76, 1633–1641.

Donia, M.S., Hathaway, B.J., Sudek, S., Haygood,M.G., Rosovitz, M.J., Ravel,J., and Schmidt, E.W. (2006). Natural combinatorial peptide libraries in cyano-bacterial symbionts of marine ascidians. Nat. Chem. Biol. 2, 729–735.

Donia, M.S., Ravel, J., and Schmidt, E.W. (2008). A global assembly line for cy-anobactins. Nat. Chem. Biol. 4, 341–343.

Donia, M.S., Fricke, W.F., Partensky, F., Cox, J., Elshahawi, S.I., White, J.R.,Phillippy, A.M., Schatz, M.C., Piel, J., Haygood, M.G., et al. (2011a). Complexmicrobiome underlying secondary and primary metabolism in the tunicate-Prochloron symbiosis. Proc. Natl. Acad. Sci. USA 108, E1423–E1432.

Donia, M.S., Ruffner, D.E., Cao, S., and Schmidt, E.W. (2011b). Accessing thehidden majority of marine natural products through metagenomics.ChemBioChem 12, 1230–1236.

Chemistry & B

Feng, Z., Kim, J.H., and Brady, S.F. (2010). Fluostatins produced by the heter-ologous expression of a TAR reassembled environmental DNA derived type IIPKS gene cluster. J. Am. Chem. Soc. 132, 11902–11903.

Feng, Z., Kallifidas, D., and Brady, S.F. (2011). Functional analysis of environ-mental DNA-derived type II polyketide synthases reveals structurally diversesecondary metabolites. Proc. Natl. Acad. Sci. USA 108, 12629–12634.

Feng, Z., Chakraborty, D., Dewell, S.B., Reddy, B.V., and Brady, S.F. (2012).Environmental DNA-encoded antibiotics fasamycins A and B inhibit FabF intype II fatty acid biosynthesis. J. Am. Chem. Soc. 134, 2981–2987.

Fisch, K.M., Gurgui, C., Heycke, N., van der Sar, S.A., Anderson, S.A., Webb,V.L., Taudien, S., Platzer, M., Rubio, B.K., Robinson, S.J., et al. (2009). Polyke-tide assembly lines of uncultivated sponge symbionts from structure-basedgene targeting. Nat. Chem. Biol. 5, 494–501.

Freeman, M.F., Gurgui, C., Helf, M.J., Morinaka, B.I., Uria, A.R., Oldham, N.J.,Sahl, H.G., Matsunaga, S., and Piel, J. (2012). Metagenome mining revealspolytheonamides as posttranslationally modified ribosomal peptides. Science338, 387–390.

Gillespie, D.E., Brady, S.F., Bettermann, A.D., Cianciotto, N.P., Liles, M.R.,Rondon, M.R., Clardy, J., Goodman, R.M., and Handelsman, J. (2002). Isola-tion of antibiotics turbomycin a and B from ametagenomic library of soil micro-bial DNA. Appl. Environ. Microbiol. 68, 4301–4306.

Grindberg, R.V., Ishoey, T., Brinza, D., Esquenazi, E., Coates, R.C., Liu, W.T.,Gerwick, L., Dorrestein, P.C., Pevzner, P., Lasken, R., and Gerwick, W.H.(2011). Single cell genome amplification accelerates identification of the apra-toxin biosynthetic pathway from a complex microbial assemblage. PLoS ONE6, e18565.

Guan, C., Ju, J., Borlee, B.R., Williamson, L.L., Shen, B., Raffa, K.F., and Han-delsman, J. (2007). Signal mimics derived from a metagenomic analysis of thegypsy moth gut microbiota. Appl. Environ. Microbiol. 73, 3669–3676.

Gulder, T.A., Freeman, M.F., and Piel, J. (2011). The catalytic diversity of multi-modular polyketide synthases: natural product biosynthesis beyond textbookassembly rules. Top. Curr. Chem.. http://dx.doi.org/10.1007/128_2010_113

Gurgui, C., and Piel, J. (2010). Metagenomic approaches to identify and isolatebioactive natural products from microbiota of marine sponges. Methods Mol.Biol. 668, 247–264.

Haft, D.H., Basu, M.K., and Mitchell, D.A. (2010). Expansion of ribosomallyproduced natural products: a nitrile hydratase- and Nif11-related precursorfamily. BMC Biol. 8, 70–94.

Hamada, T., Matsunaga, S., Yano, G., and Fusetani, N. (2005). Polytheona-mides A and B, highly cytotoxic, linear polypeptides with unprecedentedstructural features, from the marine sponge, Theonella swinhoei. J. Am.Chem. Soc. 127, 110–118.

Handelsman, J. (2004). Metagenomics: application of genomics to unculturedmicroorganisms. Microbiol. Mol. Biol. Rev. 68, 669–685.

Hildebrand, M., Waggoner, L.E., Liu, H., Sudek, S., Allen, S., Anderson, C.,Sherman, D.H., and Haygood, M. (2004). bryA: an unusual modular polyketidesynthase gene from the uncultivated bacterial symbiont of the marine bryo-zoan Bugula neritina. Chem. Biol. 11, 1543–1552.

Huang, G., Zhang, L., and Birch, R.G. (2001). Amultifunctional polyketide-pep-tide synthetase essential for albicidin biosynthesis in Xanthomonas albilineans.Microbiology 147, 631–642.

Irschik, H., Kopp, M., Weissman, K.J., Buntin, K., Piel, J., andMuller, R. (2010).Analysis of the sorangicin gene cluster reinforces the utility of a combinedphylogenetic/retrobiosynthetic analysis for deciphering natural productassembly by trans-AT PKS. ChemBioChem 11, 1840–1849.

Jiao, J.Y., Wang, H.X., Zeng, Y., and Shen, Y.M. (2006). Enrichment for mi-crobes living in association with plant tissues. J. Appl. Microbiol. 100,830–837.

Johnston, C.W., Zvanych, R., Khyzha, N., andMagarvey, N.A. (2013). Nonribo-somal assembly of natural lipocyclocarbamate lipoprotein-associated phos-pholipase inhibitors. ChemBioChem 14, 431–435.

Kaeberlein, T., Lewis, K., and Epstein, S.S. (2002). Isolating "uncultivable" mi-croorganisms in pure culture in a simulated natural environment. Science 296,1127–1129.

iology 20, May 23, 2013 ª2013 Elsevier Ltd All rights reserved 645

Page 11: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Chemistry & Biology

Perspective

Kamke, J., Bayer, K., Woyke, T., and Hentschel, U. (2012). Exploring symbio-ses by single-cell genomics. Biol. Bull. 223, 30–43.

Kim, J.H., Feng, Z., Bauer, J.D., Kallifidas, D., Calle, P.Y., and Brady, S.F.(2010). Cloning large natural product gene clusters from the environment:piecing environmental DNA gene clusters back together with TAR. Biopoly-mers 93, 833–844.

King, R.W., Bauer, J.D., and Brady, S.F. (2009). An environmental DNA-derived type II polyketide biosynthetic pathway encodes the biosynthesis ofthe pentacyclic polyketide erdacin. Angew. Chem. Int. Ed. Engl. 48, 6257–6261.

Koehnke, J., Bent, A., Houssen, W.E., Zollman, D., Morawitz, F., Shirran, S.,Vendome, J., Nneoyiegbe, A.F., Trembleau, L., Botting, C.H., et al. (2012).The mechanism of patellamide macrocyclization revealed by the characteriza-tion of the PatG macrocyclase domain. Nat. Struct. Mol. Biol. 19, 767–772.

Komatsu, M., Komatsu, K., Koiwai, H., Yamada, Y., Kozone, I., Izumikawa, M.,Hashimoto, J., Takagi, M., Omura, S., Shin-ya, K., et al. (2013). EngineeredStreptomyces avermitilis host for heterologous expression of biosyntheticgene cluster for secondary metabolites (ACS Synth. Biol.). Published onlineJanuary 10, 2013. http://dx.doi.org/10.1021/sb3001003.

Kusebauch, B., Busch, B., Scherlach, K., Roth, M., and Hertweck, C. (2010).Functionally distinct modules operate two consecutive a,b—>b,g double-bond shifts in the rhizoxin polyketide assembly line. Angew. Chem. Int. Ed.Engl. 49, 1460–1464.

Kvist, T., Ahring, B.K., Lasken, R.S., and Westermann, P. (2007). Specific sin-gle-cell isolation and genomic amplification of uncultured microorganisms.Appl. Microbiol. Biotechnol. 74, 926–935.

Leikoski, N., Fewer, D.P., and Sivonen, K. (2009). Widespread occurrence andlateral transfer of the cyanobactin biosynthesis gene cluster in cyanobacteria.Appl. Environ. Microbiol. 75, 853–857.

Lenk, S., Moraru, C., Hahnke, S., Arnds, J., Richter, M., Kube, M., Reinhardt,R., Brinkhoff, T., Harder, J., Amann, R., andMußmann, M. (2012).Roseobacterclade bacteria are abundant in coastal sediments and encode a novel combi-nation of sulfur oxidation genes. ISME J. 6, 2178–2187.

Liles, M.R., Williamson, L.L., Rodbumrer, J., Torsvik, V., Goodman, R.M., andHandelsman, J. (2008). Recovery, purification, and cloning of high-molecular-weight DNA from soil microorganisms. Appl. Environ. Microbiol. 74, 3302–3305.

Liu, W., Kim, H.J., Lucchetta, E.M., Du, W., and Ismagilov, R.F. (2009). Isola-tion, incubation, and parallel functional testing and identification by FISH ofrare microbial single-copy cells from multi-species mixtures using the combi-nation of chemistrode and stochastic confinement. Lab Chip 9, 2153–2162.

Liu, X., Zhang, Q., Murata, K., Baker, M.L., Sullivan, M.B., Fu, C., Dougherty,M.T., Schmid, M.F., Osburne, M.S., Chisholm, S.W., and Chiu, W. (2010).Structural changes in a marine podovirus associated with release of itsgenome into Prochlorococcus. Nat. Struct. Mol. Biol. 17, 830–836.

Long, P.F., Dunlap, W.C., Battershill, C.N., and Jaspars, M. (2005). Shotguncloning and heterologous expression of the patellamide gene cluster as astrategy to achieving sustained metabolite production. ChemBioChem 6,1760–1765.

Lorenz, P., and Eck, J. (2005). Metagenomics and industrial applications. Nat.Rev. Microbiol. 3, 510–516.

Matilla, M.A., Stockmann, H., Leeper, F.J., and Salmond, G.P.C. (2012). Bac-terial biosynthetic gene clusters encoding the anti-cancer haterumalide classof molecules: biogenesis of the broad spectrum antifungal and anti-oomycetecompound, oocydin A. J. Biol. Chem. 287, 39125–39138.

McIntosh, J.A., and Schmidt, E.W. (2010). Marine molecular machines: heter-ocyclization in cyanobactin biosynthesis. ChemBioChem 11, 1413–1421.

McIntosh, J.A., Robertson, C.R., Agarwal, V., Nair, S.K., Bulaj, G.W., andSchmidt, E.W. (2010a). Circular logic: nonribosomal peptide-like macrocycli-zation with a ribosomal peptide catalyst. J. Am. Chem. Soc. 132, 15499–15501.

McIntosh, J.A., Donia, M.S., and Schmidt, E.W. (2010b). Insights into hetero-cyclization from two highly similar enzymes. J. Am. Chem. Soc. 132, 4089–4091.

646 Chemistry & Biology 20, May 23, 2013 ª2013 Elsevier Ltd All righ

McIntosh, J.A., Donia, M.S., Nair, S.K., and Schmidt, E.W. (2011). Enzymaticbasis of ribosomal peptide prenylation in cyanobacteria. J. Am. Chem. Soc.133, 13698–13705.

Medema, M.H., Blin, K., Cimermancic, P., de Jager, V., Zakrzewski, P., Fisch-bach, M.A., Weber, T., Takano, E., and Breitling, R. (2011). antiSMASH: rapididentification, annotation and analysis of secondary metabolite biosynthesisgene clusters in bacterial and fungal genome sequences. Nucleic Acids Res.39, W339–W346.

Miao, V., and Davies, J. (2009). Metagenomics and antibiotic discovery fromuncultivated bacteria. In Uncultivated Microorganisms, S.S. Epstein, ed. (Ber-lin Heidelberg: Springer-Verlag), pp. 217–236.

Moldenhauer, J., Gotz, D.C., Albert, C.R., Bischof, S.K., Schneider, K., Sus-smuth, R.D., Engeser, M., Gross, H., Bringmann, G., and Piel, J. (2010). Thefinal steps of bacillaene biosynthesis in Bacillus amyloliquefaciens FZB42:direct evidence for beta, gamma dehydration by a trans-acyltransferase poly-ketide synthase. Angew. Chem. Int. Ed. Engl. 49, 1465–1467.

Muth, T., Benndorf, D., Reichl, U., Rapp, E., and Martens, L. (2013). Searchingfor a needle in a stack of needles: challenges in metaproteomics data analysis.Mol. Biosyst. 9, 578–585. http://dx.doi.org/10.1039/C2MB25415H.

Nguyen, T., Ishida, K., Jenke-Kodama, H., Dittmann, E., Gurgui, C., Hochmuth,T., Taudien, S., Platzer, M., Hertweck, C., and Piel, J. (2008). Exploiting themosaic structure of trans-acyltransferase polyketide synthases for naturalproduct discovery and pathway dissection. Nat. Biotechnol. 26, 225–233.

Nikolouli, K., and Mossialos, D. (2012). Bioactive compounds synthesized bynon-ribosomal peptide synthetases and type-I polyketide synthases discov-ered through genome-mining and metagenomics. Biotechnol. Lett. 34,1393–1403.

Ouyang, Y., Dai, S., Xie, L., Ravi Kumar, M.S., Sun, W., Sun, H., Tang, D., andLi, X. (2010). Isolation of high molecular weight DNA from marine sponge bac-teria for BAC library construction. Mar. Biotechnol. (NY) 12, 318–325.

Owen, J.G., Robins, K.J., Parachin, N.S., and Ackerley, D.F. (2012). A func-tional screen for recovery of 40-phosphopantetheinyl transferase and associ-ated natural product biosynthesis genes from metagenome libraries. Environ.Microbiol. 14, 1198–1209.

Paitan, Y., Alon, G., Orr, E., Ron, E.Z., and Rosenberg, E. (1999). The first genein the biosynthesis of the polyketide antibiotic TA of Myxococcus xanthus co-des for a unique PKS module coupled to a peptide synthetase. J. Mol. Biol.286, 465–474.

Piel, J. (2002). A polyketide synthase-peptide synthetase gene cluster from anuncultured bacterial symbiont of Paederus beetles. Proc. Natl. Acad. Sci. USA99, 14002–14007.

Piel, J. (2010). Biosynthesis of polyketides by trans-AT polyketide synthases.Nat. Prod. Rep. 27, 996–1047.

Piel, J. (2011). Approaches to capturing and designing biologically active smallmolecules produced by uncultured microbes. Annu. Rev. Microbiol. 65,431–453.

Piel, J., Hui, D., Wen, G., Butzke, D., Platzer, M., Fusetani, N., and Matsunaga,S. (2004a). Antitumor polyketide biosynthesis by an uncultivated bacterialsymbiont of the marine sponge Theonella swinhoei. Proc. Natl. Acad. Sci.USA 101, 16222–16227.

Piel, J., Hui, D., Fusetani, N., and Matsunaga, S. (2004b). Targeting modularpolyketide synthases with iteratively acting acyltransferases from metage-nomes of uncultured bacterial consortia. Environ. Microbiol. 6, 921–927.

Rath, C.M., Janto, B., Earl, J., Ahmed, A., Hu, F.Z., Hiller, L., Dahlgren, M.,Kreft, R., Yu, F., Wolff, J.J., et al. (2011). Meta-omic characterization of thema-rine invertebrate microbial consortium that produces the chemotherapeuticnatural product ET-743. ACS Chem. Biol. 6, 1244–1256.

Reyes-Duarte, D., Ferrer, M., andGarcıa-Arellano, H. (2012). Functional-basedscreening methods for lipases, esterases, and phospholipases in metage-nomic libraries. Methods Mol. Biol. 861, 101–113.

Rondon, M.R., August, P.R., Bettermann, A.D., Brady, S.F., Grossman, T.H.,Liles, M.R., Loiacono, K.A., Lynch, B.A., MacNeil, I.A., Minor, C., et al.(2000). Cloning the soil metagenome: a strategy for accessing the geneticand functional diversity of uncultured microorganisms. Appl. Environ. Micro-biol. 66, 2541–2547.

ts reserved

Page 12: Chemistry & Biology Perspective - CORE · phosphopantetheinyl transferases involved in polyketide and nonribosomal peptide biosynthesis (Owen et al., 2012). In these screens, the

Chemistry & Biology

Perspective

Schmidt, E.W., and Donia, M.S. (2009). Chapter 23. Cyanobactin ribosomallysynthesized peptides–a case of deepmetagenomemining. Methods Enzymol.458, 575–596.

Schmidt, E.W., Nelson, J.T., Rasko, D.A., Sudek, S., Eisen, J.A., Haygood,M.G., and Ravel, J. (2005). Patellamide A and C biosynthesis by a microcin-like pathway in Prochloron didemni, the cyanobacterial symbiont ofLissoclinum patella. Proc. Natl. Acad. Sci. USA 102, 7315–7320.

Schmidt, E.W., Donia, M.S., McIntosh, J.A., Fricke, W.F., and Ravel, J. (2012).Origin and variation of tunicate secondary metabolites. J. Nat. Prod. 75,295–304.

Scotti, C., Piatti, M., Cuzzoni, A., Perani, P., Tognoni, A., Grandi, G., Galizzi, A.,and Albertini, A.M. (1993). A Bacillus subtilis large ORF coding for a polypep-tide highly similar to polyketide synthases. Gene 130, 65–71.

Shokralla, S., Spall, J.L., Gibson, J.F., and Hajibabaei, M. (2012). Next-gener-ation sequencing technologies for environmental DNA research. Mol. Ecol. 21,1794–1805.

Siegl, A., Kamke, J., Hochmuth, T., Piel, J., Richter, M., Liang, C., Dandekar, T.,and Hentschel, U. (2011). Single-cell genomics reveals the lifestyle of Poribac-teria, a candidate phylum symbiotically associated withmarine sponges. ISMEJ. 5, 61–70.

Stewart, E.J. (2012). Growing unculturable bacteria. J. Bacteriol. 194, 4151–4160.

Sudek, S., Lopanik, N.B., Waggoner, L.E., Hildebrand, M., Anderson, C., Liu,H., Patel, A., Sherman, D.H., and Haygood, M.G. (2007). Identification of theputative bryostatin polyketide synthase gene cluster from "Candidatus Endo-bugula sertula", the uncultivated microbial symbiont of the marine bryozoanBugula neritina. J. Nat. Prod. 70, 67–74.

Chemistry & B

Tang, G.-L., Cheng, Y.-Q., and Shen, B. (2004). Leinamycin biosynthesisrevealing unprecedented architectural complexity for a hybrid polyketide syn-thase and nonribosomal peptide synthetase. Chem. Biol. 11, 33–45.

Teta, R., Gurgui, M., Helfrich, E.J., Kunne, S., Schneider, A., Van Echten-Deck-ert, G., Mangoni, A., and Piel, J. (2010). Genome mining reveals trans-ATpolyketide synthase directed antibiotic biosynthesis in the bacterial phylumbacteroidetes. ChemBioChem 11, 2506–2512.

Udwary, D.W., Zeigler, L., Asolkar, R.N., Singan, V., Lapidus, A., Fenical, W.,Jensen, P.R., and Moore, B.S. (2007). Genome sequencing reveals complexsecondary metabolome in the marine actinomycete Salinispora tropica.Proc. Natl. Acad. Sci. USA 104, 10376–10381.

Wang, G.Y., Graziani, E.,Waters, B., Pan,W., Li, X., McDermott, J., Meurer, G.,Saxena, G., Andersen, R.J., and Davies, J. (2000). Novel natural products fromsoil DNA libraries in a streptomycete host. Org. Lett. 2, 2401–2404.

Williamson, L.L., Borlee, B.R., Schloss, P.D., Guan, C., Allen, H.K., and Han-delsman, J. (2005). Intracellular screen to identify metagenomic clones thatinduce or inhibit a quorum-sensing biosensor. Appl. Environ. Microbiol. 71,6335–6344.

Wright, G.D. (2012). Antibiotics: a new hope. Chem. Biol. 19, 3–10.

Wu, D., Hugenholtz, P., Mavromatis, K., Pukall, R., Dalin, E., Ivanova, N.N., Ku-nin, V., Goodwin, L., Wu, M., Tindall, B.J., et al. (2009). A phylogeny-drivengenomic encyclopaedia of Bacteria and Archaea. Nature 462, 1056–1060.

Xing, M.N., Zhang, X.Z., and Huang, H. (2012). Application of metagenomictechniques in mining enzymes from microbial communities for biofuel synthe-sis. Biotechnol. Adv. 30, 920–929.

Yu, K., and Zhang, T. (2012). Metagenomic andmetatranscriptomic analysis ofmicrobial community structure and gene expression of activated sludge. PLoSONE 7, e38183.

iology 20, May 23, 2013 ª2013 Elsevier Ltd All rights reserved 647