genome-wide identification and characterization of mlo gene … · 2020. 2. 3. · (mdmlo1), pepper...
TRANSCRIPT
-
1
Genome-Wide Identification and Characterization of MLO Gene Family in Octoploid 1
Strawberry (Fragaria ×ananassa) 2
3
4
Ronald R. Tapia1, Christopher R. Barbey2, Saket Chandra1, Kevin M. Folta2, Vance M. 5
Whitaker1 and Seonghee Lee1† 6
7
8
1Department of Horticultural Sciences, University of Florida, IFAS Gulf Coast Research and 9
Education Center, Wimauma, FL 33598, USA. 10
2Department of Horticultural Sciences, University of Florida, 1301 Fifield Hall, PO Box 110690, 11
Gainesville, FL 32611, USA. 12
13
†Corresponding author: 14
Seonghee Lee 15
Tel: (813) 419-6611 16
Fax: (813) 634-0001 17
Email: [email protected] 18
19
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
2
Abstract 20
Powdery mildew (PM) caused by Podosphaera aphanis is a major fungal disease in 21
cultivated strawberry. Mildew Resistance Locus O (MLO) is a gene family described for having 22
conserved seven-transmembrane domains. Induced loss-of-function in specific MLO genes can 23
confer durable and broad resistance against PM pathogens. However, the underlying biological 24
role of MLO genes in strawberry is still unknown. In the present study, the genomic structure of 25
MLO genes were characterized in both diploid (Fragaria vesca) and octoploid strawberry 26
(Fragaria ×ananassa), and the potential sources of MLO-mediated susceptibility were identified. 27
Twenty MLO-like sequences were identified in F. vesca, with sixty-eight in F. ×ananassa. 28
Phylogenetic analysis divides strawberry MLO genes into eight different clades, in which three 29
FveMLO and ten FaMLO genes were grouped together with the functionally known MLO 30
susceptibility. Out of ten FaMLO genes, FaMLO17-2 and FaMLO17-3 showed the highest 31
similarity to the known susceptibility MLO proteins. Gene expression analysis of FaMLO genes 32
was conducted using a multi-parental segregating population. Three expression quantitative trait 33
loci (eQTL) were substantially associated with MLO transcript levels in mature fruits, suggesting 34
discrete genetic control of susceptibility. These results are a critical first step in understanding 35
allele function of MLO genes, and are necessary for further genetic studies of PM resistance in 36
cultivated strawberry. 37
Keywords: Transmembrane domain, eQTL, Disease resistance, Rosaceae, RNA sequencing, 38
Powdery mildew disease 39
40
41
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
3
Introduction 42
Powdery Mildew (PM) is a major fungal plant disease affecting many economically 43
important cereal and horticultural crops, including strawberry. The causal agent of PM in 44
strawberry is the obligate parasite Podosphaera aphanis (Wallr.) (former Sphaerotheca 45
macularis f. sp. fragariae)1. PM primarily affects the leaf and depending on severity can also 46
affect other organs above ground2. Initially, a white powdery mycelium develops on the 47
underside of the leaves, followed by leaf upward curling while severe leaf infection can cause 48
burning at leaf margins3. Infected flower and fruits, on the other hand, can cause fruit 49
deformation and delayed ripening4. PM is widespread in many strawberry growing regions 50
worldwide, such that either open-field or high-tunnel growing system can experience severe 51
yield losses when infected field are left untreated5. Most farmers rely heavily on multiple 52
antimicrobial chemistry applications to manage PM in the field. Hence, developing new cultivars 53
with improved resistance against powdery mildew is necessary to significantly reduce reliance to 54
anti-fungal applications and promote sustainable agriculture. 55
The Mildew Resistance Locus O (MLO) gene family is present in several crop species and 56
was described for having conserved seven-transmembrane (TM) and C-terminal calmodulin-57
binding (CaMB) domains6,7. The MLO gene family has been the focus of attention in many crop 58
species because transgenic downregulation or elimination of specific endogenous MLO genes 59
has led to PM resistance8. However, alteration of MLO gene sequences can trigger negative 60
phenotypic effects including premature leaf chlorosis, altered root growth and pollen tube 61
germination9. 62
The first MLO-based resistance trait was first characterized in barley (HvMLO), where a 63
loss-of-function mutation in an MLO gene conferred broad resistance against PM pathogens10. 64
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
4
This discovery led to subsequent comparative genomic studies of MLO gene families in several 65
plant species to find suitable candidate genes. In the model plant Arabidopsis thaliana, three 66
MLO genes (AtMLO2, AtMLO6 and AtMLO12) were functionally characterized to confer PM 67
susceptibility11,12. Expression analysis of these homologs upon pathogen challenge suggested 68
functional redundancy. The Atmlo2 single mutant has only partial resistance while triple mutants 69
(Atmlo2, Atmlo6 and Atmlo12) have full resistance against PM pathogens13. Identification of the 70
functional MLO genes greatly relies on the availability of genetic information, therefore the 71
development of reference genomes for several crop species provides an opportunity to identify 72
potential MLO orthologs which can be used for functional studies. 73
MLO genes have been genetically characterized across many crops including apple 74
(MdMLO1), pepper (CaMLO2) , rose (RhMLO1), grapevine (VvMLO13), melon (CmMLO2), pea 75
(PsMLO1), tobacco (NtMLO2), tomato (SlMLO1), rice (OsMLO1), corn (ZmMLO1) and wheat 76
(TaMLO1)8,14,15 . Recently, targeted-genome mutation of the SlMLO1 gene resulted in the 77
development of PM-resistant Slmlo1 tomato variety16. The durable and broad-spectrum 78
effectiveness of MLO-based resistance could provide a useful target for strawberry improvement 79
to PM disease. 80
The modern cultivated strawberry (Fragaria ×ananassa) is an allo-octoploid (2n=8x=56) 81
resulting from hybridization between a Chilean strawberry (F. chiloensis) and a North American 82
native strawberry (F. virginiana)17. Further domestication of F. ×ananassa produced large and 83
tasty berries that have become one of the world’s most widely grown fruit crops. In 2010, the 84
genome of the diploid progenitor species F. vesca was sequenced18 and has been used as an 85
excellent genetic resource towards molecular marker development19 and for gene-trait 86
association studies in Fragaria species20-22. However, the F. ×ananassa genome is far more 87
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
5
complicated than its diploid progenitor, as diploid species contain two copies for each gene 88
whereas octoploid species may possess up to eight. Recently, a new F. ×ananassa reference 89
genome was developed, and the complex evolutionary origin of octoploid strawberry has been 90
deciphered23. This new reference genome sequence will serve as powerful genetic resource to 91
unravel complexity of octoploid strawberry genome for gene-traits association studies including 92
causal MLO genes in strawberry breeding programs. 93
The goals of this study were to characterize MLO gene family in diploid (F. vesca) and 94
octoploid (F. ×ananassa) strawberries, compare their gene structures, compare transcript levels 95
throughout the plant and between individuals, and identify potential MLO genes associated with 96
PM resistance and fruit ripening in octoploid strawberry. The data reported here are a critical 97
first step in understanding allele function of strawberry MLO genes which will be useful for 98
future genomics and functional studies. 99
100
Results 101
Genome-wide identification of MLO gene family in diploid and octoploid strawberry 102
Using the Arabidopsis MLO protein sequence as a BLAST query, 20 putative MLO genes 103
were identified using the latest diploid genome annotation F. vesca v4.0.a124. These 20 MLO 104
genes were renamed FveMLO1 through FveMLO20 based on their ordered chromosomal 105
positions (Fig. 1A, Table S1a). Predicted proteins ranged between 200 to 903 aa with an average 106
of 524 aa. Three truncated MLO proteins were identified as putative pseudogenes (Table S2). 107
The number of predicted MLO genes in diploid strawberry agrees with the previous 108
characterization of strawberry MLO genes using the first F. vesca draft genome25. 109
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
6
The FveMLO genes identified in F. vesca were then used to identify gene orthologs in the 110
octoploid strawberry annotated genome F. ×ananassa v1.0.a123. Analysis revealed 68 predicted 111
MLO genes across 28 chromosomes (Fig. 1B, Table S1b). Of 20 FveMLO genes, 17 were 112
matched with high amino acid sequence identity to a putative ortholog, however, conserved 113
sequences could not be found for FveMLO6, FveMLO7 or FveMLO8. Based on their putative 114
orthology to F. vesca, the octoploid MLO genes were named FaMLO1 through FaMLO20, 115
respectively. To distinguish homoeologous MLO genes, FaMLO1-1 to FaMLO1-4 were used for 116
each FveMLO1 homolog. Predicted MLO proteins have amino acid sequence lengths ranging 117
between 144 and 2,365 with an average of 560 amino acids (Table S2). 118
119
Gene structure and synteny analysis of octoploid strawberry MLO genes 120
Twenty FveMLO genes were distributed randomly across six chromosomes of the diploid 121
F. vesca genome: one in Fvb2, two in Fvb7, four in Fvb5, seven in Fvb3 and three in Fvb1 and 122
Fvb6 (Fig. 1, Table S1a). The coding DNA sequence composition ranged from 1 to 16 exons 123
(Fig. S1-A). In the octoploid genome, 68 FaMLO genes were distributed across every 124
chromosome except Fvb4-1 and Fvb4-3 (Fig. 1B, Table S1b). The intron-exon structures of 125
FaMLO genes have more variation as compared with its diploid progenitor F. vesca, with coding 126
DNA sequence (CDS) composition ranging from 1 to 23 exons (Fig. S1-B). Despite this 127
structure complexity, many FaMLO genes demonstrated high homology of DNA sequence with 128
the diploid progenitor, F. vesca (Fig. S2). Two additional FaMLO sequences in chromosome 4-129
2 (FaMLO19-4) and 4-4 (FaMLO12-7) were identified in F. ×ananassa, which are not present in 130
F. vesca genome (Fig. 1A-B). 131
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
7
The four subgenomes in the modern cultivated strawberry are a result of allo-132
polyploidization with four specific diploid progenitor genomes, which have subsequently 133
undergone substantial subgenome conversion23. To elaborate on this recent discovery, we display 134
synteny networks for putative MLO genes between diploid (F. vesca) and octoploid (F. 135
×ananassa) strawberry (Fig. 2). Most putative FaMLO orthologs co-localized in the same 136
chromosome of F. vesca genome. Additional FaMLO orthologs of FveMLO12 and FveMLO19 137
were identified in subgenomes Fvb4-4 and Fvb4-2, respectively. Meanwhile, no orthologs were 138
found for FveMLO5, FveMLO6 and FveMLO7 in the F. ×ananassa genome. Variants in copy 139
number of homoeologous genes for each FaMLO were identified with two homeologs each for 140
FaMLO18 and FaMLO19, three for FaMLO1 and FaMLO9, four for FaMLO2, FaMLO3, 141
FaMLO5, FaMLO20 and FaMLO10 through FaMLO17, and five for FaMLO4 (Fig. 2). The 142
variation in the distribution of MLO genes showed a wide diversity in the genome composition 143
of octoploid strawberry. Furthermore, there were at least two FaMLO homeologs with 80% or 144
more sequence similarity with FveMLO orthologs (Fig. S2). 145
146
Domain organization and structure characterization of octoploid strawberry MLO genes 147
The MLO gene was first identified in barley and characterized as membrane protein with 148
seven transmembrane (TM) domains and a uniquely-identified “MLO-functional domain”6. To 149
examine the conserved protein domains of the strawberry MLO genes, the deduced amino acid 150
sequence of predicted MLO proteins found in diploid and octoploid strawberries were subjected 151
to theoretical domain prediction using online software InterProScan (https://www.ebi.ac.uk) and 152
NCBI's conserved domain database (https://www.ncbi.nlm.nih.gov /Structure/cdd). Most MLO 153
proteins from either diploid or octoploid strawberry contain the conserved domain of MLO and 154
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
8
TM, which covers a large portion of the protein (Fig. 3). To predict TM domains and subcellular 155
localization of strawberry MLO proteins, CCTOP26 and WoLF PSORT27 software were used to 156
find differences in the number of TM domains and subcellular localization among FveMLO and 157
FaMLO proteins (Fig. 3, Table S2). All FveMLO proteins were predicted to localize within the 158
plasma membrane except for FveMLO7, which was predicted to localize in the extracellular 159
matrix. Out of 68 FaMLO genes, 61 were predicted to localize within plasma membrane while 160
nine were predicted to localize in other organelles: four in the chloroplast, two in ER, two in the 161
nucleus and one in Golgi bodies (Table S2). Thirteen out of 20 FveMLO proteins have seven TM 162
domains, while seven have between three to six TM domains. The FaMLO gene family has a 163
high degree of variation in TM domain composition (Table S2). Only 35 FaMLO proteins have 164
seven TM domains while the remaining FaMLO proteins have TM domains ranging between 165
zero and eight (Fig. 3). 166
The domain organization of some FaMLO proteins showed high levels of conservation 167
with the diploid ancestor species, F. vesca. For example, there were at least two homoeologs of 168
FaMLO2, FaMLO4, FaMLO5, FaMLO9, FaMLO11, FaMLO12, FaMLO15, FaMLO16, 169
FaMLO17 and FaMLO20 that showed F. vesca-like domain structures (Fig. 3). However, one to 170
three homoeologs of some FaMLO proteins including FaMLO3, FaMLO4, FaMLO5, FaMLO10, 171
FaMLO 12, FaMLO13, FaMLO414, FaMLO16, FaMLO17, FaMLO18, FaMLO19 and 172
FaMLO20 showed more diverse protein structures which caused by either protein sequence 173
truncation or extension (Fig. 3). 174
Variation in protein characteristics of FaMLO gene family, especially between 175
homoeologous proteins, could imply that some MLO proteins are unique and might not share the 176
same function. MLO proteins that were localized in the plasma membrane and contain conserved 177
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
9
transmembrane protein domains were selected for further phylogenetic analysis of functional 178
divergence. 179
180
Phylogenetic relationship analysis of MLO genes in strawberry 181
To study the evolutionary relationships between strawberry MLO genes and MLO genes 182
from other plant species, we aligned the deduced amino acid sequence of 20 MLO genes from F. 183
vesca (FveMLOs) and 68 from F. ×ananassa (FaMLOs) with previously characterized MLO 184
from Arabidopsis (AtMLOs), rice (OsMLO1), corn (ZmMLO1), barley (HvMLO), tomato 185
(SlMLO1), pepper (CaMLO2) and other rosaceous crops such as apple (MdMLOs) and peach 186
(PpMLOs). Phylogenetic analysis using MUSCLE and FastTree divided MLO genes into eight 187
different clades (Fig. 4). Functionally characterized MLO genes that are associated with PM 188
resistance from selected monocots and dicots were clustered in clade IV and V, respectively. 189
Among strawberry MLO genes, FveMLO12 and FaMLO12 together with HvMLO, OsMLO1 and 190
ZmMLO1 were grouped in clade IV while FveMLO10, FveMLO17, FveMLO20, FaMLO10, 191
FaMLO17 and FaMLO20 together with SlMLO1, CaMLO1, AtMLO2, AtMLO6 and AtMLO12 192
were grouped in clade V. Remaining putative MLO sequences were distributed to six other 193
groups. The results showed that three FveMLO and ten FaMLO proteins have a close 194
evolutionary relationship with the MLO proteins known to confer PM susceptibility. 195
To further investigate the candidate susceptibility-conferring FaMLO genes, namely 196
FaMLO10-1, FaMLO10-4, FaMLO17-1, FaMLO17-2, FaMLO17-3, FaMLO17-4, FaMLO20-1, 197
FaMLO20-2, FaMLO20-3 and FaMLO20-4, their deduced amino acid sequences were aligned 198
with their orthologs from F. vesca and AtMLO2, AtMLO6 and AtMLO12 from Arabidopsis (Fig. 199
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
10
S3). We identified conserved domains including seven TM, CaMB and two C-terminal protein (I 200
and II) domains of MLO genes7,28. FaMLO17-2, FaMLO17-3, FaMLO20-1, FaMLO20-2, 201
FaMLO20-3 and FaMLO20-4 proteins possessed seven TM domains and conserved CaMB and 202
C-terminal I and II domains; however, FaMLO10-1, FaMLO10-4, FaMLO17-2 and FaMLO17-4 203
showed truncated protein sequences at C-terminal end, resulting in the loss of these conserved 204
domains. Among candidate MLO proteins, FaMLO17-2 and FaMLO17-3 protein sequences are 205
most identical to known susceptibility-conferring Arabidopsis MLO proteins, and therefore could 206
have potential association with PM resistance in octoploid strawberry. 207
208
Expression profiling of FaMLO genes in cultivated strawberry 209
Expression patterns of previously characterized MLO genes suggested diverse biological 210
functions for the MLO gene family29. To investigate functional strawberry MLO genes in fruit, 211
we performed RNA sequencing analysis to evaluate transcript profiles in selected strawberry 212
varieties, ‘Mara de Bois’, ‘Florida Elyana’, ‘Winter Dawn’, ‘Florida Radiance’ and ‘Strawberry 213
Festival’. Variation in transcript abundance was observed among MLO genes and 27 FaMLO 214
genes were constitutively expressed (>1 Transcript per Million, TPM) across samples (Fig. 5A 215
and Table S3). We found that FaMLO14 homeologs had the highest fruit transcript abundance 216
among FaMLO genes. Meanwhile, FaMLO1, FaMLO13 and FaMLO15-18, including their 217
homeologs, were not detected in all varieties (Table S3). MLO genes clustered in clade V also 218
showed variation in gene expression among them and between their homoeologous genes. 219
Transcripts from at least two homeologs of FaMLO10, FaMLO17 and FaMLO20 were detected 220
suggesting that genes are functional. To characterize MLO transcript accumulation patters in 221
octoploid strawberry, raw data from the strawberry gene expression atlas study30 were re-222
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
11
assembled using the recently published octoploid strawberry genome23. The steady-state 223
accumulation of several MLO transcripts are specific to different tissues, with MLO genes such 224
as FaMLO1 and FaMLO17 showing tissue-specific gene expression. The gene FaMLO17 and 225
its two homeologs are preferentially expressed in root tissues (Fig. 5B and Table S4). 226
227
Identification of eQTL associated with FaMLO transcripts in strawberry mature fruit 228
RNA-seq analysis identified FaMLO transcript level variation across fruit developmental 229
stages (Fig. 5B), supporting a potential role of MLO gene family in the fruit. To characterize 230
FaMLO fruit expression variation associated with genetic differences between individuals 231
(eQTL), we performed eQTL analysis using 61 octoploid IStraw35 genotypes and mature fruit 232
transcriptomes and identified at least three eQTL. These include maker-Fvb4-4-augustus-gene-233
48.50 (FaMLO12-7), snap_masked-Fvb3-2-processed-gene-33.29 (FaMLO5-2) and 234
snap_masked-Fvb3-3-processed-gene-28.38 (FaMLO5-3), the latter two of which are 235
homoeologous genes (Fig. 5C). Single marker analysis of MLO eQTL also shows a general 236
association between homozygous marker genotypes and lower MLO transcript levels (Fig. S4). 237
Because the ‘Camarosa’ genomic locus of each transcript is known, it could be shown that some 238
of the identified transcript eQTL coincide with the location of their corresponding MLO gene 239
locus, and hence were considered as cis-eQTL (Table 1). The most significant IStraw35 SNP 240
marker name and position for each MLO gene is provided with the eQTL phase, minor allele 241
frequency, p-value (FDR-adjusted) and heritability estimates (Table 1). 242
243
244
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
12
Discussion 245
The MLO gene family is an important target in many agricultural crops for the 246
improvement of resistance against PM pathogens. The previous characterization identified that 247
some MLO genes are involved in PM susceptibility, and that loss-of-function of those genes can 248
confer durable and broad-spectrum resistance. In this study, it was found that variants of F. vesca 249
MLO genes and F. ×ananassa MLO genes were distributed across the genomes. However, we 250
observed copy number variation of FaMLO homeologs showing differences in composition 251
among subgenomes, supporting the distinct origin of each subgenome during the evolution of 252
octoploid strawberry29. The unique FaMLO sequences in chromosome four (Fig. 1B and Fig. 2) 253
that are not present in F. vesca might suggest an acquisition of chromosome segments from other 254
diploid progenitors. Also, we did not find any significant homologs for FveMLO6, FveMLO7 255
and FveMLO8 using the F. ×ananassa ‘Camarosa’ reference genome (Fig. 1B). The loss of 256
genes could be evolutionary significant in the development of polyploid species31 or other 257
possible functional divergence during domestication. Additional octoploid reference genomes 258
would explain such phenomena, as the chromosomal localization of FaMLO genes and possible 259
misorientation in genome sequence for some subgenomes can be identified. 260
The average MLO protein length (~500 amino acids) was similar in both F. vesca and F. 261
×ananassa, although few FaMLO proteins had more structural variations (SV) as compared with 262
other homologous genes (Fig. 3). SV is defined as a change in genome sequence such as 263
deletion, insertion and inversion caused by mutations which in turn can affect biological gene 264
function32. Furthermore, SV between homeologs of FaMLO10 and FaMLO12 causes changes in 265
their gene expression profile (Fig. 5A-B) and subcellular localization (Table S2). SV between 266
homeologs might result in different gene regulation of gene expression and consequently lead to 267
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
13
novel functions33. Diverse gene expression patterns among FaMLO genes across different 268
tissues (Fig. 5B) suggested that MLO proteins might be specialized for different biological 269
responses. This may be analogous to the unique expression patterns observed for Arabidopsis 270
MLO genes, where transcription of each MLO gene is distinct and regulated differently by 271
various biotic and abiotic stimuli34 272
In the present study, we found that number of the TM domains of FveMLOs and FaMLOs 273
varied between zero to eight. This variation was also observed in previous genome-wide MLO 274
studies in other Rosaceous crops such as apple and peach35. Phylogenetic analysis grouped 275
strawberry MLO genes into eight different clades, which is consistent with previously classified 276
MLO gene families in diploid strawberry25 and other Rosaceous crops35. Functional MLO genes 277
associated with PM resistance are grouped in clade IV and V for monocots and dicots, 278
respectively13. Interestingly, we found FveMLO12, FaMLO12-1, FaMLO12-2 and FaMLO12-4 279
grouped together in clade IV with previously characterized HvMLO, OsMLO, ZmMLO1 and 280
PpMLO12 35. In the present study, we found at least three clade V FveMLO and FaMLO genes, 281
which are potential candidate genes involved in PM resistance. MLO-based resistance to the PM 282
pathogen P. aphanis in a strawberry was successfully tested through RNAi-induced gene 283
silencing of FaMLO genes using antisense MLO from peach (PpMLO1)36. In Arabidopsis, 284
mutation of AtMLO2 showed only partial resistance while complete resistance was obtained after 285
knocking-out both AtMLO6 and AtMLO1237. Redundancy of gene function among homologs 286
could arise from gene duplication for adaptation to varying selection pressures and changing 287
environments38,39. High gene copy-number is commonly observed in polyploid crop species due 288
to the presence of homoeologous genes, posing a considerable challenge for determining the 289
biological contribution of specific genes. The genes FaMLO17-2 and FaMLO17-3 showed the 290
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
14
highest sequence identity with known Arabidopsis MLO genes, suggesting that they might 291
contribute to strawberry MLO susceptibility. Overall, we identified a total of 10 FaMLO genes 292
with high homology to functional disease-susceptible MLO, representing candidate genes for 293
breeding new cultivars with improved PM resistance. 294
Despite considerable research into MLO as a susceptibility gene, the endogenous function 295
of the MLO family remains largely unknown. It appears most likely that specific MLO genes 296
have diverse and specialized functions29. The diverse pattern of expression for FaMLO genes in 297
different fruit developmental stages (Fig. 5B), and the presence of certain signal transduction 298
domains, supports one possible role for MLO as a ripening regulator. The conserved CaMB 299
domain in FaMLO gene family could be a binding site for CaM-dependent modulation during 300
fruit ripening. Calmodulin (CaM) is a well-characterized protein known as Ca2+ sensor that 301
triggers response to different developmental and biotic stimuli7,40. A previously-studied CaMB 302
domain-containing gene family in tomato showed fruit-ripening regulation activity, via 303
developmental network interaction with Ca2+ and calmodulin40. Furthermore, generated ripening 304
mutant altered expression of those genes indicating its important role in tomato fruit 305
development. It is possible that the conserved CaMB domain in strawberry MLO gene family, 306
together with CaM and other unknown regulators, could play a similar role in the fruit-ripening 307
process. Findings from this study presented the functional diversity of MLO gene family, 308
although further studies are needed to validate the integral role of strawberry MLO genes in 309
response to biotic and developmental stimuli. 310
311
312
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
15
Conclusions 313
Here, we identified a total of 20 MLO homologues in F. vesca and 68 in F.×ananassa. 314
Three FveMLO genes and ten FaMLO genes were clustered with previously characterized MLO 315
genes known to be in PM resistance/susceptibility in other plant species. The deduced amino 316
acid sequences of putative strawberry MLO genes showed conserved protein characteristics, 317
including transmembrane and calmodulin-binding domains that have been previously described. 318
The considerable amino acid-level variation between MLO homoeologous copies was observed, 319
suggesting possible non-redundant functions of MLOs in different subgenomes. FaMLO17-2 and 320
FaMLO17-3 are the most identical to functionally characterized MLO genes associated with PM 321
susceptibility in other plants. These genes could be functionally characterized via CRISPR gene 322
editing in future studies. eQTL analysis also identified MLO genes whose differential 323
transcription is governed by genetic segregation, which could be an indication of functional 324
diversity. These findings support and expand upon the potential roles for MLO-regulated fruit 325
development. Taken together, these data are a critical first step in understanding the allele 326
function of straberry MLO gene family, and should be useful for future functional studies to 327
better understand their role on powdery mildew resistance in strawberry. 328
329
Materials and Methods 330
Identification of the strawberry MLO genes in diploid and octoploid strawberry 331
To identify MLO gene orthologs in diploid strawberry, F. vesca, Arabidopsis MLO genes 332
were searched to the latest F. vesca genome (v4.0.a1)24. Consequently, each FveMLO gene 333
identified was used to search for predicted MLO genes in octoploid strawberry using the 334
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
16
octoploid strawberry reference genome (F. ×ananassa v1.0.a1)23. The deduced diploid and 335
octoploid strawberry protein sequences were validated by reciprocal BLAST searches obtained 336
from NCBI data sets of Arabidopsis reference genome. 337
338
Gene structure and synteny analysis of strawberry MLO genes 339
The chromosomal localization of each predicted MLO genes in F. vesca and F. 340
×ananassa was identified using available information in GDR database and visualized using 341
MapChart 2.3 software41. Gene structure featuring introns, exons and UTR of predicted MLO 342
genes were constructed using Gene Structure Display Server 2.0 (GSDS) 343
(http://gsds.cbi.pku.edu.cn/)42. Pairwise comparisons of coding DNA sequence (CDS) between 344
predicted FveMLO and FaMLO genes were obtained using ClustalW2 multiple sequence 345
alignment via Geneious software43 followed by heat map visualization to determine closely 346
related MLO genes using R-package “Lattice”44. Synteny analysis of MLO genes between F. 347
vesca and F. ×ananassa were summarized using R-package “Circlize”45. 348
349
Phylogenetic analysis of strawberry and other plant MLO genes 350
The protein sequences of MLO genes from F. vesca and F. ×ananassa were aligned to 351
available sequences and phylogenetic relationship was constructed using FastTree consensus tree 352
protein alignment via Geneious Tree Builder software43. Phylogenetic tree was constructed by 353
adding 15 MLO genes from Arabidopsis, six MLO from other Rosaceous crops, Malus 354
domesticus (MdMLOs) and Prunus persica (PpMLOs), barley (HvMLO), corn (ZmMLO1), rice 355
(OsMLO1), tomato (SlMLO1) and pepper (CaMLO2). 356
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
17
Protein characterization and domain prediction 357
The deduced amino acid sequences of putative FveMLO and FaMLO genes were 358
analyzed by different prediction software to identify functional domains and determine protein 359
topologies and sub-cellular localizations. Functional MLO domain of protein sequences was 360
predicted using CDD: NCBI's conserved domain database (https://www.ncbi.nlm.nih.gov 361
/Structure/cdd)46. Protein topology and number of transmembrane domains were predicted using 362
online software CCTOP Prediction Server26 while protein sub-cellular localization was analyzed 363
using WoLF PSORT program27. Default setting was used to run for all prediction software. 364
Visualization of proteins domains were constructed using IBS 1.0.3 (Illustrator for Biological 365
Sequences) program47. To analyze conserved amino acids of MLO genes associated with PM 366
resistance, protein sequences of FveMLO10, FveMLO17 and FveMLO20 from F. vesca and 367
FaMLO10, FaMLO17 and FaMLO20 from F. ×ananassa were aligned against functionally 368
characterized AtMLO2, AtMLO6 and AtMLO12 from Arabidopsis using MultAlin software 369
(http://multalin.toulouse.inra.fr/multalin/ )48. 370
371
Expression profile of putative strawberry MLO genes 372
To examine transcript accumulation patterns of putative strawberry MLO genes, we 373
performed RNA-seq analysis in selected strawberry cultivars including ‘Florida Elyana’, ‘Mara 374
de Bois’, ‘Florida Radiance’, ‘Strawberry Festival’ and ‘Winter Dawn’. Also, RNA-seq libraries 375
from various ‘Camarosa’ tissues30 with the study reference PRJEB12420 were downloaded from 376
the European Nucleotide Archive (https://www.ebi.ac.uk/ena). The complete 54 libraries RNA-377
seq experiment consisted of six independent green receptacle libraries, six white receptacle 378
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
18
libraries, six turning receptacle libraries, six red receptacle libraries, three root libraries, three 379
leaf libraries, and six achene libraries each for all corresponding fruit stages. For both libraries, 380
raw RNA-seq reads were assembled to the ‘Camarosa’ reference genome using CLC Genomic 381
Workbench 11 (mismatch cost of 2, insertion cost of 3, deletion cost of 3, length fraction of 0.8, 382
similarity fraction of 0.8, 1 maximum hit per read). Reads that mapped equally well to more than 383
one locus were discarded from the analysis. RNA-seq counts were quantified in Transcripts Per 384
Million (TPM). 385
386
Plant materials for eQTL analysis 387
Three pedigree-connected and segregating strawberry populations were created from 388
crosses ‘Florida Elyana’ × ‘Mara de Bois’, ‘Florida Radiance’ × ‘Mara des Bois’, and 389
‘Strawberry Festival’ × ‘Winter Dawn’ (Figure S7). These cultivars and 54 progenies were 390
selected for RNA-seq and SNP genotyping analysis and were used to identify eQTL. 391
392
Fruit transcriptomes analysis of strawberry MLO genes for eQTL analysis 393
Sixty-one fruit transcriptomes were sequenced via Illumina paired-end RNA-seq (Avg. 394
65million reads, 2x100bp), and consisted of parents and progeny from crosses of ‘Florida 395
Elyana’ × ‘Mara de Bois’, ‘Florida Radiance’ × ‘Mara des Bois’, and ‘Strawberry Festival’ × 396
‘Winter Dawn’. Reads were trimmed and mapped to the F. ×ananassa octoploid ‘Camarosa’ 397
annotated genome using CLC Genomic Workbench 11 (mismatch cost of 2, insertion cost of 3, 398
deletion cost of 3, length fraction of 0.8, similarity fraction of 0.8, 1 maximum hit per read). 399
Reads that mapped equally well to more than one locus were discarded from the analysis. RNA-400
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
19
seq counts were calculated in Transcripts Per Million (TPM). Three-dimensional principle 401
component analysis (PCA) was performed on all RNA-seq assemblies, including two replicates 402
of ‘Mara des Bois’ fruit harvested three years apart and sequenced independently (Figure S3). 403
Transcript abundances were normalized via the Box-Cox transformation algorithm performed in 404
R49 prior to eQTL analysis. The BLAST2GO pipeline was used to annotate the full ‘Camarosa’ 405
predicted gene complement. 406
407
Genome-wide association study (GWAS) of strawberry MLO gene expression in fruit 408
The Affymetrix IStraw35 Axiom® SNP array50 was used to genotype 60 individuals, 409
including six parental lines from three independent biparental RNAseq populations. Sequence 410
variants belonging to the Poly High Resolution (PHR) and No Minor Homozygote (NMH) 411
marker classes were included for association mapping. Mono High Resolution (MHR), Off-412
Target Variant (OTV), Call Rate Below Threshold (CRBT), and Other marker quality classes, 413
were discarded and not used for mapping. Individual marker calls inconsistent with Mendelian 414
inheritance from parental lines were removed. The F. vesca physical map was used to orient 415
marker positions as current octoploid maps do not include a majority of the available IStraw35 416
markers. A genome-wide analysis study (GWAS) was performed using GAPIT v251 performed 417
in R49. MLO eQTL were evaluated for significance based on the presence of multiple co-locating 418
markers of p-value < 0.05 after false discovery rate correction for multiple comparisons. MLO 419
genes demonstrating evidence of an eQTL were then associated using the ‘Holiday’ × ‘Korona’ 420
and family ‘14.95’ octoploid genetic maps to evaluate subgenomic location. Linkage Group in 421
the family ‘14.95’ map was corresponded to the ‘Camarosa’ genome physical chromosomes 422
(Michael Hardigan, UCD, personal communication). Cis vs trans eQTL determinations were 423
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
20
made by corroborating known ‘Camarosa’ physical gene position with marker positions in the 424
’14.95’ map, and by BLAST of eQTL markers to the ‘Camarosa’ genome. 425
426
ReferencesUncategorized References 427
1. Peries, O.S. Studies on strawberry mildew, caused by Sphaerotheca macularis (Wallr. ex 428
Fries) Jaczewski*. Annals of Applied Biology 50, 211-224 (1962). 429
2. Xiao, C.L. et al. Comparison of Epidemics of Botrytis Fruit Rot and Powdery Mildew of 430
Strawberry in Large Plastic Tunnel and Field Production Systems. Plant Dis 85, 901-909 431
(2001). 432
3. Paulus, A. Fungal Disease of Strawberry. HortScience 25(1990). 433
4. Peres, N.A., Mertely. J.C. Powdery Mildew of Strawberries. Electronic Data Information 434
Source UF/IFAS Extension (2018). 435
5. Nelson M.D., G.W.D., Shaw D.V. Relative Resistance of 47 Strawberry Cultivars to 436
Powdery Mildew in California Greenhouse and Field Environments. Plant Disease 437
80(1996). 438
6. Devoto, A., Piffanelli, P., Nilsson, I., Wallin, E. Panstruga, R., & Heijne, G.V., Schulze-439
Lefert, P. Topology, Subcellular, Localization, and Sequence Diversity of the Mlo Family 440
in Plants. The Journal of Biological Chemistry 274, 13 (1999). 441
7. Kim, M.C. et al. Mlo, a modulator of plant defense and cell death, is a novel calmodulin-442
binding protein. Isolation and characterization of a rice Mlo homologue. J Biol Chem 443
277, 19304-14 (2002). 444
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
21
8. Kusch, S. & Panstruga, R. mlo-Based Resistance: An Apparently Universal "Weapon" to 445
Defeat Powdery Mildew Disease. Mol Plant Microbe Interact 30, 179-189 (2017). 446
9. Pastruga, R. Serpentine plant MLO proteins as entry portals for powdery mildew fungi. 447
Plant Signalling from Genes to Biochemistry 33(2005). 448
10. Jørgensen, I.H. Discovery, characterization and exploitation of Mlo powdery mildew 449
resistance in barley. Euphytica 63, 141-152 (1992). 450
11. Acevedo-Garcia, J. et al. The powdery mildew-resistant Arabidopsis mlo2 mlo6 mlo12 451
triple mutant displays altered infection phenotypes with diverse types of phytopathogens. 452
Sci Rep 7, 9319 (2017). 453
12. Devoto, A. et al. Molecular phylogeny and evolution of the plant-specific seven-454
transmembrane MLO family. J Mol Evol 56, 77-88 (2003). 455
13. Acevedo-Garcia, J., Kusch, S. & Panstruga, R. Magical mystery tour: MLO proteins in 456
plant immunity and beyond. New Phytol 204, 273-81 (2014). 457
14. Acevedo-Garcia, J. et al. mlo-based powdery mildew resistance in hexaploid bread wheat 458
generated by a non-transgenic TILLING approach. Plant Biotechnol J 15, 367-378 459
(2017). 460
15. Zheng, Z. et al. Loss of function in Mlo orthologs reduces susceptibility of pepper and 461
tomato to powdery mildew disease caused by Leveillula taurica. PLoS One 8, e70723 462
(2013). 463
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
22
16. Nekrasov, V. et al. Rapid generation of a transgene-free powdery mildew resistant 464
tomato by genome deletion. Sci Rep 7, 482 (2017). 465
17. Njuguna, W., Liston, A., Cronn, R., Ashman, T.-L. & Bassil, N. Insights into phylogeny, 466
sex function and age of Fragaria based on whole chloroplast genome sequencing. 467
Molecular Phylogenetics and Evolution 66, 17-29 (2013). 468
18. Shulaev, V. et al. The genome of woodland strawberry (Fragaria vesca). Nat Genet 43, 469
109-16 (2011). 470
19. Anciro, A. et al. FaRCg1: a quantitative trait locus conferring resistance to 471
Colletotrichum crown rot caused by Colletotrichum gloeosporioides in octoploid 472
strawberry. Theor Appl Genet 131, 2167-2177 (2018). 473
20. Mangandi, J. et al. Pedigree-Based Analysis in a Multiparental Population of Octoploid 474
Strawberry Reveals QTL Alleles Conferring Resistance to Phytophthora cactorum. G3 475
(Bethesda, Md.) 7, 1707-1719 (2017). 476
21. Salinas, N., Verma, S., Peres, N. & Whitaker, V.M. FaRCa1: a major subgenome-specific 477
locus conferring resistance to Colletotrichum acutatum in strawberry. Theor Appl Genet 478
132, 1109-1120 (2019). 479
22. Roach, J.A. et al. FaRXf1: a locus conferring resistance to angular leaf spot caused by 480
Xanthomonas fragariae in octoploid strawberry. Theor Appl Genet 129, 1191-201 (2016). 481
23. Edger, P.P. et al. Origin and evolution of the octoploid strawberry genome. Nat Genet 51, 482
541-547 (2019). 483
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
23
24. Edger, P.P. et al. Single-molecule sequencing and optical mapping yields an improved 484
genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity. 485
Gigascience 7, 1-7 (2018). 486
25. Miao, L.X. et al. Genomic identification, phylogeny, and expression analysis of MLO 487
genes involved in susceptibility to powdery mildew in Fragaria vesca. Genet Mol Res 488
15(2016). 489
26. Dobson, L., Reményi, I. & Tusnády, G.E. CCTOP: a Consensus Constrained TOPology 490
prediction web server. Nucleic acids research 43, W408-W412 (2015). 491
27. Horton, P. et al. WoLF PSORT: protein localization predictor. Nucleic acids research 35, 492
W585-W587 (2007). 493
28. Panstruga, R. Discovery of Novel Conserved Peptide Domains by Ortholog Comparison 494
within Plant Multi-Protein Families. Plant Molecular Biology 59, 485-500 (2005). 495
29. Nguyen, V.N.T., Vo, K.T.X., Park, H., Jeon, J.-S. & Jung, K.-H. A Systematic View of 496
the MLO Family in Rice Suggests Their Novel Roles in Morphological Development, 497
Diurnal Responses, the Light-Signaling Pathway, and Various Stress Responses. 498
Frontiers in Plant Science 7(2016). 499
30. Sánchez-Sevilla, J.F. et al. Gene expression atlas of fruit ripening and transcriptome 500
assembly from RNA-seq data in octoploid strawberry (Fragaria��ananassa). Scientific 501
Reports 7, 13737 (2017). 502
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
24
31. Jiao, Y. & Paterson, A.H. Polyploidy-associated genome modifications during land plant 503
evolution. Philosophical transactions of the Royal Society of London. Series B, 504
Biological sciences 369, 20130355 (2014). 505
32. Tao, Y., Zhao, X., Mace, E., Henry, R. & Jordan, D. Exploring and Exploiting Pan-506
genomics for Crop Improvement. Mol Plant 12, 156-169 (2019). 507
33. Mutti, J.S., Bhullar, R.K. & Gill, K.S. Evolution of Gene Expression Balance Among 508
Homeologs of Natural Polyploids. G3 (Bethesda) 7, 1225-1237 (2017). 509
34. Chen, Z. et al. Expression analysis of the AtMLO Gene Family Encoding Plant-Specific 510
Seven-Transmembrane Domain Proteins. Plant Molecular Biology 60, 583-597 (2006). 511
35. Pessina, S. et al. Characterization of the MLO gene family in Rosaceae and gene 512
expression analysis in Malus domestica. BMC Genomics 15, 618 (2014). 513
36. Jiwan, D., Roalson, E.H., Main, D. & Dhingra, A. Antisense expression of peach mildew 514
resistance locus O (PpMlo1) gene confers cross-species resistance to powdery mildew in 515
Fragaria x ananassa. Transgenic Res 22, 1119-31 (2013). 516
37. Consonni, C. et al. Conserved requirement for a plant host cell protein in powdery 517
mildew pathogenesis. Nat Genet 38, 716-20 (2006). 518
38. Leister, D. Tandem and segmental gene duplication and recombination in the evolution of 519
plant disease resistance gene. Trends Genet 20, 116-22 (2004). 520
39. Panchy, N., Lehti-Shiu, M. & Shiu, S.H. Evolution of Gene Duplication in Plants. Plant 521
Physiol 171, 2294-316 (2016). 522
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
25
40. Yang, T., Peng, H. & Bauchan, G.R. Functional analysis of tomato calmodulin gene 523
family during fruit development and ripening. Horticulture research 1, 14057-14057 524
(2014). 525
41. Voorrips, R.E. MapChart: Software for the Graphical Presentation of Linkage Maps and 526
QTLs. Journal of Heredity 93, 77-78 (2002). 527
42. Guo, A.-Y. et al. GSDS 2.0: an upgraded gene feature visualization server. 528
Bioinformatics 31, 1296-1297 (2014). 529
43. Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform 530
for the organization and analysis of sequence data. Bioinformatics (Oxford, England) 28, 531
1647-1649 (2012). 532
44. Sarkar, D. Lattice: Multivariate Data Visualization with R. Springer (2008). 533
45. Brors, B., Gu, L., Schlesner, M., Eils, R. & Gu, Z. circlize implements and enhances 534
circular visualization in R. Bioinformatics 30, 2811-2812 (2014). 535
46. Marchler-Bauer, A. et al. CDD: a Conserved Domain Database for the functional 536
annotation of proteins. Nucleic acids research 39, D225-D229 (2011). 537
47. Liu, W. et al. IBS: an illustrator for the presentation and visualization of biological 538
sequences. Bioinformatics 31, 3359-61 (2015). 539
48. Mitchell, C. MultAlin–multiple sequence alignment. Bioinformatics 9, 614-614 (1993). 540
49. Team, R.D.C. R: A Language and Environment for Statistical Computing. R Foundation 541
for Statistical Computing, Vienna, Austria (2014). 542
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
26
50. Verma, S. et al. Development and evaluation of the Axiom® IStraw35 384HT array for 543
the allo-octoploid cultivated strawberry Fragaria ×ananassa. 1156 edn 75-82 544
(International Society for Horticultural Science (ISHS), Leuven, Belgium, 2017). 545
51. Tang, Y. et al. GAPIT Version 2: An Enhanced Integrated Tool for Genomic Association 546
and Prediction. The Plant Genome 9(2016). 547
548
Acknowledgement 549
This work was supported by the Florida Strawberry Research and Education Foundation 550
(FSREF), And the ‘‘Next-generation Disease Resistance Breeding and Management Solutions 551
for Strawberry’’ under award no. 2017-51181-26833). We thank all the members of the 552
strawberry genetics and breeding program, and the strawberry molecular genetics and genomics 553
program for their technical assistance. 554
555
Author contributions 556
557
Competing Financial Interest Statement 558
The authors declare that they have no competing financial interests. 559
560
561
562
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
27
563
564
565
566
567
568
569
570
571
572
573
574
575
Figure Legends 576
Figure 1. Chromosomal localization and distribution of FveMLO (A) and FaMLO (B) genes in 577
F. vesca and F. ×ananassa genomes, respectively. The seven chromosomes of F. vesca 578
(yellow) were named as Fvb1 to Fvb7 while F. ×ananassa (blue) were named as Fvb1-1 579
to Fvb1-7, Fvb2-1 to Fvb2-7, Fvb3-1 to Fvb3-7 and Fvb4-1 to Fvb4-7, respectively to 580
indicate four subgenomes within each chromosome. Putative strawberry MLO genes 581
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
28
chromosome locations were visualized using MapChart 2.3 software41. Relative 582
chromosome size was indicated by unit, Mbp. 583
Figure 2. Synteny analysis of MLO genes between F. vesca and F. ×ananassa. Syntenic regions 584
present in each chromosome of F. vesca was filled with red, light blue, green, dark blue, 585
yellow and pink sequentially. A total of 68 connecting lines between two genomes 586
denote syntenic chromosomal regions. The F. vesca and F. ×ananassa chromosomes are 587
highlighted in orange and light blue. Chromosomes Fvb4 from F. vesca, and Fvb4-1 and 588
Fvb4-3 from F. ×ananassa were not included since no putative MLO genes was 589
identified in those regions. Relative chromosome size was indicated by unit, Mbp. 590
Circular visualization of syntenic regions between F. vesca and F. ×ananassa were 591
constructed using an R-package “Circlize”. 592
Figure 3. Domain organization of deduced MLO protein sequences of F. vesca and F. 593
×ananassa. Visualization of protein domains was constructed using IBS 1.0.3 (Illustrator 594
for Biological Sequences) program47. The positions of transmembrane (blue) and MLO 595
(yellow) domains were predicted using online CCTOP prediction server 596
(http://cctop.enzim.ttk.mta.hu/) and CDD: NCBI's conserved domain database 597
(https://www.ncbi.nlm.nih.gov /Structure/cdd). 598
Figure 4. Phylogenetic tree of MLO protein sequences of strawberry., F. vesca and F. 599
×ananassa, and other plant species such as Arabidopsis, tomato, pepper, rice, barley, 600
maize, apple and peach. Strawberry MLO genes together with other MLOs were grouped 601
into eight different clades. Unrooted phylogenetic tree of 15 AtMLOs, 20 FveMLOs, 59 602
FaMLOs, five PpMLOs, six MdMLOs, CaMLO2, SlMLO1, HvMLO, OsMLO1 and 603
ZmMLO1 was constructed by using FastTree via Geneious software. 604
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
29
Figure 5. Transcript and GWAS analysis of FaMLO genes. A) Expression profile of putative 605
strawberry MLO genes in mature fruit of selected strawberry varieties, ‘Mara de Bois’, 606
‘Elyana’, ‘Winter Dawn’, ‘Radiance’ and ‘Festival’. B) Expression profile in different 607
plant tissues e.g., leaf, roots, green (GR), turning (TR), white (WR) and red (RR) 608
receptables, and green (GA), turning (TA), white (WA) and red (RA) achenes extracted 609
from RNA-seq data published by Sánchez-Sevilla, et al (2017). The mean gene 610
expression level was normalized using Transcripts Per Million (TPM). Gene expression 611
profile was visualized with heatmap using R-package “ggplot2”. C) Manhattan plots of 612
GWAS showing significant SNPs associated with FaMLO transcript in fruits. X- axis 613
corresponds to the distribution of SNPs across seven chromosomes of F. vesca while y-614
axis shows FDR-adjusted probability values. 615
616
Table Legends 617
Table 1. Summary genome-wide association analysis of loci significantly associated with MLO 618
transcript in strawberry fruit. The most significant IStraw35 SNP marker name and 619
position for each MLO gene is provided with the eQTL phase, minor allele frequency, p-620
value (FDR-adjusted), and heritability estimates. 621
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
A
B Chromosome 1 Chromosome 2
Chromosome 3 Chromosome 4
Chromosome 5 Chromosome 6
Chromosome 7
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
FveMLO1FaMLO1-1FaMLO1-2FaMLO1-3
FveMLO5FaMLO5-1FaMLO5-2FaMLO5-3FaMLO5-4
FveMLO11FaMLO11-1FaMLO11-2FaMLO11-3FaMLO11-4
FveMLO14FaMLO14-1FaMLO14-2FaMLO14-3FaMLO14-4
FveMLO18FaMLO18-1FaMLO18-2
FveMLO2FaMLO2-1FaMLO2-2FaMLO2-3FaMLO2-4
FveMLO3FaMLO3-1FaMLO3-2FaMLO3-3FaMLO3-4
FveMLO9FaMLO9-1FaMLO9-2FaMLO9-3
FveMLO6
FveMLO7
FveMLO8
FveMLO12FaMLO12-1FaMLO12-2FaMLO12-3FaMLO12-4FaMLO12-5FaMLO12-6FaMLO12-7
FveMLO15FaMLO15-1FaMLO15-2FaMLO15-3FaMLO15-4
FveMLO19FaMLO19-1FaMLO19-2FaMLO19-3FaMLO19-4
FveMLO20FaMLO20-1FaMLO20-2FaMLO20-3FaMLO20-4
FveMLO16FaMLO16-1FaMLO16-2FaMLO16-3FaMLO16-4
FveMLO17FaMLO17-1FaMLO17-2FaMLO17-3FaMLO17-4
FveMLO13FaMLO13-1FaMLO13-2FaMLO13-3FaMLO13-4
FveMLO10FaMLO10-1FaMLO10-2FaMLO10-3FaMLO10-4
FveMLO4FaMLO4-1FaMLO4-2FaMLO4-3FaMLO4-4FaMLO4-5
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
Clade II
Clade VI
Clade IV
Clade III
Clade V
Clade I
Clade VIIIClade VII
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764
-
FaMLO5-2
FaMLO5-3
FaMLO12-7
Linkage Group
A B C
(which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint
https://doi.org/10.1101/2020.02.03.932764