genome-wide identification and characterization of mlo gene … · 2020. 2. 3. · (mdmlo1), pepper...

34
1 Genome-Wide Identification and Characterization of MLO Gene Family in Octoploid Strawberry (Fragaria ×ananassa) Ronald R. Tapia 1 , Christopher R. Barbey 2 , Saket Chandra 1 , Kevin M. Folta 2 , Vance M. Whitaker 1 and Seonghee Lee 1† 1 Department of Horticultural Sciences, University of Florida, IFAS Gulf Coast Research and Education Center, Wimauma, FL 33598, USA. 2 Department of Horticultural Sciences, University of Florida, 1301 Fifield Hall, PO Box 110690, Gainesville, FL 32611, USA. Corresponding author: Seonghee Lee Tel: (813) 419-6611 Fax: (813) 634-0001 Email: [email protected] (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprint this version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764 doi: bioRxiv preprint

Upload: others

Post on 28-Jan-2021

1 views

Category:

Documents


0 download

TRANSCRIPT

  • 1

    Genome-Wide Identification and Characterization of MLO Gene Family in Octoploid 1

    Strawberry (Fragaria ×ananassa) 2

    3

    4

    Ronald R. Tapia1, Christopher R. Barbey2, Saket Chandra1, Kevin M. Folta2, Vance M. 5

    Whitaker1 and Seonghee Lee1† 6

    7

    8

    1Department of Horticultural Sciences, University of Florida, IFAS Gulf Coast Research and 9

    Education Center, Wimauma, FL 33598, USA. 10

    2Department of Horticultural Sciences, University of Florida, 1301 Fifield Hall, PO Box 110690, 11

    Gainesville, FL 32611, USA. 12

    13

    †Corresponding author: 14

    Seonghee Lee 15

    Tel: (813) 419-6611 16

    Fax: (813) 634-0001 17

    Email: [email protected] 18

    19

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 2

    Abstract 20

    Powdery mildew (PM) caused by Podosphaera aphanis is a major fungal disease in 21

    cultivated strawberry. Mildew Resistance Locus O (MLO) is a gene family described for having 22

    conserved seven-transmembrane domains. Induced loss-of-function in specific MLO genes can 23

    confer durable and broad resistance against PM pathogens. However, the underlying biological 24

    role of MLO genes in strawberry is still unknown. In the present study, the genomic structure of 25

    MLO genes were characterized in both diploid (Fragaria vesca) and octoploid strawberry 26

    (Fragaria ×ananassa), and the potential sources of MLO-mediated susceptibility were identified. 27

    Twenty MLO-like sequences were identified in F. vesca, with sixty-eight in F. ×ananassa. 28

    Phylogenetic analysis divides strawberry MLO genes into eight different clades, in which three 29

    FveMLO and ten FaMLO genes were grouped together with the functionally known MLO 30

    susceptibility. Out of ten FaMLO genes, FaMLO17-2 and FaMLO17-3 showed the highest 31

    similarity to the known susceptibility MLO proteins. Gene expression analysis of FaMLO genes 32

    was conducted using a multi-parental segregating population. Three expression quantitative trait 33

    loci (eQTL) were substantially associated with MLO transcript levels in mature fruits, suggesting 34

    discrete genetic control of susceptibility. These results are a critical first step in understanding 35

    allele function of MLO genes, and are necessary for further genetic studies of PM resistance in 36

    cultivated strawberry. 37

    Keywords: Transmembrane domain, eQTL, Disease resistance, Rosaceae, RNA sequencing, 38

    Powdery mildew disease 39

    40

    41

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 3

    Introduction 42

    Powdery Mildew (PM) is a major fungal plant disease affecting many economically 43

    important cereal and horticultural crops, including strawberry. The causal agent of PM in 44

    strawberry is the obligate parasite Podosphaera aphanis (Wallr.) (former Sphaerotheca 45

    macularis f. sp. fragariae)1. PM primarily affects the leaf and depending on severity can also 46

    affect other organs above ground2. Initially, a white powdery mycelium develops on the 47

    underside of the leaves, followed by leaf upward curling while severe leaf infection can cause 48

    burning at leaf margins3. Infected flower and fruits, on the other hand, can cause fruit 49

    deformation and delayed ripening4. PM is widespread in many strawberry growing regions 50

    worldwide, such that either open-field or high-tunnel growing system can experience severe 51

    yield losses when infected field are left untreated5. Most farmers rely heavily on multiple 52

    antimicrobial chemistry applications to manage PM in the field. Hence, developing new cultivars 53

    with improved resistance against powdery mildew is necessary to significantly reduce reliance to 54

    anti-fungal applications and promote sustainable agriculture. 55

    The Mildew Resistance Locus O (MLO) gene family is present in several crop species and 56

    was described for having conserved seven-transmembrane (TM) and C-terminal calmodulin-57

    binding (CaMB) domains6,7. The MLO gene family has been the focus of attention in many crop 58

    species because transgenic downregulation or elimination of specific endogenous MLO genes 59

    has led to PM resistance8. However, alteration of MLO gene sequences can trigger negative 60

    phenotypic effects including premature leaf chlorosis, altered root growth and pollen tube 61

    germination9. 62

    The first MLO-based resistance trait was first characterized in barley (HvMLO), where a 63

    loss-of-function mutation in an MLO gene conferred broad resistance against PM pathogens10. 64

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 4

    This discovery led to subsequent comparative genomic studies of MLO gene families in several 65

    plant species to find suitable candidate genes. In the model plant Arabidopsis thaliana, three 66

    MLO genes (AtMLO2, AtMLO6 and AtMLO12) were functionally characterized to confer PM 67

    susceptibility11,12. Expression analysis of these homologs upon pathogen challenge suggested 68

    functional redundancy. The Atmlo2 single mutant has only partial resistance while triple mutants 69

    (Atmlo2, Atmlo6 and Atmlo12) have full resistance against PM pathogens13. Identification of the 70

    functional MLO genes greatly relies on the availability of genetic information, therefore the 71

    development of reference genomes for several crop species provides an opportunity to identify 72

    potential MLO orthologs which can be used for functional studies. 73

    MLO genes have been genetically characterized across many crops including apple 74

    (MdMLO1), pepper (CaMLO2) , rose (RhMLO1), grapevine (VvMLO13), melon (CmMLO2), pea 75

    (PsMLO1), tobacco (NtMLO2), tomato (SlMLO1), rice (OsMLO1), corn (ZmMLO1) and wheat 76

    (TaMLO1)8,14,15 . Recently, targeted-genome mutation of the SlMLO1 gene resulted in the 77

    development of PM-resistant Slmlo1 tomato variety16. The durable and broad-spectrum 78

    effectiveness of MLO-based resistance could provide a useful target for strawberry improvement 79

    to PM disease. 80

    The modern cultivated strawberry (Fragaria ×ananassa) is an allo-octoploid (2n=8x=56) 81

    resulting from hybridization between a Chilean strawberry (F. chiloensis) and a North American 82

    native strawberry (F. virginiana)17. Further domestication of F. ×ananassa produced large and 83

    tasty berries that have become one of the world’s most widely grown fruit crops. In 2010, the 84

    genome of the diploid progenitor species F. vesca was sequenced18 and has been used as an 85

    excellent genetic resource towards molecular marker development19 and for gene-trait 86

    association studies in Fragaria species20-22. However, the F. ×ananassa genome is far more 87

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 5

    complicated than its diploid progenitor, as diploid species contain two copies for each gene 88

    whereas octoploid species may possess up to eight. Recently, a new F. ×ananassa reference 89

    genome was developed, and the complex evolutionary origin of octoploid strawberry has been 90

    deciphered23. This new reference genome sequence will serve as powerful genetic resource to 91

    unravel complexity of octoploid strawberry genome for gene-traits association studies including 92

    causal MLO genes in strawberry breeding programs. 93

    The goals of this study were to characterize MLO gene family in diploid (F. vesca) and 94

    octoploid (F. ×ananassa) strawberries, compare their gene structures, compare transcript levels 95

    throughout the plant and between individuals, and identify potential MLO genes associated with 96

    PM resistance and fruit ripening in octoploid strawberry. The data reported here are a critical 97

    first step in understanding allele function of strawberry MLO genes which will be useful for 98

    future genomics and functional studies. 99

    100

    Results 101

    Genome-wide identification of MLO gene family in diploid and octoploid strawberry 102

    Using the Arabidopsis MLO protein sequence as a BLAST query, 20 putative MLO genes 103

    were identified using the latest diploid genome annotation F. vesca v4.0.a124. These 20 MLO 104

    genes were renamed FveMLO1 through FveMLO20 based on their ordered chromosomal 105

    positions (Fig. 1A, Table S1a). Predicted proteins ranged between 200 to 903 aa with an average 106

    of 524 aa. Three truncated MLO proteins were identified as putative pseudogenes (Table S2). 107

    The number of predicted MLO genes in diploid strawberry agrees with the previous 108

    characterization of strawberry MLO genes using the first F. vesca draft genome25. 109

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 6

    The FveMLO genes identified in F. vesca were then used to identify gene orthologs in the 110

    octoploid strawberry annotated genome F. ×ananassa v1.0.a123. Analysis revealed 68 predicted 111

    MLO genes across 28 chromosomes (Fig. 1B, Table S1b). Of 20 FveMLO genes, 17 were 112

    matched with high amino acid sequence identity to a putative ortholog, however, conserved 113

    sequences could not be found for FveMLO6, FveMLO7 or FveMLO8. Based on their putative 114

    orthology to F. vesca, the octoploid MLO genes were named FaMLO1 through FaMLO20, 115

    respectively. To distinguish homoeologous MLO genes, FaMLO1-1 to FaMLO1-4 were used for 116

    each FveMLO1 homolog. Predicted MLO proteins have amino acid sequence lengths ranging 117

    between 144 and 2,365 with an average of 560 amino acids (Table S2). 118

    119

    Gene structure and synteny analysis of octoploid strawberry MLO genes 120

    Twenty FveMLO genes were distributed randomly across six chromosomes of the diploid 121

    F. vesca genome: one in Fvb2, two in Fvb7, four in Fvb5, seven in Fvb3 and three in Fvb1 and 122

    Fvb6 (Fig. 1, Table S1a). The coding DNA sequence composition ranged from 1 to 16 exons 123

    (Fig. S1-A). In the octoploid genome, 68 FaMLO genes were distributed across every 124

    chromosome except Fvb4-1 and Fvb4-3 (Fig. 1B, Table S1b). The intron-exon structures of 125

    FaMLO genes have more variation as compared with its diploid progenitor F. vesca, with coding 126

    DNA sequence (CDS) composition ranging from 1 to 23 exons (Fig. S1-B). Despite this 127

    structure complexity, many FaMLO genes demonstrated high homology of DNA sequence with 128

    the diploid progenitor, F. vesca (Fig. S2). Two additional FaMLO sequences in chromosome 4-129

    2 (FaMLO19-4) and 4-4 (FaMLO12-7) were identified in F. ×ananassa, which are not present in 130

    F. vesca genome (Fig. 1A-B). 131

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 7

    The four subgenomes in the modern cultivated strawberry are a result of allo-132

    polyploidization with four specific diploid progenitor genomes, which have subsequently 133

    undergone substantial subgenome conversion23. To elaborate on this recent discovery, we display 134

    synteny networks for putative MLO genes between diploid (F. vesca) and octoploid (F. 135

    ×ananassa) strawberry (Fig. 2). Most putative FaMLO orthologs co-localized in the same 136

    chromosome of F. vesca genome. Additional FaMLO orthologs of FveMLO12 and FveMLO19 137

    were identified in subgenomes Fvb4-4 and Fvb4-2, respectively. Meanwhile, no orthologs were 138

    found for FveMLO5, FveMLO6 and FveMLO7 in the F. ×ananassa genome. Variants in copy 139

    number of homoeologous genes for each FaMLO were identified with two homeologs each for 140

    FaMLO18 and FaMLO19, three for FaMLO1 and FaMLO9, four for FaMLO2, FaMLO3, 141

    FaMLO5, FaMLO20 and FaMLO10 through FaMLO17, and five for FaMLO4 (Fig. 2). The 142

    variation in the distribution of MLO genes showed a wide diversity in the genome composition 143

    of octoploid strawberry. Furthermore, there were at least two FaMLO homeologs with 80% or 144

    more sequence similarity with FveMLO orthologs (Fig. S2). 145

    146

    Domain organization and structure characterization of octoploid strawberry MLO genes 147

    The MLO gene was first identified in barley and characterized as membrane protein with 148

    seven transmembrane (TM) domains and a uniquely-identified “MLO-functional domain”6. To 149

    examine the conserved protein domains of the strawberry MLO genes, the deduced amino acid 150

    sequence of predicted MLO proteins found in diploid and octoploid strawberries were subjected 151

    to theoretical domain prediction using online software InterProScan (https://www.ebi.ac.uk) and 152

    NCBI's conserved domain database (https://www.ncbi.nlm.nih.gov /Structure/cdd). Most MLO 153

    proteins from either diploid or octoploid strawberry contain the conserved domain of MLO and 154

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 8

    TM, which covers a large portion of the protein (Fig. 3). To predict TM domains and subcellular 155

    localization of strawberry MLO proteins, CCTOP26 and WoLF PSORT27 software were used to 156

    find differences in the number of TM domains and subcellular localization among FveMLO and 157

    FaMLO proteins (Fig. 3, Table S2). All FveMLO proteins were predicted to localize within the 158

    plasma membrane except for FveMLO7, which was predicted to localize in the extracellular 159

    matrix. Out of 68 FaMLO genes, 61 were predicted to localize within plasma membrane while 160

    nine were predicted to localize in other organelles: four in the chloroplast, two in ER, two in the 161

    nucleus and one in Golgi bodies (Table S2). Thirteen out of 20 FveMLO proteins have seven TM 162

    domains, while seven have between three to six TM domains. The FaMLO gene family has a 163

    high degree of variation in TM domain composition (Table S2). Only 35 FaMLO proteins have 164

    seven TM domains while the remaining FaMLO proteins have TM domains ranging between 165

    zero and eight (Fig. 3). 166

    The domain organization of some FaMLO proteins showed high levels of conservation 167

    with the diploid ancestor species, F. vesca. For example, there were at least two homoeologs of 168

    FaMLO2, FaMLO4, FaMLO5, FaMLO9, FaMLO11, FaMLO12, FaMLO15, FaMLO16, 169

    FaMLO17 and FaMLO20 that showed F. vesca-like domain structures (Fig. 3). However, one to 170

    three homoeologs of some FaMLO proteins including FaMLO3, FaMLO4, FaMLO5, FaMLO10, 171

    FaMLO 12, FaMLO13, FaMLO414, FaMLO16, FaMLO17, FaMLO18, FaMLO19 and 172

    FaMLO20 showed more diverse protein structures which caused by either protein sequence 173

    truncation or extension (Fig. 3). 174

    Variation in protein characteristics of FaMLO gene family, especially between 175

    homoeologous proteins, could imply that some MLO proteins are unique and might not share the 176

    same function. MLO proteins that were localized in the plasma membrane and contain conserved 177

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 9

    transmembrane protein domains were selected for further phylogenetic analysis of functional 178

    divergence. 179

    180

    Phylogenetic relationship analysis of MLO genes in strawberry 181

    To study the evolutionary relationships between strawberry MLO genes and MLO genes 182

    from other plant species, we aligned the deduced amino acid sequence of 20 MLO genes from F. 183

    vesca (FveMLOs) and 68 from F. ×ananassa (FaMLOs) with previously characterized MLO 184

    from Arabidopsis (AtMLOs), rice (OsMLO1), corn (ZmMLO1), barley (HvMLO), tomato 185

    (SlMLO1), pepper (CaMLO2) and other rosaceous crops such as apple (MdMLOs) and peach 186

    (PpMLOs). Phylogenetic analysis using MUSCLE and FastTree divided MLO genes into eight 187

    different clades (Fig. 4). Functionally characterized MLO genes that are associated with PM 188

    resistance from selected monocots and dicots were clustered in clade IV and V, respectively. 189

    Among strawberry MLO genes, FveMLO12 and FaMLO12 together with HvMLO, OsMLO1 and 190

    ZmMLO1 were grouped in clade IV while FveMLO10, FveMLO17, FveMLO20, FaMLO10, 191

    FaMLO17 and FaMLO20 together with SlMLO1, CaMLO1, AtMLO2, AtMLO6 and AtMLO12 192

    were grouped in clade V. Remaining putative MLO sequences were distributed to six other 193

    groups. The results showed that three FveMLO and ten FaMLO proteins have a close 194

    evolutionary relationship with the MLO proteins known to confer PM susceptibility. 195

    To further investigate the candidate susceptibility-conferring FaMLO genes, namely 196

    FaMLO10-1, FaMLO10-4, FaMLO17-1, FaMLO17-2, FaMLO17-3, FaMLO17-4, FaMLO20-1, 197

    FaMLO20-2, FaMLO20-3 and FaMLO20-4, their deduced amino acid sequences were aligned 198

    with their orthologs from F. vesca and AtMLO2, AtMLO6 and AtMLO12 from Arabidopsis (Fig. 199

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 10

    S3). We identified conserved domains including seven TM, CaMB and two C-terminal protein (I 200

    and II) domains of MLO genes7,28. FaMLO17-2, FaMLO17-3, FaMLO20-1, FaMLO20-2, 201

    FaMLO20-3 and FaMLO20-4 proteins possessed seven TM domains and conserved CaMB and 202

    C-terminal I and II domains; however, FaMLO10-1, FaMLO10-4, FaMLO17-2 and FaMLO17-4 203

    showed truncated protein sequences at C-terminal end, resulting in the loss of these conserved 204

    domains. Among candidate MLO proteins, FaMLO17-2 and FaMLO17-3 protein sequences are 205

    most identical to known susceptibility-conferring Arabidopsis MLO proteins, and therefore could 206

    have potential association with PM resistance in octoploid strawberry. 207

    208

    Expression profiling of FaMLO genes in cultivated strawberry 209

    Expression patterns of previously characterized MLO genes suggested diverse biological 210

    functions for the MLO gene family29. To investigate functional strawberry MLO genes in fruit, 211

    we performed RNA sequencing analysis to evaluate transcript profiles in selected strawberry 212

    varieties, ‘Mara de Bois’, ‘Florida Elyana’, ‘Winter Dawn’, ‘Florida Radiance’ and ‘Strawberry 213

    Festival’. Variation in transcript abundance was observed among MLO genes and 27 FaMLO 214

    genes were constitutively expressed (>1 Transcript per Million, TPM) across samples (Fig. 5A 215

    and Table S3). We found that FaMLO14 homeologs had the highest fruit transcript abundance 216

    among FaMLO genes. Meanwhile, FaMLO1, FaMLO13 and FaMLO15-18, including their 217

    homeologs, were not detected in all varieties (Table S3). MLO genes clustered in clade V also 218

    showed variation in gene expression among them and between their homoeologous genes. 219

    Transcripts from at least two homeologs of FaMLO10, FaMLO17 and FaMLO20 were detected 220

    suggesting that genes are functional. To characterize MLO transcript accumulation patters in 221

    octoploid strawberry, raw data from the strawberry gene expression atlas study30 were re-222

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 11

    assembled using the recently published octoploid strawberry genome23. The steady-state 223

    accumulation of several MLO transcripts are specific to different tissues, with MLO genes such 224

    as FaMLO1 and FaMLO17 showing tissue-specific gene expression. The gene FaMLO17 and 225

    its two homeologs are preferentially expressed in root tissues (Fig. 5B and Table S4). 226

    227

    Identification of eQTL associated with FaMLO transcripts in strawberry mature fruit 228

    RNA-seq analysis identified FaMLO transcript level variation across fruit developmental 229

    stages (Fig. 5B), supporting a potential role of MLO gene family in the fruit. To characterize 230

    FaMLO fruit expression variation associated with genetic differences between individuals 231

    (eQTL), we performed eQTL analysis using 61 octoploid IStraw35 genotypes and mature fruit 232

    transcriptomes and identified at least three eQTL. These include maker-Fvb4-4-augustus-gene-233

    48.50 (FaMLO12-7), snap_masked-Fvb3-2-processed-gene-33.29 (FaMLO5-2) and 234

    snap_masked-Fvb3-3-processed-gene-28.38 (FaMLO5-3), the latter two of which are 235

    homoeologous genes (Fig. 5C). Single marker analysis of MLO eQTL also shows a general 236

    association between homozygous marker genotypes and lower MLO transcript levels (Fig. S4). 237

    Because the ‘Camarosa’ genomic locus of each transcript is known, it could be shown that some 238

    of the identified transcript eQTL coincide with the location of their corresponding MLO gene 239

    locus, and hence were considered as cis-eQTL (Table 1). The most significant IStraw35 SNP 240

    marker name and position for each MLO gene is provided with the eQTL phase, minor allele 241

    frequency, p-value (FDR-adjusted) and heritability estimates (Table 1). 242

    243

    244

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 12

    Discussion 245

    The MLO gene family is an important target in many agricultural crops for the 246

    improvement of resistance against PM pathogens. The previous characterization identified that 247

    some MLO genes are involved in PM susceptibility, and that loss-of-function of those genes can 248

    confer durable and broad-spectrum resistance. In this study, it was found that variants of F. vesca 249

    MLO genes and F. ×ananassa MLO genes were distributed across the genomes. However, we 250

    observed copy number variation of FaMLO homeologs showing differences in composition 251

    among subgenomes, supporting the distinct origin of each subgenome during the evolution of 252

    octoploid strawberry29. The unique FaMLO sequences in chromosome four (Fig. 1B and Fig. 2) 253

    that are not present in F. vesca might suggest an acquisition of chromosome segments from other 254

    diploid progenitors. Also, we did not find any significant homologs for FveMLO6, FveMLO7 255

    and FveMLO8 using the F. ×ananassa ‘Camarosa’ reference genome (Fig. 1B). The loss of 256

    genes could be evolutionary significant in the development of polyploid species31 or other 257

    possible functional divergence during domestication. Additional octoploid reference genomes 258

    would explain such phenomena, as the chromosomal localization of FaMLO genes and possible 259

    misorientation in genome sequence for some subgenomes can be identified. 260

    The average MLO protein length (~500 amino acids) was similar in both F. vesca and F. 261

    ×ananassa, although few FaMLO proteins had more structural variations (SV) as compared with 262

    other homologous genes (Fig. 3). SV is defined as a change in genome sequence such as 263

    deletion, insertion and inversion caused by mutations which in turn can affect biological gene 264

    function32. Furthermore, SV between homeologs of FaMLO10 and FaMLO12 causes changes in 265

    their gene expression profile (Fig. 5A-B) and subcellular localization (Table S2). SV between 266

    homeologs might result in different gene regulation of gene expression and consequently lead to 267

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 13

    novel functions33. Diverse gene expression patterns among FaMLO genes across different 268

    tissues (Fig. 5B) suggested that MLO proteins might be specialized for different biological 269

    responses. This may be analogous to the unique expression patterns observed for Arabidopsis 270

    MLO genes, where transcription of each MLO gene is distinct and regulated differently by 271

    various biotic and abiotic stimuli34 272

    In the present study, we found that number of the TM domains of FveMLOs and FaMLOs 273

    varied between zero to eight. This variation was also observed in previous genome-wide MLO 274

    studies in other Rosaceous crops such as apple and peach35. Phylogenetic analysis grouped 275

    strawberry MLO genes into eight different clades, which is consistent with previously classified 276

    MLO gene families in diploid strawberry25 and other Rosaceous crops35. Functional MLO genes 277

    associated with PM resistance are grouped in clade IV and V for monocots and dicots, 278

    respectively13. Interestingly, we found FveMLO12, FaMLO12-1, FaMLO12-2 and FaMLO12-4 279

    grouped together in clade IV with previously characterized HvMLO, OsMLO, ZmMLO1 and 280

    PpMLO12 35. In the present study, we found at least three clade V FveMLO and FaMLO genes, 281

    which are potential candidate genes involved in PM resistance. MLO-based resistance to the PM 282

    pathogen P. aphanis in a strawberry was successfully tested through RNAi-induced gene 283

    silencing of FaMLO genes using antisense MLO from peach (PpMLO1)36. In Arabidopsis, 284

    mutation of AtMLO2 showed only partial resistance while complete resistance was obtained after 285

    knocking-out both AtMLO6 and AtMLO1237. Redundancy of gene function among homologs 286

    could arise from gene duplication for adaptation to varying selection pressures and changing 287

    environments38,39. High gene copy-number is commonly observed in polyploid crop species due 288

    to the presence of homoeologous genes, posing a considerable challenge for determining the 289

    biological contribution of specific genes. The genes FaMLO17-2 and FaMLO17-3 showed the 290

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 14

    highest sequence identity with known Arabidopsis MLO genes, suggesting that they might 291

    contribute to strawberry MLO susceptibility. Overall, we identified a total of 10 FaMLO genes 292

    with high homology to functional disease-susceptible MLO, representing candidate genes for 293

    breeding new cultivars with improved PM resistance. 294

    Despite considerable research into MLO as a susceptibility gene, the endogenous function 295

    of the MLO family remains largely unknown. It appears most likely that specific MLO genes 296

    have diverse and specialized functions29. The diverse pattern of expression for FaMLO genes in 297

    different fruit developmental stages (Fig. 5B), and the presence of certain signal transduction 298

    domains, supports one possible role for MLO as a ripening regulator. The conserved CaMB 299

    domain in FaMLO gene family could be a binding site for CaM-dependent modulation during 300

    fruit ripening. Calmodulin (CaM) is a well-characterized protein known as Ca2+ sensor that 301

    triggers response to different developmental and biotic stimuli7,40. A previously-studied CaMB 302

    domain-containing gene family in tomato showed fruit-ripening regulation activity, via 303

    developmental network interaction with Ca2+ and calmodulin40. Furthermore, generated ripening 304

    mutant altered expression of those genes indicating its important role in tomato fruit 305

    development. It is possible that the conserved CaMB domain in strawberry MLO gene family, 306

    together with CaM and other unknown regulators, could play a similar role in the fruit-ripening 307

    process. Findings from this study presented the functional diversity of MLO gene family, 308

    although further studies are needed to validate the integral role of strawberry MLO genes in 309

    response to biotic and developmental stimuli. 310

    311

    312

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 15

    Conclusions 313

    Here, we identified a total of 20 MLO homologues in F. vesca and 68 in F.×ananassa. 314

    Three FveMLO genes and ten FaMLO genes were clustered with previously characterized MLO 315

    genes known to be in PM resistance/susceptibility in other plant species. The deduced amino 316

    acid sequences of putative strawberry MLO genes showed conserved protein characteristics, 317

    including transmembrane and calmodulin-binding domains that have been previously described. 318

    The considerable amino acid-level variation between MLO homoeologous copies was observed, 319

    suggesting possible non-redundant functions of MLOs in different subgenomes. FaMLO17-2 and 320

    FaMLO17-3 are the most identical to functionally characterized MLO genes associated with PM 321

    susceptibility in other plants. These genes could be functionally characterized via CRISPR gene 322

    editing in future studies. eQTL analysis also identified MLO genes whose differential 323

    transcription is governed by genetic segregation, which could be an indication of functional 324

    diversity. These findings support and expand upon the potential roles for MLO-regulated fruit 325

    development. Taken together, these data are a critical first step in understanding the allele 326

    function of straberry MLO gene family, and should be useful for future functional studies to 327

    better understand their role on powdery mildew resistance in strawberry. 328

    329

    Materials and Methods 330

    Identification of the strawberry MLO genes in diploid and octoploid strawberry 331

    To identify MLO gene orthologs in diploid strawberry, F. vesca, Arabidopsis MLO genes 332

    were searched to the latest F. vesca genome (v4.0.a1)24. Consequently, each FveMLO gene 333

    identified was used to search for predicted MLO genes in octoploid strawberry using the 334

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 16

    octoploid strawberry reference genome (F. ×ananassa v1.0.a1)23. The deduced diploid and 335

    octoploid strawberry protein sequences were validated by reciprocal BLAST searches obtained 336

    from NCBI data sets of Arabidopsis reference genome. 337

    338

    Gene structure and synteny analysis of strawberry MLO genes 339

    The chromosomal localization of each predicted MLO genes in F. vesca and F. 340

    ×ananassa was identified using available information in GDR database and visualized using 341

    MapChart 2.3 software41. Gene structure featuring introns, exons and UTR of predicted MLO 342

    genes were constructed using Gene Structure Display Server 2.0 (GSDS) 343

    (http://gsds.cbi.pku.edu.cn/)42. Pairwise comparisons of coding DNA sequence (CDS) between 344

    predicted FveMLO and FaMLO genes were obtained using ClustalW2 multiple sequence 345

    alignment via Geneious software43 followed by heat map visualization to determine closely 346

    related MLO genes using R-package “Lattice”44. Synteny analysis of MLO genes between F. 347

    vesca and F. ×ananassa were summarized using R-package “Circlize”45. 348

    349

    Phylogenetic analysis of strawberry and other plant MLO genes 350

    The protein sequences of MLO genes from F. vesca and F. ×ananassa were aligned to 351

    available sequences and phylogenetic relationship was constructed using FastTree consensus tree 352

    protein alignment via Geneious Tree Builder software43. Phylogenetic tree was constructed by 353

    adding 15 MLO genes from Arabidopsis, six MLO from other Rosaceous crops, Malus 354

    domesticus (MdMLOs) and Prunus persica (PpMLOs), barley (HvMLO), corn (ZmMLO1), rice 355

    (OsMLO1), tomato (SlMLO1) and pepper (CaMLO2). 356

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 17

    Protein characterization and domain prediction 357

    The deduced amino acid sequences of putative FveMLO and FaMLO genes were 358

    analyzed by different prediction software to identify functional domains and determine protein 359

    topologies and sub-cellular localizations. Functional MLO domain of protein sequences was 360

    predicted using CDD: NCBI's conserved domain database (https://www.ncbi.nlm.nih.gov 361

    /Structure/cdd)46. Protein topology and number of transmembrane domains were predicted using 362

    online software CCTOP Prediction Server26 while protein sub-cellular localization was analyzed 363

    using WoLF PSORT program27. Default setting was used to run for all prediction software. 364

    Visualization of proteins domains were constructed using IBS 1.0.3 (Illustrator for Biological 365

    Sequences) program47. To analyze conserved amino acids of MLO genes associated with PM 366

    resistance, protein sequences of FveMLO10, FveMLO17 and FveMLO20 from F. vesca and 367

    FaMLO10, FaMLO17 and FaMLO20 from F. ×ananassa were aligned against functionally 368

    characterized AtMLO2, AtMLO6 and AtMLO12 from Arabidopsis using MultAlin software 369

    (http://multalin.toulouse.inra.fr/multalin/ )48. 370

    371

    Expression profile of putative strawberry MLO genes 372

    To examine transcript accumulation patterns of putative strawberry MLO genes, we 373

    performed RNA-seq analysis in selected strawberry cultivars including ‘Florida Elyana’, ‘Mara 374

    de Bois’, ‘Florida Radiance’, ‘Strawberry Festival’ and ‘Winter Dawn’. Also, RNA-seq libraries 375

    from various ‘Camarosa’ tissues30 with the study reference PRJEB12420 were downloaded from 376

    the European Nucleotide Archive (https://www.ebi.ac.uk/ena). The complete 54 libraries RNA-377

    seq experiment consisted of six independent green receptacle libraries, six white receptacle 378

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 18

    libraries, six turning receptacle libraries, six red receptacle libraries, three root libraries, three 379

    leaf libraries, and six achene libraries each for all corresponding fruit stages. For both libraries, 380

    raw RNA-seq reads were assembled to the ‘Camarosa’ reference genome using CLC Genomic 381

    Workbench 11 (mismatch cost of 2, insertion cost of 3, deletion cost of 3, length fraction of 0.8, 382

    similarity fraction of 0.8, 1 maximum hit per read). Reads that mapped equally well to more than 383

    one locus were discarded from the analysis. RNA-seq counts were quantified in Transcripts Per 384

    Million (TPM). 385

    386

    Plant materials for eQTL analysis 387

    Three pedigree-connected and segregating strawberry populations were created from 388

    crosses ‘Florida Elyana’ × ‘Mara de Bois’, ‘Florida Radiance’ × ‘Mara des Bois’, and 389

    ‘Strawberry Festival’ × ‘Winter Dawn’ (Figure S7). These cultivars and 54 progenies were 390

    selected for RNA-seq and SNP genotyping analysis and were used to identify eQTL. 391

    392

    Fruit transcriptomes analysis of strawberry MLO genes for eQTL analysis 393

    Sixty-one fruit transcriptomes were sequenced via Illumina paired-end RNA-seq (Avg. 394

    65million reads, 2x100bp), and consisted of parents and progeny from crosses of ‘Florida 395

    Elyana’ × ‘Mara de Bois’, ‘Florida Radiance’ × ‘Mara des Bois’, and ‘Strawberry Festival’ × 396

    ‘Winter Dawn’. Reads were trimmed and mapped to the F. ×ananassa octoploid ‘Camarosa’ 397

    annotated genome using CLC Genomic Workbench 11 (mismatch cost of 2, insertion cost of 3, 398

    deletion cost of 3, length fraction of 0.8, similarity fraction of 0.8, 1 maximum hit per read). 399

    Reads that mapped equally well to more than one locus were discarded from the analysis. RNA-400

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 19

    seq counts were calculated in Transcripts Per Million (TPM). Three-dimensional principle 401

    component analysis (PCA) was performed on all RNA-seq assemblies, including two replicates 402

    of ‘Mara des Bois’ fruit harvested three years apart and sequenced independently (Figure S3). 403

    Transcript abundances were normalized via the Box-Cox transformation algorithm performed in 404

    R49 prior to eQTL analysis. The BLAST2GO pipeline was used to annotate the full ‘Camarosa’ 405

    predicted gene complement. 406

    407

    Genome-wide association study (GWAS) of strawberry MLO gene expression in fruit 408

    The Affymetrix IStraw35 Axiom® SNP array50 was used to genotype 60 individuals, 409

    including six parental lines from three independent biparental RNAseq populations. Sequence 410

    variants belonging to the Poly High Resolution (PHR) and No Minor Homozygote (NMH) 411

    marker classes were included for association mapping. Mono High Resolution (MHR), Off-412

    Target Variant (OTV), Call Rate Below Threshold (CRBT), and Other marker quality classes, 413

    were discarded and not used for mapping. Individual marker calls inconsistent with Mendelian 414

    inheritance from parental lines were removed. The F. vesca physical map was used to orient 415

    marker positions as current octoploid maps do not include a majority of the available IStraw35 416

    markers. A genome-wide analysis study (GWAS) was performed using GAPIT v251 performed 417

    in R49. MLO eQTL were evaluated for significance based on the presence of multiple co-locating 418

    markers of p-value < 0.05 after false discovery rate correction for multiple comparisons. MLO 419

    genes demonstrating evidence of an eQTL were then associated using the ‘Holiday’ × ‘Korona’ 420

    and family ‘14.95’ octoploid genetic maps to evaluate subgenomic location. Linkage Group in 421

    the family ‘14.95’ map was corresponded to the ‘Camarosa’ genome physical chromosomes 422

    (Michael Hardigan, UCD, personal communication). Cis vs trans eQTL determinations were 423

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 20

    made by corroborating known ‘Camarosa’ physical gene position with marker positions in the 424

    ’14.95’ map, and by BLAST of eQTL markers to the ‘Camarosa’ genome. 425

    426

    ReferencesUncategorized References 427

    1. Peries, O.S. Studies on strawberry mildew, caused by Sphaerotheca macularis (Wallr. ex 428

    Fries) Jaczewski*. Annals of Applied Biology 50, 211-224 (1962). 429

    2. Xiao, C.L. et al. Comparison of Epidemics of Botrytis Fruit Rot and Powdery Mildew of 430

    Strawberry in Large Plastic Tunnel and Field Production Systems. Plant Dis 85, 901-909 431

    (2001). 432

    3. Paulus, A. Fungal Disease of Strawberry. HortScience 25(1990). 433

    4. Peres, N.A., Mertely. J.C. Powdery Mildew of Strawberries. Electronic Data Information 434

    Source UF/IFAS Extension (2018). 435

    5. Nelson M.D., G.W.D., Shaw D.V. Relative Resistance of 47 Strawberry Cultivars to 436

    Powdery Mildew in California Greenhouse and Field Environments. Plant Disease 437

    80(1996). 438

    6. Devoto, A., Piffanelli, P., Nilsson, I., Wallin, E. Panstruga, R., & Heijne, G.V., Schulze-439

    Lefert, P. Topology, Subcellular, Localization, and Sequence Diversity of the Mlo Family 440

    in Plants. The Journal of Biological Chemistry 274, 13 (1999). 441

    7. Kim, M.C. et al. Mlo, a modulator of plant defense and cell death, is a novel calmodulin-442

    binding protein. Isolation and characterization of a rice Mlo homologue. J Biol Chem 443

    277, 19304-14 (2002). 444

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 21

    8. Kusch, S. & Panstruga, R. mlo-Based Resistance: An Apparently Universal "Weapon" to 445

    Defeat Powdery Mildew Disease. Mol Plant Microbe Interact 30, 179-189 (2017). 446

    9. Pastruga, R. Serpentine plant MLO proteins as entry portals for powdery mildew fungi. 447

    Plant Signalling from Genes to Biochemistry 33(2005). 448

    10. Jørgensen, I.H. Discovery, characterization and exploitation of Mlo powdery mildew 449

    resistance in barley. Euphytica 63, 141-152 (1992). 450

    11. Acevedo-Garcia, J. et al. The powdery mildew-resistant Arabidopsis mlo2 mlo6 mlo12 451

    triple mutant displays altered infection phenotypes with diverse types of phytopathogens. 452

    Sci Rep 7, 9319 (2017). 453

    12. Devoto, A. et al. Molecular phylogeny and evolution of the plant-specific seven-454

    transmembrane MLO family. J Mol Evol 56, 77-88 (2003). 455

    13. Acevedo-Garcia, J., Kusch, S. & Panstruga, R. Magical mystery tour: MLO proteins in 456

    plant immunity and beyond. New Phytol 204, 273-81 (2014). 457

    14. Acevedo-Garcia, J. et al. mlo-based powdery mildew resistance in hexaploid bread wheat 458

    generated by a non-transgenic TILLING approach. Plant Biotechnol J 15, 367-378 459

    (2017). 460

    15. Zheng, Z. et al. Loss of function in Mlo orthologs reduces susceptibility of pepper and 461

    tomato to powdery mildew disease caused by Leveillula taurica. PLoS One 8, e70723 462

    (2013). 463

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 22

    16. Nekrasov, V. et al. Rapid generation of a transgene-free powdery mildew resistant 464

    tomato by genome deletion. Sci Rep 7, 482 (2017). 465

    17. Njuguna, W., Liston, A., Cronn, R., Ashman, T.-L. & Bassil, N. Insights into phylogeny, 466

    sex function and age of Fragaria based on whole chloroplast genome sequencing. 467

    Molecular Phylogenetics and Evolution 66, 17-29 (2013). 468

    18. Shulaev, V. et al. The genome of woodland strawberry (Fragaria vesca). Nat Genet 43, 469

    109-16 (2011). 470

    19. Anciro, A. et al. FaRCg1: a quantitative trait locus conferring resistance to 471

    Colletotrichum crown rot caused by Colletotrichum gloeosporioides in octoploid 472

    strawberry. Theor Appl Genet 131, 2167-2177 (2018). 473

    20. Mangandi, J. et al. Pedigree-Based Analysis in a Multiparental Population of Octoploid 474

    Strawberry Reveals QTL Alleles Conferring Resistance to Phytophthora cactorum. G3 475

    (Bethesda, Md.) 7, 1707-1719 (2017). 476

    21. Salinas, N., Verma, S., Peres, N. & Whitaker, V.M. FaRCa1: a major subgenome-specific 477

    locus conferring resistance to Colletotrichum acutatum in strawberry. Theor Appl Genet 478

    132, 1109-1120 (2019). 479

    22. Roach, J.A. et al. FaRXf1: a locus conferring resistance to angular leaf spot caused by 480

    Xanthomonas fragariae in octoploid strawberry. Theor Appl Genet 129, 1191-201 (2016). 481

    23. Edger, P.P. et al. Origin and evolution of the octoploid strawberry genome. Nat Genet 51, 482

    541-547 (2019). 483

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 23

    24. Edger, P.P. et al. Single-molecule sequencing and optical mapping yields an improved 484

    genome of woodland strawberry (Fragaria vesca) with chromosome-scale contiguity. 485

    Gigascience 7, 1-7 (2018). 486

    25. Miao, L.X. et al. Genomic identification, phylogeny, and expression analysis of MLO 487

    genes involved in susceptibility to powdery mildew in Fragaria vesca. Genet Mol Res 488

    15(2016). 489

    26. Dobson, L., Reményi, I. & Tusnády, G.E. CCTOP: a Consensus Constrained TOPology 490

    prediction web server. Nucleic acids research 43, W408-W412 (2015). 491

    27. Horton, P. et al. WoLF PSORT: protein localization predictor. Nucleic acids research 35, 492

    W585-W587 (2007). 493

    28. Panstruga, R. Discovery of Novel Conserved Peptide Domains by Ortholog Comparison 494

    within Plant Multi-Protein Families. Plant Molecular Biology 59, 485-500 (2005). 495

    29. Nguyen, V.N.T., Vo, K.T.X., Park, H., Jeon, J.-S. & Jung, K.-H. A Systematic View of 496

    the MLO Family in Rice Suggests Their Novel Roles in Morphological Development, 497

    Diurnal Responses, the Light-Signaling Pathway, and Various Stress Responses. 498

    Frontiers in Plant Science 7(2016). 499

    30. Sánchez-Sevilla, J.F. et al. Gene expression atlas of fruit ripening and transcriptome 500

    assembly from RNA-seq data in octoploid strawberry (Fragaria��ananassa). Scientific 501

    Reports 7, 13737 (2017). 502

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 24

    31. Jiao, Y. & Paterson, A.H. Polyploidy-associated genome modifications during land plant 503

    evolution. Philosophical transactions of the Royal Society of London. Series B, 504

    Biological sciences 369, 20130355 (2014). 505

    32. Tao, Y., Zhao, X., Mace, E., Henry, R. & Jordan, D. Exploring and Exploiting Pan-506

    genomics for Crop Improvement. Mol Plant 12, 156-169 (2019). 507

    33. Mutti, J.S., Bhullar, R.K. & Gill, K.S. Evolution of Gene Expression Balance Among 508

    Homeologs of Natural Polyploids. G3 (Bethesda) 7, 1225-1237 (2017). 509

    34. Chen, Z. et al. Expression analysis of the AtMLO Gene Family Encoding Plant-Specific 510

    Seven-Transmembrane Domain Proteins. Plant Molecular Biology 60, 583-597 (2006). 511

    35. Pessina, S. et al. Characterization of the MLO gene family in Rosaceae and gene 512

    expression analysis in Malus domestica. BMC Genomics 15, 618 (2014). 513

    36. Jiwan, D., Roalson, E.H., Main, D. & Dhingra, A. Antisense expression of peach mildew 514

    resistance locus O (PpMlo1) gene confers cross-species resistance to powdery mildew in 515

    Fragaria x ananassa. Transgenic Res 22, 1119-31 (2013). 516

    37. Consonni, C. et al. Conserved requirement for a plant host cell protein in powdery 517

    mildew pathogenesis. Nat Genet 38, 716-20 (2006). 518

    38. Leister, D. Tandem and segmental gene duplication and recombination in the evolution of 519

    plant disease resistance gene. Trends Genet 20, 116-22 (2004). 520

    39. Panchy, N., Lehti-Shiu, M. & Shiu, S.H. Evolution of Gene Duplication in Plants. Plant 521

    Physiol 171, 2294-316 (2016). 522

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 25

    40. Yang, T., Peng, H. & Bauchan, G.R. Functional analysis of tomato calmodulin gene 523

    family during fruit development and ripening. Horticulture research 1, 14057-14057 524

    (2014). 525

    41. Voorrips, R.E. MapChart: Software for the Graphical Presentation of Linkage Maps and 526

    QTLs. Journal of Heredity 93, 77-78 (2002). 527

    42. Guo, A.-Y. et al. GSDS 2.0: an upgraded gene feature visualization server. 528

    Bioinformatics 31, 1296-1297 (2014). 529

    43. Kearse, M. et al. Geneious Basic: an integrated and extendable desktop software platform 530

    for the organization and analysis of sequence data. Bioinformatics (Oxford, England) 28, 531

    1647-1649 (2012). 532

    44. Sarkar, D. Lattice: Multivariate Data Visualization with R. Springer (2008). 533

    45. Brors, B., Gu, L., Schlesner, M., Eils, R. & Gu, Z. circlize implements and enhances 534

    circular visualization in R. Bioinformatics 30, 2811-2812 (2014). 535

    46. Marchler-Bauer, A. et al. CDD: a Conserved Domain Database for the functional 536

    annotation of proteins. Nucleic acids research 39, D225-D229 (2011). 537

    47. Liu, W. et al. IBS: an illustrator for the presentation and visualization of biological 538

    sequences. Bioinformatics 31, 3359-61 (2015). 539

    48. Mitchell, C. MultAlin–multiple sequence alignment. Bioinformatics 9, 614-614 (1993). 540

    49. Team, R.D.C. R: A Language and Environment for Statistical Computing. R Foundation 541

    for Statistical Computing, Vienna, Austria (2014). 542

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 26

    50. Verma, S. et al. Development and evaluation of the Axiom® IStraw35 384HT array for 543

    the allo-octoploid cultivated strawberry Fragaria ×ananassa. 1156 edn 75-82 544

    (International Society for Horticultural Science (ISHS), Leuven, Belgium, 2017). 545

    51. Tang, Y. et al. GAPIT Version 2: An Enhanced Integrated Tool for Genomic Association 546

    and Prediction. The Plant Genome 9(2016). 547

    548

    Acknowledgement 549

    This work was supported by the Florida Strawberry Research and Education Foundation 550

    (FSREF), And the ‘‘Next-generation Disease Resistance Breeding and Management Solutions 551

    for Strawberry’’ under award no. 2017-51181-26833). We thank all the members of the 552

    strawberry genetics and breeding program, and the strawberry molecular genetics and genomics 553

    program for their technical assistance. 554

    555

    Author contributions 556

    557

    Competing Financial Interest Statement 558

    The authors declare that they have no competing financial interests. 559

    560

    561

    562

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 27

    563

    564

    565

    566

    567

    568

    569

    570

    571

    572

    573

    574

    575

    Figure Legends 576

    Figure 1. Chromosomal localization and distribution of FveMLO (A) and FaMLO (B) genes in 577

    F. vesca and F. ×ananassa genomes, respectively. The seven chromosomes of F. vesca 578

    (yellow) were named as Fvb1 to Fvb7 while F. ×ananassa (blue) were named as Fvb1-1 579

    to Fvb1-7, Fvb2-1 to Fvb2-7, Fvb3-1 to Fvb3-7 and Fvb4-1 to Fvb4-7, respectively to 580

    indicate four subgenomes within each chromosome. Putative strawberry MLO genes 581

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 28

    chromosome locations were visualized using MapChart 2.3 software41. Relative 582

    chromosome size was indicated by unit, Mbp. 583

    Figure 2. Synteny analysis of MLO genes between F. vesca and F. ×ananassa. Syntenic regions 584

    present in each chromosome of F. vesca was filled with red, light blue, green, dark blue, 585

    yellow and pink sequentially. A total of 68 connecting lines between two genomes 586

    denote syntenic chromosomal regions. The F. vesca and F. ×ananassa chromosomes are 587

    highlighted in orange and light blue. Chromosomes Fvb4 from F. vesca, and Fvb4-1 and 588

    Fvb4-3 from F. ×ananassa were not included since no putative MLO genes was 589

    identified in those regions. Relative chromosome size was indicated by unit, Mbp. 590

    Circular visualization of syntenic regions between F. vesca and F. ×ananassa were 591

    constructed using an R-package “Circlize”. 592

    Figure 3. Domain organization of deduced MLO protein sequences of F. vesca and F. 593

    ×ananassa. Visualization of protein domains was constructed using IBS 1.0.3 (Illustrator 594

    for Biological Sequences) program47. The positions of transmembrane (blue) and MLO 595

    (yellow) domains were predicted using online CCTOP prediction server 596

    (http://cctop.enzim.ttk.mta.hu/) and CDD: NCBI's conserved domain database 597

    (https://www.ncbi.nlm.nih.gov /Structure/cdd). 598

    Figure 4. Phylogenetic tree of MLO protein sequences of strawberry., F. vesca and F. 599

    ×ananassa, and other plant species such as Arabidopsis, tomato, pepper, rice, barley, 600

    maize, apple and peach. Strawberry MLO genes together with other MLOs were grouped 601

    into eight different clades. Unrooted phylogenetic tree of 15 AtMLOs, 20 FveMLOs, 59 602

    FaMLOs, five PpMLOs, six MdMLOs, CaMLO2, SlMLO1, HvMLO, OsMLO1 and 603

    ZmMLO1 was constructed by using FastTree via Geneious software. 604

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • 29

    Figure 5. Transcript and GWAS analysis of FaMLO genes. A) Expression profile of putative 605

    strawberry MLO genes in mature fruit of selected strawberry varieties, ‘Mara de Bois’, 606

    ‘Elyana’, ‘Winter Dawn’, ‘Radiance’ and ‘Festival’. B) Expression profile in different 607

    plant tissues e.g., leaf, roots, green (GR), turning (TR), white (WR) and red (RR) 608

    receptables, and green (GA), turning (TA), white (WA) and red (RA) achenes extracted 609

    from RNA-seq data published by Sánchez-Sevilla, et al (2017). The mean gene 610

    expression level was normalized using Transcripts Per Million (TPM). Gene expression 611

    profile was visualized with heatmap using R-package “ggplot2”. C) Manhattan plots of 612

    GWAS showing significant SNPs associated with FaMLO transcript in fruits. X- axis 613

    corresponds to the distribution of SNPs across seven chromosomes of F. vesca while y-614

    axis shows FDR-adjusted probability values. 615

    616

    Table Legends 617

    Table 1. Summary genome-wide association analysis of loci significantly associated with MLO 618

    transcript in strawberry fruit. The most significant IStraw35 SNP marker name and 619

    position for each MLO gene is provided with the eQTL phase, minor allele frequency, p-620

    value (FDR-adjusted), and heritability estimates. 621

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • A

    B Chromosome 1 Chromosome 2

    Chromosome 3 Chromosome 4

    Chromosome 5 Chromosome 6

    Chromosome 7

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • FveMLO1FaMLO1-1FaMLO1-2FaMLO1-3

    FveMLO5FaMLO5-1FaMLO5-2FaMLO5-3FaMLO5-4

    FveMLO11FaMLO11-1FaMLO11-2FaMLO11-3FaMLO11-4

    FveMLO14FaMLO14-1FaMLO14-2FaMLO14-3FaMLO14-4

    FveMLO18FaMLO18-1FaMLO18-2

    FveMLO2FaMLO2-1FaMLO2-2FaMLO2-3FaMLO2-4

    FveMLO3FaMLO3-1FaMLO3-2FaMLO3-3FaMLO3-4

    FveMLO9FaMLO9-1FaMLO9-2FaMLO9-3

    FveMLO6

    FveMLO7

    FveMLO8

    FveMLO12FaMLO12-1FaMLO12-2FaMLO12-3FaMLO12-4FaMLO12-5FaMLO12-6FaMLO12-7

    FveMLO15FaMLO15-1FaMLO15-2FaMLO15-3FaMLO15-4

    FveMLO19FaMLO19-1FaMLO19-2FaMLO19-3FaMLO19-4

    FveMLO20FaMLO20-1FaMLO20-2FaMLO20-3FaMLO20-4

    FveMLO16FaMLO16-1FaMLO16-2FaMLO16-3FaMLO16-4

    FveMLO17FaMLO17-1FaMLO17-2FaMLO17-3FaMLO17-4

    FveMLO13FaMLO13-1FaMLO13-2FaMLO13-3FaMLO13-4

    FveMLO10FaMLO10-1FaMLO10-2FaMLO10-3FaMLO10-4

    FveMLO4FaMLO4-1FaMLO4-2FaMLO4-3FaMLO4-4FaMLO4-5

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • Clade II

    Clade VI

    Clade IV

    Clade III

    Clade V

    Clade I

    Clade VIIIClade VII

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764

  • FaMLO5-2

    FaMLO5-3

    FaMLO12-7

    Linkage Group

    A B C

    (which was not certified by peer review) is the author/funder. All rights reserved. No reuse allowed without permission. The copyright holder for this preprintthis version posted February 4, 2020. ; https://doi.org/10.1101/2020.02.03.932764doi: bioRxiv preprint

    https://doi.org/10.1101/2020.02.03.932764