advances in genomics for the improvement of quality in coffee381677/uq381677... · 2019-10-09 ·...
TRANSCRIPT
Acc
epte
d A
rticl
eAdvances in genomics for the improvement of quality in Coffee
Hue T.M. Tranab, L. Slade Leec, Agnelo Furtadoa, Heather Smytha, Robert Henrya*
a Queensland Alliance for Agriculture and Food Innovation (QAAFI), The University of
Queensland, Australia
b Western Highlands Agriculture & Forestry Science Institute (WASI), Vietnam
c Southern Cross University, Australia
* Correspondence to: Robert Henry, Queensland Alliance for Agriculture and Food
Innovation (QAAFI), The University of Queensland, St Lucia QLD 4072, Australia, Tel.
+61 7 3346 0551, Email: [email protected]
This article has been accepted for publication and undergone full peer review but has not been through the copyediting, typesetting, pagination and proofreading process, which may lead to differences between this version and the Version of Record. Please cite this article as doi: 10.1002/ jsfa.7692
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eAbstract
Coffee is an important crop that provides a livelihood to millions of people living in
developing countries. Production of genotypes with improved coffee quality attributes
is a primary target of coffee genetic improvement programs. Advances in genomics are
providing new tools for analysis of coffee quality at the molecular level. The recent
report of a genomic sequence for robusta coffee, Coffea canephora, is a major
development. However, a reference genome sequence for the genetically more
complex arabica coffee (C. arabica) will also be required to fully define the molecular
determinants controlling quality in coffee produced from this high quality coffee
species. Genes responsible for control of the levels of the major biochemical
components in the coffee bean that are known to be important in determining coffee
quality can now be identified by association analysis. However, the narrow genetic
base of arabica coffee suggests that genomics analysis of the wild relatives of coffee
(Coffea spp.) may be required to find the phenotypic diversity required for effective
association genetic analysis. The genomic resources available for the study of coffee
quality are described and the potential for the application of next generation
sequencing and association genetic analysis to advance coffee quality research are
explored.
Key words: Coffee quality, genetics, genomics, biochemical compounds, next
generation sequencing, association studies.
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eGENERAL INTRODUCTION
Coffee is an important crop and the second most traded commodity in the world (after
petroleum) providing a living to more than 125 million people 1. Coffee belongs to the
Rubiaceae family and the Coffeeae tribe and consists of more than 124 species spread
across two genera Coffea L. and Psilanthus Hook.f, each of which in turn consist of two
sub-genera, Coffea and Baracoffea (J.-F. Leroy) J.-F. Leroy, and Psilanthus and
Afrocoffea (Moens) summarized by Anthony et al. 2 respectively. The grouping
of Coffea and Psilanthus genera has been examined in several studies 3 4 5 6 7.
Commercial coffee production is dominated by only two species belonging to the
Coffea genus: C. arabica and C. canephora (the latter generally referred to as robusta
coffee). All coffee species are diploid (2n=2x=22) and generally self-incompatible,
except for C. arabica which is a self-fertile tetraploid (2n=4x=44) derived from a
spontaneous hybridization between C. canephora (as paternal progenitor) and C.
eugenioides (as maternal progenitor) 8 9.
Although C. arabica is considered to have better cupping quality than C. canephora,
improving the quality of both commercial species remains a target for most coffee
improvement programs. With advances in genomic and sequencing technology, it is
feasible to understand the coffee genome and the molecular inheritance underlying
coffee quality, thereby helping improve the efficiency of breeding programs.
This review will discuss current knowledge regarding the genetics of those biochemical
compounds which are considered quality determinants, for use as a foundation for
improving coffee quality by breeding. Available genomic resources for the study of the
genetics of biochemical compounds that are likely to be playing a role in coffee flavour
will also be discussed. In addition, the potential value of the study of genetics of coffee
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
equality using next generation sequencing and association genetic analysis will be
considered.
BIOCHEMICAL CONTROL OF COFFEE QUALITY AND ITS VARIATION
Coffee quality is assessed by evaluating both the physical attributes of the coffee beans
and the organoleptic properties of the coffee. Physical quality reflects moisture content,
defects (e.g. sticks, stones, damaged beans and black beans ), bean size and bean colour;
while organoleptic quality reflects aroma, taste, flavour, body (a feeling of the heaviness or
richness on the tongue), acidity and the preference of tasters 10. A range of biochemical
compounds in coffee beans are important contributors to the quality of the coffee in the
cup.
Biochemical compounds influencing the organoleptic quality of coffee beans include
non-volatile and volatile components. Important non-volatile components of the bean
include carbohydrates and fibre (sucrose, reducing sugars, cell wall polysaccharides,
lignin), nitrogenous compounds (protein, free amino acids, caffeine, trigonelline), lipids
(coffee oil, diterpene esters), minerals (potassium and phosphorus), acids and esters
(total chlorogenic acids, aliphatic acids and quinic acid) 11. All of these biochemical
compounds are considered to play a role in roasting chemistry 12. Proteins and amino
acids, for example, react with reducing sugars to produce aroma precursors.
Chlorogenic acids (CGAs) and caffeine, are responsible for bitterness 13. Among these
components, sucrose, caffeine, trigonelline, lipids and CGAs are major biochemical
compounds that contribute to the flavour of the beverage after the roasting of the
beans 11. While sucrose, trigonelline and lipid have a positive correlation with coffee
cup quality, caffeine and some sub classes of CGAs have a negative correlation with
coffee cup quality (often referred to simply as ‘cup’). There are about three hundred
volatiles detected so far in green coffee bean 14 of which twenty one volatiles,
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eincluding 3-isobutyl-2-methoxypyrazine and 2-methoxy-3,5-dimethylpyrazine, have
been identified as key compounds for coffee cup 14. In the roasted coffee bean, there
are more than a thousand volatiles which can be grouped into twenty key aroma
compound-groups 15. The aroma of the brew is different from that of roasted coffee.
Several notes in roasted coffee become more intensive in the brew such as caramel and
phenolic odour. This change in aroma profile of coffee is not caused by the formation of
new odorants, but by a shift in the concentrations of existing ones as summarised by
Grosch 16.
Coffee flavour and its formation is extremely complex; however, numerous studies on
coffee flavour and its constituents have been reported. How the volatile and non-
volatile compounds contribute to coffee quality and relationships between sensory
properties and the composition of coffee has been thoroughly reviewed by Flament 12
and more recently by Sunarharum et al.17. The reactions involved in the formation of
coffee aroma and its mechanisms have been reviewed by Buffo & Cardelli-Freire 18.
However, the change in bean composition, especially in aroma profile, from green to
roasted bean is complicated and not all formation pathways are fully understood
under coffee roasting conditions 19. The use of green bean or roasted bean in the study
of coffee quality genetics therefore should be considered carefully since aroma in
roasted bean and brew is of customer's interest, while the green bean attributes are
more directly affected by genetics. This is a challenge in the study of genetics of bean
composition, that is, ascribing links to coffee quality. However, with the progress in
functional genomic approaches, the identification of molecular determinants of coffee
quality characteristics is feasible and it is possible to select coffee varieties with
superior beverage quality 20.
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eThe bean chemical composition and organoleptic characteristics are significantly
different between species and to a lesser extend within species 10. A number of studies
on the variation in the biochemical composition of the bean are summarised in Table 1
21-24.
Fatty acid and CGAs were used as indicators to differentiate arabica varieties 25 26.
Similarly, Tessema et al. 27 also found that the quality traits (organoleptic quality) and
biochemical constituents (sucrose, fat, crude protein and minerals) were diverse in the
arabica germplasm collections from Ethiopia. This is an encouraging result for the
study of variation of quality traits in C. arabica for association mapping. In general,
compounds that have a positive correlation with quality such as sucrose, trigonelline
and lipids are at a higher concentration in C. arabica than in C. canephora and some
other Coffea species.
Significant differences in flavour between different coffee types have also been reported.
Robusta coffee beans have a bitter, full bodied taste, but low acidity while those of arabica
coffee are more aromatic with more perceptible acidity but less body. Within the robusta
coffees, Moschetto et al 28 found important differences in cup quality between the two
main genetic groups studied (Congolese and Guinean), for preference, aroma, acidity,
body and bitterness. Within the arabica coffees, different varieties can be associated with
specific flavour profiles. For example, genotype SL28 produces a milder brew than Kent,
genotypes Blue Mountain and Bourbon produce finer coffees than Catimor, while
genotype Liberica yields coffee that is bitter and without finesse 29.
The composition of the beans that determines the properties and quality of coffee in
the cup depends not only on genetic factors (species, varieties) but also on non-genetic
factors such as the cultivation conditions (soil, air temperature, altitude, sun exposure
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eand rainfall), plant condition (plant age, bean maturity), cultivation practices (shade
management, fertilisation and irrigation) and harvest and processing methods. In
general, volcanic soils, low temperature, high altitude and growing under shade have
positive correlation to coffee compositions which relates to coffee quality 30-34. Post-
harvest techniques, roasting and the preparation of the beverage also have a strong
influence on coffee quality 30 35. Understanding the influence of each factor and their
interaction would help with defining realistic breeding strategies. Since there have
been many reports presented in detail on these issues 26 33 29 36 37 38 20, especially the
most recent reviews of Joet et al. 39 on the relationships between coffee quality, bean
chemical composition and environmental effects, or Santos et al. 40 on climate change
impacts on bean quality, or Sunarharum et al. 17 on the impacts of processing, roasting
and preparation methods to aroma profile, they are not reviewed in this paper.
Another factor that may influence coffee quality is epigenetic effects. Epigenetics is
modified gene expression not caused by alteration in the gene sequence or DNA code.
This change is due to factors that control how and when each gene is expressed.
Epigenetics is extremely complex and is not yet fully understood. Although there have
been several studies on factors affecting coffee quality, there has not been research on
epigenetics and coffee quality. Only a very few studies have reported on epigenetics in
relation to size and shape of the tree 41, and epigenetic mechanisms (especially in
phenolic compounds) in somatic embryo regulation in arabica coffee species 42 43.
Being an allotetraploid species formed by a recent natural hybridization between two
diploid ancestors, C. canephora and C. eugenioides, and which may have gone through
a single allopolyploidization event 44, arabica may have genetic redundancy allowing
for sequence divergence resulting in development of functional variations between
duplicated genes (homoelogues). Proteins with different properties may be encoded
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eby homoelogues leading to new genetic diversity in breeding populations. However,
homoelogues are also subject to extensive epigenetic control and are therefore not
always equally expressed 45. Landey et al. 43confirmed the very limited genetic and
epigenetic alternations in somatic embryogenesis-derived plants during coffee somatic
embryogenesis while embryogenic suspensions have a high risk of genome and
epigenome instability. However, Vidal et al. 46 found homoeologous genes displaying
differential expression for arabica using Single Nucleotide Polymorphisms (SNPs) from
Expressed Sequence Tags (ESTs) and gene expression from transcript redundancy
between C. arabica and C. canephora. The divergence of function in homoeologous
genes may result in phenotypic novelty with useful agronomic traits. Homoeologous
gene expression in C. arabica grown in two contrasted conditions was also used to
compare with those of its progenitors and to examine its ability to adapt to
environmental variations 47. Recently, Lashermes et al. 44 confirmed that the genomic
rearrangements involving homoeologous exchanges as well as gene silencing,
elimination and conversion appear to have occurred in C. arabica and to be a major
source of genetic diversity. This on one hand will make a potent new source of genetic
diversity for different traits, but on the other hand make it difficult to conduct
association genetics in coffee.
GENETICS OF BIOCHEMICAL COMPOUNDS KNOWN TO BE RELATED TO COFFEE
QUALITY
Improving both yield and quality is the target for coffee breeding programs. In a study
by Montagnon et al 48, genetic correlations between coffee yield and quality
determinants and cup tasting components measured in two groups of C. canephora
were not significant, indicating that the coffee quality is not negatively associated
with yield in C. canephora. Understanding the genetic inheritance of quality traits can
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
ebenefit breeding programs, but there appears little research that has focussed on
understanding the genetic control of key biochemical compounds that relate to coffee
beverage quality. Known inheritance mode and heritability values of the key
biochemical compounds determining coffee quality in intraspecific and interspecific
hybrids are detailed in Table 2 48-56.
Male/female additive inheritance in sucrose and trigonelline was contrasted in two
studies using intraspecific hybrids 51, 56. The heritability measured in interspecific
hybrids was different from that of intraspecific hybrids due to the differences among
species in relation to their fixation during evolution of specific alleles of some genes
relating to the variability of quality components. As a result, the genetic variation in
quality at the interspecific level is not similar to that in intraspecific hybrids 10. The high
narrow-sense heritability of caffeine and fat content in intraspecific hybrids implies that
these traits can be improved at the intraspecific level. The negative correlation between
fat content and sucrose content indicates that care should be taken in selecting for a
combination of fat and sucrose content 48. Both major and minor genes with additive
gene effects appear to be involved in the variation of caffeine content in the seeds 57
implying the complexity of this trait.
The inheritance of coffee cup quality was also reported by Leroy et al 51 where for
bitterness of robusta coffee, female additive effects were predominant, while for
acidity, only a male additive effect was observed 51. This information will assist the
breeding (e.g. parent selection) for robusta coffee quality improvement. The
inheritance of quality characters in introgressed lines, which resulted during arabica
improvement to introduce the disease resistant genes from Timor hybrid to arabica,
was mentioned by Bertrand et al. 58 31 and also reviewed by Leroy et al. 10.
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eIn summary, highly heritable traits like fat or caffeine content could be improved
effectively by crossing between parents of different favourable values. For traits with
lower heritability like trigonelline, CGAs or sucrose, Marker Assisted Selection (MAS)
would be necessary 4. With of the advances in sequencing technology and marker
development coupled with trait phenotyping, it should be possible to easily identify
trait-linked markers for use in MAS.
For coffee genetic improvement, most of the past and recent research has focused on
the improvement of canephora quality due its known lower quality compared to
arabica. The focus of arabica improvement is more on transfer of disease resistance
from Timor hybrid 10 or climate change adaptation 59; quality improvement is the
second priority in arabica as its quality is considered superior. However, this does not
mean that arabica quality improvement is unnecessary. Quality determinants are not
the same between arabica and canephora. For canephora, apart from caffeine
reduction, the bean composition of galactomannans 60or other compounds (other
polysaccharides) which influence the extraction during the industrial process of
lyophilisation (i.e. instant coffee processing) is probably the most important target.
Fresh brewing is popular for arabica, rather than instant coffee processing – although it
may be applied to arabica in the future. For arabica, the increasing consumer demand
for speciality coffee has intensified research aimed at pinpointing small differences in
quality factors as well as to quantify these organoleptic features 61. For more detail on
the trend and strategies of canephora and arabica improvement breeding, there are
excellent reviews by Leroy et al. 10 and Vossen et al. 20.
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eCOFFEE GENOMIC RESOURCES
Coffee genetic diversity in C. arabica and C. canephora
A number of works on the assessment of arabica diversity have been done with
differing results. In general, among three main types of material (cultivar/varieties;
accessions/introgression/hybrids; and spontaneous/sub-spontaneous) almost all
studies show a very low genetic variation amongst arabica cultivars using different
marker systems (RAPD, AFLP, SSR, ISSR, SRAP, TRAP) and coffee collected from
different regions (Brazil, Yemen, Indian, Australia and Vietnam, Tanzania, Hawaii) 62-69.
The second group (accessions/introgression/hybrids) shows low to significant
variation, more diverse in the introgressed lines 70 27 71-74. The last group, in which one
may expect to see the most diversity, has diversity ranging from low 75 72 76 to
moderate 77 to high 78. The correlation of genetics with geography/origin clearly shows
in some studies 77-79 but not in the others 75 72 70. In the most recent work presented in
the World Coffee Research annual report 59, genetic diversity assessment of 800
arabica coffee accessions from the arabica collection at CATIE, Costa Rica shows the
least genetic diversity of this species compared to other major crops. This study also
found that cultivated coffee varieties contain approximately 45% of the genetic
diversity found in the 800 aforementioned accessions (while only 2% of tomato
diversity found in cultivated tomato) indicating the limitation of variability for breeding
programs. This also is problematic for association mapping work in arabica for
breeding improvement, and thus the selection of appropriate germplasm for
association mapping is very critical. Therefore it is essential to understand the
population structure and its genetic diversity.
In contrast to C. arabica, C. canephora possess a high genetic diversity due to its origin,
reproduction method and dissemination 80, and it is structured in two main groups
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e(Guinean and Congolese) with six subgroups 81-83. The linkage disequilibrium assessed
in C. canephora breeding populations was variable, indicating the possibility of
applying association studies within C. canephora species and using it to improve C.
arabica due to its parental relationship to this species 84. Understanding the genetic
diversity of the two dominant species is critical for any genetic improvement scheme
for any traits in coffee.
Coffee whole genome and synteny
The genome size of coffee has been estimated to be 1.3 Gb for C. arabica 85 and 710Mb
for C. canephora 86. Using flow cytometry, the DNA content of arabica was estimated as
2.47 pg (equivalent to 1.2 Gb) and canephora as 1.43 pg (equivalent to 700 Mb), while
other diploid coffee species varied from 0.96 pg (469 Mb) (C. mauritiana) to 1.84 pg (900
Mb) (C. humilis) 87 88. Despite its economic importance, the first high-quality draft genome
of robusta coffee has just been completed and reported in 2014 86. To generate a whole
genome sequence of C. canephora, Roche 454 single and mate-pair reads and Sanger
BAC-end reads were used, representing 29.5X coverage of the genome size (710 Mb).
In addition, Illumina sequence data, corresponding to 60X of coverage of the Coffee
genome (710 Mb), was also used to correct sequencing errors and fill gaps. The
resulting genome assembly consists of 25,216 contigs and 13,345 scaffolds with a total
length of 568.6 Mb (Table 3). Using a high-density genetic map, a total of 349 scaffolds
covering approximately 364 Mb were anchored to the 11 C. canephora chromosomes,
among which 139 were both anchored and oriented. Coffee displays the most
conservative chromosomal gene order among asterid angiosperms. Transposable
elements (TEs) in the C. canephora genome were identified and classified accounting
for almost half (49.2%) of the reference genome length. Organelle to nuclear genome
transfers was also analysed with a typical amount of chloroplast-derived fragments
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e(2,014 insertions, accounting for 0.16% of the draft genome) and an unusually large
750kb mitochondrion-derived fragment more than 100 Kb longer than those known
from other plant genomes. In total, 25,574 protein-coding gene models were obtained
in which several gene functions involved in secondary compound biosynthesis and
coffee flavour and aroma were characterized. Species-specific gene family expansions
were examined such as N-methyltransferases (NMTs), defence-related genes, and
alkaloid and flavonoid enzymes involved in secondary compound synthesis. This is the
most valuable genomics resource for further genomics study in coffee, especially as all
of the genomics data, transciptomic data, SNP polymorphisms and genotyping data,
gene families and metabolic pathways are publicity available via the Coffee Genome
Hub 89.
The whole chloroplast genome sequence of Coffea arabica L. was first reported by Samson
et al. 90 (Table 3) and this also provides an important reference for studying other species
and helps build up the phylogeny among members of the Coffeeae tribe which is still
questionable. Several research groups are working on an C. arabica genome sequence
using different biological materials and sequencing platforms 91 92 93 94. This genomic
resource is extremely important for the study of genetics and genomics of the high quality
coffee species, C. arabica.
Several studies 95-97 have compared genomes between coffee and other crops. The
most recent study reported that coffee chromosomal regions showed unique one-to-
one correspondences with grape chromosomes, proving the lack of any events
changing ploidy intervening in their separate histories. Coffee and grape genomes
show one-to-three correspondence with the tomato genome 86.
Among coffee species, comparison of genomes showed a high gene synteny between
C. arabica (EaEaCaCa) and C. canephora (CC) and C. arabica (EaEaCaCa), C. eugenioides
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e(EE) and C. liberica (LL) 98-100, but numerous chromosomal rearrangements were
detected 98. The two sub-genomes of C. arabica (Ca and Ea) indicated sufficient
sequence differences, and thus obtaining a whole-genome sequence would be an
important resource for the allotetraploid genome of C. arabica. Results also revealed
that C. eugenioides was more closely related to C. arabica than to C. liberica, which
was in agreement with the ancestral history of the allotetraploid C. arabica 100.
Other genomic resources
Molecular markers
As for other crops, determination of molecular predictors for coffee quality traits
would help reduce the length of breeding selection cycles and thus phenotypic
evaluation cost. However, the use of DNA technology in coffee quality improvement is
still in its infancy and therefore limited. Pot et al. 101 used polymorphisms generated
from SNPs, INDELs (insertions and deletions) and SSRs (Simple Sequence Repeats) to
identify the nucleotide diversity of four sucrose metabolism enzymes in C. canephora
genotypes using direct sequencing. The variation of these genes was also analysed
between different Coffea species to allow the identification of more polymorphic sites
using parallel in silico analysis of EST resources 101. AFLP (Amplified Fragment Length
Polymorphism) and SSR markers were used to construct a genetic map of an F2
population between C. arabica and C. canephora (artificial tetraploid). The number of
markers associated with quality traits identified was nineteen for sugar content, eight
for caffeine, eight for CGAs and one for caffeine and CGAs 102. These markers need to
be validated in other genotypes for consistency before they can be used in marker
assisted selection in coffee breeding. Recently, a total of 33,239 SNPs specific to C.
arabica and 87,271 SNPs specific to C. canephora were developed using targeted
genome capture strategies and next-generation sequencing, and were evaluated on 72
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
esamples from C. canephora and 72 from C. arabica. These genomic resources will
support for genome assemblies, accelerate breeding of interested traits as well as
manage genetic diversity in coffee species 103.
Bacterial artificial chromosome (BAC) libraries.
Three BAC libraries were developed for each species of C. canephora and C. arabica in
relation to sucrose metabolism 104, genome composition and evolution 105, nematode
and leaf rust resistant 106, bean size and cup quality 107 and mannose-6-phosphate
reductase gene (CaM6PR) 108 (Table 4). These available BAC libraries of Coffea are an
important resource to support genomic studies such as comparative genomics and the
identification of genes and regulatory elements controlling important traits in coffee.
Expressed sequence tags (ESTs)
The development of large EST sequencing projects has supported the identification of
potential genes contributing to quality traits and their interplay with other genes
through essential biochemical pathways. ESTs facilitate the development of whole
transcriptome analysis and the identification of the biosynthetic pathways associated
with the expression of quality and the genes within these pathways 10. The EST
resources available in coffee have been reviewed by Kochko et al. 85 and are
summarised in Table 5.
A total of 246,500 ESTs have been reported with 69,801 for C. canephora, 166,133 for C.
arabica and 10,566 for C. racemosa using different organs of the plant (leaves, flower,
fruits and roots) at different stages of development and maturation 109-115. However, not
all these resources are published or available which limits their use. A number of studies
have used the available resource of ESTs for further research such as microsatellite marker
discovery 116, analysis of transcriptome divergence and analysis of genes involved in the
biosynthesis pathways of lipids and main storage proteins 117, gene structure prediction
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eand functional annotation with the detection of a total of 345 pathways in C. arabica and
300 pathways in C. canephora 118. This genomic resource will be very important for further
research on coffee quality improvement.
QTL identification and genetic mapping
The construction of genetic maps is often the first step for molecular dissection of
complex traits (yield components, quality and disease). Genetic maps enable the
mapping of quantitative trait loci (QTLs) controlling these traits and thus provide a
framework for isolation of candidate genes followed by manipulation of genes for yield
increase and quality improvement 119. In many cases, markers linked to QTLs are
directly used in MAS without the need of understanding gene functionality underlying
the QTLs.
Linkage mapping in coffee in general requires more effort and cost than in annual crops
due to its long life cycle, low polymorphism (in the case of C. arabica – with the exception
of Ethiopian cultivars and germplasm), and the absence of a large collection of DNA
markers as well as genomic resources 119. The first genetic maps were constructed in the
diploid C. canephora by Paillard et al. 120 followed by several others constructed for
different coffee populations 51, 56, 96, 102, 120-129 130 (Table 6).
To date, a number of QTLs analyses relating to quality compounds and cup have been
performed on C. canephora and other species, but none has been reported for C.
arabica as indicated in Table 7 50, 51, 56, 131, 132
Co-location of some QTLs and genes involved in different metabolic pathways related
to coffee quality were noted 51. Using such candidate genes in association mapping
may determine the allelic variation of coffee quality 133. The co-localisation between
QTLs relating to organoleptic traits and genes associating with caffeine or CGA
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
ebiosynthesis may explain the fact that both CGA and caffeine are involved in conferring
bitterness in the coffee cup 51.
Gene identification
The identification of genes relating to quality is one of the main objectives of several
coffee research groups around the world. Thanks to the concerted efforts on coffee
genomics, a number of coffee candidate genes have been identified and some of them
have been cloned and characterised. These results are useful to the coffee genetics
community, especially those on genes encoding the enzymes of key metabolic
processes. These genes are candidate genes which may control the variability of coffee
quality 10. Genes regulating the main chemical components that are thought to be
involved in the flavour and sensory quality of coffee are listed in Table 8 13, 51, 86, 104, 117,
131, 134-141. Recently, 36,935 unigenes from leaves and fruits of C. eugenioides, one of C.
arabica’s ancestors, were identified by Yuyama et al. 142. A sub-set of these genes
related to sugar metabolism and fruit ripening were examined for their expression.
This will be a valuable resource for molecular and genetics studies of commercial
coffee species. Currently there are works on transcriptomics to understand the
transcriptional regulations during seed development 39 13 or more specifically for
caffeine accumulation 143 and chlorogenic acids 144 or galactomannan 60. These could
facilitate the development of robust genomics/metabolomics fingerprints of coffee
bean quality and ultimately be useful in breeding programs for coffee quality.
Although several genes encoding the biosynthesis of biochemical compounds in coffee
have been identified in C. arabica and C. canephora, there are no genes identified for
trigonelline synthesis and no studies on allelic variation (e.g., SNPs) associated with
low and high levels of the key biochemical compounds which can be utilised in MAS.
However, sequences from the genes that are known can serve as useful references in
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
ere-sequencing to detect polymorphisms for genetic mapping of the control of
biochemical compounds in C. arabica.
ASSOCIATION GENOMICS APPROACH USING NEXT GENERATION SEQUENCING (NGS)
AND PERSPECTIVES FOR THE STUDY OF GENETICS OF COFFEE QUALITY
Research on genetics and genomics for coffee, especially in relation to quality is
relatively limited compared to other crops and thus belies its potential and economic
contribution. The reasons could possibly be due to the lack of funding – most coffee
growing regions being developing countries – the complexity of the quality traits and
the limitations of the technology used. More studies have been conducted for robusta
coffee than arabica coffee due to its lower ploidy level and greater genetic diversity.
Previous studies showed considerable genetic variation in compounds defining coffee
quality among coffee species, especially in diploids. In addition, a number of genes
relating to the synthesis of sucrose, caffeine, several sub-groups of CGAs and lipids
have also been identified. These studies provided gene sequences via ESTs that will
facilitate further studies on bean composition relating to coffee quality. However,
there has been no research focusing on allelic variation of these genes in natural
populations except bi-parental mapping of QTLs for trigonelline and CGA contents.
A number of previous studies involving genome-wide association studies (GWAS) using
next generation sequencing (NGS) have used these genetic approaches to detect
candidate genes/QTLs of complex traits such as physical wood properties, resistant genes,
drought tolerance, cold hardiness and timing of bud set summarised by Sexton et al. 145
and successfully applied in various crops including annuals (rice, maize, soybean, wheat) or
perennial crops and forestry trees (pine, cottonwood) summarised by Hall et al. 146. Based
on knowledge of biochemical compounds (and their reference gene sequences) that are
thought to be involved in the flavour and sensory quality, it should be possible to apply
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eNGS and GWAS combined with bi-parental QTL mapping to detect valuable haplotypes
from natural populations for use in breeding coffee varieties with better quality.
Application of NGS in plant genomics studies has been extensively reviewed by a
number of authors 147-150. NGS lowers the cost of sequencing and generates a large
amount of data in a single sequencing run, making it feasible to study genetics at the
whole genome level. The steps involved in the NGS strategies (shearing of DNA, ligation,
library preparation) as well as different sequencing platforms (Illumina, Life Technologies,
PacBio) have been described and their advantages and disadvantages have been reviewed
by several authors 151, 152. NGS can be used via an approach of whole genome sequencing or
targeted sequencing.
Two methods of whole genome sequencing can be used including (1) de novo
sequencing - applied for species that do not have pre-existing sequence data, followed
by the use of bioinformatics tools to assemble the sequences and obtain the genomic
map for that species; and (2) re-sequencing performed on individuals of a species that
has already known genome sequence152.
Targeted sequencing includes two approaches, amplicon sequencing and target
enrichment. Amplicon sequencing will sequence amplified regions (PCR amplicons) of
small and selected regions of the genome with lengths of hundreds of base pairs and
with a high depth of coverage to identify common and rare sequence variations.
Sequencing PCR amplicons of sets of candidate genes from DNA bulks will help to
identify the available variation in these genes exploited in a population 148. Target
enrichment is quite similar to amplicon sequencing in terms of using only selected
regions or genes enriched in the library from genomic DNA. However, target
enrichment allows for larger DNA insert sizes and enables a greater amount of total
DNA to be sequenced per sample. For this method, regions of interest in the genome
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
ecan be targeted, making it an ideal approach for examining specific gene pathways, or
as a follow-up analysis to GWAS 153.
When making decisions on the approaches of candidate gene or of whole-genome,
one of the most critical factors is the extent of linkage disequilibrium (LD) in the
organism of interest. In genome-wide studies, the extent of LD determines the
mapping resolution that is achieved and the numbers of markers covered 154. Genetic,
biochemical, physiological and phenotypic information must be used to support the
selection of candidate genes 155. Where a developmental or biochemical pathway is
well-understood, candidate gene selection is straightforward; but the researcher is
typically limited to the field of known genes, which presents the risk of overlooking
causal gene mutations not previously identified 146
NGS technology has been used to assemble the whole genome sequence of C.
canephora 86 and to examine genetic change in allopolyploidisation of arabica 44. To
obtain the genomic sequence of C. arabica, as only the draft genome of C. canephora
is available, a de novo approach can be applied. The other approach for obtaining the
genomic sequence of C. arabica is whole genome re-sequencing as the available draft
genomic sequence of C. canephora can be used as reference since it is one of
progenitors of C. arabica. The whole genome sequence will be a key resource of data
to identify gene sequences and SNP markers for most genes including novel and
published genes. EST and BAC sequences can be used as reference sequences to
assemble sequence reads from NGS data to identify genes/alleles and their positions
on the chromosome. In addition, whole-genome re-sequencing and SNP genotyping
data can be used to generate a dense SNP genetic map of the population for QTL
mapping.
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eAmplicon sequencing in a larger set of samples could be an appropriate approach to genetic
association study of quality traits, since the biosynthesis pathways, QTLs and candidate
genes of sucrose, caffeine and several subgroups of CGAs are known. This will help to
confirm or validate the candidate genes from the literature and identify the alleles that are
useful for breeding. For those compounds for which candidate genes have never been
identified or very few identified, for example trigonelline and some compounds
belonging to the lipid group, the approach could be to use whole genome sequencing
via NGS to detect SNPs, then using an association genomics approach to detect the link
between SNPs and the traits of interest.
Association studies differ from quantitative trait locus mapping by utilising samples
from genetically diverse populations that contain short stretches of linkage
disequilibrium due to generations of recombination 145. Such studies thus enable
relationships between phenotypic traits and underlying DNA sequence variation to be
understood. Association studies may be suitable for perennial species such as coffee
because they can be carried out on pre-existing populations, in collections or in
selection trials. This approach does not require specific populations created by
controlled crossing (like conventional genetic mapping approaches) which is very
difficult for perennial crops 155. Association or linkage disequilibrium mapping has
become a very popular method for dissecting the genetic basis of complex traits in
plants 146. For the study on volatile and non-volatile compounds in coffee and that are
likely to be playing a role in coffee flavour, once the data on biochemical compounds
and SNPs variation are available, association genomics will be applied to identify key
genes associating coffee quality. The integration of NGS technologies and association
study to analyse complex traits will be a powerful strategy complementing the
traditional method of parent-hybrid map construction 152.
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eCONCLUSIONS
While whole genome technology will provide a powerful resource, a good start has
been made with AFLP and SSR markers associated with quality traits identified for
sugar content, caffeine and CGAs in C. arabica. These are already helpful in studies
aimed at genetic improvement of coffee quality, but importantly, they will provide
useful reference points as the genomic approach is developed. Existing coffee BAC
libraries are beneficial in comparative genomics, the identification of genes and
regulatory elements controlling important traits in coffee, and will play an important role
in validating new C. arabica genomic data as it emerges. With the current knowledge of
biochemical compounds that govern coffee bean composition that relates to beverage
quality and available genetic resources, it is possible to apply an association genomics
approach using both whole genome sequencing and targeted sequencing to detect the link
between SNPs and the traits of quality determinants from natural populations for use in
quality improvement via breeding programs. However, caution should be taken when
using this approach for C. arabica since this species has a narrow genetic base, lacks
a reference genome, is a polyploid (4n) and lacks populations from controlled crosses
for analysis of specific traits and the validation of markers or genes once detected. The
aforementioned difficulties can be overcome by the use of wild arabica accessions
from Ethiopia, development of a reference genome sequence for C. arabica, and the
complementary use of the two approaches of targeted re-sequencing and whole
genome re-sequencing in variant discovery respectively. There is no doubt that the
availability of whole genome sequences will provide the greatest stimulus yet to the
understanding of the genetic basis of coffee quality traits, and indeed of many other
aspects of the plant’s biology.
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e
ACKNOWLEDGEMENTS
This research was supported under Australian Research Council's Linkage Projects
(project number LP130100376). We acknowledge Australia Awards Scholarship (AAS)
and QAAFI/UQ for their ongoing support of this research.
REFERENCES
1. ICO, ICO Annual Review 2011/12, Ed (2013).
2. Anthony F, Bertrand B, Etienne H and Lashermes P, Coffea and Psilanthus, in Wild Crop Relatives: Genomic and Breeding Resources. Springer, pp 41-61 (2011).
3. Maurin O, Davis AP, Chester M, Mvungi EF, Jaufeerally-Fakim Y and Fay MF, Towards a phylogeny for coffea (rubiaceae): Identifying well-supported lineages based on nuclear and plastid DNA sequences. Annals of Botany 100:1565–1583 (2007).
4. Davis AP, Chester M, Maurin O and Fay MF, Searching for the relatives of Coffea (Rubiaceae, Ixoroideae): The circumscription and phylogeny of Coffeeae based on plastid sequence data and morphology. American Journal of Botany 94:313–329 (2007).
5. Tosh J, Davis AP, Dessein S, Block PD, Huysmans S, Fay MF, Smets E and Robbrecht E, Phylogeny of Tricalysia (Rubiaceae) and its relationships with allied genera based on plastid DNA data: Resurrection of the Genus Empogona. Annals of the Missouri Botanical Garden 96:194-213 (2009).
6. Anthony F, Diniz LEC, Combes M-C and Lashermes P, Adaptive radiation in Coffea subgenus Coffea L. (Rubiaceae) in Africa and Madagascar. Plant Systematics and Evolution 285:51–64 (2010).
7. Davis AP, Tosh J, Ruch N and Fay MF, Growing coffee: Psilanthus (Rubiaceae) subsumed on the basis of molecular and morphological data; implications for the size, morphology, distribution and evolutionary history of Coffea. Botanical Journal of the Linnean Society 167:357-377 (2011).
8. Charrier A and Eskes AB, Botany and Genetics of Coffee, in Coffee: Growing, Processing, Sustainable Production - A Guidebook for Growers, Processors, Traders, and Researchers, Ed by Wintgens JN. WILEY-VCH Verlag GmbH & Co. KCaA (2004).
9. Lashermes P, Combes M-C, Robert J, Trouslot P, D'Hont A, Anthony F and Charrier A, Molecular characterisation and origin of the Coffea arabica L. genome. Molecular Genetics and Genomics 261:259-266 (1999).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e10. Leroy T, Ribeyre F, Bertrand B, Charmetant P, Dufour M, Montagnon C, Marraccini P and Pot D, Genetics of coffee quality. Brazilian Journal of Plant Physiology 18:229-242 (2006).
11. Farah A, Coffee as a speciality and functional beverage, in Functional and speciality beverage technology, ed. by Paquin P. Woodhead Publishing in Food Science, Technology and Nutrition, pp 370-395 (2009).
12. Flament I, Coffee flavour chemistry. Chichester, UK (2002).
13. Joët T, Laffargue A, Salmona J, Doulbeau S, Descroix F, Bertrand B, Kochko Ad and Dussert S, Metabolic pathways in tropical dicotyledonous albuminous seeds: Coffea arabica as a case study. New Phytologist 182:146–162 (2009).
14. Holscher H and Steinhart H, Aroma compounds in green coffee, in Food flavours – Generation, Analysis and Process Influence, ed. by Charalambous G. Elsevier Science, Amsterdam, pp 785-803 (1995).
15. Oestreich-Janzen S, Chemistry of coffee, in Comprehensive Natural Products II: Chemistry & Biology, Ed by L. M and Liu H-W. CAFEA GmbH, Hamburg, Germany (2010).
16. Grosch W, Chemistry III: Volatile compounds, in Coffee Recent development, ed. by Clarke RJ and Vitzthum OG. Blackwell Science (2001).
17. Sunarharum WB, Williams DJ and Smyth HE, Review: Complexity of coffee flavor: A compositional and sensory perspective. Food Research International 62:315–325 (2014).
18. Buffo RA and Cardelli-Freire C, Coffee flavour: an overview. Flavour And Fragrance Journal 19:99–104 (2004).
19. Mestdagh F, Wocheslander S, Rodriguez A, Egli A, Davidek T and Blank I, Conversion of green coffee precursors into flavour during roasting of arabica and robusta coffees, in ASIC: The 25th International conference on coffee and science, Ed, Colombia (2014).
20. Vossen Hvd, Bertrand B and Charrier A, Next generation variety development for sustainable production of arabica coffee (Coffea arabica L.): a review. Euphytica (2015).
21. Campa C, Ballester JF, Doulbeau S, Dussert S, Hamon S and Noirot M, Trigonelline and sucrose diversity in wild Coffea species. Food Chemistry 88:39-43 (2004).
22. Campa C, Doulbeau S, Dussert S, Hamon S and Noirot M, Diversity in bean caffeine content among wild Coffea species: evidence of a discontinuous distribution. Food Chemistry 91:633–637 (2005a).
23. Ky C-L, Louarn J, Dussert S, Guyot B, Hamon H and Noirot M, Caffeine, trigonelline, chlorogenic acids and sucrose diversity in wild Coffea arabica L. and C. canephora P. accessions. Food Chemistry 75:223–230 (2001b).
24. Mazzafera P, Trigonelline in coffee. Phytochemistry 30:2309 -2310 (1991).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e25. Bertrand B, Villarreal D, Laffargue A, Posada H, Lashermes P and Dussert S, Comparison of the effectiveness of fatty acids, chlorogenic acids, and elements for the chemometric discrimination of coffee (Coffea arabica L.) varieties and growing origins. Agricultural and Food chemistry 56:2273–2280 (2008).
26. Villarreal D, Laffargue A, Posada H, Bertrand B, Lashermes P and Dussert S, Genotypic and environmental effects on coffee (Coffea arabica L.) bean fatty acid profile: impact on variety and origin chemometric determination. Journal of Agricultural and Food Chemistry 57:11321-11327 (2009).
27. Tessema A, Alamerew S, Kufa T and Garedew W, Genetic diversity analysis for quality attributes of some promissing Coffea arabica germplasm collections in Southwestern Ethiopia. Journal of Biological Sciences 11:236-244 (2011).
28. Moschetto D, Montagnon C, Guyot B, Perriot JJ, Leroy T and Eskes A, Studies on the effect of genotype on cup quality of Coffea canephora. Tropical Science 36:18‐31 (1996).
29. Wintgens JN, Factors influencing the Quality of Green Coffee, in Coffee: Growing, Processing, Sustainable Production - A Guidebook for Growers, Processors, Traders, and Researchers, Ed by Wintgens JN. WILEY-VCH Verlag GmbH & Co. KCaA (2012).
30. Bertrand B, Vaast P, Alpizar E, Etienne H, Davrieux F and Charmetant P, Comparison of bean biochemical composition and beverage quality of Arabica hybrids involving Sudanese-Ethiopian origins with traditional varieties at various elevations in Central America. Tree Physiology 26:1239-1248 (2006).
31. Bertrand B, Boulanger R, Dussert S, Ribeyre F, Berthiot L, Descroix F and Joet T, Climatic factors directly impact the volatile organic compound fingerprint in green Arabica coffee bean as well as coffee beverage quality. Food Chemistry 135:2575-2583 (2012).
32. Joët T, Laffargue A, Descroix F, Doulbeau S, Bertrand B, kochko Ad and Dussert S, Influence of environmental factors, wet processing and their interactions on the biochemical composition of green Arabica coffee beans. Food Chemistry 118:693-701 (2010).
33. Barbosa JN, Borém FM, Cirillo MÂ, Malta MR, Alvarenga AA and Alves HMR, Coffee quality and its interactions with environmental factors in Minas Gerais, Brazil. Journal of Agricultural Science 4 (2012).
34. Muschler RG, Shade improves coffee quality in a sub-optimal coffee-zone of Costa Rica. Agroforestry systems 51:131-139 (2001).
35. Duarte GS, Pereira AA and Farah A, Chlorogenic acids and other relevant compounds in Brazilian coffees processed by semi-dry and wet post-harvesting methods. Food Chemistry 118:851-855 (2010).
36. Avelino J, Barboza B, Araya JC, Fonseca C, Davrieux F, Guyot B and Cilas C, Effects of slope exposure, altitude and yield on coffee quality in two altitude terroirs of Costa Rica, Orosi and Santa Mar´ıa de Dota. Journal of the Science of Food and Agriculture 85:1869–1876 (2005).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e37. Decazy F, Avelino J, Guyot B, Perriot JJ, Pineda C and Cilas C, Quality of different Honduran coffees in relation to several environments. Journal of Food Science 68:2356-2361 (2003).
38. Borém FM, Ribeiro DR, Taveira JHS, Prado MVB, Ferraz V, Tosta MF, Luz MPS and Cortez RM, Genotype and environment Interaction in chemical composition and sensory quality of natural coffees, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).
39. Joët T, Bertrand B and Dussert S, Environmental effects on coffee seed biochemical composition and quality attributes: a genomic perspective, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014b).
40. Santos CAFd, Leitão AE, Pais IP, Lidon FC and Ramalho JC, Perspectives on the potential impacts of climate changes on coffee plant and bean quality. Emir J Food Agric 27:152-163 (2015).
41. Lecolier A, Verdeil J-L, Escoute J, Chrestin H and Noirot M, Laurina mutation affected Coffea arabica tree size and shape mainly through internode dwarfism. Trees 23:1043–1051 (2009).
42. Nic-Can GI and Pena CDl, Chapter 6: Epigenetic Advances on Somatic Embryogenesis of Agronomical and Important Crops, in Epigenetics in plants of agronomic importance: Fundamentals and applications transcriptional regulation and chromatin remodelling in plants, ed. by Alvarez-Venegas R, Peña CDl, Armando J and Casas-Mollano. Springer (2014).
43. Landey RB, Cenci A, Georget F, Bertrand B, Camayo G, Dechamp E, Herrera JC, Santoni S, Lashermes P, Simpson J and Etienne H, High genetic and epigenetic stability in Coffea arabica plants derived from embryogenic suspensions and secondary embryogenesis as revealed by AFLP, MSAP and the phenotypic variation rate. PLOS ONE 8 (2013).
44. Lashermes P, Combes M-C, Hueber Y, Severac D and Dereeper A, Genome rearrangements derived from homoeologous recombination following allopolyploidy speciation in coffee. The Plant Journal 78:674–685 (2014).
45. Bottley A, Chapter 3: Epigenetic variation amongst polyploidy crop species, in Epigenetics in plants of agronomic importance: Fundamentals and applications transcriptional regulation and chromatin remodelling in plants, ed. by Alvarez-Venegas R, Peña CDl, Armando J and Casas-Mollano. Springer (2014).
46. Vidal RO, Mondego JM, Pot D, Ambrosio AB, Andrade AC, Pereira LF, Colombo CA, Vieira LG, Carazzolle MF and Pereira GA, A high-throughput data mining of single nucleotide polymorphisms in Coffea species expressed sequence tags suggests differential homeologous gene expression in the allotetraploid Coffea arabica. Plant Physiol 154:1053-1066 (2010).
47. Combes M-C, Dereeper A, Severac D, Bertrand B and Lashermes P, Contribution of subgenomes to the transcriptome and their intertwined regulation in the allopolyploid Coffea arabica grown at contrasted temperatures. New phytologist 200:251–260 (2013).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e48. Montagnon C, Guyot B, Cilas C and Leroy T, Genetic parameters of several biochemical compounds from green coffee, Coffea canephora. Plant Breeding 117:576-578 (1998).
49. Ky C-L, Louarn J, Guyot B, Charrier A, Hamon H and Noirot M, Relations between and inheritance of chlorogenic acid contents in an interspecific cross between Coffea pseudozanguebariae and Coffea liberica var ‘dewevrei’. Theoretical and Applied Genetics 98:628-637 (1999).
50. Ky C-L, Guyot B, Louarn J, Hamon S and Noirot M, Trigonelline inheritance in the interspecific Coffea pseudozanguebariae × C. liberica var. dewevrei cross. Theoretical and Applied Genetics 102:630–634 (2001).
51. Leroy T, Bellis F, Legnate H and Kananura E, Improving the quality of African robustas: QTLs for yield- and quality-related traits in Coffea canephora. Tree Genetics & Genomes 7:781-798 (2011).
52. Mazzafera P and Carvalho A, Breeding for low seed caffeine content of coffee (Coffea L.) by interspecific hybridization. Euphytica 59:55–60 (1992).
53. Barre P, Akaffou S, Louarn J, Charrier A, Hamon S and Noirot M, Inheritance of caffeine and heteroside contents in an interspecific cross between a cultivated coffee species Coffea liberica var dewevrei and a wild species caffeine-free C. pseudozanguebariae. Theoretical and Applied Genetics 96:306-311 (1998).
54. Akaffou DS, Hamon P, Doulbeau S, Keli J, Legnate H, Campa C, Hamon S, Kochko A and Zoro BIA, Inheritance and relationship between key agronomic and quality traits in an interspecific cross between Coffea pseudozanguebariae Bridson and C. canephora Pierre. Tree Genetics & Genomes 8:1149-1162 (2012).
55. Ky CL, Doulbeau S, Akkaffou S, Charrier A, Hamon S, Louarn J and Noirot M, Inheritance of coffee bean sucrose content in the interspecific cross Coffea pseudozanguebariae × Coffea liberica dewevrei. Plant Breeding 119:165-168 (2000b).
56. Mérot-L’Anthoëne V, Mangin B, Lefebvre-Pautigny F, Jasson S, Rigoreau M, Husson J, Lambot C and Crouzillat D, Comparison of three QTL detection models on biochemical, sensory, and yield characters in Coffea canephora. Tree Genetics & Genomes (2014).
57. Priolli RHM, Paulo, Siqueira WJ, Möller M, Zucchi MI, Ramos LCS, Gallo PB and Colombo CA, Caffeine inheritance in interspecific hybrids of Coffea arabica x Coffea canephora (Gentianales, Rubiaceae). Genetics and Molecular Biology 31:498-504 (2008).
58. Bertrand B, Guyot B, Anthony F and Lashermes P, Impact of the Coffea canephora gene introgression on beverage quality of C. arabica. Theor Appl Genet 107:387–394 (2003).
59. WCR, Assessment of Genetic Diversity in Coffea arabica. World Coffee Research 2014 Annual Report (2014).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e60. Joët T, Laffargue A, Salmona J, Doulbeau S, Descroix F, Bertrand B, Lashermes P and Dussert S, Regulation of galactomannan biosynthesis in coffee seeds. Journal of Experimental Botany 65:323–337 (2014a).
61. Dias EC, Paiva LC, Leal RS, Pimenta C and Kieckbusch TG, Effect of processing and roasting conditions on the quality and chemical composition of coffee produced from different regions in Brazil, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).
62. Motta LB, Soares TCB, Ferrão MAG, Caixeta ET, Lorenzoni RM and Neto JDdS, Molecular characterization of arabica and conilon coffee plants genotypes by SSR and ISSR markers. Brazilian archives of biology and technology 57:728-735 (2014).
63. Al-Murish TM, Elshafei AA, Al-Doss AA and Barakat MN, Genetic diversity of coffee (Coffea arabica L.) in Yemen via SRAP, TRAP and SSR markers. Journal of Food, Agriculture & Environment 11:411-416 (2013).
64. Mishra MK, Sandhyarani N, Suresh N, Kumar SS, Soumya PR, Yashodha MH, Bhat A and Jayarama, Genetic diversity among Indian coffee cultivars determined via molecular markers. Journal of Crop Improvement 26:727–750 (2012b).
65. Vieira ESN, Von Pinho EVD, Carvalho MGG, Esselink DG and Vosman B, Development of microsatellite markers for identifying Brazilian Coffea arabica varieties. Genetics and Molecular Biology 33:507-U120 (2010).
66. Tran HT, Genetic variation in cultivated coffee (Coffea arabica L.) accessions in northern New South Wales, Australia, Ed. Southern Cross University, NSW, Australia (2005).
67. Maluf MP, Silvestrini M, Ruggiero LMdC, Filho OG and Colombo CA, Genetic diversity of cultivated coffea arabica inbred lines assessed by RAPD, AFLP and SSR marker systems. Sci Agric (Piracicaba, Braz) 62:366-373 (2005).
68. Masumbuko LI, Bryngelsson T, Mneney EE and Salomon B, Genetic diversity in Tanzanian Arabica coffee using Random Amplified Polymorphic DNA (RAPD) markers. Hereditas 139:56–63 (2003).
69. Steiger DL, Nagai C, Morden PHMCW, Osgood RV and Ming R, AFLP analysis of genetic diversity within and among Coffea arabica cultivars. Theor Appl Genet 105:209–215 (2002).
70. Geleta M, Herrera I, Monzon A and Bryngelsson TV, Article ID 939820, Genetic diversity of arabica coffee (Coffea arabica L.) in Nicaragua as estimated by Simple Sequence Repeat Markers. The Scientific World Journal 2012 (2012).
71. Teressa A, Crouzillat D, Petiard V and Brouhan P, Genetic diversity of Arabica coffee (Coffea arabica L.) collections. EJAST 1:63-79 (2010).
72. Dessalegn Y, Herselman L and Labuschagne MT, AFLP analysis among Ethiopian arabica coffee genotypes. African Journal of Biotechnology 7:3193-3199 (2008b).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e73. Dessalegn Y, Herselman L and Labuschagne M, Comparison of SSR and AFLP analysis for genetic diversity assessment of Ethiopian arabica coffee genotypes. S Afr J Plant Soil 26:119-125 (2009).
74. Lashermes P, Andrzejewski S, Bertrand B, Combes MC, Dussert S, Graziosi G, Trouslot P and Anthony F, Molecular analysis of introgressive breeding in coffee (Coffea arabica L.). Theor Appl Genet 100:139–146 (2000).
75. Anthony F, Bertrand B, Quiros O, Wilches A, Lashermes P, Berthaud J and Charrier A, Genetic diversity of wild coffee (Coffea arabica L.) using molecular markers. Euphytica 118:53–65 (2001).
76. Aerts R, Berecha G, Gijbels P, Hundera K, Glabeke SV, Vandepitte K, Muys B, Roldan-Ruiz I and Honnay O, Genetic variation and risks of introgression in the wild Coffea arabica gene pool in South-Western Ethiopian montane rainforests. Evolutionary Applications:243-252 (2012).
77. Aga E, Bryngelsson T, Bekele E and Salomon B, Genetic diversity of forest arabica coffee (Coffea arabica L.) in Ethiopia as revealed by Random Amplified Polymorphic DNA (RAPD) analysis. Hereditas 138:36–46 (2003).
78. Silvestrini M, Junqueira MG, Favarin AC, Guerreiro-Filho O, Maluf MP, Silvarolla MB and Colombo CA, Genetic diversity and structure of Ethiopian, Yemen and Brazilian Coffea arabica L. accessions using microsatellites markers. Genet Resour Crop Evol 54:1367–1379 (2007).
79. López-Gartner G, Cortina H, McCouch SR and Moncada MDP, Analysis of genetic structure in a sample of coffee (Coffea arabica L.) using fluorescent SSR markers. Tree Genetics & Genomes 5:435-446 (2009).
80. Ferrao LFV, Caixeta ET, Souza FdF, Zambolim EM, Cruz CD, Zambolim L and Sakiyama NS, Comparative study of different molecular markers for classifying and establishing genetic relationships in Coffea canephora. Plant Syst Evol 299:225–238 (2013).
81. Leroy T, Bellis FD, Legnate H, Musoli P, Kalonji A, Solorzano RGL and Cubry P, Developing core collections to optimize the management and the exploitation of diversity of the coffee Coffea canephora. Genetica 142:185–199 (2014).
82. Cubry P, Bellis FD, Pot D, Musoli P and Leroy T, Global analysis of Coffea canephora Pierre ex Froehner (Rubiaceae) from the Guineo-Congolese region reveals impacts from climatic refuges and migration effects. Genet Resour Crop Evol 60:483–501 (2013a).
83. Musoli P, Cubry P, Aluka P, Billot C, Dufour M, Bellis FD, Bleysse P, D. , Charrier A and Leroy T, Genetic differentiation of wild and cultivated populations: diversity of Coffea canephora Pierre in Uganda. Genome 52:634-646 (2009).
84. Cubry P, de Bellis F, Avia K, Bouchet S, Pot D, Dufour M, Legnate H and Leroy T, An initial assessment of linkage disequilibrium (LD) in coffee trees: LD patterns in groups of Coffea canephora Pierre using microsatellite analysis. BMC genomics 14:10 (2013b).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e85. Kochko AD, Akaffou S, Andrade AC, Campa C, Crouzillat D, Guyot R, Hamon P, Ming R, Mueller LA, Poncet V, Tranchant-Dubreuil C and Hamon S, Advances in Coffea Genomics Advances in Botanical Research 53:23-63 (2010).
86. Denoeud F, Carretero-Paulet L, Dereeper A and Droc G, The coffee genome provides insight into the convergent evolution of caffeine biosynthesis. Science 345:1181-1184 (2014).
87. Noirot M, Poncet V, Barre P, Hamon P, Hamon S and Kochko AD, Genome size variations in diploid African Coffea species. Annals of Botany 92:709-714 (2003).
88. Razafinarivo NJ, Rakotomalala J-J, Brown SC, Bourge M, Hamon S, Kochko A, Poncet V, Dubreuil-Tranchant C, Couturon E, Guyot R and Hamon P, Geographical gradients in the genome size variation of wild coffee trees (Coffea) native to Africa and Indian Ocean islands. Tree Genetics & Genomes 8:1345-1358 (2012).
89. Dereeper A, Bocs S, Rouard M, Guignon V, Ravel S, Tranchant-Dubreuil C, Poncet V, Garsmeur O, Lashermes P and Droc G, The coffee genome hub: a resource for coffee genomes. Nucleic Acids Research (2014).
90. Samson N, Bausher MG, Lee S-B, Jansen RK and Daniell H, The complete nucleotide sequence of the coffee (Coffea arabica L.) chloroplast genome: organization and implications for biotechnology and phylogenetic relationships amongst angiosperms. Plant Biotechnology Journal 5:339–353 (2007).
91. Mueller L, Strickler S, Somingues D, Pereira L, Andrade A, Marraccini P, Ming R, Wai J, Albert V, Giuliano G, Descombes P, Moine D, Guyot R, Poncet V, Hamon P, Hamon S, Tranchant C, De kochko A, Lepelley M, Rigoreau M and Crouzillat D, Towards a better understanding of the Coffea arabica genome structure, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).
92. Morgante M, Scalabrin S, Scaglione D, Cattonaro F, Magni F, Jurman I, Cerutti M, Liverani FS, Navarini L, Terra LD, Pellegrino G, Graziosi G, Vitulo N and Valle G, Progress report on the sequencing and assembly of the allotetraploid Coffea arabica var. Bourbon genome, in Plant and Animal Genome XXIII, Ed, San Diego, CA (2015).
93. Kochko Ad, Crouzillat D, Rigoreau M, Lepelley M, Bellanger L, l'Anthoene VM, Vandecasteele C, Guyot R, Poncet V, Tranchant-Dubreuil C, Hamon P, Hamon S, Couturon E, Descombes P, Moine D, Mueller L, Strickler SR, Andrade A, Luiz-Filipe, Marraccini P, Giuliano G, Fiore A, Pietrella M, Aprea G, Ming R, Wai J, Domingues DS, Paschoal A, Kuhn G, Korlach J, Chin J, Sankoff D, Zheng C and Albert VA, Dihaploid Coffea arabica genome sequencing and assembly, in Plant and Animal Genomes XXIII, Ed, San Diego, CA (2015).
94. Yepes M, Gaitan A, Cristancho MA, Rivera LF, Correa JC, Maldonado CE, Gongora CE, Villegas AM, Posada H, Zimin A, Yorke JA, Mockaitis K and Aldwinckle H, Building high quality reference genome assemblies using PacBio long reads for the allotetraploid Coffea arabica and its diploid ancestral maternal species Coffea eugenioides, in Plant and Animal Genome XXIV, Ed, San Diego, CA (2016).
95. Mahe L, Combes MC and Lashermes P, Comparison between a coffee single copy chromosomal region and Arabidopsis duplicated counterparts evidenced high level
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
esynteny between the coffee genome and the ancestral Arabidopsis genome. Plant Molecular Biology 64:699-711 (2007).
96. Lefebvre-Pautigny F, Wu F, Philippot M, Rigoreau M, Priyono, Zouine M, Frasse P, Bouzayen M, Broun P, Pétiard V, Tanksley SD and Crouzillat D, High resolution synteny maps allowing direct comparisons between the coffee and tomato genomes. Tree Genetics & Genomes 6:565-577 (2010).
97. Guyot R, Lefebvre-Pautigny F, Tranchant-Dubreuil C, Rigoreau M, Hamon P, Leroy T, Hamon S, Poncet V, Crouzillat D and de Kochko A, Ancestral synteny shared between distantly-related plant species from the asterid (Coffea canephora and Solanum Sp.) and rosid (Vitis vinifera) clades. BMC Genomics 13:103 (2012).
98. Yu Q, Guyot R, Kochko Ad, Byers A, rez RN-P, Langston BJ, Dubreuil-Tranchant C, Paterson AH, Poncet Vr, Nagai C and Ming R, Micro-collinearity and genome evolution in the vicinity of an ethylene receptor gene of cultivated diploid and allotetraploid coffee species (Coffea). The Plant Journal 67:305–317 (2011).
99. Cenci A, Combes M-C and Lashermes P, Genome evolution in diploid and tetraploid Coffea species as revealed by comparative analysis of orthologous genome segments. Plant Molecular Biology 78:135–145 (2012).
100. Herrera J-C, Romero J-V, Camayo G-C, Caetano C-M and Cortina H-A, Evidence of intergenomic relationships in triploid hybrids of coffee (Coffea sp.) as revealed by meiotic behavior and genomic in situ hybridization. Tropical Plant Biology 5:207-217 (2012).
101. Pot D, Bouchet S, Marraccini P, De bellis F, Cubry P, Jourdan I, Pereira LFP, Vieira LGE, Ferreira LP, Musoli P, Legnate H and Leroy T, Nucleotide diversity of genes involved in sucrose metabolism. Towards the identification of candidates genes controlling sucrose variability in Coffea sp., in International conference on coffee science - ASIC, Ed, Montpellier, France (2007).
102. Priolli RHG, Ramos LCS, Pot D and Moller M, Construction of a genetic map based on an interspecific F2 population between Coffea arabica and Coffea canephora and its usefulness for quality related traits, in 22nd International Conference on Coffee Science, ASIC 2008, Ed, Campinas, SP, Brazil, pp 882-890 (2009).
103. Resende M, Caixeta E, Alkimin ER, Sousa TV, Resende MDV, Chamala S and Neves LG, High-throughput targeted genotyping of Coffea arabica and Coffea canephora using next generation sequencing, in Plant and Animal Genome XXIV, Ed, San Diego, CA (2016).
104. Leroy T, Marraccini P, Dufour M, Montagnon C, Lashermes P, Sabau X, Ferreira LP, Jourdan I, Pot D, Andrade AC, Glaszmann JC, Vieira LG and Piffanelli P, Construction and characterization of a Coffea canephora BAC library to study the organization of sucrose biosynthesis genes. Theoretical and Applied Genetics 111:1032-1041 (2005).
105. Dereeper A, Guyot R, Tranchant-Dubreuil C and Anthony Fo, BAC-end sequences analysis provides first insights into coffee (Coffea canephora P.) genome composition and evolution. Plant Molecular Biology 83:177-189 (2013).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e106. Noir S, Patheyron S, Combes M-C, Lashermes P and Chalhoub B, Construction and characterisation of a BAC library for genome analysis of the allotetraploid coffee species (Coffea arabica L.). Theoretical and Applied Genetics 109:225-230 (2004).
107. Jones MR, Byers A, Skelton RL, Yu Q, Nagai C and Moore PH, Construction of an arabica coffee BAC library for molecular dissection of an allotetraploid genome., in 21st International Conference on Coffee Science (ASIC), Ed, Montpellier, France, p 49 (2006).
108. Cacao SMB, Silva NV, Domingues DS and Vieira LGE, Construction and characterization of a BAC library from the Coffea arabica genotype Timor Hybrid CIFC 832/2. Genetica 141:217–226 (2013).
109. Fernandez D, Santos P, Agostini C, Bon M and Petito A, Coffee (Coffea arabica L.) genes early expressed during infection by the rust fungus (Hemileia vastatrix). Molecular Genetics and Genomics 5:527–536 (2004).
110. Lin C, Mueller LA, Carthy JM, Crouzillat D, Pétiard V and Tanksley S, Coffee and tomato share common gene repertoires as revealed by deep sequencing of seed and cherry transcripts. Theoretical and Applied Genetics 112:114-130 (2005).
111. Vieira LGE, Andrade AC and Colombo CA, Brazilian coffee genome project: an EST-based genomic resource. Brazilian Journal of Plant Physiology 18:95-108 (2006).
112. Montoya G, Vuong H, Cristancho M, Moncada P and Yepes M, Sequence analysis from leaves, flowers and fruits of Coffea arabica var. Caturra, in 21st International Conference on Coffee Science (ASIC), Ed, Montpellier, France (2006).
113. De Nardi B, Dreos R, Del Terra L and Martellossi C, Differential responses of Coffea arabica L. leaves and roots to chemically induced systemic acquired resistance. Genome 49:1594–1605 (2006).
114. Poncet V, Rondeau M, Tranchant C, Cayrel A, Hamon S, de Kochko A and Hamon P, SSR mining in coffee tree EST databases: potential use of EST–SSRs as markers for the Coffea genus. Molecular Genetics and Genomics 436–449 (2006).
115. Salmona J, Dussert S, Descroix F, de Kochko A, Bertrand B and Joet T, Deciphering transcriptional networks that govern Coffea arabica seed development using combined cDNA array and real-time RT-PCR approaches. Plant Molecular Biology 66:105-124 (2008).
116. Aggarwal R, Hendre P, Varshney R, Bhat P, Krishnakumar V and Singh L, Identification, characterization and utilization of EST-derived genetic microsatellite markers for genome analyses of coffee and related species. Theoretical and Applied Genetics 114:359–372 (2007).
117. Privat I, Bardil A, Gomez AB and Severac D, The 'PUCE CAFE' Project: the first 15K coffee microarray, a new tool for discovering candidate genes correlated to agronomic and quality traits. BMC Genomics 12:5 (2011).
118. Mondego JM, Vidal RO, Carazzolle MF and Tokuda EK, An EST-based analysis identifies new genes and reveals distinctive gene expression features of Coffea arabica and Coffea canephora. BMC Plant Biology 11:30 (2011).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e119. Vega FE, Ebert AW and Ming R, Coffee germplasm resources, genomics, and breeding, in Plant Breeding Reviews, Ed by Janick J. John Wiley & Sons, Inc. (2008).
120. Paillard M, Lashermes P and Pétiard V, Construction of a molecular linkage map in coffee. Theoretical and Applied Genetics 93:41-47 (1996).
121. Pearl H, Nagai C, Moore P, Steiger D, Osgood R and Ming R, Construction of a genetic map for arabica coffee. Theoretical and Applied Genetics 108:829-835 (2004).
122. Nagai C, Jones MR, Byers AE, Adamski DJ and Ming R, Development and characterization of a true F2 population for genetic and QTL mapping in Arabica, in 21st International Conference on Coffee Science Ed, Montpellier, France, pp 771-777 (2007).
123. Lashermes P, Combes MC, Prakash NS, Trouslot P, Lorieux M and Charrier A, Genetic linkage map of Coffea canephora: effect of segregation distortion and analysis of recombination rate in male and female meioses. Genome 44:589-595 (2001).
124. Coulibaly I, Revol B, Noirot M, Poncet V, Lorieux M, Carasco-Lacombe C, Minier J, Dufour M and Hamon P, AFLP and SSR polymorphism in a Coffea interspecific backcross progeny [(C. heterocalyx × C. canephora) × C. canephora]. Theoretical and Applied Genetics 107:1148-1155 (2003).
125. Crouzillat D, Rigoreau M, Bellanger L, Priyono P, Mawardi S, Syahrudi, McCarthy J, Tanksley S, Zaenudin I and Pétiard V, A Robusta consensus genetic map using RFLP and microsatellite markers for the detection of QTL, in 20th International Conference on Coffee Science (ASIC 2004), Ed, Bangalore, India, pp 546-553 (2005).
126. Amidou ND, Michel N, Serge H and Valérie P, Genetic basis of species differentiation between Coffea liberica Hiern and C. canephora Pierre: Analysis of an interspecific cross. Genetic Resources and Crop Evolution 54:1011-1021 (2007).
127. Ky C-L, Barre P, Lorieux M, Trouslot P, Akaffou S, Louarn J, Charrier A, Hamon S and Noirot M, Interspecific genetic linkage map, segregation distortion and genetic conversion in coffee (Coffea sp.). Theoretical and Applied Genetics 101:669-676 (2000).
128. Gartner GAL, McCouch SR and Moncada MDP, A genetic map of an interspecific diploid pseudo testcross population of coffee. Euphytica 192:305-323 (2013).
129. Teixeira-Cabral TA, Sakiyama NS, Zambolim L, Pereira AA and Schuster I, Single-locus inheritance and partial linkage map of Coffea arabica L. Crop Breeding and Applied Biotechnology 4:416-421 (2004).
130. Moncada P, Tovar E, Montoya JC, Gonzalez A, Spindel J and McCouch S, A genetic of linkage map of coffee (Coffea arabica L) and QTL for yield, plant height and bean size, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).
131. Campa C, Noirot M, Bourgeois M, Pervent M, Ky C, Chrestin H, Hamon S and de Kochko A, Genetic mapping of a caffeoyl-coenzyme A 3-0-methyltransferase gene
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
ein coffee trees. Impact on chlorogenic acid content. Theoretical and Applied Genetics 107:751–756 (2003).
132. Ky C-L, Barre P and Noirot M, Genetic investigations on the caffeine and chlorogenic acid relationship in an interspecific cross between Coffea liberica dewevrei and C. pseudozanguebariae. Tree Genetics & Genomes 9:1043-1049 (2013).
133. Hendre PS and Aggarwal RK, DNA markers: Development and application for genetic improvement of coffee, in Genomics Assisted Crop Improvement: Vol 2: Genomics Applications in Crops, Ed by Varshney RK and Tuberosa R. Springer, pp 399–434 (2007).
134. Geromel C, Ferreira LP, Guerreiro SMC, Cavalari AA, Pot D, Pereira LFP, Leroy T, Vieira LGE and Marraccini PMaP, Biochemical and genomic analysis of sucrose metabolism during coffee (Coffea arabica) fruit development. Journal of Experimental Botany 57:3243–3258 (2006).
135. Privat I, Foucrier S, Prins A, Epalle T, Eychenne M, Kandalaft L, Caillet V, Lin C, Tanksley S and Foyer C, Differential regulation of grain sucrose accumulation and metabolism in Coffea arabica (Arabica) and Coffea canephora (Robusta) revealed through gene expression and enzyme activity analysis. New Phytologist 178:781-797 (2008).
136. Mizuno K, Okuda A, Kato M, Yoneyama N, Tanaka H, Ashihara H and Fujimura T, Isolation of a new dual-functional caffeine synthase gene encoding an enzyme for the conversion of 7-methylxanthine to caffeine from coffee (Coffea arabica L.). FEBS letters 534:75-81 (2003b).
137. Simkin AJ, Qian T, Caillet V, Michoux F, Ben Amor M, Lin C, Tanksley S and McCarthy J, Oleosin gene family of Coffea canephora: Quantitative expression analysis of five oleosin genes in developing and germinating coffee grain. Journal of plant physiology 163:691-708 (2006).
138. Mahesh V, Rakotomalala JJ, Le Gal L, Vigne H, De Kochko A, Hamon S, Noirot M and Campa C, Isolation and genetic mapping of a Coffea canephora phenylalanine ammonia-lyase gene (CcPAL1) and its involvement in the accumulation of caffeoyl quinic acids. Plant cell reports 25:986-992 (2006).
139. Lepelley M, Mahesh V, McCarthy J, Rigoreau M, Crouzillat D, Chabrillange N, de Kochko A and Campa C, Characterization, high-resolution mapping and differential expression of three homologous PAL genes in Coffea canephora Pierre (Rubiaceae). Planta 236:313-326 (2012).
140. Perrois C, Strickler S, G M, M L, L B, S M, J H, L M and I. P, Differential regulation of caffeine metabolism in Coffea arabica (Arabica) and Coffea canephora (Robusta). Planta (2014).
141. Mizuno K, Matsuzaki M, Kanazawa S, Tokiwano T, Yoshizawa Y and Kato M, Conversion of nicotinic acid to trigonelline is catalyzed by N-methyltransferase belonged to motif B' methyltransferase family in Coffea arabica. Biochemical and Biophysical Research Communications 452:1060–1066 (2014).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e142. Yuyama PM, Ivamoto ST, Carazzolle MF, Reis Júnior O, Pereira GAG, Sakuray LM, Ruiz M, Leroy T, Charmetant P, Domingues DS and Pereira LFP, Transcriptome analysis of leaves and fruits of Coffea eugenioides, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).
143. Privat I, Perrois C, Mathieu G, Strickler SR, Lepelley M, Bedon L, Michaux S, Husson J and Mueller L, Differential regulation of caffeine metabolism in Coffea arabica and Coffea canephora, in The 25th International conference on coffee and science ASIC 2014, Ed, Colombia (2014).
144. Joët T, Salmona J, Laffargue A, Descroix F and Dussert S, Use of the growing environment as a source of variation to identify the quantitative trait transcripts and modules of co‐expressed genes that determine chlorogenic acid accumulation. Plant, cell & environment 33:1220-1233 (2010b).
145. Sexton TR, Henry RJ, Harwood CE, Thomas DS, McManus LJ, Raymond C, Henson M and Shepherd M, Pectin Methylesterase genes influence solid wood properties of Eucalyptus pilularis. Plant Physiology 158:531–541 (2012).
146. Hall D, Tegstrom C and Ingvarsson PK, Using association mapping to dissect the genetic basis of complex traits in plants. Briefings in functional genomics 9:157-165 (2010).
147. Henry RJ, Next-generation sequencing for understanding and accelerating crop domestication. Briefings in functional genomics 11:51-56 (2011).
148. Henry RJ, Edwards M, Waters DL, Bundock P, Sexton TR, Masouleh AK, Nock CJ and Pattemore J, Application of large-scale sequencing to marker discovery in plants. Journal of biosciences 37:829-841 (2012).
149. Varshney RK, Terauchi R and McCouch SR, Harvesting the Promising Fruits of Genomics: Applying Genome Sequencing Technologies to Crop Breeding. PLOS biology 12 (2014).
150. Kumar S, Banks TW and Cloutier S, SNP discovery through next-generation sequencing and its applications. International journal of plant genomics 2012 (2012).
151. Edwards M, Whole-Genome Sequencing for Marker Discovery, in Molecular Markers in Plants, Ed by Henry RJ. Wiley-Blackwell (2013).
152. Gao Q, Yue G, Li W, Wang J, Xu J and Yin Y, Recent Progress Using High‐throughput Sequencing Technologies in Plant Molecular Breeding. Journal of integrative plant biology 54:215-227 (2012).
153. Illumina, An introduction to Next-generation sequencing technology. (2013).
154. Whitt SR and Buckler IV ES, Using natural allelic diversity to evaluate gene function, in Plant Functional Genomics: Methods and Protocols, ed. by Grotewold E. Humana Press, Totowa, NJ, pp 123-139 (2003).
155. Neale DB and Savolainen O, Association genetics of complex traits in conifers. Trends in Plant Science 9:325-330 (2004).
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e
Table 1. Variation of the main biochemical compounds determining quality in coffee
Compound Level C. arabica
(dmb – g kg-1)
C. canephora
(dmb – g kg-1)
Other species
(dmb – g kg-1)
Sucrose Lowest 7.40 4.05 3.80
Highest 11.10 7.05 10.70
Caffeine Lowest 0.96 1.51 0.00
Highest 1.62 3.30 2.64
Trigonelline Lowest 0.88 0.75 0.39
Highest 2.90 1.24 1.77
Lipids Ave. 15 10 8-30
CGAs Lowest 4.34 7 0.8
Highest 4.80 14.4 11.9
Source: (15-18)
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e
Table 2: Inheritance mode and heritability of coffee quality determinants
Trait Intraspecific hybrids Interspecific hybrids
Sucrose content Polygenic inheritance
NSH = 0.11
Male & female additive &
dominant effects
Polygenic & additive inheritance
Higher heritability value
Caffeine Additive inheritance
NSH = 0.33; BSH = 0.76
NSH = 0.80
Female additive effects
Polygenic & additive inheritance
One major gene with two alleles
No caffeine = recessive gene
(caf1)
High BSH
Trigonelline Male & female additive and
dominant effects
NSH = 0.38
Not additive, maternal
inheritance
heritability = 0.71
Fat content NSH = 0.74
Male additive
CGAs NSH = 0.36
Female additive effects
Nuclear inheritance
3-FQA isomer controlled by one
major gene
NSH = narrow-sense heritability; BSH = broad-sense heritability
Source: (41-49)
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e
Table 3. Genome sequence data available for the two main domesticated coffee
species
Whole genome sequence of C. canephora
Genome
size
(Mb)
Coverage
No of
contigs
No of
scaffolds
No of
genes
MicroRNA
precursors
Organellar-to-
nuclear genome
transfers
Est. 710 30 x 25,216 13,345 25,574 92 2,573
Chloroplast genome sequence of C. arabica
Genome
size (bp)
IRs No of
genes
Coding region
protein
genes
tRNA
genes
rRNA
genes
genes containing
introns
155,189 25,943 130 79 29 4 18
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e
Table 4: Bacterial Artificial Chromosome libraries of two main coffee species
Genotypes No of
clones
Ave. size of
clones (kb)
Coverage Authors
C. canephora
IF 126 55,296 135 10.6 x Leroy et al., 2005
IF 200 DH 36,864 166 8.6 x Dereeper et al., 2013
IF 200 DH 36,864 121 6.3 x Dereeper et al., 2013
C. arabica
IAPAR 59 80,813 130 8 x Noir et al., 2004
Hybrid (Mokka x Catimor) 52,416 94 4 x Jones et al., 2006
Timor Hybrid CIFC 832/2 56,832 118 5-6 x Cacao et al., 2013
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e
Table 5: Expressed Sequence Tags from the three main coffee species
Project/
organisation
Tissues
ESTs of species Unigenes/
genes C.
arabica
C.
canephora
C.
racemosa
Fernandez et al. (98) leaves 527
Nestle and Cornell
Uni
Lin et al. (99)
seeds
47,000 13,175
Brazilian Genome
Vieira et al. (100)
leaves and
fruits
130,792 12,381 10,566 33,000
CENICAFE
Montoya et al. (101)
leaf, flower
and fruit
32,961 10,799
De Nardi et al. (102) leaves and
roots
1,587
French IRD
Poncet et al. (103)
leaves and
fruits
10,420 5,534
Salmona et al. (104) seeds 266
Total (246,500) 166,133 69,801 10,566 62,508
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e
Table 6. Genetic linkage maps of coffee
Markers Population used Length
(cM)
No of
linkage
groups
Traits/
purpose
Authors
C. arabica
AFLP pseudo-F2
population
(Mokka hybrid x
Catimor)
1,802.8 31 source-sink traits (110)
AFLP F2 of Tall Mokka and
Catimor
1,042.4 40 cupping quality
and morphology
(111)
AFLP and SSR F2 of C. arabica x C.
canephora
1,011 37 quality and
productivity
(92)
RAPD F2 of (Mundo Nov x
Hybrido de Timor) x
Hybrido de Timor
540.6 8 Partial linkage
map
(118)
SSR F2 and F3 of Caturra
x CCC1046
3800 22 yield, plant height and fruit size
(119)
C. canephora
RFLP and
RAPD
doubled haploid
(IF200)
1,402 15 QTL analysis (109)
RFLP and SSR doubled haploid and 1,041 11 segregation (112)
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eTC (IF200 x DH) distortion
RFLP and SSR (C. heterocalyx x C.
canephora) x C.
canephora
1,360 15 QTL analysis (113)
RFLP and SSR BP409 x Q121 1,258 11 yield and vigour (114)
AFLP and
ISSR
(C.canephora x C.
liberica) x C. liberica
1,502 16 morphological
traits
(115)
Conserved
Ortholog Set
(COS)
Intraspecific hybrid
of C. canephora
1,331 11 synteny maps
between coffee
and tomato
(86)
SSR pseudo-backcross of
C. canephora
1,290 11 yield and quality (44)
SSR 2 populations of
intraspecific hybrid
of C. canephora
1,201 11 yield and quality (49)
Other species
AFLP and
RFLP
(C.
pseudozanguebariae
x C. liberica var.
dewevrei (DEW)) x
DEW
1,144 14 biochemical
traits (caffeine,
CGAs, sucrose)
(116)
SSR C. liberica x C.
eugenioides
798.68 11 QTL analysis (117)
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
e
Table 7: Quantitative Trait Loci and Linkage Groups in coffee
Traits No of QTLs and LGs
C. canephora Other species
Sucrose 9 QTLs/A, E, I, J
Caffeine 14 QTLs/A, C, E, I, K 2 QTLs/A, G
Trigonelline 5 QTLs/F, G, I, K 1 QTL/G
Lipids 7 QTLs/B, C, E, G, I
CGAs 23 QTLs/A, B, D, E, F, I, J, K 1 QTL/A
Cup quality 14 QTLs/A, B, D, E, G, H, I
Source: (43, 44, 49, 120, 121)
This article is protected by copyright. All rights reserved.
Acc
epte
d A
rticl
eTable 8: Genes controlling biochemical compounds determining quality traits in coffee
Species Sucrose Caffeine Trigonelline Fatty acids CGAs
C. arabica CaSUS1 &
CaSUS2,
CaInv3
CmXRS1, CTS2, CCS1, CaXMT1-2, CaMXMT1-2, and CaDXMT1-2
CTgS1 &
CTgS2
OLE2,
DGAT
4CL8, HQT,
F5H1 & POD
C.
canephora
CcSUS1,
CcSUS2
& 8 new
genes
CmXRS1, CTS2,
CCS1 &
23 genes
relating to
alkaloid
catalytic
N/A CcOLE1-5,
CcSTO1
& 15 other
genes
Many genes
involved in
phenylpropa
-noid
pathway
Source: (7, 44, 79, 93, 106, 120, 123-130)
This article is protected by copyright. All rights reserved.