adaptive evolution of the retinoid x receptor in vertebratesadaptive evolution of the retinoid x...

9
Adaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a, b , L. Filipe C. Castro a , Rute R. da Fonseca a , Maria A. Reis-Henriques a , Vítor Vasconcelos a, b , Miguel M. Santos a, b , Agostinho Antunes a, b, a CIMAR/CIIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177; 4050-123 Porto, Portugal b Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal abstract article info Article history: Received 7 October 2011 Accepted 5 December 2011 Available online 14 December 2011 Keywords: Adaptive evolution Retinoid X receptor Positive selection Retinoid X receptors (RXR) are transcription factors with important roles in development, reproduction, homeostasis, and cell differentiation. Different types of vertebrate RXRs (α (RXRA), ß (RXRB) and γ (RXRG)) have arisen from multiple duplication events. The adaptive evolution mechanism that has preserved duplicate RXR paralogs, as well as their role in development and adaptation, is thus far unknown. In this work, we have investigated different aspects of vertebrate RXR evolution. Codon based tests of positive selection identied that RXR was under signicant positive selection immediately after the whole genome duplications in verte- brates. Amino acid based rate shift analysis also revealed signicant rate shifts immediately after the whole ge- nome duplications and functional divergence between all the pairs of RXRs. However, the extant RXR genes are highly conserved, particularly the helix involved in dimerization and the DNA-binding domain, but positively se- lected sites can nevertheless be found in domains for RXR regulation. © 2011 Elsevier Inc. All rights reserved. 1. Introduction Retinoid X receptors (RXR; NR2B), are transcription factors that mediate an array of extracellular signals in a ligand dependent manner, to regulate the target gene by binding to response elements within the promoter region of those genes. RXR regulates many biological func- tions in vertebrates such as development, reproduction, homeostasis and cell differentiation [13]. Their disruption has been associated with a vast array of developmental and reproductive abnormalities (e.g. reduced testicular development and fertility, masculinization of female gastropodsimposex) in wildlife and humans [4]. RXRs are members of the nuclear receptor super-family. The canonical structure of a nuclear receptor follows a common pattern including the N-terminal A/B domain, a DNA-binding domain and a ligand-binding domain. RXRs bind to their targets often called response elements, which may be single elements or repeats, arranged in a direct, inverted or everted manner, of a consensus sequence 'AGGTCA'. These repeat elements require the formation of dimers, and RXR is an important heterodimerization partner for many other nuclear receptors [5]. RXR can also form homodimers, suggesting an indepen- dent signaling pathway, but its exact biological role remains elusive [5]. The key role of RXR in the heterodimerization with other nuclear receptors, and thus its interference in multiple signaling pathways, makes it an interesting therapeutic target for treatment of diseases like cancer and metabolic syndrome [6]. The other major roles of RXRs are in the embryonic development, differentiation, organogenesis and cell proliferation. Due to its ubiquitous presence within metazoans and its activation by low molecular weight ligands, RXR is a prime target of environmental pollutants, both in vertebrates and invertebrates, which may be a cause of cancer and endocrine disruption [7,8]. RXR has been found to bind 9-cis retinoic acid (9-cis RA) with high afnity [9], suggesting a role in the retinoic acid signaling pathway in addition to its heterodimerization partner status. Although 9-cis RA has not been clearly detected in mammalian cells [10], it has recently been identied in teleost sh [11] and invertebrate tissues [12]. Since 9-cis RA can also act as a ligand to the retinoic acid receptors (RARs) its role as the natural ligand of the RXR has been questioned. Also phytanic acid and docosahexaenoic acid have been proposed as ligands of the RXR [13,14]. One or more RXR genes are found in most metazoan taxa from placozoans to vertebrates [15]. In vertebrates, there are typically three copies of the gene, RXRA (NR2B1/RXRα), RXRB (NR2B2/RXRβ) and RXRG (NR2B3/RXRγ), which arose due to the two rounds of genome duplications (2R) in the vertebrate ancestor [16]. In shes, owing to a specic whole genome duplication (3R) there is an additional RXRB gene [16], but only in zebrash (among the sequenced teleost genomes), additional copies of the RXRA and the RXRG genes are found [17] (Fig. 1). In vertebrates, RXRB is the ubiquitously expressed subtype [9], RXRA is mainly expressed in the liver, kidney, epidermis, intestine and dominates the RXR expression in the skin, while RXRG shows a restricted expression in muscles, pituitary gland and certain regions of the brain (see www.nursa.org/10.1621/datasets.02001 for more details about expression of different RXR subtypes). Genomics 99 (2012) 8189 Corresponding author at: CIMAR/CIIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177; 4050-123, Porto, Portugal. Fax: +351 22 339 06 08. E-mail address: [email protected] (A. Antunes). 0888-7543/$ see front matter © 2011 Elsevier Inc. All rights reserved. doi:10.1016/j.ygeno.2011.12.001 Contents lists available at SciVerse ScienceDirect Genomics journal homepage: www.elsevier.com/locate/ygeno

Upload: others

Post on 19-Jan-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Adaptive evolution of the Retinoid X receptor in vertebratesAdaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca

Genomics 99 (2012) 81–89

Contents lists available at SciVerse ScienceDirect

Genomics

j ourna l homepage: www.e lsev ie r .com/ locate /ygeno

Adaptive evolution of the Retinoid X receptor in vertebrates

Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca a, Maria A. Reis-Henriques a, Vítor Vasconcelos a,b,Miguel M. Santos a,b, Agostinho Antunes a,b,⁎a CIMAR/CIIMAR, Centro Interdisciplinar de Investigação Marinha e Ambiental, Universidade do Porto, Rua dos Bragas, 177; 4050-123 Porto, Portugalb Departamento de Biologia, Faculdade de Ciências, Universidade do Porto, Rua do Campo Alegre, 4169-007 Porto, Portugal

⁎ Corresponding author at: CIMAR/CIIMAR, Centro InMarinha e Ambiental, Universidade do Porto, Rua dosPortugal. Fax: +351 22 339 06 08.

E-mail address: [email protected] (A. Antunes)

0888-7543/$ – see front matter © 2011 Elsevier Inc. Alldoi:10.1016/j.ygeno.2011.12.001

a b s t r a c t

a r t i c l e i n f o

Article history:Received 7 October 2011Accepted 5 December 2011Available online 14 December 2011

Keywords:Adaptive evolutionRetinoid X receptorPositive selection

Retinoid X receptors (RXR) are transcription factors with important roles in development, reproduction,homeostasis, and cell differentiation. Different types of vertebrate RXRs (α (RXRA), ß (RXRB) and γ (RXRG))have arisen from multiple duplication events. The adaptive evolution mechanism that has preserved duplicateRXR paralogs, as well as their role in development and adaptation, is thus far unknown. In this work, we haveinvestigated different aspects of vertebrate RXR evolution. Codon based tests of positive selection identifiedthat RXR was under significant positive selection immediately after the whole genome duplications in verte-brates. Amino acid based rate shift analysis also revealed significant rate shifts immediately after the whole ge-nome duplications and functional divergence between all the pairs of RXRs. However, the extant RXR genes arehighly conserved, particularly the helix involved in dimerization and the DNA-binding domain, but positively se-lected sites can nevertheless be found in domains for RXR regulation.

© 2011 Elsevier Inc. All rights reserved.

1. Introduction

Retinoid X receptors (RXR; NR2B), are transcription factors thatmediate an array of extracellular signals in a ligand dependent manner,to regulate the target gene by binding to response elements within thepromoter region of those genes. RXR regulates many biological func-tions in vertebrates such as development, reproduction, homeostasisand cell differentiation [1–3]. Their disruption has been associatedwith a vast array of developmental and reproductive abnormalities(e.g. reduced testicular development and fertility, masculinization offemale gastropods—imposex) in wildlife and humans [4].

RXRs are members of the nuclear receptor super-family. Thecanonical structure of a nuclear receptor follows a common patternincluding the N-terminal ‘A/B domain’, a DNA-binding domain and aligand-binding domain. RXRs bind to their targets often calledresponse elements, which may be single elements or repeats, arrangedin a direct, inverted or everted manner, of a consensus sequence'AGGTCA'. These repeat elements require the formation of dimers, andRXR is an important heterodimerization partner for many other nuclearreceptors [5]. RXR can also form homodimers, suggesting an indepen-dent signaling pathway, but its exact biological role remains elusive[5]. The key role of RXR in the heterodimerization with other nuclearreceptors, and thus its interference in multiple signaling pathways,makes it an interesting therapeutic target for treatment of diseases

terdisciplinar de InvestigaçãoBragas, 177; 4050-123, Porto,

.

rights reserved.

like cancer and metabolic syndrome [6]. The other major roles of RXRsare in the embryonic development, differentiation, organogenesis andcell proliferation. Due to its ubiquitous presence within metazoansand its activation by lowmolecularweight ligands, RXR is a prime targetof environmental pollutants, both in vertebrates and invertebrates,which may be a cause of cancer and endocrine disruption [7,8].

RXR has been found to bind 9-cis retinoic acid (9-cis RA) with highaffinity [9], suggesting a role in the retinoic acid signaling pathway inaddition to its heterodimerization partner status. Although 9-cis RAhas not been clearly detected in mammalian cells [10], it has recentlybeen identified in teleost fish [11] and invertebrate tissues [12]. Since9-cis RA can also act as a ligand to the retinoic acid receptors (RARs)its role as the natural ligand of the RXR has been questioned. Alsophytanic acid and docosahexaenoic acid have been proposed asligands of the RXR [13,14].

One or more RXR genes are found in most metazoan taxa fromplacozoans to vertebrates [15]. In vertebrates, there are typically threecopies of the gene, RXRA (NR2B1/RXRα), RXRB (NR2B2/RXRβ) andRXRG (NR2B3/RXRγ), which arose due to the two rounds of genomeduplications (2R) in the vertebrate ancestor [16]. In fishes, owing to aspecific whole genome duplication (3R) there is an additional RXRBgene [16], but only in zebrafish (among the sequenced teleost genomes),additional copies of the RXRA and the RXRG genes are found [17] (Fig. 1).

In vertebrates, RXRB is the ubiquitously expressed subtype [9],RXRA is mainly expressed in the liver, kidney, epidermis, intestineand dominates the RXR expression in the skin, while RXRG shows arestricted expression in muscles, pituitary gland and certain regionsof the brain (see www.nursa.org/10.1621/datasets.02001 for moredetails about expression of different RXR subtypes).

Page 2: Adaptive evolution of the Retinoid X receptor in vertebratesAdaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca

Fig. 1. Schematic RXR gene family tree showing the orthologous relationships of the Teleost RXR genes: the post-duplication branches tested are shown in red, positive selectedbranches are marked with a star symbol and rate-shifting branches are marked with a blue circle, the values for functional divergence (θ) between different clades of RXRs arealso shown.

82 S. Philip et al. / Genomics 99 (2012) 81–89

The main goal of this study was to assess the adaptive evolution ofthe RXR genes in vertebrates using a comparative genomics framework.We compared the rates of synonymous (silent; dS) and non-synonymous (amino acid replacement; dN) substitutions andconducted functional divergence analyses of the vertebrate RXRgenes.Moreover, we studied the synteny of teleost RXR genes to retraceits evolutionary history in the vertebrates and we further evaluated thechanges of the RXR expression patterns across vertebrates. We foundthat all the branches immediately following the first round (1R) and2R of genome duplication were under positive selection and all thepairs of RXRs produced from these two rounds of duplication werefunctionally divergent. Similarly, the paralogs that resulted from the3R (third round/fish specific) genomeduplicationwere also functionallydivergent. Positive selected sites were identified mainly in the N-terminal region and the Ligand-Binding domain (LBD), which harborsregions responsible for the expression of the protein. To clarify the effectof genome duplication on the RXR expression we evaluated expressiondatabases for the patterns of zebrafish RXRs (duplicates) comparativelyto the mouse RXRs (singletons), which suggested expression shufflingamong the paralogs of zebrafish. Finally, we provide evidence that theasymmetric distribution of RXR genes in teleosts (in comparison tozebrafish) was due to secondary gene loss events.

2. Results

2.1. Analysis of synteny: Gene loss in medaka, fugu, stickleback andtetraodon

The synteny analysis between the fish (medaka and zebrafish)chromosomes containing the RXR genes and their human

counterparts (Sl Fig. 1a and b), confirmed the orthologous relation-ship between the genes from both groups. The existence of otherco-orthologous genes in the fish chromosomes (four chromosomesin the case of medaka and six in the case of zebrafish) containing theRXR gene when comparing them to the three human chromosomes(each with one RXR) suggests that the fish chromosomes are productsof an ancient duplication event, supporting that the additional RXRgenes in the teleost genomes are actually products of the ancestralteleost specific genome duplication [16,17].

The teleost chromosome evolutionary model [18] suggests thatthe ancestral teleost had 24 chromosomes post-teleost specificgenomeduplication. Themedaka and the fugu genomes have preservedthat same condition till date (for ~350 million years [18]; Sl Fig. 2)while the zebrafish genome suffered lineage-specific chromosomalrearrangements after the ancestral teleost fish genome duplicationevent. The chromosome 20 of the zebrafish and the chromosome 24of the medaka genome are thought to be the products of a same ances-tral chromosome and no ancestral rearrangement events have beendocumented leading to the chromosome 20 in zebrafish [18]. However,while the chromosome 20 of zebrafish harbors one RXR gene (rxrgb),the chromosome 24 of themedaka genome does not (Fig. 2a). Our anal-ysis of conserved synteny for the chromosomes, between the region33.5 Mb to 36.7 Mb of the chromosome 20 of the zebrafish (rxrgb isfound on 33.90–33.94 Mb) and the chromosome 24 of the medaka ge-nome revealed several neighboring orthologous (co-orthologous)genes but the RXRG was missing from medaka chromosome 24(Fig. 2b and Sl Fig. 3), signaling a gene loss event. This trend of con-served synteny of co-orthologs (genes in the neighboring regions) isalso evident in the comparison between the zebrafish chromosome 20and the stickleback group XVIII (Sl Fig. 3). The searches in the synteny

Page 3: Adaptive evolution of the Retinoid X receptor in vertebratesAdaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca

Fig. 2. Syntenic analyses providing evidences of gene number variation in teleosts: A) The hypothesis of chromosome evolution in teleosts [18] (note that the chromosomal blocksare not up to scale), the ancestor to vertebrates possessed 13 chromosomes which duplicated during the teleost specific genome duplication event producing 24 chromosomes inthe teleost ancestor, following the genome duplication there was 8 major genome rearrangements in the teleost ancestor, the medaka genome is almost unchanged after that event~350 million years ago, whereas the zebrafish ancestor had some major and minor genome rearrangements which gave rise to the present day zebrafish genome; The presentchromosomes in the species are placed directly below the ancestral chromosomes in the figure. The major rearrangements are marked by solid lines and minor rearrangementsare marked by dotted lines, Chromosome 20 of zebrafish (which possess the RXRGb) and Chromosome 24 of medaka are products of the same parental chromosome, but themedaka lack and additional copy of RXRG; B) The synteny (gene-trace image—a scale free representation of the orthologous clusters) showing the chromosome 20 of zebrafish(33.5 Mb to 36.7 Mb) to the chromosome 24 of medaka, RXRGb (33.9 Mb) has no ortholog in the medaka genome, whereas there are several neighboring co-orthologs, highlightinga geneloss event.

83S. Philip et al. / Genomics 99 (2012) 81–89

database [19] for syntenic clusters of both rxrga and rxrgb of zebrafishwith medaka and stickleback identified only, the chromosome 4 of me-daka, which contains RXRG genes (clusters: 351475 and 349229) andthe group VII of stickleback, which contains RXRG gene (clusters:372199 and 374691), consistently proving the absence of an additionalgene in these species. Thus, we conclude that some teleosts, such as themedaka, the three-spine stickleback, the fugu and the tetraodon (allwith only 4 RXR genes) have lost an additional copy of the RXRG pro-duced during the whole genome duplication.

Chromosomes 5 and 21 of the zebrafish are products of genomerearrangements in the zebrafish ancestor and these are the chromo-somes harboring the rxrab and rxraa, respectively. The reciprocalblast hit based synteny analysis between the zebrafish and the medakaand stickleback genomes consistently identified chromosome 12 ofmedaka and group XIV of stickleback using both rxrab and rxraa asquery. The chromosomes of these two species share several neighboringco-ortholog genes, for example: the co-orthologous genes of rxraa, gsna,vav2 andwdr5, and co-orthologous genes of rxrab, gsnb and anxawhereidentified on these chromosomes (see Sl Fig. 4), but no other chromo-somes containing an additional RXRA gene could be identified. Theabsence of a second copy of RXRA gene in medaka and stickleback (onany other chromosome), suggests that an additional copy of RXRA hasbeen secondarily lost in these species. The circular plots of the chromo-somes containing RXR genes for zebrafish and medaka also show that

there are several genes orthologous on these chromosomes(Sl Fig. 1c), enabling us to conclude that the teleost specific genomeduplication, not lineage-specific gene gains, has been the reason foradditional RXR genes in teleosts, while there has been lineage-specificgene losses in some of the acanthopterygiian teleosts (medaka, fugu,stickleback and tetraodon) analyzed in this study, there was retentionin the only ostariophysan teleost (zebrafish) analyzed.

2.2. Positive selection on post-duplication branches

The topology testing revealed that the correct trichotomy of theRXR genes was ((RXRG,RXRB),RXRA), which was selected aheadof the ((RXRB,RXRA),RXRG) and ((RXRA,RXRG),RXRB) topologies(Sl Table 1). This topology was used for the branch site and branchmodels implemented in PAML [20]. For the branchmodels, the likelihoodratio test between the alternate and null model likelihoods from PAMLshows that the two ratiomodel fits the data better (Table 1). The averageω value (ω0) identified by the one ratio model was 0.048. The compari-son between the unconstrained two ratiomodel and the constrained tworatio model supported the null model favoring the post-duplicationbranches not to be under positive selection. Thus the branch modelssuggest that the post-duplication branches are under relaxed selectionconstraints. However, the unconstrained two ratio model identified

Page 4: Adaptive evolution of the Retinoid X receptor in vertebratesAdaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca

Table 1Likelihood parameter estimates under lineage-specific models of post-duplication branches of vertebrates.

Model ℓ ω0 ωPDa LRT (2Δ ℓ)

One ratio −32,986.908 0.049 NATwo ratio (ωPD=1)–null model −32,951.918 0.046 1Two ratio −32,951.912 0.047 1.110 Vs one ratio=69.99 (pbb0.01)

Vs two ratio (ωPD=1)=0.013 (NS)

Likelihood parameters of the Branch models of post-duplication branches leading to RXRBa and RXRBb in teleosts

Model ℓ ω0 ωPD LRT (2Δ ℓ)

One ratio—null model −10,522.007 0.057 NATwo ratio (ωPD=1)–null model −10,565.773 0.049 1Two ratio—alternate model −10,516.565 0.053 0.109 Vs one ratio=10.88 (pbb0.01)

Vs two ratio (ωPD=1)=98.4 (pbb0.01)

Branch site analysis of post-duplication branches leading to RXRA, RXRB and RXRG

Model ℓ Proportion of sites ωPDa(2a=2b) LRT (2Δ ℓ)

Branch site null model for the whole vertebrate tree −32,495.157 p0=0.78391 1.0 NAp1=0.05835p2a=0.14681p2b=0.01093

Branch site test for positive selection on ancestral branch of RXRA −32,501.170 p0=0.83949 999.00 12.027 (Pb0.01)p1=0.06194p2a=0.09180p2b=0.00677

Branch site test for positive selection on ancestral branch of RXRB −32,485.585 p0=0.85621 999.00 19.14 (Pb0.01)p1=0.06275p2a=0.07550p2b=0.00553

Branch site test for positive selection on ancestral branch of RXRG −32,482.212 p0=0.77362 999.00 25.89 (Pb0.01)p1=0.05798p2a=0.15666p2b=0.01174

Branch site test for positive selection on ancestral branch of RXRB & RXRG −32,501.170 p0=0.83949 999.00 12.027 (Pb0.01)p1=0.06194p2a=0.09180p2b=0.00677

Branch site analysis of post-duplication branches leading to RXRBa and RXRBb in teleosts

Model ℓ Proportion of sites ωPD (2a=2b) LRT (2Δ ℓ)

Branch site null model for the RXRB tree −10,470.676 p0=0.84682 1.0 NAp1=0.03646p2a=0.11190p2b=0.00482

Branch site test for positive selection on ancestral branch of RXRBa −10,472.312 p0=0.89881 33.97754 3.27 (Pb0.05)p1=0.03430p2a=0.06443p2b=0.00246

Branch site test for positive selection on ancestral branch of RXRBb −10,469.822 p0=0.88937 2.41428 1708 (NS)p1=0.03803p2a=0.06963p2b=0.00298

ωPD=omega value of the post-duplication branches; ω0=omega ratio of all other branches except post-duplication branches; NA=not allowed.a Values of 999 for dN/dS indicate dS=0, so dN/dS is undefined.

84 S. Philip et al. / Genomics 99 (2012) 81–89

significantly increased ω values (ωPD=1.11) in the post-duplicationbranches. For the RXRB gene duplication in the teleosts, initially thetwo ratio model was found to fit the data better against the one ratiomodel (LRT=10.88). The analysis for positive selection against the con-strained Two Ratio (ωPD=1), also favored the unconstrained two ratiomodel (LRT=98.4), however the ωPD value for the unconstrained tworatio model was 0.19 (opposed to ωPD=0.048 in constrained two ratiomodel) which supports a scenario of relaxed selection constraints inthe post-duplication branches (Table 1), similar to the post-duplicationbranches resulting from 2R.

The branch site analysis also revealed that each of the ancestralbranches of the RXR genes resulting from the second round ofwhole genome duplication (2R-WGD), RXRA, RXRG and RXRB werestrongly positive selected using the LRTs (Table 1), and an ω valueof 999 (infinity in the case of each of the post-duplication branch tested,Values of 999 for dN/dS indicate dS=0, so dN/dS is undefined) which

signals that there are no synonymous substitutions at the few codons(~10%) that appear to have come from site classes 2a and 2b (positivelyselected site class in the foreground branches). However, only theancestral branch leading to the rxrba gene that resulted from the thirdround of whole genome duplication in fishes (3R) was positivelyselected (Table 1). In addition, the branch site test segregates theamino acid positions/codons into four different categories. Twodescribesites for which selective pressure does not change over time, eitherunder purifying selection (site class 0,ω0b1) or under neutral evolution(site class 1, ω1=1). The two other categories (site classes 2a and 2b)are sites potentially evolving under positive selection only in theforegroundbranches (ω2>1), and evolving in the background branchesunder purifying selection (site class 2a, background branches ω0b1) orneutral evolution (site class 2b, background branches ω1=1). Onaverage ~10% of the sites were under the site class 2a and ~1% of siteswere under site class 2b (Table 1 and Sl Table 2), which signals that

Page 5: Adaptive evolution of the Retinoid X receptor in vertebratesAdaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca

85S. Philip et al. / Genomics 99 (2012) 81–89

majority of the sites were under the influence of strong purifyingselection during the evolution of RXR, while immediately after duplica-tion a handful of sites were positive selected (on the foregroundbranches), signaling episodic events of positive selection during theevolution of RXR. Most of the positive selected sites (P>0.95) werelocated either in the N-terminal region of the protein or the ligand-binding region of the protein (Fig. 3 and Sl Table 2), except one site inthe RXRB ancestral branch (204-L relative to human RXRB), whichwas located just one site before the beginning of the DNA-bindingdomain. In the post-duplication branch leading to rxrba only one sitewas found to be under positive selection, which was in the N-terminalregion (Sl Table 2).

A)

B)

C)

Fig. 3. Selected sites plotted on the 3D structures of the RXR DBD–LBD: Positive selectedsites in the DNA-binding domain and the ligand-binding domain are plotted on the RXRDBD–LBD modeled in SWISS-PROT; A) positive selected sites on the ancestral branchleading to RXRA plotted on the human RXRA DBD–LBD model; B) positive selectedsites on the ancestral branch leading to RXRB plotted on the human RXRB DBD–LBDmodel; C) positive selected sites on the ancestral branch leading to RXRG plotted onthe human RXRG DBD–LBD model.

2.3. Significant rate shifts after RXR duplication

Codon models can suffer from the saturation of substitutionsespecially in deep branches, like the branches immediately afterwhole genome duplication [21], to eliminate any such problems andto give confidence to our codon based analysis we employed aminoacid based rate shift analysis, since amino acid based analysis isrecommended for divergent sequences [22]. To identify functionaldivergence between the RXRs we used DIVERGE [23]. Significantevidence for functional divergence (altered evolutionary rate) wasobserved between all the pairs of RXRA, RXRB and RXRG (Fig. 1 andSl Table 3), and between rxrba and rxrbb the paralogs that resultedfrom the teleost specific whole genome duplication.

Using RASER2 [24], we detected significant rate shifts on all theancestral branches (post-duplication), which lead to the major RXRlineages. In the case of zebrafish specific duplicates, rxrgb was theonly gene that showed a significant rate shift (Table 2). The ancestralbranch leading to the rxrbb teleost gene was also found to be undersignificant rate shift; however, rxrbb branch had only weak signalsof positive selection using the codon models (Table 1). In additionto the lineage-specific models RASER2 implements the empiricalBayes test to identify the sites evolving under a covarion-like modelor heterotachy, which is similar to the site class 2a in the branchsite test of PAML. The details of the sites under rate shift are presentedin supplementary Table 2.

2.4. Changes in the expression pattern of RXR paralogs

We evaluated the expression dataset of BGEE [25] to assess theshifts in the RXR expression patterns during the embryonic stages ofDanio rerio (paralogous duplicates) and Mus musculus (orthologoussingletons). Our assessment of the RXR expression patterns duringthe embryonic development stage in the various homologousanatomical structures (Fig. 4) was in agreement with an earlierstudy [26] (however, they had not specifically tested the duplicatesof RXRG in zebrafish) and further revealed that: (i) each of the zebra-fish duplicates (paralogs) has at least one distinct expression patternwhen compared to the sister paralog and RXRB and RXRG havedistinct expression patterns (neofunctionalization) when comparedto their singleton mouse ortholog, and (ii) the zebrafish genes (eachof the duplicates) had their expression in a lesser number of anatomicstructures (subfunctionalization)when compared to the correspondingmouse singleton ortholog. We suggest that this ‘functional shuffling’among the duplicate genes in zebrafish after the whole genome dupli-cation, due to the positive selection on the ancestor, contributed to

Table 2Likelihood parameter estimates under lineage-specific models for rate shifts amongsites between different RXRs.

Model ℓ LRT (2Δ ℓ)a

Null model −10,355.6Rate shift on ancestral branch of RXRA −10,305.8 99.6Rate shift on ancestral branch of RXRB −10,285.5 140.2Rate shift on ancestral branch of RXRG −10,285.5 140.2Rate shift on ancestral branch of RXRB and RXRG −10,285.5 140.2

Analysis of post-duplication branches leading to teleost paralogsNull model RXRB −4218.13Rate shift on Ancestral branch of RXRBa −4217.14 1.98Rate shift on Ancestral branch of RXRBb −4213.46 9.34Null model for RXRA −3564.79Rate shift on Ancestral branch of RXRAa −3565.87 2.16Rate shift on Ancestral branch of RXRAb −3563.48 2.62Null model for RXRG −4493.22Rate shift on Ancestral branch of RXRGa −4493.22 0Rate shift on Ancestral branch of RXRGb −4485.59 15.26

a Significant LRT (Pb0.01) values are shown in boldface.

Page 6: Adaptive evolution of the Retinoid X receptor in vertebratesAdaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca

Fig. 4. Comparison of the RXR gene expression patterns during the embryonic develop-ment stages of Mus musculus and Danio rerio: Comparison of the RXR gene expressionpatterns during the embryonic development stages of M. musculus and D. rerio formultiple homologous anatomical structures (BGEE, retrieved on 10/12/2009), showingsubfunctionalization and neofunctionalization of RXR genes in zebrafish followingduplication events; blue=presence of expression; red=absence of expression;green=new expression in zebrafish paralogs; orange=loss of expression in duplicatesbut corresponding singleton is expressed.

Fig. 5. Evolutionary conservation of the RXR ligand-binding domain: The 3D structure ofthe amphioxus RXR-LBD 3D structure (PDB ID—3EYB) was used to map the evolutionaryconservation of the metazoan RXRs in PROPHYLER, the helix-10 of the ligand-bindingdomain is the most conserved cooler colors (blue–green) specify evolutionaryconservation and warmer colors (yellow–red) specify lesser constraints.

86 S. Philip et al. / Genomics 99 (2012) 81–89

the conservation of additional gene copies during the course ofevolution.

2.5. Different domains, different constraints: Asymmetric selective pressuresin the vertebrate RXR protein

The 3D structure of the ligand-binding domain of amphioxus’ RXR(PDB ID: 3EYB) was used to map ProPhylER's [27] results, revealingthat helix 10 of the ligand-binding domain is the most conservedsubstructure (Fig. 5). Helix 10 interacts with RXRs’ dimerizationpartners, along with the helices 7 and 9. The conservation of helix10 suggests that the dimerization partners of RXRs are well preservedthroughout evolution.

In vertebrates, the ancestral genome duplications have producedthree RXR genes that are different in overall protein length andspecific domain size (Sl Fig. 5). The median P values obtained fromthe analysis for each column of the alignment, from multivariateanalysis of protein polymorphism (MAPP) [28], was plotted (Sl Fig. 5).The DNA-binding domain and the ligand-binding domains were foundto be more constrained than the N-terminal region.

3. Discussion

RXR gene is found in most metazoan taxa from placozoans tovertebrates [15]. In vertebrates, there are three copies, RXRA, RXRBand RXRG, which are due to the two rounds of genome duplications

(1R and 2R-WGD) in the vertebrate ancestor. In the teleost ancestorthere has been another whole genome duplication (3R-WGD) and,accordingly, we identified a higher number of RXR genes in fishes.In all the fish species studied, we could identify two copies of theRXRB gene, but the zebrafish showed additional copies of the RXRAand the RXRG. Thus, to insightfully evaluate if the additional RXRgene copies in teleosts have resulted from a scenario of wholegenome duplications, we performed detailed synteny analyses toascertain if the teleost RXRs were indeed products of whole genomeduplication. Since we did not find evidences of additional RXR genecopies other than the four genes reported here, in medaka, stickle-back, tetraodon and fugu genomes, themost parsimonious explanationretrieved from the synteny analysis is that these fishes have underwenttrough events of gene loss. Thus, the simplest explanation for the reten-tion of duplicates in vertebrates would be the dosage balance model[29] pertaining to the preservation of duplicates following wholegenome duplications.

In addition to the duplicate retention and gene loss patterns, ourmajor finding was the presence of positive selection in the vertebrateRXR right after 2R-WGD and 3R-WGD. This wave of positive selectionimpacted all of the duplication branches resulting from 2R-WGD andone of the ancestral branches resulting from 3R-WGD; in addition~10% of the sites were under positive selection (site classes 2a and2b). Interestingly, most of the significantly positive selected siteswere located in the LBD and the N-terminal region of the proteinwhich are known to be the variable regions in the nuclear receptors[30], and most of the sites in RXRwere evolving under strong purifyingselection which is in concordance with earlier findings [31]. While allthe branches following 2R-WGD were evolving under positive selec-tion, only one of the paralog was evolving under positive selectionafter the 3R-WGD, similarly significant rate shift was observed in oneparalog of RXRG in zebrafish and only weak signals of rate shift wasobserved between the paralogs of RXRA. The presence of positiveselection/rate shift on post-duplication branches could signal to amechanism of escaping from adaptive conflict [32] in the case of the2R gene copies, and neofunctionalization models explains positiveselection only in one post-duplication branch [29] in teleosts, to bethe mechanism for preservation of the duplicates following geneduplication.

Page 7: Adaptive evolution of the Retinoid X receptor in vertebratesAdaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca

87S. Philip et al. / Genomics 99 (2012) 81–89

It has been also suggested that retention of duplicates after WGDis favored by dosage balancing selection, expression (or regulation)divergence and subfunctionalization, and that duplicates (fromWGDs) least commonly diverge in their biochemical function [29].Our results corroborate this hypothesis since the presence of positiveselection is only found in the N-terminal region and the LBD of theRXR protein, which are thought to harbor the activation function 1and 2 domains (AF-1 and AF-2) [33], containing phosphorylationsites for proline-dependent kinases. The presence of positive selectiononly in the N-terminal region and ligand-binding domain makes itpossible that the positive selection would have affected the activa-tion function 1 and 2, which is responsible for the spatial andtemporal variation of the RXR expression. Thus, significant shiftsin DNA-binding function have not occurred due to the positiveselection.

When looking at the expression data in homologous anatomicstructures of embryonic stages of zebrafish and mouse, expressionshuffling is observed between the zebrafish paralogs. An earlierstudy [26], has found RXRB to be neofunctionalized and RXRA to besubfunctionalized in teleosts, but they have not analyzed the duplicatesof RXRG in zebrafish. We found evidence for subfunctionalization andneofunctionalization events in the zebrafish paralogs and we furtherdetected a novel expression (neofunctionalization) pattern in RXRG(Fig. 4). These results demonstrate that the effect of positive selectionor rate shifts on the RXR genes should have enabled the altered expres-sion pattern, which in turn may explain the retention of the duplicatesfollowing the increase in the dosage of the protein. This suggests thatperhaps the evolution of RXR transcription factors is linked to anincreased complexity of its regulation in different cell types. Thiscould be a strategy to fine tune the triggering of the same pathwayby different agents and/or with different intensities in different celltypes.

The different regions of the RXR proteins are under different selectivepressures. Helix 10, located at the dimerization interface, is the mostconstrained helix in RXRs, which is consistent with RXR binding manydifferent partners. A mutation in this interface could interfere withvarious cellular processes [31]. It is known that retinoic acid has a centralrole in basic biological processes such as cell's fate, survival and growthor the apoptosis, depending on the heterodimeric partners that isactivated, i.e., PPAR beta or delta and RAR (alpha, beta or gamma)respectively, in which RXR is the “indispensible dimerization partner”of the nuclear receptors involved [34]. The conservation of the dimeriza-tion interface is also consistent with the recent findings thatmany of theRXR heterodimeric partners, such as RAR, TR, VDR, LXR are not chordatenovelties, but were already present in the ancestor of all bilateria [15].While the ability of RXR to dimerize with other nuclear receptors inbasal bilateria remains to be elucidated, the findings of the presentstudy favor this hypothesis. We conclude that the ability of RXR tobind a conserved set of partners is one of its most important functions,along with its roles in RA signaling [5] and as a lipid sensor [6]. A recentstudy in insects [35] identified positive selection on the ancestral branchleading to lepidoptera and diptera, specifically in the helix-9 of RXR,which is another helix involved in the dimerization interface (alongwith helices 7 and 10) with the ecdysone receptor. In our study, wedetected positive selection on the post-duplication branch leading toRXRG in one site (405T respective to the human RXRG) of the helix-9(Sl Table 2).

While dN/dS methods may sometimes retrieve false positives, thepower of branch site models is well recognized [36], showing that ourresults should not be misinterpreted. Although a large dataset likeours encompassing representatives of the major vertebrate lineagesposes challenges in alignment (Sl Fig. 6), we overcame such difficultiesby removing the non-aligned regions using Gblocks [37], with theresulting dataset showing ample phylogenetic signal (Sl Fig. 7) andno saturation bias (Sl Fig. 8 and Sl Table 4). The use of dN/dS methodsallowed us to detect positive selected sites, most of them in the

hitherto known variable regions of the nuclear receptors, consistentwith earlier findings [30,31]. In this study our main goal was to findthe major evolutionary factor during the preservation phase of theduplicated RXR paralogs. We provide multiple evidences for thepost-duplication evolutionary mechanism of RXR genes. Using theamino acid basedmethods the retrieved results support altered evolu-tionary rates in the post-duplication branches (paralogs), while thedN/dS methods find instances of positive selection, both likelycontributing to the observed expression pattern changes and functionaldivergence of the two post-duplication gene copies. Finally, our resultsfrom the amino acid based rate shift/conservation detection methodsand the codon models are consistent, which are further supported bythe expression patterns of the genes, as well as by other studies[30,31,35].

Further investigation to characterize the RXR from the differentorders of Ostariophysi (here represented by the zebrafish) would beinteresting to assess the level of retention of the six RXR genes acrossthis super-order, which could provide valuable insights into theevolution and adaptation of this group of fishes. Since RXR is knownto exert its action in the development of the organisms and 28% ofthe known freshwater fish species belong to Ostariophysi, theremarkable diversity, adaptability and morphological variationsamong these fish species, especially cypriniformes and siluriformes,makes it an interesting target group to study the evolution of theRXR gene. Interestingly, at least two fish species displaying differentnumber of RXR genes, i.e., medaka and zebrafish, are known torespond differently to the high affinity RXR agonist tributyltin (TBT)[38,39]. Whereas TBT exposure during the sex differentiation periodleads to an almost 100% male zebrafish population, no effects on thesex ratio of medaka were observed upon TBT exposure. Althoughthe experimental demonstration of a link between RXR and thereported differences is still lacking, the data indicates that somecaution should be taken in cross-teleost extrapolations. During thelast decade, the need of detailed chemical hazard assessment of alarge group of compounds, together with ethical concerns of animalwelfare, has prompted the use of zebrafish and other teleosts inlarge scale chemical risk assessment and drug discovery. However,the fact that teleosts possess more RXR genes than humans shouldbe treated as a major point while generalizing the findings since theexpression of the genes and the response of the teleosts could bemisleading if generalized to a mammalian context. In contrast tovertebrates, most invertebrates display a single RXR gene. Hence,invertebrate-specific impacts of environmental pollutants actingthrough RXR and their heterodimeric partners cannot be excluded.Indeed, the TBT-induced imposex observed in female prosobranchgastropods and the synergistic impact of TBT and 20-hydroxyecdysoneacting through RXR/EcR in daphnids seems to support this hypothesis[40].

3.1. Conclusion

Our results indicated highest constraint in the dimerization helix,allowing us to conclude that the dimerization partners are main-tained throughout the evolution of this nuclear receptor. The DNA-binding domain is highly conserved, however, the N-terminal andligand-binding domain which harbor the phosphorylation sitesresponsible for activation function shows lower constraints andharbors positive selected sites. Thus the evolution of RXR could belinked to an increase in complexity of the organism, where differenttypes of cells that require the same basic biochemical process aretriggered by different agents and/or with different intensities. Thepresence of positive selection/accelerated rates in the paralogs,coupled with the evidences of altered expression of paralogousgenes explains the preservation of the additional RXR copies followinggenome duplication.

Page 8: Adaptive evolution of the Retinoid X receptor in vertebratesAdaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca

88 S. Philip et al. / Genomics 99 (2012) 81–89

4. Materials and methods

4.1. Sequence alignment and phylogenetic analyses

Vertebrate RXR sequences were downloaded from ENSEMBLdatabase (www.ensembl.org). Sequence alignment was done usingMUSCLE [41] and the alignments were viewed and edited in SEAVIEW[42]. Given our large dataset, with representatives of all majorvertebrate lineages, we increased the alignment quality by removingthe non-aligned regions using Gblocks [37]. A total of 84 sequences ofvertebrates were selected for the final analyses. Before proceeding tothe evolutionary analyses the dataset was checked for phylogeneticsignal with likelihood mapping in TREE-PUZZLE [43] and for saturationbias in DAMBE [44]. The phylogeny was estimated using maximumlikelihood (ML) method as implemented in PHYML v.3.0 [45] . Thebest fit model for the phylogenetic analyses (GTR+G+I) was chosenwithMrAIC [46]. The correct topology of the RXR geneswas ascertainedusing the various topology testing methods implemented in TREE-PUZZLE [43]. For the branch site tests of RXRB teleost paralogs andrate shift analysis of RXRA and RXRG zebrafish paralogs, independentML phylogenies for each genes were produced.

4.2. Synteny analysis: Gene gain and gene loss

The synteny analyses, circular plots and the related (orthologs’)analyses were done using the Synteny database [19]. The toolsprovided in this database, such as the Reciprocal Best Hit BLASTalgorithm and the sliding window analysis, allow the detection ofgene loss or gene gain in chromosomes by providing evidences of tightlylinked genes on the chromosomal segments. The gene orientation andorder, clusters connecting between the query and the outgroup speciesare used to denote orthologous syntenic conservation.

4.3. Detection of positive selection

Two different methods were used to evaluate adaptive evolutionin the nucleotide sequences. First, a likelihood ratio test betweenthe branch models implemented in PAML 4 [20] was employed.Initially, we compared the log likelihood values of, a two ratiomodel (where all the post-duplication branches have a differentevolutionary rate relative to other branches; ωPD≠ω0) against aone ratio model (where all branches are supposed to evolve at asame rate; ωPD=ω0) to find out which model fitted the data better.Then the two ratio model (unconstrained two ratio model) if foundto fit the data better was tested against another null (constrainedtwo ratio) model where the ω value in the foreground branch wasconstrained to 1 (ωPD=1) to check for the prevalence of positiveselection. Post-duplication branches leading to RXRBa and RXRBbwere tested for positive selection, with a smaller alignment forRXRB gene and the corresponding gene tree, similarly to the abovementioned branch models.

Second, to find out if positive selection acted on a specific post-duplication branch and to identify the sites that were positive selected,we used the modified branch site model implemented in PAML 4. Themodified branch site model (branch site test 2) [47] has been found tobe a conservative test of positive selection, which allows the omegavalue (dN/dS ratio) to vary among the branches and the sites, support-ing the hypothesis that the substitutionsmay vary according to the timeand space, rather than averaging the values along all the sites as in thebranch models. In the branch site test 2, the alternate model assignstwo ω values (0bω0>1 and ω1=1) for the background branches (allother branches except the foreground branches) and the foregroundbranches (the branch of interest) is assigned an additional ω value(ω2>1), the alternate model is compared to the null model where theω value in the foreground branch is constrained to 1, a likelihood ratiotest (LRT) is used to check if the foreground branches are evolving

under the influence of positive selection. The sites under the influenceof positive selection are identified by a Bayes Empirical Bayes [48](BEB) test. We tested the post-duplication branches (one branch at atime) to detect episodic events of positive selection, care was taken toensure that at least four branches were present on either side of thelabeled foreground branch to minimize false positives (due to thisrequirement the RXRA and RXRG paralogs of zebrafish were not testedusing the branch site models). Post-duplication branches leading torxrba and rxrbb were tested for positive selection with a smaller align-ment for RXRB gene and the corresponding gene tree.

4.4. Detection of rate shifts among sites using protein-based methods

We used two methods to check for rate shifts on the RXR genes.First, a likelihood ratio test based method implemented in DIVERGE[23] was used to check for functional divergence in the duplicated/paralogous proteins. DIVERGE calculates a coefficient of functionaldivergence (Θ) between two clades. A value of Θ>0 indicates analtered evolutionary rate at some sites between those clusters. It isof interest (if Θ>0) to identify which sites were evolving at an al-tered rate; however, we did not look at the site-specific rate shiftsat this stage since we were interested in the post-duplicationbranches and the sites involved in rate shifts at those branches.

Secondly, rate shifts of evolution in branches immediately aftergenome duplication (ancestral branches of RXRA, RXRB, RXRBa,RXRBb, RXRG, RXRG-RXRB, and zebrafish paralogs of RXRA andRXRG) were checked using the RASER2 (RAte Shift EstimatoR version2) [24]. RASER2 uses the stochastic mapping of mutations [49] tocalculate the probability that a rate shift occurred at a specific branch.We used RASER2 to compare the alternate lineage-specific model tothe null model, which does not enable rate shifts. The program wasrun separately for each of the post-duplication branches. Likelihoodratio tests (LRT) were performed to determine whether the lineage-specific model fit the data significantly better than the null model.

4.5. Expression dataset analysis

To assess the action of whole genome duplication in the expressionpatterns of the RXR genes we compiled a dataset of RXR expression,for mouse (three RXR genes produced during the first two rounds ofwhole genome duplications in the vertebrate ancestor or 2R-WGD)and zebrafish (where the RXR compliment is six following a thirdround of whole genome duplication, in the teleost ancestor, or 3R-WGD), during the embryonic stages from the BGEE [25] database.The mouse genes were considered as singletons which retain theancestral functions in comparison to the zebrafish genes, whichwere considered as duplicates, for which the whole genome duplica-tion (3R-WGD) has increased the dosage of the RXR protein. Thisdataset was further checked for any altered expression patternsmanually.

4.6. Evolutionary conservation analysis

ProPhylER (Protein Phylogeny and Evolutionary Rates) [27] wasused to evaluate the evolutionary conservation of the RXR protein(the cluster 1731 represents the RXR genes on the server). ProPhylERis a curated database that allows the identification of the evolutionaryconservation of protein sequences. This program relies on theassumption that the closely related homologs have not changed infunction over time. The evolutionary conservation of the ligand-binding domainwasmapped on the three-dimensional (3D) structureof the amphioxus RXR [PDB: 3EYB].

We thenusedMAPP (multivariate analysis of protein polymorphism)[28], to evaluate the evolutionary variation in single columns of thealign-ment, predicting the impact of all possible variants on the structure andfunction of the RXR protein. Since the three RXR genes are different in

Page 9: Adaptive evolution of the Retinoid X receptor in vertebratesAdaptive evolution of the Retinoid X receptor in vertebrates Siby Philip a,b, L. Filipe C. Castro a, Rute R. da Fonseca

89S. Philip et al. / Genomics 99 (2012) 81–89

overall protein length and specific domain size, each protein alignmentwas analyzed separately. MAPP generates impact scores based on sixphysicochemical properties for all possible variants from the observedevolutionary variation and provides the correspondent P values—thelower the P value, the higher the chance that the substitution will bedeleterious for the structure or the function of the protein.

4.7. Protein tertiary structure modeling

Relevant amino acid sites were mapped on the tertiary structures(homology model of the RXR DBD-LBD region) using SWISS-MODEL(http://swissmodel.expasy.org/). The visualization and editing of the3D structures were performed in PyMOL [50].

Supplementary materials related to this article can be foundonline at doi:10.1016/j.ygeno.2011.12.001.

Acknowledgments

SP thanks João Paulo Machado for insightful discussion and helpfulsuggestions. This work was supported by the Portuguese Fundaçãopara a Ciência e a Tecnologia (Foundation for Science and Technology)with projects PTDC/BIA-BDE/69144/2006 (FCOMP-01-0124-FEDER-007065) and PTDC/MAR/68106/2006 (FCOMP-01-0124-FEDER-007379), and a PhD grant (SFRH/BD/47938/2008) to SP.

References

[1] D.J. Mangelsdorf, R.M. Evans, The RXR heterodimers and orphan receptors, Cell 83(1995) 841–850.

[2] J.M. Olefsky, Nuclear receptor minireview series, J. Biol. Chem. 276 (2001)36863–36864.

[3] J. Bowles, D. Knight, C. Smith, D. Wilhelm, J. Richman, S. Mamiya, K. Yashiro, K.Chawengsaksophak, M.J. Wilson, J. Rossant, H. Hamada, P. Koopman, Retinoid sig-naling determines germ cell fate in mice, Science 312 (2006) 596–600.

[4] L.F.C. Castro, D. Lima, A. Machado, C. Melo, Y. Hiromori, J. Nishikawa, T. Nakanishi,M.A. Reis-Henriques, M.M. Santos, Imposex induction is mediated through theRetinoid X Receptor signalling pathway in the neogastropod Nucella lapillus,Aquat. Toxicol. 85 (2007) 57–66.

[5] P. Germain, P. Chambon, G. Eichele, R.M. Evans, M.A. Lazar, M. Leid, A.R. De Lera, R.Lotan, D.J. Mangelsdorf, H. Gronemeyer, International union of pharmacology.LXIII. Retinoid X receptors, Pharmacol. Rev. 58 (2006) 760–772.

[6] A.I. Shulman, D.J. Mangelsdorf, Retinoid X receptor heterodimers in the metabolicsyndrome, N. Engl. J. Med. 353 (2005) 604–615.

[7] J. Novák, M. Benísek, K. Hilscherová, Disruption of retinoid transport, metabolismand signaling by environmental pollutants, Environ. Int. 34 (2008) 898–913.

[8] D. Lima, M.A. Reis-Henriques, R. Silva, A.I. Santos, L. Filipe, C. Castro, M.M. Santos,Tributyltin-induced imposex in marine gastropods involves tissue-specific modu-lation of the retinoid X receptor, Aquat. Toxicol. 101 (2011) 221–227.

[9] D.J. Mangelsdorf, U. Borgmeyer, R.A. Heyman, J.Y. Zhou, E.S. Ong, A.E. Oro, A.Kakizuka, R.M. Evans, Characterization of three RXR genes that mediate the actionof 9-cis retinoic acid, Genes Dev. 6 (1992) 329–344.

[10] S.M. Ulven, T.E. Gundersen, A.K. Sakhi, J.C. Glover, R. Blomhoff, Quantitative axialprofiles of retinoic acid in the embryonic mouse spinal cord: 9-Cis retinoic acidonly detected after all-trans-retinoic acid levels are super-elevated experimental-ly, Dev. Dyn. 222 (2001) 341–353.

[11] M. Gesto, L.F.C. Castro, M.A. Reis-Henriques, M.M. Santos, Tissue-specific distribu-tion patterns of retinoids and didehydroretinoids in rainbow trout Oncorhynchusmykiss, Comp. Biochem. Physiol. B Biochem. Mol. Biol. 161 (2012) 69–78.

[12] S.M. Nowickyj, J.V. Chithalen, D. Cameron, M.G. Tyshenko, M. Petkovich, G.R.Wyatt, G. Jones, V.K. Walker, Locust retinoid X receptors: 9-Cis-retinoic acid inembryos from a primitive insect, Proc. Natl. Acad. Sci. 105 (2008) 9540–9545.

[13] A. Lampen, S. Meyer, H. Nau, Phytanic acid and docosahexaenoic acid increase themetabolism of all-trans-retinoic acid and CYP26 gene expression in intestinalcells, Biochim. Biophys. Acta, Gene Struct. Expression 1521 (2001) 97–106.

[14] A.M. de Urquiza, S. Liu, M. Sjoberg, R.H. Zetterstrom, W. Griffiths, J. Sjovall, T.Perlmann, Docosahexaenoic acid, a ligand for the retinoid X receptor in mousebrain, Science 290 (2000) 2140–2144.

[15] J.T. Bridgham, G.N. Eick, C. Larroux, K. Deshpande, M.J. Harms, M.E.A. Gauthier,E.A. Ortlund, B.M. Degnan, J.W. Thornton, Protein evolution by molecular tinker-ing: diversification of the nuclear receptor superfamily from a ligand-dependentancestor, PLoS Biol. 8 (2010) e1000497.

[16] A. Tallafuss, L.A. Hale, Y.-L. Yan, L. Dudley, J.S. Eisen, J.H. Postlethwait, Character-ization of retinoid-X receptor genes rxra, rxrba, rxrbb and rxrg during zebrafishdevelopment, Gene Expr. Patterns 6 (2006) 556–565.

[17] J.S. Waxman, D. Yelon, Comparison of the expression patterns of newly identifiedzebrafish retinoic acid and retinoid X receptors, Dev. Dyn. 236 (2007) 587–595.

[18] M. Kasahara, K. Naruse, S. Sasaki, Y. Nakatani, W. Qu, B. Ahsan, T. Yamada, Y.Nagayasu, K. Doi, Y. Kasai, T. Jindo, D. Kobayashi, A. Shimada, A. Toyoda, Y. Kuroki,A. Fujiyama, T. Sasaki, A. Shimizu, S. Asakawa, N. Shimizu, S.-i. Hashimoto, J. Yang,Y. Lee, K. Matsushima, S. Sugano, M. Sakaizumi, T. Narita, K. Ohishi, S. Haga, F.Ohta, H. Nomoto, K. Nogata, T. Morishita, T. Endo, T. Shin-I, H. Takeda, S.Morishita, Y. Kohara, The medaka draft genome and insights into vertebrategenome evolution, Nature 447 (2007) 714–719.

[19] J.M. Catchen, J.S. Conery, J.H. Postlethwait, Automated identification of conservedsynteny after whole-genome duplication, Genome Res. 19 (2009) 1497–1505.

[20] Z. Yang, PAML 4: phylogenetic analysis by maximum likelihood, Mol. Biol. Evol.24 (2007) 1586–1591.

[21] M. Anisimova, J.P. Bielawski, Z. Yang, Accuracy and power of bayes prediction ofamino acid sites under positive selection, Mol. Biol. Evol. 19 (2002) 950–958.

[22] J. Bielawski, Z. Yang, Maximum likelihood methods for detecting adaptive evolu-tion after gene duplication, J. Struct. Funct. Genomics 3 (2003) 201–212.

[23] X. Gu, K. Vander Velden, DIVERGE: phylogeny-based analysis for functional-structural divergence of a protein family, Bioinformatics 18 (2002) 500–501.

[24] O. Penn, A. Stern, N. Rubinstein, J. Dutheil, E. Bacharach, N. Galtier, T. Pupko,Evolutionary modeling of rate shifts reveals specificity determinants in HIV-1subtypes, PLoS Comput. Biol. 4 (2008) e1000214.

[25] F. Bastian, G. Parmentier, J. Roux, S. Moretti, V. Laudet, M. Robinson-Rechavi,Bgee: integrating and comparing heterogeneous transcriptome data among spe-cies, in: A. Bairoch, S. Cohen-Boulakia, C. Froidevaux (Eds.), Data Integration inthe Life Sciences, Springer, Berlin / Heidelberg, 2008 124–131.

[26] S. Bertrand, B. Thisse, R. Tavares, L. Sachs, A. Chaumot, P.-L. Bardet, H. Escrivà, M.Duffraisse, O. Marchand, R. Safi, C. Thisse, V. Laudet, Unexpected novel relationallinks uncovered by extensive developmental profiling of nuclear receptor expres-sion, PLoS Genet. 3 (2007).

[27] J. Binkley, K. Karra, A. Kirby, M. Hosobuchi, E.A. Stone, A. Sidow, ProPhylER: acurated online resource for protein function and structure based on evolutionaryconstraint analyses, Genome Res. 20 (2010) 142–154.

[28] E.A. Stone, A. Sidow, Physicochemical constraint violation by missense substitu-tions mediates impairment of protein function and disease severity, GenomeRes. 15 (2005) 978–986.

[29] G. Conant, K. Wolfe, Turning a hobby into a job: how duplicated genes find newfunctions, Nat. Rev. Genet. 9 (2008) 938–950.

[30] M. Krasowski, K. Yasuda, L. Hagey, E. Schuetz, Evolutionary selection across thenuclear hormone receptor superfamily with a focus on the NR1I subfamily (vita-min D, pregnane X, and constitutive androstane receptors), Nucl. Recept. 3(2005) 2.

[31] Z. Zhang, P.E. Burch, A.J. Cooney, R.B. Lanz, F.A. Pereira, J. Wu, R.A. Gibbs, G.Weinstock, D.A. Wheeler, Genomic analysis of the nuclear receptor family:new insights into structure, regulation, and evolution from the rat genome, GenomeRes. 14 (2004) 580–590.

[32] A.L. Hughes, The evolution of functionally novel proteins after gene duplication,Proc. R. Soc. Lond. B Biol. Sci. 256 (1994) 119–124.

[33] J. Bastien, C. Rochette-Egly, Nuclear retinoid receptors and the transcription ofretinoid-target genes, Gene 328 (2004) 1–16.

[34] L. Michalik, W. Wahli, Guiding ligands to nuclear receptors, Cell 129 (2007)649–651.

[35] E.F. Hult, S.S. Tobe, B.S.W. Chang, Molecular evolution of ultraspiracle protein(USP/RXR) in insects, PLoS One 6 (2011) e23416.

[36] Z. Yang, M. dos Reis, Statistical properties of the branch-site test of positive selec-tion, Mol. Biol. Evol. 28 (2011) 1217–1228.

[37] J. Castresana, Selection of conserved blocks frommultiple alignments for their usein phylogenetic analysis, Mol. Biol. Evol. 17 (2000) 540–552.

[38] A.J. Kuhl, M. Brouwer, Antiestrogens inhibit xenoestrogen-induced brain aroma-tase activity but do not prevent xenoestrogen-induced feminization in Japanesemedaka (Oryzias latipes), Environ. Health Perspect. 114 (2006) 500–506.

[39] B.G. McAllister, D.E. Kime, Early life exposure to environmental levels of the aro-matase inhibitor tributyltin causes masculinisation and irreversible sperm dam-age in zebrafish (Danio rerio), Aquat. Toxicol. 65 (2003) 309–316.

[40] Y.H. Wang, G. Kwon, H. Li, G.A. LeBlanc, Tributyltin synergizes with 20-hydroxyecdysone to produce endocrine toxicity, Toxicol. Sci. 123 (2011) 71–79.

[41] R.C. Edgar, MUSCLE: multiple sequence alignment with high accuracy and highthroughput, Nucleic Acids Res. 32 (2004) 1792–1797.

[42] M. Gouy, S. Guindon, O. Gascuel, SeaView version 4: a multiplatform graphicaluser interface for sequence alignment and phylogenetic tree building, Mol. Biol.Evol. 27 (2010) 221–224.

[43] H.A. Schmidt, K. Strimmer, M. Vingron, A. von Haeseler, TREE-PUZZLE: maximumlikelihood phylogenetic analysis using quartets and parallel computing, Bioinfor-matics 18 (2002) 502–504.

[44] X. Xia, Z. Xie, DAMBE: software package for data analysis in molecular biology andevolution, J. Hered. 92 (2001) 371–373.

[45] S. Guindon, F. Delsuc, J.-F. Dufayard, O. Gascuel, Estimating maximum likelihoodphylogenies with PhyML, Methods Mol. Biol. 537 (2009) 113–137.

[46] J.A.A. Nylander, MrAIC.pl. Program distributed by the author, Evolutionary BiologyCentre, Uppsala University, 2004.

[47] J. Zhang, R. Nielsen, Z. Yang, Evaluation of an improved branch-site likelihoodmethod for detecting positive selection at the molecular level, Mol. Biol. Evol.22 (2005) 2472–2479.

[48] Z. Yang, W.S.W. Wong, R. Nielsen, Bayes empirical Bayes inference of amino acidsites under positive selection, , 2005.

[49] R. Nielsen, Mapping mutations on phylogenies, Syst. Biol. 51 (2002) 729–739.[50] L.L.C. Schrodinger, The PyMOL Molecular Graphics System, Version 1.3r1, 2010.