comparison of the btb superfamilies in physcomitrella

1
Comparison of the BTB Superfamilies in Physcomitrella patens and Arabidopsis thaliana Jacob P. Janssen, Dr. Derek Gingerich, University of Wisconsin-Eau Claire K Target BTB Motif CUL3 Rbx Ub C E2 BTB The Arabidopsis and Physcomitrella BTB families encoded many similar BTB proteins Figure 2. Grouping of the BTB subfamilies based on the nature of additional motifs outside of the BTB domain along with a protein diagram of representative members. 135987 226178 234547 98728 114857 106616 52205 At5g64330 NPR1 At3g48360 At5g19330 At5g55000 NPH3 Ankyrin TPR Armadillo Pentapeptide Other 30 At3g43700 MATH 6 7 5 3 2 1 26 17 3 6 3 Physcomitrella Arabidopsis 2 2 5 Other 27 1 0 6 6 1 6 1 S D V T F L - V E G R R F Y A H R I A L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - L A S S D A F R A M F D - - G G 2 2 6 1 7 8 1 A D V K F E - V D G E L Y H A H K L V L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A R S P V F K A Q L F - - G P 1 7 0 4 5 1 1 S D V V F E S L D G R K V Y G H R A V L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A S Q S D V F K A M F S - - M P 2 3 1 8 9 4 1 V D V E L Q G V D G D R V H A H K A I L S V G W V L V D P S L C S E L G Q V L L S T I H E S V R R R C P Y V I A E K K S V A S K S I E F E A M F R - - A D 1 3 0 7 6 9 1 T D V T I C S A T G - C V G A H R A I L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A R S P V F D S M F S - - H N 1 6 5 6 6 0 1 A D V C F L - V G N E K F R C H R F V L G - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A R S E Y F K A R F S R T T G 1 8 9 9 2 2 1 M D T S V V - L R S K T I H I S S A I L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A A K S P Y F Y K L F S - - N G 9 8 7 2 8 1 A D L T V C T D D G S Q I H V H S M V L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - M A S S P V F K F Y L Q - - K E 1 1 4 8 5 7 1 N D I V F H - L G G D A V P C N R E K I - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A G L S M P F N T M L N - - G V 1 3 5 9 8 7 1 S D I V V E - A G G M N F S L H K F P L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V A R S G R I R K L V A N L A D 1 8 7 7 0 7 1 - S V V H L N V G G M M F A T T V D T L T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Q R D T D S M L A V M F S G R H R 1 0 4 5 4 5 1 - G R A K I E V G G R V F V T T M Q T V - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - E K A G K D S H L Y K - - - T 1 0 6 6 1 6 K R L C E Y S M A Q N L - 2 2 6 1 7 8 R L L C E S K L C E N V - 1 7 0 4 5 1 K S L C E Q E I L Q S L - 2 3 1 8 9 4 L E V V C E D S L V K N M 1 3 0 7 6 9 K E A C E S S L V D D I - 1 6 5 6 6 0 K R A V T D A L L P Q L - 1 8 9 9 2 2 M R Y C S - R L L K N M P 9 8 7 2 8 K Q T C T V H F E Q G L - 1 1 4 8 5 7 K D A C D Q N L A T F V - 1 3 5 9 8 7 V A R T E A F L S E V V - 1 8 7 7 0 7 V E N I T P F L C K K D - 1 0 4 5 4 5 L R N A H G R A P L N G - 1 0 6 6 1 6 2 2 6 1 7 8 1 7 0 4 5 1 2 3 1 8 9 4 1 3 0 7 6 9 1 6 5 6 6 0 1 8 9 9 2 2 9 8 7 2 8 1 1 4 8 5 7 1 3 5 9 8 7 1 8 7 7 0 7 1 0 4 5 4 5 Y R E K E A - - - - - - - - - - - L D I E I P M R D R N S - - - - - - - - - - - D H I E I K M A E G T P - - - - - - - - - - - L H M E M N T D R K V V - - - - - - - - - - - W N L E M P L M E K Q S - - - - - - - - - - - A T V R I N F R E G V A G L A T N S G A D S L L I L Q E N M R E S E Q - - - - - - - - - R D V T L R I T V R K S R F P - - - - - - - - - - - A I D I F F L E A R M C D - - - - - - - - - I G F S K N S E N P N L - - - - - - - - - - - L Q L T D V L H M D S K - - - - - - - - - - - E G A V F I V S Q S G Q - - - - - - - - - - - - - - G W F N I S W K V F E L M M R F I Y - - - - - - - - - - T G N V D M A T D N A Q D L L R A A D Q Y L L E D I E P P V F K A L L H F I Y R D S L P D T K E L I G A P S T S T L L A Q H L L A A A D R Y G L D - - - Y P A L D A F I F F F Y R - - - - - - - - A T V N Q A V L R N H L V E L L Q A A D K Y G I E - - - Y V A L K A F V T F F Y T - - - - - - - - G R V S S S V L I E Y L T T L L D A A H K Y N V Q D M C L E S C R A L L S F L Y G - - - - - - - - - N L K Y P D F R K H R V A L V R A A H K Y D I V D L S A S A F E K V L E Y I Y T - - - - - - - - - D S V K T V D L D E A E E L F D A A S R Y L L F Q A E E S P L M D L L Q F M Y S - - - - - - - - - G R V Q A N T P A T V L D V L M A A D K Y E V A G V P F H A V R C F L R Y L Y S - - - - - - - - S R C E R S D M E E C A L H L V V L A H T Y M V P G I S V T G M R A V D H F S K T - - - - - - - - - G R L A R L S P E M L L E I L S F A N R F C C D P G G A E A F D L A A K F C Y G - - - - I N F E I T T S N V A V L R C A A E Y L D M T E S Y G E N D R D G T H F R H I L N W L R D - - - - - - - - - G V I P M L E I S A Y Q E L H R E A E Y Y Q L M D R D P D M F N I L L N M L R T G - - - - - - - V V Y P K P E S L G F L D R L I E E A T F Y G I E G R A F D P T A T N G E L L L L L C L L L L C - Sequence Alignment of Representatative Physcomitrella BTB Domains. Introduction Following their synthesis, proteins in cells can be modified in a variety of ways. These modifications may modulate the protein’s activity, localization, interactions with other molecules, or stability. One such modification is the covalent attachment of a small globular protein called ubiquitin (Ub). Ubiquitination can change the activity or localization of a protein, if one Ub moiety is attached, or can lead to degradation of the protein if chains of Ubs are added. Selective degradation of key proteins by ubiquitination has been shown to be an important regulatory mechanism in many signaling pathways in eukaryotes. The attachment of Ub to target proteins is catalyzed by E3 Ub-ligases. One family of ligases are the BTB/Cullin 3 (CUL3) E3s. These complexes are composed of three proteins: CUL3, which acts as a backbone, RBX, which helps recruit the Ub, and a BTB (Bric-a-Brac/Tramtrack/Broad Complex) domain- containing protein (Fig. 1). The BTB motif allows the BTB protein to bind CUL3. Besides the BTB motif, BTB proteins contain other sequences (such as MATH or NPH3 domains) which recruit the target. BTB gene families are very diverse in eukaryotes. The sets of BTB proteins encoded in animal genomes are very different from those encoded in plants, for instance. We are interested in how the BTB gene family has changed during plant evolution. Dr. Gingerich has previously characterized the BTB family in a dicotyledenous flowering plant, Arabidopsis thaliana. In this studied, I have begun to characterize the family in a simple nonvascular moss, Physcomitrella patens. Significant differences in the BTB families might suggest that the two plants use the BTB/CUL3 E3s to target different proteins for degradation. Conclusions 9 9 1 3 7 1 0 0 7 9 9 9 1 0 2 1 0 2 1 0 2 1 0 2 1 0 8 1 0 4 7 9 B A1 D2 R L B P E C J A2 A2 39 known eukaryotic BTB domains used as initial queries 69 loci identified with < 0.0004 E-value analysis and reannotation of gene sequences 4 loci predicted to be nonfunctional 65 loci identified with < 0.0004 E-value Figure 1 BTB/Cullin 3 E3 Ubiquitin-Ligases 77289 93266 61156 118125 130769 66017 97923 74560 124356 150882 85405 87316 50789 42628 159693 219111 134565 30287 164153 100 100 100 100 93 100 63 65 100 97 35 94 78 100 100 100 100 100 82 100 100 69 100 100 77 95 32 39 53 90 52 99 100 100 58 45 100 92 56 100 96 96 99 90 67 35 72 56 33 29 43 20 5 6 9 24 6 1 6 0 1 1 NPH3 TAZ-CaM B D TPR Armadillo Pentapeptide AP2 Back 3 Physcomitrella Arabidopsis Subfamily size 6 2 6 30 5 3 1 13 80 Scop Domain 1 6 2 5 1 0 "A2" 4 3 "D2" 2 1 "F" 3 0 "H" 5 6 Ankyrin 170451 106616 106476 165660 143618 226178 217835 98728 150321 223881 234547 231894 189922 122867 3260 121620 119190 31431 67811 231552 93641 41851 154574 21814 135987 127110 134520 12061 106949 12163 31592 2869 1322 22014 161287 114857 80508 50218 167201 111151 104545 148281 125607 187707 52205 52179 17 3 3 2 MATH Other 5 0 0 Total 65 1 RPT2 NPH3, RPT3 PRL1-IFG NPR1 AtPOB1 AtBT5 AtBT4 AtBT3 AtBT2 AtBT1 At5g21010 At3g43700 At3g03740 At2g39760 At5g19000 At3g06190 At5g55000 At5g41330 At3g09030 At4g30940 At2g24240 At5g19330 At5g13060 At4g02680 EOL1 At5g58550 EOL2 At3g51770 ETO1 At2g05330 At5g48510 At4g04090 At2g40450 At3g29740 At2g40440 At3g56230 At1g01640 At1g55760 At1g21780 At4g08455 At4g37590 At2g23050 At5g67440 At2g14820 At4g31820 At5g47800 At1g67900 At3g26490 At3g44820 At5g10250 At3g50840 At5g66560 At5g64330 At1g30440 At5g03250 At5g13600 At3g49970 At5g67385 At5g48130 At2g30520 At3g08570 At3g08660 At5g63160 At3g48360 At1g05690 At5g67480 At4g37610 At2g30600 At4g01160 At3g61600 At2g46260 At5g45110 At4g19660 At4g26120 At1g64280 At3g57130 At2g41370 At5g60050 At1g63850 At3g50780 At2g13690 At3g05675 At5g48800 At1g03010 At2g47860 At3g22104 At1g50280 At3g19850 At3g49900 At3g03510 At5g17580 At2g04740 At1g04390 100 99 85 81 65 48 100 26 46 26 11 100 99 71 52 13 93 32 29 91 97 99 46 46 37 42 6 100 99 54 98 97 100 96 69 100 100 0 90 97 60 58 2 5 100 99 40 91 100 66 43 27 99 28 10 3 10 19 97 33 26 12 8 11 28 51 100 79 75 61 85 86 80 78 100 100 ARIA BOP1 BOP2 Phylogenetic Analysis of the BTB Superfamilies in Physcomitrella and Arabidopsis Physcomitrella Arabidopsis Identification of BTB domain-containing proteins encoded by the Physcomitrella genome BLAST search Physcomitrella patens genome v1.1 predicted protein sequences (from the DOE Joint Genome Institute Physcomitrella genome sequencing project) Domain # of Genes Protein ID TAZ zinc finger ETO1 # of Genes Protein ID B A1 D2 R L B P E C J A2 A2 B A1 D2 R L B P E C J A2 A2 Figure 3. Sequence alignment of representative Physcomitrella BTB domains. The approximately 100 amino acid core BTB sequence from 12 representative Physcomitrella BTB proteins were aligned with ClustalW and displayed with Boxshade using a threshhold of 55%. Identical and conserved amino acids are shown with black and grey, respectively. Dashes denote gaps. Desginations to the left indicate the BTB family and the JGI number for each protein. Some data presented in this poster was generated by Dr. Gingerich when he was funded by an NIH NRSA post-doctoral fellowship (#GM068361) 100 4 3 0 0 “K” “Q” J C S T A2 Q I G H P R M N E A1 B L D2 K J I H G F E D2 D1 C B A2 A1 Figure 4. Phylogenetic trees of the complete BTB protein superfamilies from Arabidopsis and Physcomitrella. The approximately 100 amino acid core BTB domain from all Arabidopsis and Physcomitrella BTB proteins were aligned byClustalW. The alignments were used to generate non-rooted phylogenetic trees with MEGA 4.0, using the Poisson distance method and a bootstrap value of 1000. The subfamilies identified from the phylogenetic analysis are marked on the bottom of the trees. Individual members of the tree are color-coded by the nature of the domain(s) either N- or C-terminal to the BTB domain. Where possible, designations for proteins previously identified by other methods are used. -Physcomitrella patens encodes 65 putative BTB domain-containing proteins in its genome. -There are a variety of different motifs/domains attached to BTB domains in the Physcomitrella BTB family, suggesting a number of different types of target proteins. -The Physcomitrella and Arabidopsis BTB families are similar in overall size and share many similar BTB types, suggesting the two plants use the BTB family in similar ways. -However, there are also several Physcomitrella and Arabidopsis-specific types, suggesting some specialization in each lineage. -The BTB-NPH3 family in Arabidopsis is significantly larger than that in Physcomitrella (30 vs. 17), suggesting that there may have been an expansion of this subfamily as more complex plants evolved.

Upload: others

Post on 22-Jan-2022

1 views

Category:

Documents


0 download

TRANSCRIPT

Comparison of the BTB Superfamilies in Physcomitrella patens and Arabidopsis thaliana

Jacob P. Janssen, Dr. Derek Gingerich,University of Wisconsin-Eau Claire

K

Target

BTBMotif

CUL3

RbxUb

C E2

BTB

The Arabidopsis and Physcomitrella BTB families encoded many similar BTB proteins

Figure 2. Grouping of the BTB subfamilies based on the nature of additional motifs outside of the BTB domain along with a protein diagram of representative members.

135987

2261782345479872811485710661652205

At5g64330

NPR1At3g48360

At5g19330At5g55000

NPH3

Ankyrin

TPRArmadilloPentapeptideOther

30At3g43700MATH 6

75321

26

17363

PhyscomitrellaArabidopsis

225

Other27

1 0 6 6 1 6 1 S DV T F L - V E GR R F Y AHR I AL - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - L AS S DAF R AMF D- - GG2 2 6 1 7 8 1 ADV K F E - V DGE L Y HAHK L V L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAR S P V F K AQL F - - GP1 7 0 4 5 1 1 S DV V F E S L DGR K V Y GHR AV L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AS QS DV F K AMF S - - MP2 3 1 8 9 4 1 V DV E L QGV DGDR V HAHK AI L S V GWV L V DP S L C S E L GQV L L S T I HE S V R R R C P Y V I AE K K S V AS K S I E F E AMF R - - AD1 3 0 7 6 9 1 T DV T I C S AT G- C V GAHR AI L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAR S P V F DS MF S - - HN1 6 5 6 6 0 1 ADV C F L - V GNE K F R C HR F V L G- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AR S E Y F K AR F S R T T G1 8 9 9 2 2 1 MDT S V V - L R S K T I HI S S AI L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AAK S P Y F Y K L F S - - NG9 8 7 2 8 1 ADL T V C T DDGS QI HV HS MV L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - MAS S P V F K F Y L Q- - K E1 1 4 8 5 7 1 NDI V F H- L GGDAV P C NR E K I - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - AGL S MP F NT ML N- - GV1 3 5 9 8 7 1 S DI V V E - AGGMNF S L HK F P L - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - V AR S GR I R K L V ANL AD1 8 7 7 0 7 1 - S V V HL NV GGMMF AT T V DT L T - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - QR DT DS ML AV MF S GR HR1 0 4 5 4 5 1 - GR AK I E V GGR V F V T T MQT V - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - E K AGK DS HL Y K - - - T

1 0 6 6 1 6 K R L C E Y S MAQNL -2 2 6 1 7 8 R L L C E S K L C E NV -1 7 0 4 5 1 K S L C E QE I L QS L -2 3 1 8 9 4 L E V V C E DS L V K NM1 3 0 7 6 9 K E AC E S S L V DDI -1 6 5 6 6 0 K R AV T DAL L P QL -1 8 9 9 2 2 MR Y C S - R L L K NMP9 8 7 2 8 K QT C T V HF E QGL -1 1 4 8 5 7 K DAC DQNL AT F V -1 3 5 9 8 7 V AR T E AF L S E V V -1 8 7 7 0 7 V E NI T P F L C K K D-1 0 4 5 4 5 L R NAHGR AP L NG-

1 0 6 6 1 62 2 6 1 7 81 7 0 4 5 12 3 1 8 9 41 3 0 7 6 91 6 5 6 6 01 8 9 9 2 29 8 7 2 81 1 4 8 5 71 3 5 9 8 71 8 7 7 0 71 0 4 5 4 5

Y R E K E A- - - - - - - - - - - L DI E I P MR DR NS - - - - - - - - - - - DHI E I K MAE GT P - - - - - - - - - - - L HME MN T DR K V V - - - - - - - - - - - WNL E MP L ME K QS - - - - - - - - - - - AT V R I N F R E GV AGL AT NS GADS L L I L QE N MR E S E Q- - - - - - - - - R DV T L R I T V R K S R F P - - - - - - - - - - - AI DI F F L E AR MC D- - - - - - - - - I GF S K N S E NP NL - - - - - - - - - - - L QL T DV L HMDS K - - - - - - - - - - - E GAV F I V S QS GQ- - - - - - - - - - - - - - GWF

NI S WK V F E L MMR F I Y - - - - - - - - - - T GNV DMAT DNAQDL L R AADQY L L EDI E P P V F K AL L HF I Y R DS L P DT K E L I GAP S T S T L L AQHL L AAADR Y GL D- - - Y P AL DAF I F F F Y R - - - - - - - - AT V NQAV L R NHL V E L L QAADK Y GI E- - - Y V AL K AF V T F F Y T - - - - - - - - GR V S S S V L I E Y L T T L L DAAHK Y NV QDMC L E S C R AL L S F L Y G- - - - - - - - - NL K Y P DF R K HR V AL V R AAHK Y DI VDL S AS AF E K V L E Y I Y T - - - - - - - - - DS V K T V DL DE AE E L F DAAS R Y L L FQAE E S P L MDL L QF MY S - - - - - - - - - GR V QANT P AT V L DV L MAADK Y E V AGV P F HAV R C F L R Y L Y S - - - - - - - - S R C E R S DME E C AL HL V V L AHT Y MV PGI S V T GMR AV DHF S K T - - - - - - - - - GR L AR L S P E ML L E I L S F ANR F C C DP GGAE AF DL AAK F C Y G- - - - I NF E I T T S NV AV L R C AAE Y L DMT E S Y GE NDR DGT HF R HI L NWL R D- - - - - - - - - GV I P ML E I S AY QE L HR E AE Y Y QL MDR DP DMF NI L L NML R T G- - - - - - - V V Y P K P E S L GF L DR L I E E AT F Y GI E

GRAFDPTATNGE

LLL

LLCLLLLC

-

Sequence Alignment of Representatative Physcomitrella BTB Domains.

Introduction Following their synthesis, proteins in cells can be modified in a variety of ways. These modifications may modulate the protein’s activity, localization, interactions with other molecules, or stability. One such modification is the covalent attachment of a small globular protein called ubiquitin (Ub). Ubiquitination can change the activity or localization of a protein, if one Ub moiety is attached, or can lead to degradation of the protein if chains of Ubs are added. Selective degradation of key proteins by ubiquitination has been shown to be an important regulatory mechanism in many signaling pathways in eukaryotes. The attachment of Ub to target proteins is catalyzed by E3 Ub-ligases. One family of ligases are the BTB/Cullin 3 (CUL3) E3s. These complexes are composed of three proteins: CUL3, which acts as a backbone, RBX, which helps recruit the Ub, and a BTB (Bric-a-Brac/Tramtrack/Broad Complex) domain-containing protein (Fig. 1). The BTB motif allows the BTB protein to bind CUL3. Besides the BTB motif, BTB proteins contain other sequences (such as MATH or NPH3 domains) which recruit the target. BTB gene families are very diverse in eukaryotes. The sets of BTB proteins encoded in animal genomes are very different from those encoded in plants, for instance. We are interested in how the BTB gene family has changed during plant evolution. Dr. Gingerich has previously characterized the BTB family in a dicotyledenous flowering plant, Arabidopsis thaliana. In this studied, I have begun to characterize the family in a simple nonvascular moss, Physcomitrella patens. Significant differences in the BTB families might suggest that the two plants use the BTB/CUL3 E3s to target different proteins for degradation.

Conclusions

9 9

1 3 71 0 0

7999

1 0 2 1 0 2 1 0 2 1 0 2 1 0 8 1 0 4

79

BA1

D2RL

BPECJ

A2A2

39 known eukaryotic BTB domains used as initial queries

69 loci identified with < 0.0004 E-value

analysis and reannotation of gene sequences4 loci predicted to be nonfunctional

65 loci identified with < 0.0004 E-value

Figure 1 BTB/Cullin 3 E3 Ubiquitin-Ligases

77289

93266 61156

118125

130769

66017

97923

74560 124356150882

85405

87316

50789

42628 159693

219111

134565

30287

164153

100

100

100

100

93

100

6365

100

97

35

94

78

100

100

100

100

100

82

100

100

69100

10077

95

32

39

53

90

52

99

100

100

58

45

100

92

56

100

96

96

99

90

67

35

72 56

33

29 43

205 6

9

24 6

1

6

0

11

NPH3TAZ-CaM B DTPR

Armadillo

PentapeptideAP2Back 3

Physcomitrella ArabidopsisSubfamily size

626

3053

1

1380

Scop Domain 162

51

0

"A2" 43"D2" 21"F" 30"H" 56

Ankyrin 170451

106616

106476

165660

143618

226178

217835 98728

150321

223881

234547

231894 189922

122867

3260

121620 119190

31431

67811

231552

93641

41851

154574

21814

135987 127110

134520

12061 106949

12163

31592

2869 1322

22014

161287

114857

80508

50218

167201

111151

104545

148281

125607

187707 52205

52179

173

3

2

MATH

Other 5

00

Total 65

1

RPT2

NPH3, RPT3

PRL1-IFG

NPR1

AtPOB1

AtBT5AtBT4

AtBT3AtBT2AtBT1

At5g21010At3g43700

At3g03740At2g39760At5g19000

At3g06190At5g55000

At5g41330At3g09030At4g30940At2g24240

At5g19330At5g13060

At4g02680 EOL1At5g58550EOL2

At3g51770 ETO1At2g05330

At5g48510At4g04090

At2g40450At3g29740

At2g40440At3g56230

At1g01640At1g55760At1g21780

At4g08455

At4g37590 At2g23050At5g67440 At2g14820

At4g31820At5g47800

At1g67900At3g26490

At3g44820At5g10250

At3g50840At5g66560

At5g64330At1g30440

At5g03250 At5g13600At3g49970

At5g67385At5g48130

At2g30520At3g08570

At3g08660

At5g63160At3g48360

At1g05690At5g67480At4g37610

At2g30600At4g01160

At3g61600At2g46260

At5g45110At4g19660

At4g26120At1g64280

At3g57130At2g41370

At5g60050At1g63850At3g50780

At2g13690At3g05675

At5g48800At1g03010

At2g47860At3g22104

At1g50280At3g19850

At3g49900At3g03510

At5g17580At2g04740

At1g04390

10099

85

81

65

48

10026

4626

11100

9971

5213

933229

91

97

99

4646

37

42

6

100

9954

98

97

10096

69100

100

090

9760

58

2

5100

99

40

91100

6643

2799

2810

3

101997

3326

12

811

2851

100

79

7561

8586

8078

100

100

ARIA

BOP1BOP2

Phylogenetic Analysis of the BTB Superfamilies in Physcomitrella and Arabidopsis

Physcomitrella

Arabidopsis

Identification of BTB domain-containingproteins encoded by the Physcomitrella genome

BLAST search Physcomitrella patens genome v1.1 predicted protein sequences (from the DOE Joint Genome Institute Physcomitrella genome sequencing project)

Domain # of Genes Protein ID

TAZ zinc fingerETO1

# of Genes Protein ID

BA1

D2RL

BPECJ

A2A2

BA1

D2RL

BPECJ

A2A2

Figure 3. Sequence alignment of representative Physcomitrella BTB domains. The approximately 100 amino acid core BTB sequence from 12 representative Physcomitrella BTB proteins were aligned with ClustalW and displayed with Boxshade using a threshhold of 55%. Identical and conserved amino acids are shown with black and grey, respectively. Dashes denote gaps. Desginations to the left indicate the BTB family and the JGI number for each protein.

Some data presented in this poster was generated by Dr. Gingerich when he was funded by an NIH NRSA post-doctoral fellowship (#GM068361)

100

43

00

“K”“Q”

JCSTA2

Q I G H P R MN E A1

B L D2

K

J I H G F E D2

D1

C B A2

A1

Figure 4. Phylogenetic trees of the complete BTB protein superfamilies from Arabidopsis and Physcomitrella. The approximately 100 amino acid core BTB domain from all Arabidopsis and Physcomitrella BTB proteins were aligned byClustalW. The alignments were used to generate non-rooted phylogenetic trees with MEGA 4.0, using the Poisson distance method and a bootstrap value of 1000. The subfamilies identified from the phylogenetic analysis are marked on the bottom of the trees. Individual members of the tree are color-coded by the nature of the domain(s) either N- or C-terminal to the BTB domain. Where possible, designations for proteins previously identified by other methods are used.

-Physcomitrella patens encodes 65 putative BTB domain-containing proteins in its genome.

-There are a variety of different motifs/domains attached to BTB domains in the PhyscomitrellaBTB family, suggesting a number of different types of target proteins.

-The Physcomitrella and Arabidopsis BTB families are similar in overall size and share many similarBTB types, suggesting the two plants use the BTB family in similar ways.

-However, there are also several Physcomitrella and Arabidopsis-specific types, suggesting some specialization in each lineage.

-The BTB-NPH3 family in Arabidopsis is significantly larger than that in Physcomitrella (30 vs. 17), suggesting that there may have been an expansion of this subfamily as more complex plants evolved.