platelet-derived growth factor a chain: gene structure, chromosomal

5
Proc. Nail. Acad. Sci. USA Vol. 85, pp. 1492-1496, March 1988 Biochemistry Platelet-derived growth factor A chain: Gene structure, chromosomal location, and basis for alternative mRNA splicing DAVID T. BONTHRON*t, CYNTHIA C. MORTONO, STUART H. ORKIN*, AND TUCKER COLLINS*§ *Division of Hematology-Oncology, Children's Hospital and Dana-Farber Cancer Institute, Howard Hughes Medical Institute, Harvard Medical School, and *Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115 Communicated by Earl P. Benditt, November 23, 1987 (received for review July 9, 1987) ABSTRACT Genomic clones encoding the A chain of platelet-derived growth factor (PDGF) have been isolated. The gene contains seven exons spanning about 24 kilobases of DNA. The positions of intervening sequences closely match those of the related B-chain (c-sis) gene on chromosome 22. In situ hybridization was used to localize the PDGF A-chain gene to the distal portion of the short arm of chromosome 7 (7p2l-p22). Within the (G+C)-rich 5' region, a single tran- scriptional start site was identified =%36 base pairs downstream of a TATAA consensus promoter element. The three size classes of A-chain mRNA probably arise by selection of alternative poly(A) sites in exon 7, but only a single consensus AATAAA signal was identified in this region. Two functionally different A-chain precursors, which differ by the presence or absence of a basic C terminus, are generated as a result of alternative mRNA splicing events, which include or exdude exon 6. This and other structural features of the A-chain gene suggest that PDGF expression may be modulated at transcrip- tional and post-transcriptional levels. Platelet-derived growth factor (PDGF) is a cationic glyco- protein of Mr 30,000 (for recent review, see ref. 1). As a potent mitogen for a variety of connective tissue cells, it induces competence to enter S phase, which is accompanied by transient rises in the levels of mRNA for the protoonco- genes c-fos and c-myc (2, 3). PDGF acts by way of a specific cell-surface receptor, a member of the transmembrane tyro- sine kinase family (4). As purified from platelets, PDGF is a disulfide-bonded dimer of two polypeptides, designated A and B, which are the products of related but separate genes (5). cDNA encoding the A chain of PDGF was first isolated from a malignant glioma cell line; the A-chain gene was assigned to chromosome 7 (7pter-7q22) (5). The B-chain precursor is encoded by the c-sis gene, located in chromo- somal region 22ql2.3-ql3.1 (6). The product of the simian sarcoma virus oncogene (v-sis) is a B-chain-related mitogen that accounts for the viral transforming activity (7-9). Although a variety of tumor-derived cells also secrete PDGF-like mitogens, some express only A- or B-chain mRNA (5). This implies that A- and B-chain precursors can associate to form functional homodimers. Cultured human endothelial cells express A- and B-chain mRNAs (10, 11). The coding region of B-chain mRNA from endothelial cells is identical to that of tumor-derived cDNA (10). In contrast, we and others have shown (12, 13) that A-chain cDNA clones derived from nontransformed cultured human endothelial cells lack an internal 69-base-pair (bp) region present in the reported glioma cDNA sequence. This region encodes an extremely basic C-terminal portion of the A-chain precursor, which appears to be required for efficient assembly of a mitogenic A-A homodimer. To define the nature of A-chain mRNA processing and to compare the organization of A- and B-chain genes, we have isolated and characterized the normal human PDGF A-chain gene.¶ The internal 69-bp region of the glioma-derived cDNA is a single exon of the normal A-chain gene, confirm- ing the suggestion (5, 12) that the two forms of mRNA arise by alternative splicing. Consistent with the homologous primary structures of A and B chains, the exon-intron structures of the two PDGF genes are similar. We have further localized the A-chain gene to the distal end of the short arm of chromosome 7 by in situ hybridization. MATERIALS AND METHODS Cell Culture. Growth of human passaged umbilical vein endothelial cells and RNA preparation have been described (11, 12). HepG2 cells were grown in Dulbecco's modified Eagle's medium containing 10%o fetal bovine serum (GIBCO). Isolation of Genomic Clones. A library of human peripheral blood DNA partially digested with Sau3A was constructed in the vector AEMBL3 (14). It was screened by plaque hybridization with 5' end [EcoRI-Mst II of clone D-1 (5), 290 bp] and 3' end [Bal I-EcoRI of clone dT1.1 (12), 724 bp] cDNA probes. DNA Sequencing. Sequencing was performed mostly by the Sanger method, using M13 and plasmid subclones of isolated restriction fragments (15, 16). The nucleotide ana- logue 7-deazaguanosine trisphosphate (17) was helpful in some (G + C)-rich 5' regions, but others required use of chemical degradation techniques (18). Primer Extension. Oligonucleotides DB1 (5' CTGCGAGCT- GCGAGCCGCCCTG 3') and DB2 (5' CACACCGATCA- CCTGCCTGAAC 3') were labeled with [32PJATP and poly- nucleotide kinase to a specific activity 1-2 x 109 cpm/pug; 5 x 105 cpm were hybridized to each RNA sample at 580C for 1 hr in 12 Al1 of 100 mM KCI/10 mM MgCl2/25 mM Tris'HCI, pH 8.5. The reverse transcription reactions (40 ,ul) contained 30 mM KCl, 8 mM MgCI2, 50 mM Tris HCl (pH 8.5), 500 .uM (each) dNTP, 50 ,ug of actinomycin D per ml, 4 units of RNasin (Promega Biotec, Madison, WI), and 50 units of avian myeloblastosis virus reverse transcriptase (Molecular Genetic Resources, Tampa, FL) at 42°C for 30 min. Filter Hybridization. Restriction fragments were labeled by random priming (19). A 69-bp probe specific for exon 6 was made from overlapping complementary oligonucleotides PDGF-1 (5' CTGGTTGGCTGCTTTAGGTGGGTTTTAAC- Abbreviations: PDGF, platelet-derived growth factor; IVS, inter- vening sequence. tPresent address: Department of Paediatrics, Royal Postgraduate Medical School, Hammersmith Hospital, Du Cane Road, London W12 OHS, United Kingdom. §To whom reprint requests should be addressed. IThis sequence is being deposited in the EMBL/GenBank data base (Bolt, Beranek, and Newman Laboratories, Cambridge, MA, and Eur. Mol. Biol. Lab., Heidelberg) (accession no. J03638). 1492 The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Upload: nguyenngoc

Post on 09-Feb-2017

216 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Platelet-derived growth factor A chain: Gene structure, chromosomal

Proc. Nail. Acad. Sci. USAVol. 85, pp. 1492-1496, March 1988Biochemistry

Platelet-derived growth factor A chain: Gene structure,chromosomal location, and basis for alternative mRNA splicingDAVID T. BONTHRON*t, CYNTHIA C. MORTONO, STUART H. ORKIN*, AND TUCKER COLLINS*§*Division of Hematology-Oncology, Children's Hospital and Dana-Farber Cancer Institute, Howard Hughes Medical Institute, Harvard Medical School, and*Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA 02115

Communicated by Earl P. Benditt, November 23, 1987 (received for review July 9, 1987)

ABSTRACT Genomic clones encoding the A chain ofplatelet-derived growth factor (PDGF) have been isolated. Thegene contains seven exons spanning about 24 kilobases ofDNA. The positions of intervening sequences closely matchthose of the related B-chain (c-sis) gene on chromosome 22. Insitu hybridization was used to localize the PDGF A-chain geneto the distal portion of the short arm of chromosome 7(7p2l-p22). Within the (G+C)-rich 5' region, a single tran-scriptional start site was identified =%36 base pairs downstreamof a TATAA consensus promoter element. The three sizeclasses of A-chain mRNA probably arise by selection ofalternative poly(A) sites in exon 7, but only a single consensusAATAAA signal was identified in this region. Two functionallydifferent A-chain precursors, which differ by the presence orabsence of a basic C terminus, are generated as a result ofalternative mRNA splicing events, which include or exdudeexon 6. This and other structural features of the A-chain genesuggest that PDGF expression may be modulated at transcrip-tional and post-transcriptional levels.

Platelet-derived growth factor (PDGF) is a cationic glyco-protein of Mr 30,000 (for recent review, see ref. 1). As apotent mitogen for a variety of connective tissue cells, itinduces competence to enter S phase, which is accompaniedby transient rises in the levels of mRNA for the protoonco-genes c-fos and c-myc (2, 3). PDGF acts by way of a specificcell-surface receptor, a member of the transmembrane tyro-sine kinase family (4). As purified from platelets, PDGF is adisulfide-bonded dimer of two polypeptides, designated Aand B, which are the products of related but separate genes(5). cDNA encoding the A chain of PDGF was first isolatedfrom a malignant glioma cell line; the A-chain gene wasassigned to chromosome 7 (7pter-7q22) (5). The B-chainprecursor is encoded by the c-sis gene, located in chromo-somal region 22ql2.3-ql3.1 (6). The product of the simiansarcoma virus oncogene (v-sis) is a B-chain-related mitogenthat accounts for the viral transforming activity (7-9).

Although a variety of tumor-derived cells also secretePDGF-like mitogens, some express only A- or B-chainmRNA (5). This implies that A- and B-chain precursors canassociate to form functional homodimers. Cultured humanendothelial cells express A- and B-chain mRNAs (10, 11).The coding region of B-chain mRNA from endothelial cells isidentical to that of tumor-derived cDNA (10). In contrast, weand others have shown (12, 13) that A-chain cDNA clonesderived from nontransformed cultured human endothelialcells lack an internal 69-base-pair (bp) region present in thereported glioma cDNA sequence. This region encodes anextremely basic C-terminal portion of the A-chain precursor,which appears to be required for efficient assembly of amitogenic A-A homodimer.

To define the nature of A-chain mRNA processing and tocompare the organization of A- and B-chain genes, we haveisolated and characterized the normal human PDGF A-chaingene.¶ The internal 69-bp region of the glioma-derivedcDNA is a single exon of the normal A-chain gene, confirm-ing the suggestion (5, 12) that the two forms of mRNA ariseby alternative splicing. Consistent with the homologousprimary structures of A and B chains, the exon-intronstructures of the two PDGF genes are similar. We havefurther localized the A-chain gene to the distal end of theshort arm of chromosome 7 by in situ hybridization.

MATERIALS AND METHODSCell Culture. Growth of human passaged umbilical vein

endothelial cells and RNA preparation have been described(11, 12). HepG2 cells were grown in Dulbecco's modifiedEagle's medium containing 10%o fetal bovine serum (GIBCO).

Isolation of Genomic Clones. A library of human peripheralblood DNA partially digested with Sau3A was constructedin the vector AEMBL3 (14). It was screened by plaquehybridization with 5' end [EcoRI-Mst II of clone D-1 (5), 290bp] and 3' end [Bal I-EcoRI of clone dT1.1 (12), 724 bp]cDNA probes.DNA Sequencing. Sequencing was performed mostly by

the Sanger method, using M13 and plasmid subclones ofisolated restriction fragments (15, 16). The nucleotide ana-logue 7-deazaguanosine trisphosphate (17) was helpful insome (G +C)-rich 5' regions, but others required use ofchemical degradation techniques (18).

Primer Extension. Oligonucleotides DB1 (5' CTGCGAGCT-GCGAGCCGCCCTG 3') and DB2 (5' CACACCGATCA-CCTGCCTGAAC 3') were labeled with [32PJATP and poly-nucleotide kinase to a specific activity 1-2 x 109 cpm/pug; 5x 105 cpm were hybridized to each RNA sample at 580C for1 hr in 12 Al1 of 100 mM KCI/10 mM MgCl2/25 mM Tris'HCI,pH 8.5. The reverse transcription reactions (40 ,ul) contained30 mM KCl, 8 mM MgCI2, 50 mM Tris HCl (pH 8.5), 500 .uM(each) dNTP, 50 ,ug of actinomycin D per ml, 4 units ofRNasin (Promega Biotec, Madison, WI), and 50 units of avianmyeloblastosis virus reverse transcriptase (Molecular GeneticResources, Tampa, FL) at 42°C for 30 min.

Filter Hybridization. Restriction fragments were labeledby random priming (19). A 69-bp probe specific for exon 6was made from overlapping complementary oligonucleotidesPDGF-1 (5' CTGGTTGGCTGCTTTAGGTGGGTTTTAAC-

Abbreviations: PDGF, platelet-derived growth factor; IVS, inter-vening sequence.tPresent address: Department of Paediatrics, Royal PostgraduateMedical School, Hammersmith Hospital, Du Cane Road, LondonW12 OHS, United Kingdom.§To whom reprint requests should be addressed.IThis sequence is being deposited in the EMBL/GenBank data base(Bolt, Beranek, and Newman Laboratories, Cambridge, MA, andEur. Mol. Biol. Lab., Heidelberg) (accession no. J03638).

1492

The publication costs of this article were defrayed in part by page chargepayment. This article must therefore be hereby marked "advertisement"in accordance with 18 U.S.C. §1734 solely to indicate this fact.

Page 2: Platelet-derived growth factor A chain: Gene structure, chromosomal

Proc. Nati. Acad. Sci. USA 85 (1988) 1493

CTTT 3') and PDGF-2 (5' GAAGGCCTAGGGAGTCAGG-TAAAAAACGGAAAAGAAAAAGGTTAAAACCC 3') asdescribed (20).Chromosomal in Situ Hybridization. The 1.75-kilobase (kb)

endothelial cell-derived cDNA clone dTl.1 (12) was labeledto a specific activity of 1.2 x 107 cpm/,ug by nick-translationwith all four tritiated nucleotides (New England Nuclear)(21). In situ hybridization to metaphase chromosome prep-arations of normal male peripheral blood lymphocytes wasperformed as described (21).

RESULTSStructure of the PDGF A-Chain Gene. The restriction map

of a 47-kb region containing the A-chain gene (Fig. 1) wasconstructed by analysis of overlapping bacteriophage clonesisolated from a human genomic library. The gene spans -24kb. Exons and their corresponding intron boundaries werelocated and sequenced (Fig. 2). Like the related B-chaingene (22), the A-chain gene contains seven exons, whichcorrelate with functional subdivisions of the growth factor.The unusually long 5' untranslated region and signal peptideconstitute exon 1. Exons 2 and 3 encode the N-terminalpropeptide; exons 4 and 5 encode the mature PDGF subunit.The long exon 7 consists almost entirely of 3' untranslatedsequences.The 69-bp region present in a glioma-derived cDNA clone

but missing in normal endothelial cell-derived cDNAs cor-responds to exon 6. This demonstrates that the two observedforms of A-chain cDNA are indeed the consequence ofalternative processing of transcripts of the normal A-chaingene. Hybridization of genomic Southern blots with a syn-thetic 69-bp probe corresponding to exon 6 established thatno gross rearrangement had occurred in the vicinity of thisexon (not shown).

All splice junction sequences conform to the GT/AG rule(23). Unusual features were seen near the splice donor sitesof intervening sequences 3 and 4 (IVS3 and IVS4). In theformer case, there is a perfect 41-bp repeat (underlined inFig. 2). IVS4 begins with an unbroken stretch of 100 pyrim-idine residues that includes multiple repeats of the sequenceCCCT.Assignment of the Transcriptional Start Site. To define the

transcription initiation site(s) of the A-chain gene, primerextension analysis was performed by using RNA from pas-saged human endothelial cells and from the hepatoblastomaline HepG2 (Fig. 3). The latter has been reported to secretea PDGF-like mitogen (24), and we have confirmed by RNAtransfer blotting (not shown) that it expresses a low level ofA- and B-chain mRNAs. Single bands, of 352 and 389nucleotides, were generated by using each of two oligonu-

cleotide primers, designated DB1 and DB2, respectively,spaced 39 bp apart on the mRNA. This placed the start site36 bp (+ 5 bp) downstream of a consensus TATAA (Gold-berg-Hogness box) sequence (Fig. 3). This result is consis-tent with the TATAA motif being a functional element of theA-chain promoter. No other TATAA sequences were iden-tified in 1 kb of upstream sequence (data not shown). Sinceno other extension products were seen in the 370 bp down-stream of this site, multiple transcriptional start sites down-stream from the consensus TATAA are unlikely to accountfor the heterogeneous size of A-chain mRNAs (approxi-mately 2.6, 2.3, and 1.6 kb).Exon 7: Poly(A) Sites. In view of the unique transcriptional

start site, mRNA size heterogeneity is likely to reflect use ofalternative poly(A) sites. Exon 7 extends at least as far as thearrowheads at position 3157 in Fig. 2, which represents theend of the cDNA clone dTM.1 (12) and corresponds to amRNA size of at least 2.0 kb. The shortest form of mRNApresumably results from cleavage and polyadenylylationfarther upstream, though the closest match to a consensusAATAAA signal upstream of this point is a number of copiesof AACAAA, which is not thought to be an efficient substi-tute (25). The short (750 bp) form of mouse dihydrofolatereductase mRNA is also derived from an alternative noncon-sensus poly(A) signal within the 3' untranslated region of thegene (26). Farther beyond the end of the endothelial cDNAclone, the only close matches to AATAAA in the genomicsequence are CATAAA at position 3201 and a perfect match,AATAAA, at position 3372. The latter predicts a mRNA of2.25 kb plus poly(A) tail, which may correspond to the largeor intermediate mRNA species seen on RNA transfer blots.An (A+ U)-rich sequence, including repeats of AUUUA,

is characteristic of the 3' untranslated regions of somerapidly degraded mRNAs for inflammatory mediators andoncogenes (27). Such a sequence is not present in theexisting A-chain exon 7 sequence (Fig. 2) but is present inthe 3' untranslated region of the B-chain mRNA (22). This isconsistent with the short half-life of the B-chain mRNA(35-40 min) compared to that of the A chain (>3 hr) incultured rat pup smooth muscle cells (28).Chromosomal Localization of the PDGF A-Chain Gene.

The genes for A and B chains of PDGF have been localizedto chromosomes 7 (5) and 22 (6), respectively. To map theA-chain gene more precisely, we performed in situ hybrid-ization to metaphase chromosomes. The results of analysisof 150 metaphase spreads are shown in Fig. 4. Of the 383silver grains recorded, 29 (7.6%) were located on or besidebands 7p21-p22 (most likely within 7p22). We note that 7p22is the site of one of a defined subset of mutagen-inducedfragile sites, many of which coincide with the locations ofknown oncogenes (29), and is also the site of one breakpoint

0 10 20 30 40ia I i I iI

K K KK H S K HHI I I I

I I I II I

RR R XXR X R X kkX k kg

1 2 3 4 5 6 7- m Aap-;

1 XAC 10

XAC 23 r

XAC 27 r- 1

XAC41I

FIG. 1. Map of the cloned region of the PDGF A-chain gene. The scale at the top is in kilobases. The restriction map was derived fromfour overlapping bacteriophage clones, as shown beneath. All sites are shown for the enzymes HindIII (H), Kpn I (K), EcoRI (R), Sal I (S),and Xba I (X). Exons are numbered 1-7.

Biochemistry: Bonthron et al.

Page 3: Platelet-derived growth factor A chain: Gene structure, chromosomal

1494 Biochemistry: Bonthron et al.

1IGAGGGAGGGGCGCGGAGCCCCGGCGCGGAGCCGGGCGCGGGGCTTTGATGGATTTAGCTGCTTGCGCGAGCGCGTGTGTGCTCCCTGCCGCAGCGGCGGCGCCCGGGCCCTGCCGGGTCC121 GCACGAACCCCGAGCGCTTCCAAGGTGCGGGTCCCAGGCCCGGAATCCGGGGGAGGCGGGGGGGGGGGGGCGGGGGCGGGGGCGGGGGAGGGGCGCGGCGGCGGCGT cCCTCTCC241 CCGCCGCCGGCCGGCTCCAC jSCGCGCGCCCTGCGGAGCCCGCCCAACTCCGGCGAGCCGGGCCTGCGCCTACTCCTCCTCCTCCTCTCCCGGCGGCGGCTGCGGCGGAGGCGCCGACTC360 GGCCTTGCGCCCGCCCTCAGGCCCGCGCGGGCGGCGCAGCGAGGCCCCGGGCGGCGGGTGGTGGCTGCCAGGCGGCTCGGCCGCGGGCGCTGCCCGGCCCCGGCGAGCGGAGGGCGGAGC480 GCGGCGCCGGAGCCGAGGGCGCGCCGCGGAGGGGGTGCTGGGCCq:GCTGTGCCCGGCCGGGCGGCGGCTGCAAGAGGAGGcCGGAGGCGAGCGCGGGGCCGGCGGTGGGCGCGCAGGGC600 GGCTCGCAGCTCGCAGCCGGGGCCGGGCCAGGCGTTCAGGCAGGTwATCGGTGTGGCGGCGGCGGCGGCGGCGGCCCCAGACTCCCTCCGGAGTTCTTCTTGGGGCTGATGTCCGCAAAT720 ATGCAGAATTACCGGCCGGGTCGCTCCTGAAGCCAGCGCGGGGAGCGAGCGCGGCGGCGGCCAGCACCGGGAACGCACCGAGGAAGAAGCCCAGCCCCCGCCCTCCGCCCCTTCCGTCCC840 CACCCCCTACCCGGCGGCCCAGGAGGCTCCCCGCGCTGCGGGCGCGCACTCCCTGTTTCTCCTCCTCCTGGCTGGCGCTGCCTGCCTCTCCGCACTCACTGCTCGCGCCGGGCGCGCTCC960 GCCAGCTCCGTGCTCCCCGCGCCACCCTCCTCCGGGCCGCGCTCCCTAAGGGATGGTACTGAATTTCGCCGCCACAGGAGACCGGCTGGAGCGCCCGCCCCGCGGCCTCGCCTCTCCTCC

1080 GAGCAGCCAGCGCCTCGGGACGCGATGAGGACCTTGGCTTGCCTGCTGCTCCTCGGCTGCGGATACCTCGCCCATGTTCTGGCCGAG GTTGGTGCCGCCCCCGCGCCCCGTCCCTGCGCMetArgThrLeuAlaCysLeuLeuLeuLeuGlyCysGlyTyrLeuAlaHisValLeuAlaGluj

1199 CGGCTCCTCCG.......... IVS 1 (1.5 kb) .........GGCCGCGGGCGCTGACCGTGTGGCCTCTGCTTGCAG rGAAGCCGAGATCCCCCGCGAGGTGATCGAGAGGCTGGCCLGluAlaGluIleProArgGluValIleGluArgLeuAla

1285 CGCAGTCAGATCCACAGCATCCGGGACCTCCAGCGACTCCTGGAGATAGACTCCGTAG1 GTAAATCGCGCCCCTTCCCTCCGCGCGCGGGGAGGGCGCGGGC.... IVS 2 (5.8 kb)ArgSerGlnIleHisSerIleArgAspLeuGlnArgLeuLeuGluIleAspSerValGj

1386. GCGGGAGCTGTGAGGGGCTGTGCCGCCAGGTGCCTGTTCCCCAGTGGCTCCCAAAGCTGGTCTGTGGGAAGTGCGGCTGGACAGGCCCAGGGCACAGCGCACGGGCCTGGGGCA1500 TTCACGGTGTTCTCCTTCCGCCTGCAG rGGAGTGAGGATTCTTTGGACACCAGCCTGAGAGCTCACGGGGTCCACGCCACTAAGCATGTGCCCGAGAAGCGGCCCCTGCCCATTCGGAGG

LlySerGluAspSerLeuAspThrSerLeuArgAlaHisGlyValHisAlaThrLysHisvalProGluLysArgProLeuProIleArgArg1619 AAGAGAAGCATCG1 GTGAGTCCAGGAGGCCGCGATGGGCAGGGCAGGGCCGGQTCGGGGTGAGTCCAGGAGGCCGCGATGGGCAGGGCAGGGCCGGGTGGGGAGGAGGTGCCC .......

LysArgSer IleGJ1731 . . IVS 3 (1.4 kb) ... GAGGCCGGTCCCCGCTCACTGTGCCCCCGCCGTTGCAG FAGGAAGCTGTCCCCGCTGTCTGCAAGACCAGGACGGTCATTTACGAGATTCCTCGGAGTCAG

LluGluAlaValProAlaValCysLysThrArgThrVal IleTyrGlu IleProArgSerGln18 31 GTCGACCCCACGTCCGCCAACTTCCTGATCTGGCCCCCGTGCGTGGAGGTGAAACGCTGCACCGGCTGCTGCAACACGAGCAGTGTCAAGTGCCAGCCCTCCCGCGTCCACCACCGCAGC

ValAspProThrSerAla~snPheLeuIleTrpProProCysValGluValLysArgCysThrGlyCysCysAsnThrSerSerValLysCysGlnProSerArgVal isHiSArgSer1951 GTCAAG GTGAGCCCTCCCCTCCCCTCCCCT... IVS 4 (10.0 kb) .. .CCGAAGCTCCATGCAGGCATTCATGGCCGGGCTCTGTTCTCTCTGGCAG GTGGCCAAGGTGGAATAC

ValLysj ValAlaLy sValGluTyr2048 GTCAGGAAGAAGCCAAAATTAAAAGAAGTCCAGGTGAGGTTAGAGGAGCATTTGGAGTGCGCCTGCGCGACCACAAGCCTGAATCCGGATTATCGGGAAGAGGACACG GTGAGTGGCT

ValArgLysLysProLysLeuLysGluvalGlnvalArgLeuGluGluH isLeuGluCysAlaCysAlaTh rThrSerLeuAsnProAspTyrA rgGluGl uAspThrG2167 GCCTTCGTCGGCATCGGTGTTGGAGAACAGGTCTTCAGAGCCTTGCTTTTGGGGTGTTAGGTGGCCCCCTT...IvS 5 (0.5 kb) ... GCCGTAGGTATTTGTTGCTTCAGTTCTTC2267 ACCCCGACGGCCGTCCCTCGGCCCACTCACCGCCCTGCCCTTTTGTTAACAG jAAGGCCTAGGGAGTCAGGTAAAAAACGGAAAAGAAAAAGGTTAAAACCCACCTAAAGCAGCCAACC

LlyArgProArgGluSerGlyLysLysArgLysArgLysArgLeuLysProThr***2386 AG] GTAGGACTGTCTGCCGGACACACTGAGTCCTGCAGGGCGGGGAGGTGCAACTGACTGATCACACCAGCAAATCACGCTTCCTGTCTCGCC... IVS 6 (1.9 kb) .. .TTTTCCT2485 CCTGTGCAGCGATGTGTGTGCTGGTGGTTCACTTGGTCGGTGGTCTCAGTGTCTGTCTTTCCCTTCTCCCTGCAG fATGTGAGGTGAGGATGAGCCGCAGCCCTTTCCTGGGACATGGAT

LspValArg* * *26C0 4 GTACATGGCGTGTTACATTCCTGAACCTACTATGTACGGTGCTTTATTGCCAGTGTGCGGTCTTTGTTCTCCTCCGTGAAAAACTGTGTCCGAGAACACTCGGGAGAACAAAGAGACAGT27 24 GCACATTTGTTTAATGTGACATCAAAGCAAGTATTGTAGCACTCGGTGAAGCAGTAAGAAGCTTCCTTGTCAAAAAGAGAGAGAGAGAGAGAGAGAGAGAAAACAAAACCACAAATGACA2844 AAAACAAAACGGACTCACAAAAATATCTAAACTCGATGAGATGGAGGGTCGCCCCGTGGGATGGAAGTGCAGAGGTCTCAGCAGACTGGATTTCTGTCCGGGTGGTCACAGGTGCTTTTT2964 TGCCGAGGATGCAGAGCCTGCTTTGGGAMACGACTCCAGAGGGGTGCTGGTGGGCTCTGCAGGGGCCCGCAGGAAGCAGGAATGTCTTGGAAACCGCCACGCGAACTTTAGAAACCACACC3084 TCCTCGCTGTAGTATTTAAGCCCATACAGAAACCTTCCTGAGAGCCTTAAGTGGTTTTTTTTTTTTGTTTTTGqrTTGTTTTTTTTTTTTTTGTTTTTTTTl'-TTTTTTTTTTTACACCAl'3204 AAAGTGATTATTAAGCTTTCCTTTTTACTCTTTGGCTAGCTTTTTTTTTTTTTTTTTTTAATTATCTCTTGGATGACATTTACACCGATAACACACAGGCTGCTGTAACTGTCAGGACAG3324 TGCGACGGTATTTTTCCTAGCAAGATGCAAACTAATGAGATGTATT TGGTATACCTACCTATGCATCATTICCTAAATGTTTCTGGCTTTGTGTTTCTCCCTTACCCTGC3444 TTTATTTGTTAATTTAAGCCATTTTGAAAGAACTATGCGTCAACCAATCGTACGCCGTCCCTGCGGCACCTGCCCCAGAGCCCGTTTGTGGCTGAGTGACAACTTGTTCCCCGCAGTGCA3564 CACCTAGAATGCTGTGTICCCACGCGGCACGTGAGATGCATTGCCGCTTCTGTCTGTGTTGTTGGTGTGCCCTGGTGCCGTGGTGGCGGTCACTCCCTCTGCTGCCAGTGTTTGGACAGA3684 ACCCAAATTCTTTATTTTTGGTAAGATATTGTGCTTTACCTGTATTAACAGAAATGTGTGTGTGTGGTTTrGTTTTTTTT 3762

FIG. 2. Nucleotide sequence of the PDGF A-chain gene. Approximate sizes of introns are indicated. The 5' and 3' limits of the cDNA clonedT1.1 are shown by upturned arrowheads. The three ATG codons upstream of the A-chain coding region are underlined. The TATAA andAATAAA sequences are boxed; exons are bracketed; the indicated transcriptional start site is only approximate, due to the large size of theprimer extension products. A 41-bp direct repeat at the start of IVS3 is indicated by long arrows.

of a deletion (7pll.2-p22) seen in myelodysplastic syndrome(29). Monosomy for 7p22 appears to result in a syndrome ofcraniosynostosis and other dysmorphic features (30).

DISCUSSION

Consistent with their presumed common evolutionary ori-gin, the intron-exon structures of the PDGF A- and B-chain

DB-1 DB-2 DB-1 DB-2M Tb A b pM M '; b b -n MFabca bcab cM

||WUNIX-H :: .* :: - 389

E~~~~~~~~~~~~~l :f`352

FIG. 3. Mapping of the PDGF A-chain cap site. Primer exten-sion products were run for 2 hr (Left) or 4.5 hr (Right) on a 6%acrylamide/8 M urea gel. Markers: end-labeled pBR322 digestedwith Hinfl (lanes M) or Hpa II (lanes M'). Two micrograms ofpoly(A)+ RNA from passaged human umbilical vein endothelialcells (lanes a), 4 ,ug of poly(A) + RNA from HepG2 cells (lanes b), or10 jig of tRNA (lanes c) was used with each of the oligonucleotides,DB1 or DB2. Exposure: 14 days at - 70 C with intensifying screen.Nucleotide positions are indicated on the right.

genes are very similar (see refs. 22 and 31). In regions ofsubstantial amino acid homology-for example, exons 1 and2-the sequences and splice junctions can be preciselyaligned (Fig. 5). Exons 4 and 5 are also well conserved.Exons 3 and 6, in contrast, have few residues in commonbetween A and B chains. Nonetheless, exon 6 of A and Bchains share the feature of encoding a very basic C-terminaldomain [predicted net charge + 11 (A chain), + 9 (B chain)].Exon 6 is the exon "skipped" during splicing of the endo-thelial-type A-chain mRNA; we have provided evidence thatan active A-chain homodimer can be readily assembled onlyfrom a precursor containing the exon 6 basic domain (12).Whether the exon 6 domain of the B-chain precursor servesa similar function and is required for the assembly of B-Bhomodimers and B-small A heterodimers is undefined.There is currently no evidence for alternative splicing ofexon 6 of the PDGF B chain, but it will be important to assayfor this event specifically, by nuclease protection.The 5' end of the A-chain gene is strikingly (G +C)-rich

(78.6% over the first 1209 bp of Fig. 2) and fits the criteria fora hypomethylated "HTF island" (32). The primer extensionexperiment suggests that the TATAA sequence within thisregion is likely to be a functional component of the A-chainpromoter. A- and B-chain genes also contain more than onecopy of the consensus binding sequence for transcriptionfactor Spl (GGGCGG) upstream and downstream of the capsite. This feature is often associated with multiple transcrip-tional start sites [e.g., epidermal growth factor receptor (33),hydroxymethylglutaryl CoA reductase (34)]. The A- andB-chain promoters may resemble the (G +C)-rich promoterregion of the c-myc gene, in which each of two start sites isdefined by a TATAA element (35, 36). The TATAA, but notthe upstream GGGCGG, sequences of the B-chain promoterare required for promoter activity in transient expressionexperiments (D.T.B., T.C., and S.H.O., unpublished).

Proc. Natl. Acad. Sci. USA 85 (1988)

Page 4: Platelet-derived growth factor A chain: Gene structure, chromosomal

Proc. Natl. Acad. Sci. USA 85 (1988) 1495

201

I -. I

p q I I P p"i I I pI Iq lpI qp I lpi 16 7 a 9 10 1112

q I I q I PI II I q IP] pI'q1 2 3 4. .

4 !,Xw-i.0-,Io . 4 0

2

p1

-Uq 2

3

*b.~J k~so La *- [L.. .. --

I IIiI I

13 14 15 16 17 18'19 20 21 2 X Y

Chromosome Number

FIG. 4. In situ hybridization of PDGF A-chain cDNA to metaphase chromosomes. The histogram displays data for all grains counted in150 metaphases. The ideogram of chromosome 7 shows the distribution of grains on that chromosome and the assignment of PDGF A chainto 7p2l-p22.

An unusual feature conserved between A- and B-chainmRNAs is the presence of three ATG triplets, upstream ofthe authentic initiator codon, within a long 5' untranslatedregion. In the A chain, the first two of these are closetogether in the same reading frame and are followed shortlyby a stop codon. The third heads an open reading frame(ORF) that could encode a 31-residue peptide; the terminatorfor this reading frame overlaps the authentic ATG initiatorcodon in such a way that if this ORF were translated (37),reinitiation would have to follow a backward "jump" of onenucleotide. Backward movement of the ribosome is believedto explain a -1 frame shift in the translation of a retroviralmRNA (38). The 5' untranslated regions of A- and B-chainmRNAs would be expected to have substantial secondary

structure due to their high G-C content; this and the presenceof upstream ATGs would be expected to decrease transla-tional efficiency (39). The possibility that the upstreamATGs are involved in translational control of PDGF expres-sion (as demonstrated for the yeast GCN4 product) (40)requires analysis by site-directed mutagenesis of these re-

gions.Since alternative splicing can generate functionally dis-

tinct PDGF A-chain mRNAs, developmental and cell-type-specific regulation of this process may have important con-sequences for physiological and pathological processes,such as embryogenesis and atherosclerosis. At present, theonly direct evidence that exon 6 is utilized remains theisolation of the original glioma clone D-1 of Betsholtz et al.

A MRTLACLLLLGCGYLAHVLAE EAEIPREVIERLARSQIHSIR... DLQRLLEIDSV GSEDSLDTSLRAHGVHATKHVPEKRPLPIRRKRSIB MNRCWALFLSLCCYLRLVSAE GDPIPEELYEMLSD ...HSIRSFDDLQRLLHGDPG .EEDGAELDLNMTRS .... HSGGELESLARGRRSL

L.* ** *** * ** ** ****,*******J ** * * **,*L4

A ......EEAVPAVCKTRTVIYEI PRSQVDPTSANFLIWPPCVEVKRCTGCCNTSSVKCNPSRVHHRSVKB GSLTIAEPAIIAECKTRTEVFEISRRLIDRTNANFLVWPPCVEVQRCSGCCNNRNVQCRPTQVQLRPVQ

.5 6 -7A VAKVEYVRKKPKLKEVQVRLEEHLECACATT.SLNPDYREEDT.................GRPRESGKKRKRKRLKPT........I..DVRB VRK IEIVRKKPIFKKATVTLEDHLACKCETVAAARPVTRSPGGSQEQ AKTPQTRVTIRTVRVRRPPKGKHRKFPHTHDKTALKETLGA ...

A CHAIN 1.5 5.2 1.4 10.0 0.5 1.9

B CHAIN 8.8 2.0 1.5 1.5 6.8 0.3

FIG. 5. Intron-exon structures of PDGF A and B chains. (Upper) The alignment is similar to that of Betsholtz et al. (5) but shows that thenonhomologous regions of A and B chains are largely confined to exons 3 and 6. Dots represent gaps introduced to improve alignment. TheN-terminal propeptide cleavage sites are shown by arrows toward the right-hand end of exon 3, and each exon is numbered above. Asterisksindicate matching residues. (Lower) The diagram compares the splicing patterns ofA and B chains. Sizes of exons are to scale; those of introns(from ref. 22 in the case of the B chain) are indicated (in kb).

10I

300

X, 20-0 10.0

.0Ez

2010

em

I0I.

0000

S.

00

307

Biochemistry: Bonthron et al.

.In I

Page 5: Platelet-derived growth factor A chain: Gene structure, chromosomal

1496 Biochemistry: Bonthron et al.

(5). However, several tumor cell lines appear to secreteA-chain homodimers. If, as implied by our cDNA expressionexperiments (12), the A-chain precursor lacking exon 6cannot efficiently form such homodimers, use of this exonmay not be rare in malignant cells. Cells cultured fromhuman atheromatous plaques may likewise produce aPDGF-like growth factor composed of A but not B chains(41). Therefore, in a variety of circumstances, alteration ofthe splicing pattern to generate the longer form of A-chainprecursor, and consequent assembly of an A-A homodimer,may provide the cell with an autocrine growth advantage.

Steady-state mRNA levels of A and B chains of PDGFvary in an inducible fashion in some cell lines (42, 43).Together with the structural data discussed above, thissuggests that regulation of transcription, alternative mRNAsplicing, differing stabilities of A- and B-chain mRNAs, andtranslational control could all be important determinants of acomplex pattern of PDGF expression.

We thank Michael Gimbrone for human endothelial cell cultures,Christer Betsholtz for supplying the cDNA clone D-1, and RobinBrown for help with labeling the PDGF 1 and 2 oligonucleotides.This work was partly supported by grants from the NationalInstitutes of Health (to T.C., C.C.M., and S.H.O.). T.C. is a PewScholar in the Biomedical Sciences. S.H.O. is an Investigator of theHoward Hughes Medical Institute.

1. Ross, R., Raines, E. W. & Bowen-Pope, D. F. (1986) Cell 46,155-169.

2. Greenberg, M. E. & Ziff, E. B. (1984) Nature (London) 311,433-438.

3. Kelly, K., Cochran, B. H., Stiles, C. D. & Leder, P. (1983)Cell 35, 603-610.

4. Yarden, Y., Escobedo, J. A., Kuang, W.-J., Yang-Feng,T. L., Daniel, T. O., Tremble, P. M., Chen, E. Y., Ando,M. E., Harkins, R. N., Francke, U., Fried, V. A., Ullrich, A.& Williams, L. T. (1986) Nature (London) 323, 226-232.

5. Betsholtz, C., Johnsson, A., Heldin, C.-H., Westermark, B.,Lind, P., Urdea, M. S., Eddy, R., Shows, T. B., Philpott, K.,Mellor, A. L., Knott, T. J. & Scott, J. (1986) Nature (London)320, 695-699.

6. Bartram, C. R., de Klein, A., Hagemeijer, A., Grosveld, G.,Heisterkamp, N. & Groffen, J. (1984) Blood 63, 223-225.

7. Huang, J. S., Huang, S. S. & Deuel, T. F. (1984) Cell 39,79-87.

8. Waterfield, M. D., Scrace, G. T., Whittle, N., Stroobant, P.,Johnsson, A., Wasteson, A., Westermark, B., Heldin, C.-H.,Huang, J. S. & Deuel, T. F. (1983) Nature (London) 304,35-39.

9. Doolittle, R. F., Hunkapiller, M. W., Hood, L. E., Devare,S. G., Robbins, K. C., Aaronson, S. S. & Antoniades, H. N.(1983) Science 221, 275-277.

10. Collins, T., Ginsburg, D., Boss, J. M., Orkin, S. H. & Pober,J. S. (1985) Nature (London) 316, 748-750.

11. Collins, T., Pober, J. S., Gimbrone, M. A., Hammacher, A.,Betsholtz, C., Westermark, B. & Heldin, C.-H. (1987) Am. J.Pathol. 126, 7-12.

12. Collins, T., Bonthron, D. T. & Orkin, S. H. (1987) Nature(London) 328, 621-624.

13. Tong, B. D., Auer, D. E., Jaye, M., Kaplow, J. M., Ricca, G.,McConathy, E., Drohan, W. & Deuel, T. F. (1987) Nature(London) 328, 619-621.

14. Frischauf, A.-M., Lehrach, H., Poustka, A. & Murray, N.(1983) J. Mol. Biol. 170, 827-842.

15. Biggin, M. D., Gibson, T. J. & Hong, G. F. (1983) Proc. Natl.Acad. Sci. USA 80, 3963-3965.

16. Chen, E. Y. & Seeburg, P. H. (1985) DNA 4, 165-170.17. Mizusawa, S., Nishimura, S. & Seela, F. (1986) Nucleic Acids

Res. 14, 1319-1324.18. Maxam, A. M. & Gilbert, W. (1980) Methods Enzymol. 65,

499-553.19. Feinberg, A. P. & Vogelstein, B. (1983) Anal. Biochem. 132,

6-13.20. Bos, J. L., Verlaan-de-Vries, M., Jansen, A. M., Veeneman,

G. H., van Boom, J. H. & van der Eb, A. J. (1984) NucleicAcids Res. 12, 9155-9163.

21. Morton, C. C., Kirsch, 1. R., Taub, R., Orkin, S. H. &Brown, J. A. (1984) Am. J. Hum. Genet. 36, 576-585.

22. Rao, C. D., Igarashi, H., Chiu, I.-M., Robbins, K. C. &Aaronson, S. A. (1986) Proc. Natl. Acad. Sci. USA 83, 2392-2396.

23. Mount, S. M. (1982) Nucleic Acids Res. 10, 459-471.24. Bowen-Pope, D. F., Vogel, A. & Ross, R. (1984) Proc. Natl.

Acad. Sci. USA 81, 23%-2400.25. Birnstiel, M. L., Busslinger, M. & Strub, K. (1985) Cell 41,

349-359.26. Setzer, D. R., McGrogan, M., Nunberg, J. H. & Schimke,

R. T. (1980) Cell 22, 361-370.27. Shaw, G. & Kamen, R. (1986) Cell 46, 659-667.28. Majesky, M. W., Benditt, E. P. & Schwartz, S. M. (1988)

Proc. Natl. Acad. Sci. USA 85, 1524-1528.29. Yunis, J. J., Soreng, A. L. & Bowe, A. E. (1987) Oncogene 1,

59-69.30. Schomig-Spingler, M., Schmid, M., Brosi, W. & Grimm, T.

(1986) Hum. Genet. 74, 323-325.31. Van den Ouweland, A. M. W., Van Groningen, J. J. M.,

Schalken, J. A., Van Neck, H. W., Bloemers, H. P. J. & Vande Ven, W. J. M. (1987) Nucleic Acids Res. 15, 959-970.

32. Bird, A. P. (1986) Nature (London) 321, 209-213.33. Ishii, S., Xu, Y.-H., Stratton, R. H., Roe, B. A., Merlino,

G. T. & Pastan, 1. (1985) Proc. Natl. Acad. Sci. USA 82,4920-4924.

34. Reynolds, G. A., Basu, S. K., Osborne, T. F., Chin, D. J.,Gil, G., Brown, M. S., Goldstein, J. L. & Luskey, K. L.(1984) Cell 38, 275-285.

35. Watt, R., Nishikura, K., Sorrentino, J., Ar-Rushdi, A., Croce,C. & Rovera, G. (1983) Proc. Natl. Acad. Sci. USA 80,6307-6311.

36. Battey, J., Moulding, C., Taub, R., Murphy, W., Stewart, T.,Potter, H., Lenoir, G. & Leder, P. (1983) Cell 34, 779-787.

37. Kozak, M. (1986) Cell 47, 481-483.38. Jacks, T. & Varmus, H. E. (1985) Science 230, 1237-1242.39. Hunt, T. (1985) Nature (London) 316, 580-581.40. Fink, G. R. (1986) Cell 45, 155-156.41. Libby, P. (1987) Clin. Res. 35, 297A (abstr.).42. Papayannopoulou, Th., Raines, E., Collins, S., Nakamoto, B.,

Tweeddale, M. & Ross, R. (1987) J. Clin. Invest. 79, 859-866.43. Alitalo, R., Andersson, L. C., Betsholtz, C., Nilsson, K.,

Westermark, B., Heldin, C.-H. & Alitalo, K. (1987) EMBO J.6, 1213-1218.

Proc. Natl. Acad. Sci. USA 85 (1988)