leucokinin 1...

1
Leucokinin 1 MVVNKSLLLILLQVFAGFFLTCSGQGGCNGACNNMETFEDQSLIDSEPIEE KRKFSPWA G KRGGPVLPSYIIRTRTNSKPNVITH KK SRFSPWH G K RSAAGISDPELSDNLLERLLLQYTDSDI G RLQKLLE KRGFNPWA G K* Leucokinin 2 MQRNKRILLIVVQVVAGLLVSCSAQIGCQGSTDCHNGGDEQSLINSESTED KRKFNPWGG KRAGPVFSVYDESQSDGTSEYSAES KRSFSPWA G KR TAQSSLGYVVRARDARAD RKKTRFHPWH G KRSGSPLGSEVPENLLRQLVFEYIDLDSNKLSD KRGFNPWA G K* Luqin or cardioexcitatory peptide MKISELILSITAVLLIALTIADG SPAPKWRPQGRF G KRLSELNDDPLWVLLSSESEKREDILPYGVANKPVHFENLLCVPVGVKNAYKCTRPES* Myomodulin MNLTLTLICVLLCLQQLRQGACEDNESNNNNNNAVETGATPALRE RRAVGMLRL G R GVQMLRL G KRAPYDDLKTIVATIL G R QEQQFN RQAPLPRY G KEEGLLVDAYPADSPSQVSQIL RRSYPSYFEDEDLLHQEAPLPHLGYLQ KRSAEIRHAPLPRY G KGPDYEDLTNEVSSEEGGDDHEDENVSDTVS SEMFE R QPPLPRY G KDEIAGIDCEAYDESGNCLRFEDIE KKDVRMLRM G R Q VNMLRM G KRALSMLRM G RNGGD KRAVSMLRL G RSDNVEDS KRALA MLRL G RSGE KRAVSMLRL G RSGLDEESMPSEEQ KRALAMLRL G RSGSEEQ KRAVSMLRL G RSGEGDQ KRAVSMLRL G RSGPDGQ KRAVSMLRL G RN FPVSDAE KRAVSMLRL G RSGSDEE KRAVSMLRL G RSAEEEAE KRAVSMLRL G RSGAD KRAVSMLRL G RRNQGADDE KRAVSMLRL G RSGPESDES K RAVSMLRL G RSGPETDES KRAVSMLRL G RSGPETEES KRAVSMLRL G RSDEKFGAD KRAVSMLRL G RSDKNTDAD KRAVSMLRL G RSAEEEAE KRA VSMLRL G RSGAD KRAVSMLRL G RRNQGADEE KRAVSMLRL G RSGPESDES KRAVSMLRL G RSGPETDES KRAVSMLRL G RSG…… NKY 1 MQNLTSHIVIFALCCIGLTISGDLWQGNRPHADKNLLSLITRATAARDNALMMPPSGYQGRLPSYYDRPLS KREPLWIWM PAQGYVPVPRTSNINN SDGSGSSVIRY G* NKY 2 MAKVVFFMLLSAMVAILSPFCRASSEQMNQPQAVASFRTDHEKEALASLLLHVLIQRSIAPVYSSHPHWSNLASKAEMPSQL KKKDTRYRYRGIDS R VPAFGSFFSPSPSDNSDTSKIFRY G* NKY 3 MTVNAVHVLCIFALLFACAHSLP KRTDHASTLRYLQQSGLSDSDSRALLQAYLLGKLSNGDGSIGKELETSEYPTI KRKAF WRPMGYLPFENHVGS GASSSNDNAAGTGSASAVFRY G* Characterization of the neuropeptidome of the cuttlefish Sepia officinalis: identification of neuropeptides and neurohormones involved in the regulation of egg-laying. Céline Zatylny-Gaudin 1,2 , Valérie Cornet 1,2 , Alexandre Leduc 1,2 , Bruno Zanuttini 3 , Erwan Corre 4 , Gildas Le Corguillé 4 , Benoît Bernay 5 , Alexandra Kraut 6 and Joël Henry 1,2,5 Like many other cephalopods, the cuttlefish Sepia officinalis exhibits a wide variety of behaviors such as prey capture, communication, camouflage and reproduction, thanks to a complex central nervous system (CNS) divided into several functional lobes that express a wide range of neuropeptides. However, the diversity of these neuropeptides is crucial to modulate behavior and physiological mechanisms associated with the main stages of their life cycle. This work focuses on the neuropeptidome expressed during egg-laying, the last step of reproduction. We first identified neuropeptide transcripts through de novo construction of the CNS transcriptome using an RNASeq approach (Illumina sequencing). Then we completed the in silico analysis of the transcriptome by characterizing and tissue-mapping neuropeptides by mass spectrometry. To identify neuropeptides involved in the egg-laying process, we determined (1) the neuropeptide contents of the neurohemal area, haemolymph (blood) and nerve endings in mature females, and (2) the expression levels of these peptides. Among the 38 neuropeptide families identified from 54 transcripts, 30 were described for the first time in Sepia officinalis, 5 were described for the first time in the animal kingdom, and 14 were strongly over-expressed in egg-laying females as compared to mature males. Mass spectrometry screening of haemolymph and nerve ending contents allowed us to clarify the functional status of many neuropeptides: neuromodulators and/or neurohormones. Besides the data concerning egg-laying regulation in cephalopods, this work brings very new and important structural and expression data about the neuropeptidome of S. officinalis. 1 Normandy University, Caen, France E-mail: [email protected], 2 Université de Caen Basse-Normandie, UMR BOREA MNHN, UPMC, UCBN, CNRS-7208, IRD-207, F-14032 Caen, France, 3 Normandy University, GREYC, UMR CNRS 6072, F-14032 Caen, France, 4 UPMC, CNRS, FR2424, ABiMS, Station Biologique, 29680, Roscoff, France, 5 Post Genomic platform PROTEOGEN, Université de Caen Basse- Normandie, SF ICORE 4206, F-14032 Caen, France, 6 Proteomics platform EDyP / BGE / iRTSV / CEA / INSERM U1038 / UJF, F-38054 Grenoble, France. Sepia officinalis Phylum: Mollusca Class: Cephalopoda Order: Sepiida Family: Sepiidae Genus: Sepia In silico data mining using the homemade software PEPTRAQ 38 neuropeptide families expressed by 58 protein precursors 5 novel neuropeptides families not yet described in the animal kingdom 14 neuropeptides families over expressed in egg-laying females 8 neuropeptides families detected in the nerve endings of female accessory sex glands Large stocks of neuropeptide mRNAs at the level of ovary and female accessory sex glands Achatin 2 MVKVTSVCLCVLIGLVVLFDSWTDAS CAKP CLINFFK CVRGENDG CCSVYGG CMKES CGSATVQ CDDSM G KRGSWN KRGSWN KRGSWD KRGSWN K RDAAEE KRGSWN KRAEDIEISQ RGSWN KR ADANAEDYSEAILR R LLLENYGARL* Allatostatin A1 or buccalin 1 MTSVGVWSRALLCSLFISVHMVDCADSLKTNLSDEQILSDSSDNEEHKVE KKRSADPTLFTDTL KKRNVPERILTNGKDIHVIS RGMDPMMFGHL G KR PDPQMFDNINQ RMDPFMFGNL G KRFDPMLYGNL G KREHSIALGPLN KK MDPMMFGGL G KK MDPLMFGGL G KK MDPMMFGGL G KKMDPMMF…… Allatostatin A2 or buccalin 2 MTSVGVWSRALLCSLFISVHMVDCADSLKTNLSDEQILSDSSDNEEHKVE KKRSADPTLFTDTL KKRNVPERILTNGKDIHVIS R GMDPMMFGHL G KRPDPQMFDNINQ RMDPFMFGNL G KRFDPMLYGNL G KREHSIALGPLN KK MDPMMFGGL G KKMDPMMFGSL G KRDVSEDTLDESTD* Allatotropin MNVSGARGLCTLCCLGLLVLLASSDAHASVVVPNRPQ R GFKDNVSNRIAHGF G KRTFQDTYDLAPSLDDSKNLITP RKLAELIMYDRNLAYIVAL KLDSNGDGVISMNELILKDYV* Cholecystokinin MNHHFVEIGMSFALLAIIVTHLTTAAPSPYVESDMTHSLNNYKQQLMNAL RKRSEDEATNHVLTASARHFMSSAFH KKRSGLAAQ KKRDLDLAGR DDDAS KRRGAWYYDYGLGGGRF G KR KDYGYTDDYGIGGGRF G RDVDHVDLLDA* FaRPs MRCWSPCSLLVVIAIYCLSSHTSEAFDLAQA CVESQRLSLLPI CDTIFAVQQEGAQQSADDGLRS KRFIRF G R ALSGDAFLRF G KNVPDLPFED K RFLRF G RAAPQLDDLLKQALQRVESLQKSDDTSV RRKRSTDAAPQSNTDSAEQKNDSAKIT KRYVDDVEDSDV KRFMRF G KRFMRF G RNPSDVGS KLTE KRFMRF G RDPE KRFMRF G KSDD KRFMRF G RNPGDAEDELEED KRFMRF G RGDEEDEEEAE KRFMRF G RDPE KKFMRF G KNGEE KRFMRF G R NPEEPEAD KRFMRF G RGGEEDDVNTEE KRFMRF G RSAEK CKG CLE G* FLGamide MAFSQILVLLLGVSYVMTAPKSAADEAPVDL KKR GAESGEAHVFDSLGGGHVPYY KRYLEDNDV KR VFDTLGGGHVPYY KR SFDSLGGGAFLG G K R SFDSLGGGAFLG G KR SFDSLGGGAFLG G KR SFDSLGGGAFLG G KR SFDSLGGGAFLG G KRSFDSLGGGAFL……… …… KR SFDSLGGGSFLG G KR SFDSLGGGAFLG G KR SFDSLGGGAFLG G KR TFDSLGGGSFLG G KR SFDSLGGGAFLG G KR SFDSLGGGAFLG G KR GFD SLGGGSFIGV KRIHELDNMDGGSGDN* …… KR SFDSLGGGSFLG G KR SFDSLGGGAFLG G KR SFDSLGGGAFLG G KR TFDSLGGGSFLG G KR SFDSLGGGAFLG G KR SFDSLGGGAFLG G KR GFD SLGGGSFIGV KRIHELDNMDGGSGHNPPCCSGTS* FVRIamide MVSFWKILILGTIGVLVLMTQWASFVRAESPSNGEDLVNAAGAAVESADEPS G RSVSDSPYDI KR TNQFLRI G RGSHFIRI G R GGASSFLRI G R N PLSQFVRI G K APSSMFLRI G KSSAAGNPELGDLAAGPSSLGDNSIDEDEVL KR ASSFLRI G R SNPSTFLRI G KSAGNLDEETAANEDIVTDDIDV PSESME KR ANAFLRI G K IPASSFVRI G RGPYGIDN R SNP R GFLSVGSRFVRI G KREAIPSETGPTHARLLPNLHDQAQ* GnRH MSTSALSSNLRKMAFLTCAILLLSFCMQIQA Q NYHFSNGWHPG G KRSGLPDMQ CHFRPQTKALIEKLLDEEITRIITT CTNTVNDIADLQ* Insulin MKSSTVCMGTFLLATLLSIVNWQVVNA GLEHT C NEETIRQGPAQGAH C GVEIPNILQLL C APAGYNERMSDRQ RR NLVPTSRAI G RR GNGLRDII IS KR QAKSYLT KR DRNWTGIV C E CC YNK C ILEELLDY C KDPSYFKSQKLRS* LASGLXamide MWKLGIFLLGIWLVPQSVLCKDHIE KR SLDPLASGLI G KKRDIDDEKEEIDQKS KR Q FDHLASGLI G KRPF………… ………… SLASGLI G KR PFDHLASGLI G KR PFDPLASGLI G KR PFDPLASGLI G KR PFDSLASGLI G KR PFDHLASGLI G KR PFDSLASGLI G KR PFDSLAS GLI G KR PFDHLASGLI G KR PFDPLASGLI G KR SFDPLASGLI G KK SFDPLASGLI G KRR* …………… R PFDSLASGLI G KR ELDTLASGLI G KRDNDSEMED KR SFDPLASGLI G KK SFDPLASGLI G KRR* LFRFamide METKVMSLLATVLTVFIVQINCEDLHKIQTDTSGISNFIGLPDGEEGELVRSPIVDESALGIDDVD KRNSLFRF G KR GNLFRF G KR GNLFRF G KR GNLFRF G RGGNKDDPENEGL KRTIFRF G KRDGLEDLYDYEDPSVQQVAPTAGD KR GSFFRY G R SRTFFRY G RSTDKNAE KRPHTPFRF G REEE* Neuropeptide Y3 (NPF3 or NPY3) MQKSFFVILLIAVMFTGQVFS Q EGLLAAPKIPGELSEYLKALSDYYAIAARPRF G RSLKQRTSYRTLADDA* Neuropeptide L11 or elevenin MLKLRSSTFQKFLIWTFVLLLLNLHVNAQNKFELT KK LN C RKFIFAPR C RGVAA KRSLNMATSYPPVNTQMADERNYITGNHDSTIREILLNYIL SRLEARSLFQNDANTDSAEDLTSYR* Pedal Peptide 2 MISFHNRLFVVILSLLGQNLLISTYADIQGTSHVSYVHENPEE KR Q LDSIGSGLI KKK NIDSIGSGLI KK NLDSIGAGLI KK NLDSIGAGLI KK N LDSIGSGLV RR NFDSIGSGLI KK NFDSIGSGLI KKNLDLIGSGLV RR NFDLIGSGLI KK NFDTVGSGLV RR NFDSVGSGLI KKNLDAIGSGLV RR NFDSIGSGLI KK NLDRIGSGLV RR NFDSIGSGLI KK NLDSIGSGLV RR NFDSIGSGLIKKNLDSIGSG…………………………… DFDSIGSGLI KK NLDSIGSGLV RR NFDSIGSGLI KK NLNLDSIGSGLVDSVGAGLI KKNLDTIGMGLI RR QYENGMEMGDDNGYEGD KR HLDYIG SGLI KRSM* Pro-Sepiatocin MGSGRFLFSSTKCQVACVLFNFCVFLICTTDA C FFRN C PPG G KRAVAMNDGVAHKQ CMA CGPEGKGR CAGPNI CCQKEG CIIGDMAKE CMQEDEG CEVKGIP CGAEGQGR CVAAGV CCDTSA CSTNSH CGSALPRTSS RRQELFSLL KRLINKVN* PRQFVamide MRIYWPMYVVALLCASGQDFNAAEPDAVKNLVKDTVLKPLADSSYLNLPDEHTQDNALSEIVSGTEDGDIPEPLIGYSEEDLP KR PMEFL G KR AM EFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFLG………………… MRIYWPMYVVALLCASGQGQGKGKNAQHFNAAEPDAVKNLVKDTVLKPLADSSYLNLPDEHTQDNALSEIVSGTEDGDIPEPLIGYSEEDLP KR P MEFL G KRAMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G…… …………………… R PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KRSDESED KRRMEFL G KRRMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KRGMEFL G KR PMEFL G KRRMEFL G KR PMEFL G KTSLNSLDPRPWHNSSETLAGQADREHHDFLDDSLNALEPD RKHHFLDEMIDKGALGRYLQED MSILDQGDKPLPEGHDINKVSDEEIPTRMHLGSNPVSVKH KREIPIFV G KREQSISLN KRNVDPVLEEMLTIKPSERDREKENIMSNSVREILKN IQMNEKTDNQKADTDFEGTAGESDFHDEQFGEKPSIYE KKAYQMFV G KKNKPMFV G KKNKPMFV G KKNNPMFV G KRDNPMFV G KKTNPMFEEEMA NPMFA G KRDDPMFV G KRNNLMIFDEKARPVDVT KKAKPLFV G KKEKIGPEN KKDSMFPLNSNLILAE KKDPMFV G RKEDPILFSINGNPLLMDDT KNPTFTDENDKPVSVS KKDDPMFV G KKSKPMFV G KKNHPISIS KKYSPRFVS KKEFPLVVNRERISLVGDNRNTPIFVSKVYS KKVL G KKNNPAF IVANEDPMSVD KKYLKVSTESRNPTNTNGYKIPLAEKDYPQPSLHDYGVFSSSPIYVDHSLMKN KKNKPMFV G KRKLPRLSTNWNESMTFKDHFD EPSPTH KRFS KRSIMLYRLRTLPPHH RRLNYLDVLHFILAREELI KK DGMKWHNQQTPIFV G KREDNPMFV……… ……… G KKSKPMFV G KKSKPMFV G KKSKPMFV G KKNNPMFV G KKSKPMFV G KKNNPMFV G KKSNPIFLDNPMLT G KRSKPMFV G KK NNLMFLALKDNPRF V G KKSKPMFV G KKNHPISIS KKYSPRFVS KKEFPLVVNRERISLVGDNRNTPIFVSKVYS KKVL G KKNNPAFIVANEDPMSVD KKYLKVSTESRN PTNTNGYKIPLAEKDYPQPSLHDYGVFSSSPIYVDHSLMKN KKNKPMFV G KRKLPRLSTNWNESMTFKDHFDEPSPTH KRFS KRSIMLYRLRTLP PHH RRLNYLDVLHFILAREELI KK DGMKWHNQQTPIFV G KREDNPMFV… … …… KREFQPMLM G KRDDNPMFV G KRDLQPMFV G KRDDNPMFV G RR VDNPIFLGNRGLQPMFAVEKELQPMLV G KR DDNPMFLRTGVLREI G KRDNKPM FV G…… …… KRDDNPMFI G KRDPSPMFL G KRTDNPMISSKAGLQPMFV G RRMDNPMFV G KRNPQVMFV G KRVHNPMFV G KRDPQPIFV G KKNNKPMFV G KRENN QMFA G KRKDHQIFVGQRIAQPLTLERTYS RRMFVGERGPKIMSV RRGGDNPMFI G KKVPEPMFV G KKENKPMFV G KRDFLPMFV G KMDPQPMFM G KREDKPMFI G KREPQTMFLKISEPEPMFV G KRQTQTMFVE KRGTFPSFVEKTDTQPMLIW KRGLQPLFVQ RRISMPRFFFGSYKNMKYWH KR QPI PLFV G KRETESLFRGNAERQSLFV G KKEVMPMFV G KKNNQPSAWGKHLSQPMFI RNRNVPMFV G KRGGILAPET KRLPNQSLFDVADFDSSIEEI DNSLNKIQH RRDTSGVQDRLTNDISHKELDQPIVKASFKEKHMNDNFNQGIVSNNSSQSILRNRNDLSTDGNSRQVIDFLNPNKIERNIV KRSTG KYYNQMPVSKESNVS KRSRYSEENFIGQRYLHDDAPSEILSIN KRNTPLTFGKSNFPHRSQQRSNSLSLVLDQSSPHKSRFVHSEMALSGPYPAD EQTHFL KRNDLPNYEFRLAAAGTKPEPMIETGQTKNDHFGKMESEDIPFGSKHQSTNKSFRANGSDTIVVASRYWKPAGVAVNVRDFRNSAVSAP ASQVR* …… R PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KR PMEFL G G KRSDESED KRRMEFL G KRRMEFL G KR PMEFL G KR PMEFL G KR PMEFL G KRGMEFL G KR PMEFL G KRRMEFL G KR PMEFL G* Samide MTRNLLVVVLVAILVSTLANGRYIADTKLRS KR QTGDLKAAAYQAW LALGRTLPPD C PEVA C GVVDVEAS G KKKRTDTNSVDF RRSLLVERLLRL AAEGLVNSV* TAamide MWSSQIVVFVLTVCVCGSLCLTRT KRAVLTGDENPYDLREALRVLERE RRRLELRTLDPVSQREVRFEQQPVIVDSENDDPDFLARAVAEKGDSV FYSSDQANDVENLLREIEDQKEAENEASEDDLQTVYGRPLPDTEMDVDVDNYPSVVE KRTPT KKAVS RK RVTIKGLDLTA G KRSAETLKTVFNSL SPEEIHKLLLLEGRLQTKEMN KRQ RKARPLVVETRAQSDTGLEDAMEVEDAAAIPITKEEL KKLIQGQGEADQIEDPNDINNFNQMNEIIQAESA PADPSLDQPVDSDIFKSALQDLASLQMLQKQENDGRSEEQEALTQWGPAAMTKVVPVLLRPVPVLTQSRTAPVRVDE G RKMVKGIAAKPEPSLTT TPSLAEMMAKLWIKHKLEGIEIDYLADALNAATLTQSGVSPGVNNIPVEISSLQKAIRIEKLMKELGGNDADIEEDGMLDQLALKYLQRHQLQAE DSGIALS KKSFFDDEIEAEA RKEKEEEEEESAAAAAAAAARQQQEEEEEEEEEEEEYRDNLLRGDVNRQIPLLYEIPDNKINQIPSLPRPDQQNR IPYQLMDIPDDSDREEDKESDEIPLSPDIRLLNTEDFY* Tachykinin MHTVRVSTFQLLNCVCYVVLISGGLGNTAWGVLASYLPQTPESQQDKTAQVDESPVEKGFIDNNDILKSLSKLHMLAIGEPNDASSDPSSGLVLP IYEQEQEETADLVDDIAGDESPDEM KR FSPYAFQGSR G KKMMTSIED KK AHASLGFVGSR G KR Q PASLGFVGSR G KK Q LNFIPSR G RK QMSTAFV GSR G RR ISAEAFAPSR G RR LSSQAFFGSR G KK YSALGFMGSR G KKDSDFENLLLKNEDGWTQDNYN RR AAPFYHGFVASR G KRSAETNSA* Achatin 1 MAGQCKCLSSLVVILLITLSARGGFSNYVQNYNSRQLLSLHSFNDDQMNKDRLT ALEESFKSDGA G KRGFGD KRSIHVPHIIVET RGFGD KRDEK NDESIDTGKFRPIP RGFGD KRGFGD KRGFGD KKSLAQMDNYEPKWIHLVN RGFGD KRGFGD KRYFLD RRGFGD KRGFGD KRGFGD KRGFGD KRGF GD KRAFDDDTGFDD KRGFGD K* Allatostatin A3 MNAYILSAILLFVNAQHIKCDSKNLADFYPKLRDEYVKPYMQLDGGKYFPWKLLDLVPSREKFSRTAFPIIAAN KK IDPMLFKLGI G KR NSGSVD PFLFKIGL G KKSV RRVGEDGEQMWQMKD RRKGIESFQNGFGTEARPHVFLSSKNKEDINAENADIFSDVNM* APGWamide MKNQELFCKGTSLCLSRHSLLIVLLLVLPVFSTNSGQSSRNPADNEASSLVQESISRLLEASVDGSLSDEDDSDDDDYGSLTNLDDRTV KR APGW G KR APGW G KR APGW G KR APGW G KR APGW G KRTSPKLDSDEIATMLFSLENADNGEDTNAFE KR APGW G KR APGW G KRTVMNEDLASFLASLNAAE TAEASSDTD KR APGW G KR APGW G KR APGW G KR APGW G KRDQE KR APGW G KR APGW G KRDTE KR APGW G KRDTQGHNIAATN* Bursicon A MMERNSNLAMYNSQFVTTCLFIFVLLHPIASI RRFSTSPSLGIISEGDVFDDTNQW CHLRKFIHTINF RR CLPQNFRSYV CSGK CHSYTGADPNI TTYGNSEMSLMLRA CR CCQKLSVMSTRVVLK CRDDETGIYRRMQVKLQVPTR CI CRH CSI* Bursicon B MHLIPLASLLMFAIRHVSSQD CETLGSEIRVNKFLNVEHHGRQMNVR CGVQLSLNK CEGT CWSYESPSVIDPRGF RKK CN CCRERELVDRSVVLD E CHDAVSGLIVTGLHPVIVIKEPMN CE CRE CANFI* Cerebrin MQKAIIIFAALTLFSTAFVTDVFASQEEISEIVSLASRILKIAMLMDKP RRID KR NHGLIDTIINLPDLDKI G KK* Clionin MVKPTAFFTVVLVIAAVFMKPANLYPTPSVPLEEMDP R DA C LFQ C SN C FGDQYQPLLD C ANDV C PKVKSFSEVGYE C STILRHPNL G RRSLFKF* Crustacean Cardioactive Peptides (CCAPs) MQSVSSSLSTGLFFLTVNVLCVSTLIRVTSSQPQENTFQMKPSESDILNMKIRHLI RRAVQRQPTANDFSNEELASLPD KR VF C NSFGG C TNI KR LFDEENDFKLVASDSIQPMEQETAID KR VF C NSYGG C KSF KRVSANDI KKPEMIKNKQRHLI RRLKLQPIKIS RR VF C NSFGG C QN* FFamide MSGRLAFHSSLVLLICFALFFGQDSAVEA VYAPTRGQQNPHSY G RR GLNPNVNSLFF G KRAGAEQEALSNTDM G RKCMAAMSMCNMYFESNNVNE S* GGNamide MGRHFILFVLTSALLFCISNA VK C KWYDHI C LGGN G KRSAIGQSEQDQQLLKILTQEFIRQPH KRDSFITDVNDDDDDDAIFPHGKPFDQDYGQT NLSPTI KRLAKLMLPDERQN* Neuropeptide Y1 (NPY1 or NPF1) MQKATIILLLLVAMFSADAYS Q NNGGAAPQSPEELTNYLKALNEYYAIVARPRF G RSIIQ KRSFLGSLADAA* Neuropeptide Y2 (NPY2 or NPF2) MLSPMLTIFLIAVMLAANVSG Q NGLLGPPNRPGDLKNPGVLNNYLKALNEYQDALSRPRF G RSSS KRFSFNEFANNLPERIE* Neuropeptide Y4 (NPY4 or NPF4) MRKSFVIVFVIAVVLVIQISS Q EIMLSPPSRPAEFRNPKELREYMKALNEYYAIVGRPRF G RSIFN KRFGSNSFNEDLKSDENKE* Neuropeptide Y5 (NPY5 or NPF5) MQKIVIASLLVVLFTLNVSS QDSLLAPPSRPSEFRTPEELRQYL K ALNEYYAIVGRPRF G RSAVNRFTRTAIAKARTDP* NdWFamide-like MKPACICLIVILAASIFQTNANYY G KRDEKLDGFREFLS KRMGELNEQEIEARDVLRTISSLVQAWEARQ RKYDTLQKAA* Orcokinin B MRYWFCACFVLLQNSLISVSGVEKVN KKSHHSENEATLAHHGGNDGHGSTISE KR SFDSIDGGMFRTM G KR PFDSISDSAFGGM G KR Q FDSISHS SFRQM G KR SFDSIDGSAFGGM G KK SFDSIASSGFGGM G KR PFDSIDSSAFGGM G KK SFDSIASSGFGGM G K…… PKYMDT (Proctolin) MDSRLLAFVSVCLFLLTSPVFSAPAADPKPHLEQSKDLPIS KR PKYMDTREPQDIFKDLVFLTLQQLVSDGKVNPEAITDTDAGVPN KRGYQGLC L RRTANQRYIAYPCWRTGSK* PTSP-like Peptide MASIFSHFLVIMILALTQTRRLIAEEKNDIGSSKSLKPESVVKSENLKSWS KRSTTNGKALNAIRQAVARGAFSGPANPLDSSDERYWNLMMLWL KENGYPSTTVNAG G RRTGLRSRVARETDGTDEELLE KK DRPDTWNSMNTW G KR SPNTWDSMAAW G KR NPNTWDSMAAW G KRNGD……… …… NPDTWDSMSAW G KRSPDTWDSMSAW G KRGADTWDSMSAW G KRGADTWDSMSAW G KRNGDSKD KRDWDSLQAW G KRANAKN KKDWDSLAAW G KRDI AGDGNDEIGSQVQSLM KRSSGKSSKS RR* …… KRNPNTWDSMAAW G KRNPNTWDSMAAW G KRNGDTWDSMSAW G KR NPDTWDSMSAW G KRGADTWDSMSAW G KRGADTWDSMSAW G KRNGDSKD KR D WDSLQAW G KRANAKN KKDWDSLAAW G KRDIAGDGNDEIGSQVQSLM KRSSGKSSKS RR* PXXXamide MTRNLLVVVLVAILVSTLANGRYIADTKLSSY KR TSSDQRIAELQALIALSNTIGHGQVNPEEI G KKKRTDTNSVDF RRSLLVERLLRLAAEGLV NSV* Small Cardioactive Peptide (SCP) MFSQNLSVLAFSVCILLTMANT SYGYLVLPRQ G RSDDRAEPSCCGMPLMKATGLCPIGMECCPGL KKVLQKSGQKTVYSICIADLY* Sepiatocin MASYRWGSWALLLLIVVLPLVSLVEG CFWTT CPIG G KRSASEFRECMACGPEGKGRCAGPNICCQKEGCIIGDMAKECMQEDEGTTVCEVKGIPC GAEGQGRCVAAGVCCDTSACSTNSHCGSALPRTSS RRQELFSLL KRLINKVN* SPamide 1 MTRVVLLLLMFIQLVQIRANYPFLEQVEKVEEKNLENLLLRMLMERTNKFGRPV KRT CQIEATGE CRSEEAAEVADKYHYLLSSKSP G RKRNLFS * SPamide 2 MAPLQYILPLLLVLPIIAAWNPVLRSNEINSIRRAALLKHNEDSSDFGVYQRNGRDASDLQRAFSDYLKSSVEDS KSAWSDP CRLNLGGR CATEI ASDLVKAWHYLNSSNSP G RKRRDVREALRTILRHSAAAAAAAAADNR* Urotensin II MDSQLQTKQFLTLFCFCLCFVAIAKAMPAPQDPSQEEILA KR WLYRMLDRE G RATYPLNLAALRELESNLRLGMSNV KRGSNVPSRAS R GGMGL C LWKV C PTAPWMRST* Transcripto-peptidomic strategy neurohormones neuromodulators XXXXXX: Signal peptide RR KK RK KR: Convertase cleavage sites Q: Predicted N-terminal pyroglutamic acid G: Predicted C-terminal amidation PMEFL G: C-terminal amidated neuropeptide detected by mass spectrometry Subesophagal mass Supesophagal mass Optic lobes Optic glands Previtellogenic follicles Vitellogenic follicles Mature oocytes Oviduct gland (OG) Accessory nidamental glands Main nidamental glands Ventral view of egg-laying female 16 transcriptomes Posterior salivary glands CNS (M/F) Ovary ASGs (F) (M/F) RNAseq de novo - Allatostatin 2 and 3 - Crustacean Cardio Active Peptides (CCAPs) - FaRPs - FLGamide - LFRFamide - Myomodulin - PTSP-like peptide neurotransmitter - Small Cardioactive Peptide - Allatostatin 3 - FaRPs - FLGamide - Myomodulin Oviduct Gland Main Nidamental Glands Ovarian stroma - FLGamide Neuropeptides detected in the nerve endings of ASGs of egg laying females Neuropeptides over expressed in the sub-esophagal mass of egg laying females versus mature males Neuropeptide mRNAs recovered in ovary and ASGs expressed as a ratio of CNS mRNAs

Upload: rosamond-bryan

Post on 19-Jan-2016

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Leucokinin 1 MVVNKSLLLILLQVFAGFFLTCSGQGGCNGACNNMETFEDQSLIDSEPIEEKRKFSPWAGKRGGPVLPSYIIRTRTNSKPNVITHKKSRFSPWHG KRSAAGISDPELSDNLLERLLLQYTDSDIGRLQKLLEKRGFNPWAGK*

Leucokinin 1MVVNKSLLLILLQVFAGFFLTCSGQGGCNGACNNMETFEDQSLIDSEPIEEKRKFSPWAGKRGGPVLPSYIIRTRTNSKPNVITHKKSRFSPWHGKRSAAGISDPELSDNLLERLLLQYTDSDIGRLQKLLEKRGFNPWAGK* 

Leucokinin 2MQRNKRILLIVVQVVAGLLVSCSAQIGCQGSTDCHNGGDEQSLINSESTEDKRKFNPWGGKRAGPVFSVYDESQSDGTSEYSAESKRSFSPWAGKRTAQSSLGYVVRARDARADRKKTRFHPWHGKRSGSPLGSEVPENLLRQLVFEYIDLDSNKLSDKRGFNPWAGK* 

Luqin or cardioexcitatory peptideMKISELILSITAVLLIALTIADGSPAPKWRPQGRFGKRLSELNDDPLWVLLSSESEKREDILPYGVANKPVHFENLLCVPVGVKNAYKCTRPES*

MyomodulinMNLTLTLICVLLCLQQLRQGACEDNESNNNNNNAVETGATPALRERRAVGMLRLGRGVQMLRLGKRAPYDDLKTIVATILGRQEQQFNRQAPLPRYGKEEGLLVDAYPADSPSQVSQILRRSYPSYFEDEDLLHQEAPLPHLGYLQKRSAEIRHAPLPRYGKGPDYEDLTNEVSSEEGGDDHEDENVSDTVSSEMFERQPPLPRYGKDEIAGIDCEAYDESGNCLRFEDIEKKDVRMLRMGRQVNMLRMGKRALSMLRMGRNGGDKRAVSMLRLGRSDNVEDSKRALAMLRLGRSGEKRAVSMLRLGRSGLDEESMPSEEQKRALAMLRLGRSGSEEQKRAVSMLRLGRSGEGDQKRAVSMLRLGRSGPDGQKRAVSMLRLGRNFPVSDAEKRAVSMLRLGRSGSDEEKRAVSMLRLGRSAEEEAEKRAVSMLRLGRSGADKRAVSMLRLGRRNQGADDEKRAVSMLRLGRSGPESDESKRAVSMLRLGRSGPETDESKRAVSMLRLGRSGPETEESKRAVSMLRLGRSDEKFGADKRAVSMLRLGRSDKNTDADKRAVSMLRLGRSAEEEAEKRAVSMLRLGRSGADKRAVSMLRLGRRNQGADEEKRAVSMLRLGRSGPESDESKRAVSMLRLGRSGPETDESKRAVSMLRLGRSG…… 

NKY 1MQNLTSHIVIFALCCIGLTISGDLWQGNRPHADKNLLSLITRATAARDNALMMPPSGYQGRLPSYYDRPLSKREPLWIWMPAQGYVPVPRTSNINNSDGSGSSVIRYG* 

NKY 2MAKVVFFMLLSAMVAILSPFCRASSEQMNQPQAVASFRTDHEKEALASLLLHVLIQRSIAPVYSSHPHWSNLASKAEMPSQLKKKDTRYRYRGIDSRVPAFGSFFSPSPSDNSDTSKIFRYG* 

NKY 3MTVNAVHVLCIFALLFACAHSLPKRTDHASTLRYLQQSGLSDSDSRALLQAYLLGKLSNGDGSIGKELETSEYPTIKRKAFWRPMGYLPFENHVGSGASSSNDNAAGTGSASAVFRYG*

Characterization of the neuropeptidome of the cuttlefish Sepia officinalis: identification of

neuropeptides and neurohormones involved in the regulation of egg-laying.Céline Zatylny-Gaudin1,2, Valérie Cornet1,2, Alexandre Leduc1,2, Bruno Zanuttini3, Erwan Corre4, Gildas Le Corguillé4, Benoît Bernay5, Alexandra Kraut6 and Joël Henry1,2,5

Like many other cephalopods, the cuttlefish Sepia officinalis exhibits a wide variety of behaviors such as prey capture, communication, camouflage and reproduction, thanks to a complex central nervous system (CNS) divided into several functional lobes that express a wide range of neuropeptides. However, the diversity of these neuropeptides is crucial to modulate behavior and physiological mechanisms associated with the main stages of their life cycle.This work focuses on the neuropeptidome expressed during egg-laying, the last step of reproduction. We first identified neuropeptide transcripts through de novo construction of the CNS transcriptome using an RNASeq approach (Illumina sequencing). Then we completed the in silico analysis of the transcriptome by characterizing and tissue-mapping neuropeptides by mass spectrometry. To identify neuropeptides involved in the egg-laying process, we determined (1) the neuropeptide contents of the neurohemal area, haemolymph (blood) and nerve endings in mature females, and (2) the expression levels of these peptides. Among the 38 neuropeptide families identified from 54 transcripts, 30 were described for the first time in Sepia officinalis, 5 were described for the first time in the animal kingdom, and 14 were strongly over-expressed in egg-laying females as compared to mature males.Mass spectrometry screening of haemolymph and nerve ending contents allowed us to clarify the functional status of many neuropeptides: neuromodulators and/or neurohormones. Besides the data concerning egg-laying regulation in cephalopods, this work brings very new and important structural and expression data about the neuropeptidome of S. officinalis.

1Normandy University, Caen, France E-mail: [email protected], 2Université de Caen Basse-Normandie, UMR BOREA MNHN, UPMC, UCBN, CNRS-7208, IRD-207, F-14032 Caen, France, 3Normandy University, GREYC, UMR CNRS 6072, F-14032 Caen, France, 4UPMC, CNRS, FR2424, ABiMS, Station Biologique, 29680, Roscoff, France, 5Post Genomic platform PROTEOGEN, Université de Caen Basse-Normandie, SF ICORE 4206, F-14032 Caen, France, 6Proteomics platform EDyP / BGE / iRTSV / CEA / INSERM U1038 / UJF, F-38054 Grenoble, France.

Sepia officinalis

Phylum: MolluscaClass: CephalopodaOrder: SepiidaFamily: SepiidaeGenus: Sepia

In silico data mining usingthe homemade software PEPTRAQ

38 neuropeptide families expressed by 58 protein precursors

5 novel neuropeptides families notyet described in the animal kingdom

14 neuropeptides families over expressed in egg-laying females

8 neuropeptides families detected in the nerve endingsof female accessory sex glands

Large stocks of neuropeptide mRNAs at the levelof ovary and female accessory sex glands

Achatin 2MVKVTSVCLCVLIGLVVLFDSWTDASCAKPCLINFFKCVRGENDGCCSVYGGCMKESCGSATVQCDDSMGKRGSWNKRGSWNKRGSWDKRGSWNKRDAAEEKRGSWNKRAEDIEISQRGSWNKRADANAEDYSEAILRRLLLENYGARL*

Allatostatin A1 or buccalin 1MTSVGVWSRALLCSLFISVHMVDCADSLKTNLSDEQILSDSSDNEEHKVEKKRSADPTLFTDTLKKRNVPERILTNGKDIHVISRGMDPMMFGHLGKRPDPQMFDNINQRMDPFMFGNLGKRFDPMLYGNLGKREHSIALGPLNKKMDPMMFGGLGKKMDPLMFGGLGKKMDPMMFGGLGKKMDPMMF……

Allatostatin A2 or buccalin 2MTSVGVWSRALLCSLFISVHMVDCADSLKTNLSDEQILSDSSDNEEHKVEKKRSADPTLFTDTLKKRNVPERILTNGKDIHVISRGMDPMMFGHLGKRPDPQMFDNINQRMDPFMFGNLGKRFDPMLYGNLGKREHSIALGPLNKKMDPMMFGGLGKKMDPMMFGSLGKRDVSEDTLDESTD*

AllatotropinMNVSGARGLCTLCCLGLLVLLASSDAHASVVVPNRPQRGFKDNVSNRIAHGFGKRTFQDTYDLAPSLDDSKNLITPRKLAELIMYDRNLAYIVALKLDSNGDGVISMNELILKDYV*

Cholecystokinin MNHHFVEIGMSFALLAIIVTHLTTAAPSPYVESDMTHSLNNYKQQLMNALRKRSEDEATNHVLTASARHFMSSAFHKKRSGLAAQKKRDLDLAGRDDDASKRRGAWYYDYGLGGGRFGKRKDYGYTDDYGIGGGRFGRDVDHVDLLDA*

FaRPsMRCWSPCSLLVVIAIYCLSSHTSEAFDLAQACVESQRLSLLPICDTIFAVQQEGAQQSADDGLRSKRFIRFGRALSGDAFLRFGKNVPDLPFEDKRFLRFGRAAPQLDDLLKQALQRVESLQKSDDTSVRRKRSTDAAPQSNTDSAEQKNDSAKITKRYVDDVEDSDVKRFMRFGKRFMRFGRNPSDVGSKLTEKRFMRFGRDPEKRFMRFGKSDDKRFMRFGRNPGDAEDELEEDKRFMRFGRGDEEDEEEAEKRFMRFGRDPEKKFMRFGKNGEEKRFMRFGRNPEEPEADKRFMRFGRGGEEDDVNTEEKRFMRFGRSAEKCKGCLEG*

FLGamideMAFSQILVLLLGVSYVMTAPKSAADEAPVDLKKRGAESGEAHVFDSLGGGHVPYYKRYLEDNDVKRVFDTLGGGHVPYYKRSFDSLGGGAFLGGKRSFDSLGGGAFLGGKRSFDSLGGGAFLGGKRSFDSLGGGAFLGGKRSFDSLGGGAFLGGKRSFDSLGGGAFL……… ……KRSFDSLGGGSFLGGKRSFDSLGGGAFLGGKRSFDSLGGGAFLGGKRTFDSLGGGSFLGGKRSFDSLGGGAFLGGKRSFDSLGGGAFLGGKRGFDSLGGGSFIGVKRIHELDNMDGGSGDN* ……KRSFDSLGGGSFLGGKRSFDSLGGGAFLGGKRSFDSLGGGAFLGGKRTFDSLGGGSFLGGKRSFDSLGGGAFLGGKRSFDSLGGGAFLGGKRGFDSLGGGSFIGVKRIHELDNMDGGSGHNPPCCSGTS*

FVRIamide MVSFWKILILGTIGVLVLMTQWASFVRAESPSNGEDLVNAAGAAVESADEPSGRSVSDSPYDIKRTNQFLRIGRGSHFIRIGRGGASSFLRIGRNPLSQFVRIGKAPSSMFLRIGKSSAAGNPELGDLAAGPSSLGDNSIDEDEVLKRASSFLRIGRSNPSTFLRIGKSAGNLDEETAANEDIVTDDIDVPSESMEKRANAFLRIGKIPASSFVRIGRGPYGIDNRSNPRGFLSVGSRFVRIGKREAIPSETGPTHARLLPNLHDQAQ*

GnRHMSTSALSSNLRKMAFLTCAILLLSFCMQIQAQNYHFSNGWHPGGKRSGLPDMQCHFRPQTKALIEKLLDEEITRIITTCTNTVNDIADLQ*

Insulin MKSSTVCMGTFLLATLLSIVNWQVVNAGLEHTCNEETIRQGPAQGAHCGVEIPNILQLLCAPAGYNERMSDRQRRNLVPTSRAIGRRGNGLRDIIISKRQAKSYLTKRDRNWTGIVCECCYNKCILEELLDYCKDPSYFKSQKLRS*

LASGLXamideMWKLGIFLLGIWLVPQSVLCKDHIEKRSLDPLASGLIGKKRDIDDEKEEIDQKSKRQFDHLASGLIGKRPF…………

…………SLASGLIGKRPFDHLASGLIGKRPFDPLASGLIGKRPFDPLASGLIGKRPFDSLASGLIGKRPFDHLASGLIGKRPFDSLASGLIGKRPFDSLASGLIGKRPFDHLASGLIGKRPFDPLASGLIGKRSFDPLASGLIGKKSFDPLASGLIGKRR*

……………RPFDSLASGLIGKRELDTLASGLIGKRDNDSEMEDKRSFDPLASGLIGKKSFDPLASGLIGKRR*

LFRFamideMETKVMSLLATVLTVFIVQINCEDLHKIQTDTSGISNFIGLPDGEEGELVRSPIVDESALGIDDVDKRNSLFRFGKRGNLFRFGKRGNLFRFGKRGNLFRFGRGGNKDDPENEGLKRTIFRFGKRDGLEDLYDYEDPSVQQVAPTAGDKRGSFFRYGRSRTFFRYGRSTDKNAEKRPHTPFRFGREEE*

Neuropeptide Y3 (NPF3 or NPY3)MQKSFFVILLIAVMFTGQVFSQEGLLAAPKIPGELSEYLKALSDYYAIAARPRFGRSLKQRTSYRTLADDA*

Neuropeptide L11 or eleveninMLKLRSSTFQKFLIWTFVLLLLNLHVNAQNKFELTKKLNCRKFIFAPRCRGVAAKRSLNMATSYPPVNTQMADERNYITGNHDSTIREILLNYILSRLEARSLFQNDANTDSAEDLTSYR*

Pedal Peptide 2MISFHNRLFVVILSLLGQNLLISTYADIQGTSHVSYVHENPEEKRQLDSIGSGLIKKKNIDSIGSGLIKKNLDSIGAGLIKKNLDSIGAGLIKKNLDSIGSGLVRRNFDSIGSGLIKKNFDSIGSGLIKKNLDLIGSGLVRRNFDLIGSGLIKKNFDTVGSGLVRRNFDSVGSGLIKKNLDAIGSGLVRRNFDSIGSGLIKKNLDRIGSGLVRRNFDSIGSGLIKKNLDSIGSGLVRRNFDSIGSGLIKKNLDSIGSG……………………………DFDSIGSGLIKKNLDSIGSGLVRRNFDSIGSGLIKKNLNLDSIGSGLVDSVGAGLIKKNLDTIGMGLIRRQYENGMEMGDDNGYEGDKRHLDYIGSGLIKRSM*

Pro-SepiatocinMGSGRFLFSSTKCQVACVLFNFCVFLICTTDACFFRNCPPGGKRAVAMNDGVAHKQCMACGPEGKGRCAGPNICCQKEGCIIGDMAKECMQEDEGTTVCEVKGIPCGAEGQGRCVAAGVCCDTSACSTNSHCGSALPRTSSRRQELFSLLKRLINKVN*

PRQFVamideMRIYWPMYVVALLCASGQDFNAAEPDAVKNLVKDTVLKPLADSSYLNLPDEHTQDNALSEIVSGTEDGDIPEPLIGYSEEDLPKRPMEFLGKRAMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLG…………………

MRIYWPMYVVALLCASGQGQGKGKNAQHFNAAEPDAVKNLVKDTVLKPLADSSYLNLPDEHTQDNALSEIVSGTEDGDIPEPLIGYSEEDLPKRPMEFLGKRAMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLG……

……………………RPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRSDESEDKRRMEFLGKRRMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRGMEFLGKRPMEFLGKRRMEFLGKRPMEFLGKTSLNSLDPRPWHNSSETLAGQADREHHDFLDDSLNALEPDRKHHFLDEMIDKGALGRYLQEDMSILDQGDKPLPEGHDINKVSDEEIPTRMHLGSNPVSVKHKREIPIFVGKREQSISLNKRNVDPVLEEMLTIKPSERDREKENIMSNSVREILKNIQMNEKTDNQKADTDFEGTAGESDFHDEQFGEKPSIYEKKAYQMFVGKKNKPMFVGKKNKPMFVGKKNNPMFVGKRDNPMFVGKKTNPMFEEEMANPMFAGKRDDPMFVGKRNNLMIFDEKARPVDVTKKAKPLFVGKKEKIGPENKKDSMFPLNSNLILAEKKDPMFVGRKEDPILFSINGNPLLMDDTKNPTFTDENDKPVSVSKKDDPMFVGKKSKPMFVGKKNHPISISKKYSPRFVSKKEFPLVVNRERISLVGDNRNTPIFVSKVYSKKVLGKKNNPAFIVANEDPMSVDKKYLKVSTESRNPTNTNGYKIPLAEKDYPQPSLHDYGVFSSSPIYVDHSLMKNKKNKPMFVGKRKLPRLSTNWNESMTFKDHFDEPSPTHKRFSKRSIMLYRLRTLPPHHRRLNYLDVLHFILAREELIKKDGMKWHNQQTPIFVGKREDNPMFV………

………GKKSKPMFVGKKSKPMFVGKKSKPMFVGKKNNPMFVGKKSKPMFVGKKNNPMFVGKKSNPIFLDNPMLTGKRSKPMFVGKKNNLMFLALKDNPRFVGKKSKPMFVGKKNHPISISKKYSPRFVSKKEFPLVVNRERISLVGDNRNTPIFVSKVYSKKVLGKKNNPAFIVANEDPMSVDKKYLKVSTESRNPTNTNGYKIPLAEKDYPQPSLHDYGVFSSSPIYVDHSLMKNKKNKPMFVGKRKLPRLSTNWNESMTFKDHFDEPSPTHKRFSKRSIMLYRLRTLPPHHRRLNYLDVLHFILAREELIKKDGMKWHNQQTPIFVGKREDNPMFV… …

……KREFQPMLMGKRDDNPMFVGKRDLQPMFVGKRDDNPMFVGRRVDNPIFLGNRGLQPMFAVEKELQPMLVGKRDDNPMFLRTGVLREIGKRDNKPMFVG……

……KRDDNPMFIGKRDPSPMFLGKRTDNPMISSKAGLQPMFVGRRMDNPMFVGKRNPQVMFVGKRVHNPMFVGKRDPQPIFVGKKNNKPMFVGKRENNQMFAGKRKDHQIFVGQRIAQPLTLERTYSRRMFVGERGPKIMSVRRGGDNPMFIGKKVPEPMFVGKKENKPMFVGKRDFLPMFVGKMDPQPMFMGKREDKPMFIGKREPQTMFLKISEPEPMFVGKRQTQTMFVEKRGTFPSFVEKTDTQPMLIWKRGLQPLFVQRRISMPRFFFGSYKNMKYWHKRQPIPLFVGKRETESLFRGNAERQSLFVGKKEVMPMFVGKKNNQPSAWGKHLSQPMFIRNRNVPMFVGKRGGILAPETKRLPNQSLFDVADFDSSIEEIDNSLNKIQHRRDTSGVQDRLTNDISHKELDQPIVKASFKEKHMNDNFNQGIVSNNSSQSILRNRNDLSTDGNSRQVIDFLNPNKIERNIVKRSTGKYYNQMPVSKESNVSKRSRYSEENFIGQRYLHDDAPSEILSINKRNTPLTFGKSNFPHRSQQRSNSLSLVLDQSSPHKSRFVHSEMALSGPYPADEQTHFLKRNDLPNYEFRLAAAGTKPEPMIETGQTKNDHFGKMESEDIPFGSKHQSTNKSFRANGSDTIVVASRYWKPAGVAVNVRDFRNSAVSAPASQVR*

……RPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRPMEFLGGKRSDESEDKRRMEFLGKRRMEFLGKRPMEFLGKRPMEFLGKRPMEFLGKRGMEFLGKRPMEFLGKRRMEFLGKRPMEFLG*

SamideMTRNLLVVVLVAILVSTLANGRYIADTKLRSKRQTGDLKAAAYQAWLALGRTLPPDCPEVACGVVDVEASGKKKRTDTNSVDFRRSLLVERLLRLAAEGLVNSV*

TAamideMWSSQIVVFVLTVCVCGSLCLTRTKRAVLTGDENPYDLREALRVLERERRRLELRTLDPVSQREVRFEQQPVIVDSENDDPDFLARAVAEKGDSVFYSSDQANDVENLLREIEDQKEAENEASEDDLQTVYGRPLPDTEMDVDVDNYPSVVEKRTPTKKAVSRKRVTIKGLDLTAGKRSAETLKTVFNSLSPEEIHKLLLLEGRLQTKEMNKRQRKARPLVVETRAQSDTGLEDAMEVEDAAAIPITKEELKKLIQGQGEADQIEDPNDINNFNQMNEIIQAESAPADPSLDQPVDSDIFKSALQDLASLQMLQKQENDGRSEEQEALTQWGPAAMTKVVPVLLRPVPVLTQSRTAPVRVDEGRKMVKGIAAKPEPSLTTTPSLAEMMAKLWIKHKLEGIEIDYLADALNAATLTQSGVSPGVNNIPVEISSLQKAIRIEKLMKELGGNDADIEEDGMLDQLALKYLQRHQLQAEDSGIALSKKSFFDDEIEAEARKEKEEEEEESAAAAAAAAARQQQEEEEEEEEEEEEYRDNLLRGDVNRQIPLLYEIPDNKINQIPSLPRPDQQNRIPYQLMDIPDDSDREEDKESDEIPLSPDIRLLNTEDFY*

Tachykinin MHTVRVSTFQLLNCVCYVVLISGGLGNTAWGVLASYLPQTPESQQDKTAQVDESPVEKGFIDNNDILKSLSKLHMLAIGEPNDASSDPSSGLVLPIYEQEQEETADLVDDIAGDESPDEMKRFSPYAFQGSRGKKMMTSIEDKKAHASLGFVGSRGKRQPASLGFVGSRGKKQLNFIPSRGRKQMSTAFVGSRGRRISAEAFAPSRGRRLSSQAFFGSRGKKYSALGFMGSRGKKDSDFENLLLKNEDGWTQDNYNRRAAPFYHGFVASRGKRSAETNSA*

Achatin 1MAGQCKCLSSLVVILLITLSARGGFSNYVQNYNSRQLLSLHSFNDDQMNKDRLTALEESFKSDGAGKRGFGDKRSIHVPHIIVETRGFGDKRDEKNDESIDTGKFRPIPRGFGDKRGFGDKRGFGDKKSLAQMDNYEPKWIHLVNRGFGDKRGFGDKRYFLDRRGFGDKRGFGDKRGFGDKRGFGDKRGFGDKRAFDDDTGFDDKRGFGDK* 

Allatostatin A3MNAYILSAILLFVNAQHIKCDSKNLADFYPKLRDEYVKPYMQLDGGKYFPWKLLDLVPSREKFSRTAFPIIAANKKIDPMLFKLGIGKRNSGSVDPFLFKIGLGKKSVRRVGEDGEQMWQMKDRRKGIESFQNGFGTEARPHVFLSSKNKEDINAENADIFSDVNM* 

APGWamideMKNQELFCKGTSLCLSRHSLLIVLLLVLPVFSTNSGQSSRNPADNEASSLVQESISRLLEASVDGSLSDEDDSDDDDYGSLTNLDDRTVKRAPGWGKRAPGWGKRAPGWGKRAPGWGKRAPGWGKRTSPKLDSDEIATMLFSLENADNGEDTNAFEKRAPGWGKRAPGWGKRTVMNEDLASFLASLNAAETAEASSDTDKRAPGWGKRAPGWGKRAPGWGKRAPGWGKRDQEKRAPGWGKRAPGWGKRDTEKRAPGWGKRDTQGHNIAATN* 

Bursicon AMMERNSNLAMYNSQFVTTCLFIFVLLHPIASIRRFSTSPSLGIISEGDVFDDTNQWCHLRKFIHTINFRRCLPQNFRSYVCSGKCHSYTGADPNITTYGNSEMSLMLRACRCCQKLSVMSTRVVLKCRDDETGIYRRMQVKLQVPTRCICRHCSI* 

Bursicon BMHLIPLASLLMFAIRHVSSQDCETLGSEIRVNKFLNVEHHGRQMNVRCGVQLSLNKCEGTCWSYESPSVIDPRGFRKKCNCCRERELVDRSVVLDECHDAVSGLIVTGLHPVIVIKEPMNCECRECANFI*

CerebrinMQKAIIIFAALTLFSTAFVTDVFASQEEISEIVSLASRILKIAMLMDKPRRIDKRNHGLIDTIINLPDLDKIGKK*

ClioninMVKPTAFFTVVLVIAAVFMKPANLYPTPSVPLEEMDPRDACLFQCSNCFGDQYQPLLDCANDVCPKVKSFSEVGYECSTILRHPNLGRRSLFKF* 

Crustacean Cardioactive Peptides (CCAPs)MQSVSSSLSTGLFFLTVNVLCVSTLIRVTSSQPQENTFQMKPSESDILNMKIRHLIRRAVQRQPTANDFSNEELASLPDKRVFCNSFGGCTNIKRLFDEENDFKLVASDSIQPMEQETAIDKRVFCNSYGGCKSFKRVSANDIKKPEMIKNKQRHLIRRLKLQPIKISRRVFCNSFGGCQN* 

FFamideMSGRLAFHSSLVLLICFALFFGQDSAVEAVYAPTRGQQNPHSYGRRGLNPNVNSLFFGKRAGAEQEALSNTDMGRKCMAAMSMCNMYFESNNVNES* 

GGNamideMGRHFILFVLTSALLFCISNAVKCKWYDHICLGGNGKRSAIGQSEQDQQLLKILTQEFIRQPHKRDSFITDVNDDDDDDAIFPHGKPFDQDYGQTNLSPTIKRLAKLMLPDERQN*

Neuropeptide Y1 (NPY1 or NPF1)MQKATIILLLLVAMFSADAYSQNNGGAAPQSPEELTNYLKALNEYYAIVARPRFGRSIIQKRSFLGSLADAA*

Neuropeptide Y2 (NPY2 or NPF2)MLSPMLTIFLIAVMLAANVSGQNGLLGPPNRPGDLKNPGVLNNYLKALNEYQDALSRPRFGRSSSKRFSFNEFANNLPERIE* 

Neuropeptide Y4 (NPY4 or NPF4)MRKSFVIVFVIAVVLVIQISSQEIMLSPPSRPAEFRNPKELREYMKALNEYYAIVGRPRFGRSIFNKRFGSNSFNEDLKSDENKE* 

Neuropeptide Y5 (NPY5 or NPF5)MQKIVIASLLVVLFTLNVSSQDSLLAPPSRPSEFRTPEELRQYLKALNEYYAIVGRPRFGRSAVNRFTRTAIAKARTDP* 

NdWFamide-likeMKPACICLIVILAASIFQTNANYYGKRDEKLDGFREFLSKRMGELNEQEIEARDVLRTISSLVQAWEARQRKYDTLQKAA* 

Orcokinin BMRYWFCACFVLLQNSLISVSGVEKVNKKSHHSENEATLAHHGGNDGHGSTISEKRSFDSIDGGMFRTMGKRPFDSISDSAFGGMGKRQFDSISHSSFRQMGKRSFDSIDGSAFGGMGKKSFDSIASSGFGGMGKRPFDSIDSSAFGGMGKKSFDSIASSGFGGMGK…… 

PKYMDT (Proctolin)MDSRLLAFVSVCLFLLTSPVFSAPAADPKPHLEQSKDLPISKRPKYMDTREPQDIFKDLVFLTLQQLVSDGKVNPEAITDTDAGVPNKRGYQGLCLRRTANQRYIAYPCWRTGSK* 

PTSP-like PeptideMASIFSHFLVIMILALTQTRRLIAEEKNDIGSSKSLKPESVVKSENLKSWSKRSTTNGKALNAIRQAVARGAFSGPANPLDSSDERYWNLMMLWLKENGYPSTTVNAGGRRTGLRSRVARETDGTDEELLEKKDRPDTWNSMNTWGKRSPNTWDSMAAWGKRNPNTWDSMAAWGKRNGD……… 

……NPDTWDSMSAWGKRSPDTWDSMSAWGKRGADTWDSMSAWGKRGADTWDSMSAWGKRNGDSKDKRDWDSLQAWGKRANAKNKKDWDSLAAWGKRDIAGDGNDEIGSQVQSLMKRSSGKSSKSRR* 

……KRNPNTWDSMAAWGKRNPNTWDSMAAWGKRNGDTWDSMSAWGKRNPDTWDSMSAWGKRGADTWDSMSAWGKRGADTWDSMSAWGKRNGDSKDKRDWDSLQAWGKRANAKNKKDWDSLAAWGKRDIAGDGNDEIGSQVQSLMKRSSGKSSKSRR* 

PXXXamideMTRNLLVVVLVAILVSTLANGRYIADTKLSSYKRTSSDQRIAELQALIALSNTIGHGQVNPEEIGKKKRTDTNSVDFRRSLLVERLLRLAAEGLVNSV* 

Small Cardioactive Peptide (SCP)MFSQNLSVLAFSVCILLTMANTSYGYLVLPRQGRSDDRAEPSCCGMPLMKATGLCPIGMECCPGLKKVLQKSGQKTVYSICIADLY* 

Sepiatocin MASYRWGSWALLLLIVVLPLVSLVEGCFWTTCPIGGKRSASEFRECMACGPEGKGRCAGPNICCQKEGCIIGDMAKECMQEDEGTTVCEVKGIPCGAEGQGRCVAAGVCCDTSACSTNSHCGSALPRTSSRRQELFSLLKRLINKVN* 

SPamide 1MTRVVLLLLMFIQLVQIRANYPFLEQVEKVEEKNLENLLLRMLMERTNKFGRPVKRTCQIEATGECRSEEAAEVADKYHYLLSSKSPGRKRNLFS* 

SPamide 2MAPLQYILPLLLVLPIIAAWNPVLRSNEINSIRRAALLKHNEDSSDFGVYQRNGRDASDLQRAFSDYLKSSVEDSKSAWSDPCRLNLGGRCATEIASDLVKAWHYLNSSNSPGRKRRDVREALRTILRHSAAAAAAAAADNR* 

Urotensin IIMDSQLQTKQFLTLFCFCLCFVAIAKAMPAPQDPSQEEILAKRWLYRMLDREGRATYPLNLAALRELESNLRLGMSNVKRGSNVPSRASRGGMGLCLWKVCPTAPWMRST*

Transcripto-peptidomic strategy

neurohormones neuromodulators

XXXXXX: Signal peptideRR KK RK KR: Convertase cleavage sites

Q: Predicted N-terminal pyroglutamic acidG: Predicted C-terminal amidation

PMEFLG: C-terminal amidated neuropeptide detected by mass spectrometry

Subesophagal massSupesophagal massOptic lobesOptic glands

Previtellogenic folliclesVitellogenic folliclesMature oocytes

Oviduct gland (OG)Accessory nidamental glandsMain nidamental glands

Ventral view of egg-laying female

16 transcriptomes

Posterior salivary glands

CNS(M/F)

Ovary

ASGs(F)

(M/F)

RNAseq de novo

- Allatostatin 2 and 3- Crustacean Cardio Active Peptides (CCAPs)- FaRPs- FLGamide- LFRFamide- Myomodulin- PTSP-like peptide neurotransmitter- Small Cardioactive Peptide

- Allatostatin 3- FaRPs- FLGamide- Myomodulin

Oviduct Gland

Main Nidamental Glands

Ovarian stroma- FLGamide

Neuropeptides detected in the nerveendings of ASGs of egg laying females

Neuropeptides over expressed in the sub-esophagal mass of egg laying females versus mature males

Neuropeptide mRNAs recovered in ovary and ASGsexpressed as a ratio of CNS mRNAs