Mercurial > repos > iracooke > omssa
comparison test-data/AASequences.fasta @ 25:ea9ae5b25ce1 draft default tip
planemo upload for repository https://github.com/iracooke/protk-galaxytools/blob/master/omssa/.shed.yml commit 24e0fef2496984648a8a5cd5bff4d6b9b634a302-dirty
| author | iracooke |
|---|---|
| date | Tue, 20 Oct 2015 20:31:23 -0400 |
| parents | |
| children |
comparison
equal
deleted
inserted
replaced
| 24:11db804a50b9 | 25:ea9ae5b25ce1 |
|---|---|
| 1 >tr|O70238|O70238_MOUSE Homeobox protein PSX OS=Mus musculus GN=Rhox6 PE=2 SV=2 | |
| 2 METPQDSRQSIQKPPSPAAEEDKEEQPGGNAVVSGAPEERIDKKELVLNWLAQGEFDQGE | |
| 3 GAQGEVAGGEQAQEEPAPLSPAQEATGGEEEGENKEGEMEGRHAGDGASSSEDDSILEEG | |
| 4 GENIDQQPPQQEAASPDSIRNPHVLNRLAQLRYRRTRFTHSQLHDLERLFQETRYPSLRA | |
| 5 RRDLARWMGVDECDVQNWFRMRRALFQRNRRVLMFCELPPLPQSDSP | |
| 6 >sp|P20269|HM05_CAEEL Homeobox protein ceh-5 OS=Caenorhabditis elegans GN=ceh-5 PE=2 SV=4 | |
| 7 MPSADTEFIRVIRIKSANGSEKMLEIPAKLDLERPKRPRTVFTDEQLEKLEESFNTSGYL | |
| 8 SGSTRAKLAESLGLSDNQVKVWFQNRRTKQKKIDSRDPIKPETLKPAENYQNVYQNYQNY | |
| 9 WTAAAFLSNNVISS | |
| 10 >sp|P34663|HM23_CAEEL Homeobox protein ceh-23 OS=Caenorhabditis elegans GN=ceh-23 PE=2 SV=1 | |
| 11 MDTHLPFQTLPVSTPLPVSSSSLTDVLQTIAALQACPTSCIPSTSTGMLSPNLPFSATIP | |
| 12 RVNLFPPSQPANSLILPTIPAQPFIPNPSLLQANPSAVEALANALFATTSRRASCPEPPA | |
| 13 SSQATVTLQVPSTGSPERRRYSETNMEVLLREQLAQLMPPTSQLPGMPGCYYQHVPAAGT | |
| 14 SGIQGSLDAALMGAVPLAMNSMAHSRRAANHRKARTIYGTTQTQQLEDMFKGQMYVVGAE | |
| 15 RENLAQRLGLSPSQVRIWFQNRRSKHRRKQQEEQQSTTLEEKSEEIGKDEEEDDEEDEDD | |
| 16 VKVLN | |
| 17 >sp|P52955|LBX1_MOUSE Transcription factor LBX1 OS=Mus musculus GN=Lbx1 PE=1 SV=2 | |
| 18 MTSKEDGKAAPGEERRRSPLDHLPPPANSNKPLTPFSIEDILNKPSVRRSYSLCGAAHLL | |
| 19 AAADKHAPGGLPLAGRALLSQTSPLCALEELASKTFKGLEVSVLQAAEGRDGMTIFGQRQ | |
| 20 TPKKRRKSRTAFTNHQIYELEKRFLYQKYLSPADRDQIAQQLGLTNAQVITWFQNRRAKL | |
| 21 KRDLEEMKADVESAKKLGPSGQMDIVALAELEQNSEASGGGGGGGCGRAKSRPGSPALPP | |
| 22 GAPQAPGGGPLQLSPASPLTDQRASSQDCSEDEEDEEIDVDD | |
| 23 >sp|Q26604|SMOX5_SCHMA Homeobox protein SMOX-5 OS=Schistosoma mansoni GN=SMOX-5 PE=2 SV=1 | |
| 24 MTTSTMQQLKHDGDFSDELNETSTIQFYNKVSQQRKRRKTRTTFSNCQLNELENNFNRQR | |
| 25 YLTPTDRDRIAKHLGLTNTQVITWFQNRRAKLKREAEELERDVMALRKQKQQKFTCLSLS | |
| 26 DHDHEETQIDDENEQGDNNNDDDGDDNDVEEDDGEEQEKNHTKYLTQPPSISNILPSSLK | |
| 27 HFPSSTLNTLEIDNKHETLNMNLFINPFSNEKCLKRNKDLIRQQCYLFNHHINNYCTVNN | |
| 28 DNNINNNNNNNNRKNSIDGMNKGRSIKKGNKIWCPALELEQEIH | |
| 29 >tr|O77024|O77024_EPHMU EmH-3 (Fragment) OS=Ephydatia muelleri GN=EmH-3 PE=3 SV=1 | |
| 30 MDNCRGDKKPLLSTNQQSFRIDNLLTRKVIEQQQQPDHYTMYPPSKVENHDILSLTTGPS | |
| 31 HDDMISDGTEIYEQGRESTSSTSGNDAEDDLLTRRKKARTAFSREQVAELEKKFQDKKYL | |
| 32 SSAERGELAEKLKLSDMQVKTWFQNRRMKYKRQSEETEMEMKSPKY | |
| 33 >sp|O93367|TLX3_CHICK T-cell leukemia homeobox protein 3 OS=Gallus gallus GN=TLX3 PE=2 SV=1 | |
| 34 MEPAAGAQGPHQHEPISFGIDQILSGPEQDGAPPPPPPPPPPPPPPPPPPRGPDGAAFLG | |
| 35 GPRGGAPYPALPGPFPAIAAPFEESGPYGVNLSLAPGGVIRVPAHRPIPGAVPPPVPSAI | |
| 36 PAVPGLGGLSFPWMESSRRFVKERFTAAAALTPFTVTRRIGHPYQNRTPPKRKKPRTSFS | |
| 37 RVQICELEKRFHRQKYLASAERAALAKSLKMTDAQVKTWFQNRRTKWRRQTAEEREAERQ | |
| 38 QASRLMLQLQHDAFQKSLNESIQPDPLCLHNSSLFALQNLQPWEEESAKIPPVTSLV | |
| 39 >sp|P56407|HM09_CAEEL Homeobox protein ceh-9 OS=Caenorhabditis elegans GN=ceh-9 PE=4 SV=2 | |
| 40 METDLLFQLLQPYFALLTSDVKPQRRTSHLIKDILDLPTVNGEIDEFGRCKSSLDQAKES | |
| 41 PIEKCQKTKRKKARTTFSGKQVFELEKQFEAKKYLSSSDRSELAKRLDVTETQVKIWFQN | |
| 42 RRTKWKKIESEKERSGEIPDDQIVKPQ | |
| 43 >tr|Q24786|Q24786_9METZ Homeobox-containing protein (Fragment) OS=Ephydatia fluviatilis GN=prox1 PE=3 SV=1 | |
| 44 NSDEDKDRYASDLDTDRASSAGGALQMSRHKKRRPRALFSHAQVYELERRFAVQKYLTAH | |
| 45 EQSKLATVLHLTETQVKIWFQNRRYKSKRQQIEQTRVSPKVVKTSRMVRCSSGYITAI | |
| 46 >sp|O35767|NKX25_RAT Homeobox protein Nkx-2.5 OS=Rattus norvegicus GN=Nkx2-5 PE=2 SV=1 | |
| 47 MFPSPALTHTPFSVKDILNLEQQQRSLAAGDLSARLEATLAPASCMLAAFKPDGYSGPEA | |
| 48 AAPGLAELRAELGPAPSPPKCSPAFPTAPTFYPRAYGDPDPAKDPRADKKELCALQKAVE | |
| 49 LDKAETDGAERRRPRRRRKPRVLFSQAQVYELERRFKQQRYLSPAERDQLASVLKLTSTQ | |
| 50 VKIWFQNRRYKCKRQRQDQTLELLGPPPPPARRIAVPVLVRDGKPCLGDSAAYAPAYGLG | |
| 51 LNAYGYNAYPYPGYGGAACSPAYSCAAYPAAPPAAHAPAASANSNFVNFGVGDLNTVQSP | |
| 52 GMPQGNSGVSTLHGIRAW | |
| 53 >tr|Q9YH59|Q9YH59_CHICK Homeodomain protein NKx2.1 OS=Gallus gallus GN=NKx2.1 PE=2 SV=1 | |
| 54 MSMSPKHTTPFSVSDILSPWEESYKKVGMEGSNLGAPLSAYRQSQVSQPAMQQHPMGHNG | |
| 55 TVTAAYHMTAAGVPQLSHATMGGYCNGNLGNMSELPPYQDTMRNSASATGWYGTNPDPRF | |
| 56 SSISRFMAPSSGMNMGGMGGLSSLGDVSKSMAPLQSTPRRKRRVLFSQAQVYELERRFKQ | |
| 57 QKYLSAPEREHLASMIHLTPTQVKIWFQNHRYKMKRQAKDKAAHEMQQENGSCQQQQSPR | |
| 58 RVAVPVLVKDGKPCQAGSNTPTAAIQSHPQQAATTITVATNGNSLGQHQSHQTNSAGQSP | |
| 59 DMGQHSASPSSLQSQVSSLSHLNSSTSDYGTAMSCSTLLYGRTW | |
| 60 >tr|O35455|O35455_MOUSE Homeobox protein Nkx2.6 (Fragment) OS=Mus musculus GN=Nkx2-6 PE=2 SV=1 | |
| 61 VGAPGRQSWRWARILWGSHVKTPPGTISRLGARNPMTDRGVGNLSGDMRRGGPVSTRTRP | |
| 62 QRKSRVLFSQAQVLALERRFKQQRYLTAPEREHLASALQLTSTQVKIWFQNRRYKSKSQR | |
| 63 QDQNLELAGHPLAPRPGSSASTGTGRQPLPGSDVAAFLVPTKPPRPIPASVATRALPTTL | |
| 64 AMRAAAPAPAPAPGRSHHWPALASAQVAKVRLRRAICPLRLRESRPGEKPELTYCHSVPD | |
| 65 AWSPPLPAGGRGAGKHCPPY | |
| 66 >sp|P22711|TIN_DROME Muscle-specific homeobox protein tinman OS=Drosophila melanogaster GN=tin PE=2 SV=2 | |
| 67 MLQHHQQQAQSGGYYDHYTQSPSPGSLTNADALNTTPFSVKDILNMVNQTEAYEGSYGHI | |
| 68 DGAATASALFAAGEYQNPHQYLNHQQHQQSELPIPQQQLHHQHLDDGATTSSSLSPLLPP | |
| 69 PPHQLYGGYQDYGMPAHMFQHHHGHPHQSFQHSASAYNMSASQFYAGASATAYQTPATYN | |
| 70 YNYAGSGEVYGGATPSAVGIKSEYIPTPYVTPSPTLDLNSSAEVDSLQAPTQKLCVNPLS | |
| 71 QRLMETASNSSSLRSIYGSDEGAKKKDNSQVTSSRSELRKNSISGNSNPGSNSGSTKPRM | |
| 72 KRKPRVLFSQAQVLELECRFRLKKYLTGAEREIIAQKLNLSATQVKIWFQNRRYKSKRGD | |
| 73 IDCEGIAKHLKLKSEPLDSPTSLPPPIPNHVMWPPTMQQSQQQQQHHAQQQQMQHM | |
| 74 >tr|O16132|O16132_HYDVU NK-2 class homeobox transcription factor OS=Hydra vulgaris GN=CnNK-2 PE=2 SV=1 | |
| 75 MDFSILPVNSSFLVDDILRRKHYENKIHQSNFSQFSVLSDEISIKTRLSAFPIYNKGMHK | |
| 76 NKELVNKPFQMNDKNITETERDFNKSSISFDMTSNVEYSFGDKRMNNRHSFQGLSCRVAE | |
| 77 AEMYARGKREDNSSDENSPKCESPSLTAKTEYHNASGDAMHVTSESLIQQNLLNIKSSRK | |
| 78 KPRILFSQSQVMELGKKFKDQKYLSASERDQIANKLNLTPTQVKIWFQNKRYKCKKQTIE | |
| 79 SRTRPPPYEWLHFQHRNVPVLVQNNQVSSDVCLPYCNRPTYLPSNSPVDMNYPPFYPDPY | |
| 80 NGHNHHYSNSYNTPSQTSTYPNSWPFYK | |
| 81 >sp|O93590|ZAX_XENLA Homeobox protein zampogna OS=Xenopus laevis GN=zax PE=2 SV=1 | |
| 82 MSLTSFSIQDILARTGGNRGKDTRTDGNNISPPPSPSADEGHNEWPRAENPPLTPEKEKT | |
| 83 DTDSGTEDFHWERDTETANNGAFTDPSSGDRLADSPKSSKKRSRAAFSHAQVYELERRFS | |
| 84 LQRYLSGPERADLAASLKLTETQVKIWFQNRRYKTKRKLIATQTAPKSSLVPTRKVAVRV | |
| 85 LVKDDQRQYCPEDMLSPSLLSLYHAYQYYPYMYCLPAWVPHLPL | |
| 86 >tr|Q90853|Q90853_CHICK Homeobox protein OS=Gallus gallus GN=GH6 PE=2 SV=1 | |
| 87 MAQDRECLCSAGFQRGDYTQGNTDRSTAAGNCRRRGSGEPRSHPPAEADPPSRSCFTDDA | |
| 88 GRSDGKRRLHICPRLVLFHRGPAGHRSGGGTRRAAAGGGGGRRTSRCGPHSPLRLGASGC | |
| 89 PLRDAAVGWYRRAFLGCAAPTPATGTRRSCPEDTERAGGGGRAAGGAAGGRQSSGGREEE | |
| 90 EERGEEAGEAEQRAAGRKKKTRTVFSRSQVFQLESTFDVKRYLSSSERAGLAASLHLTET | |
| 91 QVKIWFQNRRNKWKRHVAADLEAANLSHAAQRIVRVPILYHENSPASALGFGLPHMSPPL | |
| 92 VGFSGGVSYPPGHLPRRLPSLPSLADDGTRLSAHLCRDRGPEPPPLASSFTLGLFLSTFT | |
| 93 IFRFSTFI | |
| 94 >tr|O97671|O97671_RABIT Homeobox protein (Fragment) OS=Oryctolagus cuniculus GN=HEX PE=2 SV=1 | |
| 95 GKPLLWSPFLQRPLHKRKGGQVRFSNDQTIELEKKFETQKYLSPPERKRLAKMLQLSERQ | |
| 96 VKTWFQNRRAKWRRLKQENPQSGKKEQENLDSSCEQRPDLPGDQHKGASLDSSQCSQSPA | |
| 97 SQEDLESEISEDSDQEVD | |
| 98 >tr|Q9YHC2|Q9YHC2_CHICK Homeodomain protein (Fragment) OS=Gallus gallus GN=Nkx-6.1 PE=2 SV=1 | |
| 99 PPWRDARIGCAPHQGSILLDKDGKRKHTRPTFSGQQIFALEKTFEQTKYLAGPERARLAY | |
| 100 SLGMTESQVKVWFQNRRTKWRKKHAAEMATAKKKQDSETERLKGASDNEDDDDDYNKPLD | |
| 101 PNSDDEKIAQLLKKHKPGAGGLLPHPAEGEASA | |
| 102 >sp|P28468|HOX1_HALRO Homeobox protein AHox1 OS=Halocynthia roretzi GN=AHOX1 PE=2 SV=1 | |
| 103 MEKMHSKSVSPVPFNNSNNTSLGGLRKSSSIPTLAVPECESMGNKHIEEERTNNITTMAM | |
| 104 KRRLLDPQNKKKQNRFERYSSSNHAQEQSSEENFCRSKKDSTVLKFGIDSILKNKNAEKV | |
| 105 PKGISNAGRIQDERFTEACTNTSSNVNPLSKYFKPSSNDQLGARRTATSFSSSSEASDSK | |
| 106 SCCTNNNEEARYKYRVIDKRKSADSDWSEDATGNEADDPDDHINQDNCDLASTLEQSRIV | |
| 107 ALEILKNKRLRLDSSEALNDLTPYDQLSRTEDQQISRRVEMMNHQAFARENNEWPRSFSS | |
| 108 GLQDPFAKNLPNAFLPFYMQPYLRAYYNIQKYIYHKKLLNRNDRFYREANVENDNYKTEE | |
| 109 SLRSPSETKQYSPDASTFYPIRTEDSNGSRNLKVDVEEGDKEANKLFKDLCVSVGDRLSN | |
| 110 ALSYGRKDYNGLSTSQTSGNRFLNFSDKGIQAGSYYQTGERNDSLAGPLKNSGMSFDFPP | |
| 111 KFGSNNSSTDKPEQEDNNPQTIGSEYQINTQRSMKDNLLTAKLLENEAKLRYGNIVTQYP | |
| 112 RPFSWPFAASVRKSYDPALRSYFSRFNNSDAPHYGAAQVNPTAGNNFKSMLPGNFENPYF | |
| 113 FNELNTLDTTGFLSRQYGHMSSSQNPHSETQNRSEEVRGTVKKRRKWNRAVFSLMQRRGL | |
| 114 EKSFQSQKYVAKPERRKLADALSLTDAQVKIWFQNRRMKWRQEIKMKNRGLVPVHILGQD | |
| 115 HEIEKEKTQTPSDEGEVINVD | |
| 116 >sp|O88181|BARH2_RAT BarH-like 2 homeobox protein OS=Rattus norvegicus GN=Barhl2 PE=2 SV=1 | |
| 117 MTAMEGASGSSFGIDTILSGAGSGSPGMMNGDFRSLGEARTTDFRSQATPSPCSEIDTVG | |
| 118 TAPSSPISVTLEPPEPHLVTDGPQHHHHLHHGQQPPPPSAPPAQSLQPSPQQQPPPQPQS | |
| 119 AAQQLGSAAAAPRTSTSSFLIKDILGDSKPLAACAPYSTSVSSPHHTPKQECNAAHESFR | |
| 120 PKLEQEDSKTKLDKREDSQSDIKCHGTKEEGDREITSSRESPPVRAKKPRKARTAFSDHQ | |
| 121 LNQLERSFERQKYLSVQDRMDLAAALNLTDTQVKTWYQNRRTKWKRQTAVGLELLAEAGN | |
| 122 YSALQRMFPSPYFYHPSLLGSMDSTTAAAAAAAMYSSMYRTPPAPHPQLQRPLVPRVLIH | |
| 123 GLGPGGQPALNPLSNPIPGTPHPR | |
| 124 >sp|Q24255|BARH1_DROME Homeobox protein B-H1 OS=Drosophila melanogaster GN=B-H1 PE=2 SV=2 | |
| 125 MKDSMSILTQTPSEPNAAHPQLHHHLSTLQQQHHQHHLHYGLQPPAVAHSIHSTTTMSSG | |
| 126 GSTTTASGIGKPNRSRFMINDILAGSAAAAFYKQQQHHQQLHHHNNNNNSGSSGGSSPAH | |
| 127 SNNNNNINGDNCEASNVAGVGVLPSALHHPQPHPPTHPHTHPHALMHPHGKLGHFPPTAG | |
| 128 GNGLNVAQYAAAMQQHYAAAAAAAAARNNAAAAAAAAAAAAAAGVAAPPVDGGVDGGVGL | |
| 129 APPAGGDLDDSSDYHEENEDCDSGNMDDHSVCSNGGKDDDGNSVKSGSTSDMSGLSKKQR | |
| 130 KARTAFTDHQLQTLEKSFERQKYLSVQERQELAHKLDLSDCQVKTWYQNRRTKWKRQTAV | |
| 131 GLELLAEAGNFAAFQRLYGGSPYLGAWPYAAAAGAAHGATPHTNIDIYYRQAAAAAAMQK | |
| 132 PLPYNLYAGVPSVGVGVGVGVGPAPFSHLSASSSLSSLSSYYQSAAAAASAANPGGPHPV | |
| 133 APPPSVGGGSPPSGLVKPIPAHSASASPPPRPPSTPSPTLNPGSPPGRSVDSCSQSDDED | |
| 134 QIQV | |
| 135 >sp|Q22909|HM30_CAEEL Homeobox protein ceh-30 OS=Caenorhabditis elegans GN=ceh-30 PE=2 SV=2 | |
| 136 MSLLDPRQFLLPAFYLDPTTQALLAQAASTSPCNKISSSSSFRISDILEQSPNNSSHSND | |
| 137 HDPSPQSIKSDFSTSPRASSPGGDRMGSPGSCKKSRKARTIFTDKQLQELENTFEKQKYL | |
| 138 SVQDRMDLAHRMGLTDTQVKTWYQNRRTKWKRQATSGMDLLSEPGNLSAVQNLIRSSPYW | |
| 139 ANYITALPMGTQLPMMGLPMSMIVPPAHAFQPSSSSNSPSTHISSESPQLDVSSNSE | |
| 140 >sp|P26797|HM19_CAEEL Homeobox protein ceh-19 OS=Caenorhabditis elegans GN=ceh-19 PE=2 SV=2 | |
| 141 MAFNIESLLEKKSNPVEEGNDFEEENDSEKNGEEDEEEEEKNVIDGWTNMATSQLAMFAI | |
| 142 ANDLRTPTLVELQMLLGVSARKHDYKRSRKSVCERKPRQAYSARQLDRLETEFQTDKYLS | |
| 143 VNKRIQLSQTLNLTETQIKTWFQNRRTKWKKQLTSSIRQMVKDAPTSTSVGVPFQSLLTP | |
| 144 PTPPTTLACHVNSLFACEQ | |
| 145 >sp|P22807|SLOU_DROME Homeobox protein slou OS=Drosophila melanogaster GN=slou PE=2 SV=1 | |
| 146 MVMLQSPAQKASDSASAQNTAVGGLMSPNSNPDSPKSNTSPDVASADSVVSGTGGGSTPP | |
| 147 AAKIPKFIISANGAAVAGKQEQELRYSLERLKQMSSESGSLLSRLSPLQEDSQDKEKPNH | |
| 148 NNNNSLTNHNANSNTRRSQSPPASVGSVSFSSPAQQRKLLELNAVRHLARPEPLQHPHAA | |
| 149 LLQQHPHLLQNPQFLAAAQQHMHHHQHQHHQHPAHPHSHQHPHPHPHPHPHPHPSAVFHL | |
| 150 RAPSSSSTAPPSPATSPLSPPTSPAMHSDQQMSPPIAPPQNPPHSSQPPQQQQVAAPSDM | |
| 151 DLERIKLVAAVAARTTQASSTSALASASNSVSNASISISNSSSGSPSGRDLSDYGFRIQL | |
| 152 GGLAAAAAAAAATSRQIAAATYARSDTSEELNVDGNDEDSNDGSHSTPSVCPVDLTRSVN | |
| 153 SSAAANPSSASTSASSDRDAATKRLAFSVENILDPNKFTGNKLPSGPFGHPRQWSYERDE | |
| 154 EMQERLDDDQSEDMSAQDLNDMDQDDMCDDGSDIDDPSSETDSKKGGSRNGDGKSGGGGG | |
| 155 GGSKPRRARTAFTYEQLVSLENKFKTTRYLSVCERLNLALSLSLTETQVKIWFQNRRTKW | |
| 156 KKQNPGMDVNSPTIPPPGGGSFGPGAYASGLLYSHAVPYPPYGPYFHPLGAHHLSHSHS | |
| 157 >sp|Q04787|BSH_DROME Brain-specific homeobox protein OS=Drosophila melanogaster GN=bsh PE=2 SV=5 | |
| 158 MAMLNEASLSPADAHAHANATTPTHSKAAAMASATTMLTTKTPFSIEHILFQNLNSASNN | |
| 159 NNSSDTNGIAANTNNYAPKSSRNAVKSARSAFAHDNNPHKHPSQHSHPPQSHPPASASAS | |
| 160 ATATARSNQAASGYAGEDYGKSMHSTPRSNHHSRHGTSHYNGDQISQQLGSGAAQHPPVP | |
| 161 TTQPQPPPPPPLNGGSGASNGVLYPNAPYTDHGFLQMTLGYLSPSSGTYKSVDPYFLSQA | |
| 162 SLFGGAPFFGAPGCVPELALGLGMGVNALRHCRRRKARTVFSDPQLSGLEKRFEGQRYLS | |
| 163 TPERVELATALGLSETQVKTWFQNRRMKHKKQLRRRDNANEPVDFSRSEPGKQPGEATSS | |
| 164 SGDSKHGKLNPGSVGGTPTQPTSEQQLQMCLMQQGYSTDDYSDLEADSGDEDNSSDVDIV | |
| 165 GDAKLYQLT | |
| 166 >sp|O08686|BARX2_MOUSE Homeobox protein BarH-like 2 OS=Mus musculus GN=Barx2 PE=2 SV=2 | |
| 167 MHCHAELRLSSPGQLKAARRRYKTFMIDEILSKETCDYFEKLSLYSVCPSLVVRPKPLHS | |
| 168 CTGSPSLRAYPLLSVITRQPTVISHLVPTGSGLTPVLTRHPVAAAEAAAAAAETPGGEAL | |
| 169 ASSESETEQPTPRQKKPRRSRTIFTELQLMGLEKKFQKQKYLSTPDRLDLAQSLGLTQLQ | |
| 170 VKTWYQNRRMKWKKMVLKGGQEAPTKPKGRPKKNSIPTSEEIEAEEKMNSQAQSQELLES | |
| 171 SERQEEPCDTQEPKACLVPLEVAEPIHQPQELSEASSEPPPLS | |
| 172 >tr|Q23819|Q23819_HYDVD Cnox3 protein (Fragment) OS=Hydra viridissima GN=cnox3 PE=2 SV=1 | |
| 173 NLYPILNTDQNHCTYAKEDSLIPEVEEPSTYLQLKQNNAKGSGIKCRKPRTVFSDLQLMV | |
| 174 LEREFNNRKYLSTPQRTNLADRLGLNQTQVKTWYQNRRMKWKKETFESEDKEPKIS | |
| 175 >sp|Q01702|DLX3B_DANRE Homeobox protein Dlx3b OS=Danio rerio GN=dlx3b PE=2 SV=1 | |
| 176 MSGPTYDRKIPGISTDLSGSMSCHPTSKDSPTLPESSATDMGYYSSHHEYYQSPPYPQQM | |
| 177 NSYHQFNLSGMGATPGAYPTKTEYPYNTYRQYGHFNRDLQTPPQSAVKEEPETEVRMVNG | |
| 178 KPKKIRKPRTIYSSYQLAALQRRFQKAQYLALPERAELAAQLGLTQTQVKIWFQNRRSKF | |
| 179 KKLYKNGEVPLEHSPNASDSMACNSPPSPAVWDNNAHSSQVNRGQIPQPPLSSTPPYMED | |
| 180 YSNHWYQQGSHLQHPVHHPGPPQSVGAVY | |
| 181 >sp|Q18273|HM43_CAEEL Homeobox protein ceh-43 OS=Caenorhabditis elegans GN=ceh-43 PE=2 SV=1 | |
| 182 MDPSKGFEYVAGDYYQTSGVAPPTSNGAGSNVSPYFPYHAYPTSSTNGATGGSMYGTPQQ | |
| 183 TSAYAMYPPGPGSSPEEAFPEHTTTKIVEGCEAKYNVKGKKMRKPRTIYNSSQLQMLQKK | |
| 184 FQKTQYLALPDRAALAHELGLSQTQVKIWFQNRRSKQKKQKGGSSDHASDEEDDDTEESK | |
| 185 PESPPMGESVMIQESSEPRTLVSSSIKTEMKEEYPPMTLNEQYASPYLYGSDFSTILPPS | |
| 186 QGFPNNALYNTAGAYPSIDYTNGVYQNTLYKYV | |
| 187 >tr|Q23824|Q23824_HYDVD Msh protein (Fragment) OS=Hydra viridissima GN=msh PE=2 SV=1 | |
| 188 EFQFDLSKCFLRKHKANRKPRTPFSVNQLLTLEQKFKRKQYLSISERAELSELLRLTETQ | |
| 189 IKIWFQNRRAKQKRSKEAEIEESVRNRLPLSAADYRSLDHLTLLSSFIAFIPIEYEVKIF | |
| 190 MNVQRGIE | |
| 191 >tr|Q24785|Q24785_9METZ Homeobox-containing protein (Fragment) OS=Ephydatia fluviatilis GN=prox3 PE=3 SV=1 | |
| 192 PHSSGSNASTINKQKKDRKPRTPFTSTQLIALERKFRQQKYLSVAERAEFAEYLKLTETQ | |
| 193 VKIWFQNRRAKEKRLHEAEAERAARSLGFHFHMPMQSKMNTFRHPYCNSQYQTLCLCQFR | |
| 194 HKIGIGTFPARTTDSISSNSSQPTLPWFLTCNSSTPL | |
| 195 >sp|P70354|MSX3_MOUSE Homeobox protein MSX-3 OS=Mus musculus GN=Msx3 PE=1 SV=1 | |
| 196 MARATFDMNAAGLEARGGGHTEHGPLPFSVESLLEAERVPGSESGELGVERPLGASKPGA | |
| 197 WPPPVAHSCPPRAPSPPPCTLRKHKTNRKPRTPFTTAQLLALERKFHQKQYLSIAERAEF | |
| 198 SSSLSLTETQVKIWFQNRRAKAKRLQEAELEKLKLAAKPLLPAAFALPFPLGTQLHSSAA | |
| 199 TFGGNAVPGILAGPVAAYGMYYLS | |
| 200 >tr|Q90263|Q90263_DANRE Empty spiracles homeobox 3 OS=Danio rerio GN=emx3 PE=2 SV=1 | |
| 201 MFQHNKKCFTIESLVGKDSNSSNAAADEPIRPTALRFTESIHPSPFGSCFQNSGRTLYSS | |
| 202 SPEMMFTDPSTHSTNSGLSLRHLQIPTQPFFSPHQRDTLNFYPWVLRNRYLGHRFQGDDS | |
| 203 SPENLLLHGPFSRKPKRIRTAFSPSQLLRLERAFEKNHYVVGAERKQLANGLCLTETQVK | |
| 204 VWFQNRRTKHKRQKLEEESPDPQQKRKGSQHVSRWRVATQQGSPEDIDVISED |
