TCONS_00002711-protein (polypeptide) - C. hemisphaerica

Overview
NameTCONS_00002711-protein
Unique NameTCONS_00002711-protein
Typepolypeptide
OrganismClytia hemisphaerica (Jellyfish)
Sequence length2603

Sequence
The following sequences are available for this feature:

polypeptide sequence

>TCONS_00002711-protein ID=TCONS_00002711-protein|Name=TCONS_00002711-protein|organism=Clytia hemisphaerica|type=polypeptide|length=2603bp
MKTITALLFLLAAQANAQTEINLRREQNMCSVWGSGHYQTFDGTTFNYPG
QCTYALAIDCRPGKDDFAVHIENGVNCTAGKSCSRAVVVYLNDIPHRISF
DSTFTKDGKKVTLPYADQDVAVSRHAGYTYLDAWNGKLLVKYDGNNGVYV
QVGDEFHNGAVCGLCGNNNGPTDDFLKPGGQQAKNPTEYGNSWAKPIFGQ
TCKQVIGEVDYCNGTKDLVRLLSETKCKELKRSEVFTKCRQMVNPEPYYK
ACVQEACKCQGDHKCTCGAFEQYSRECLRHGITANWRSEDQCPVQCTDGK
VFMECGPSCSAECGQKGVISNCKTGCIDGCHCPKGTVLSKGVCIPETQCP
CLHNGNSYNHGVTVKMPGGCKNCTCNGGEWDCNNMECDGTASLYGNGHYN
TFDGKSFSYRNTCPSILVSHRGKAPFSVITDRQSCASKTCYMTIIIVYKT
NQIKLSTELGKFLASVNDLGTKLPIVSNKGFRVEAVSSMIRIETTEGLRV
TWDGKSRVNIKIPPSFKNQVSGLLGNFNGKTIDDFTEVSGDLAHSEIEFG
NSWLLPSKCKPLSSTDFSLTPCEANHQNAQVAETKCDILNSGPFKKCHAM
VDTVKYFDNCKQDVCGCGNHGSECFCAVLSAYAAECAMAGVEMKGWRKTS
GCEVTCPVGQSYQECSSSCTHSCSDVITPDAKCNEDCIEGCACPKGMVKK
DDISNVCVAKTACGCQVEDKLIANGGRVQKGCNTCVCREGSLVCTTLKCK
SDIVKCKPDMVYTTCLPTCPNTCATKQLGQTCLADKQKCVSGCTCPDGTI
EHDGKCITPEQCPCFHDGKTYEENSKMSRGCNDCICKSGKWKCTEDNCPG
VCSVFGDPHYKTFDGKIFDYQGRCKHTMVSDTCAGQPSKYKKEQIHVEVN
TQACGSQEVTCAKEITAIIHGSTFILKKGDKKAVIKPALEDKPTFKVYDY
AGSYVHIVTDHGISLMWDNGTRLYITVQPALAGKLCGLCGNYDGSEANDF
VTIQGDTTASATIFGDSWADDDSCPKAKEIEDTCKARPHRLDPSKKECSI
IKSDVFKQCHHAVDPSIYYDRCVFDVCGCDQLGDCDCICDAVGAYQKACQ
DEGIYIGWRKGHSICEKEMCCPTNMTYQIKGQACPPTCQYPKGDPNCPTK
FVEGCQCPVGMVQKVDRPNGNVFCVPPSECVYCEMNGHRYIDGERVPHKP
GTAEGDETEITNPDCQECVCNKGKVVCKEIPGCTSTTVAPSTTTITTISS
TTQPQSTTESPTTESSTTPSQSTTESSTTPSQSTTEASTTKSQTTKASTT
KSQTTAKSSTTQPQTTEEASTTPSKTESSTTPSQSTTEASTTKSQTTAKS
STTQPQTTEEASTTPSKTESSTTPSQSTTEASTTKSQTTTEASTTKSQTT
KASTTKSQTTESSTTKSQTTAKSSTTQPQTTEEASTTPSKTESSTTPSQS
TTEASTTKSQTTAKSSTTQPQTTEEASTTPSKTESSTTPSQSTTEASTTK
SQTTENWSTTPSQTTESSTTKSQSTTEASTTPSQTTTEAPITQPQSTTEA
STTSSKSTTESSTTSKATEVSTKPKETGTTVIATRTFPTFPTFPPMTTKS
STTKSSTTSSRSTTEALTTPSQSTESQTTEASTTPSQTTESQTTEQSTTV
SPTTVSPTTAPICDCKNPLPVGVSNEMIPDSQMKTNSIRSPEPGPANTGP
AQARLNNRQSIHGSGAWEPKDKEAHLDIIFNKMEDVREIRTQGSPRDERF
ARRYFVFYSLDGESFENEPLQSADGSYIFEGNTDNNGIKVNKLNIKAKAI
RIVPANDESIGEPQVAMRVELYVCLPCSTTPAPPVTTTQSTTVRKSTNIY
TEPPQSTEPQVKETTTEASTTKSSTTEASTTQSQSTTEAATTPSRTTEAS
TTQSESTTEAATTKSQTTTEASTTRSQSTTEASTTQSESTTEAATTKSQT
TTEASTTRSQSTTEASTTKSQTTESSTTRSQTITEASTTAASTTKFQTTE
SSTTKSSTTKSQSTTEASTTPSKSTESQTTESQATPSQTTESQTTEQSTT
VSPTTAPICDCKTPLPVGVSDRRISDSQMKSNSVRVTSPGPFNTGPAQAR
LNNKVTSSGSGAWEPKDKEAHLDIIFNKMEDVREIRTQGSPRVNRFAKRY
FVFYSLDGEKFENEPLKSADGSYVFEGNTNNNGIKVNKLNIKAKAIRIVP
ANDESIGEPQVAMRVELYVCPPCSTTPAPPVTTTQSTTVRKSTNIYTEPP
QSTEPQVKETTTEASTTKSSTTEASTTPSQSTESSTTQSQTTESSTKSST
TQSQTTPSQSTESSTTQSQTTPSQSSTTASQSTTESPTTLSRTTEASTTQ
SQSTTKASTTESSTATTQTTTTRFSTNIYTEPTLSTEPQVKEYCVYFNGK
ENITLSVGETVEVDECNQKICVVDDVTGKPEIELIQHECDDENCNTGCEV
PLNKRQVLVPIPGKCCRKCVSSDTACSQPGFLTCGGKDAKCIPKDWFCDG
RKDCPNNRDEECCPATTTAPPTTTRPATTTAQQSSTTPTTVTTTPIFSTX
QQQHLNNLQQHPQRSPQLQSFPQQFLRHDHSLHVNHQKYLQHVLPSVKKS
VIS
Run BLAST on NCBI
Gene-mRNA-Prot
This polypeptide comes from the following gene feature:
Feature NameUnique NameSpeciesType
XLOC_001463XLOC_001463Clytia hemisphaericagene
This polypeptide derives from the following transcript feature(s):
Feature NameUnique NameSpeciesType
TCONS_00002711TCONS_00002711Clytia hemisphaericatranscript
Annotated Terms
The following terms have been associated with this polypeptide:
Vocabulary: INTERPRO
TermDefinition
IPR001846VWF_type-D
IPR002172LDrepeatLR_classA_rpt
IPR014853Unchr_dom_Cys-rich
IPR000421FA58C
IPR008979Galactose-bd-like
IPR002919TIL_dom
IPR023415LDLR_class-A_CS
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO Annotation
GO Assignments
This polypeptide is annotated with the following GO terms.
Category Term Accession Term Name
molecular_function GO:0005515 protein binding
InterPro
Analysis Name: InterPro Annotations of C. hemisphaerica v1.0
Date Performed: 2017-06-01
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001846von Willebrand factor, type D domainSMARTSM00216VWD_2coord: 20..180
e-value: 6.1E-35
score: 132.1
coord: 841..1005
e-value: 2.6E-39
score: 146.6
coord: 380..540
e-value: 9.5E-35
score: 131.4
IPR001846von Willebrand factor, type D domainPFAMPF00094VWDcoord: 852..1005
e-value: 5.8E-30
score: 104.5
coord: 30..180
e-value: 5.8E-27
score: 94.8
coord: 392..540
e-value: 7.3E-24
score: 84.7
IPR001846von Willebrand factor, type D domainPROSITEPS51233VWFDcoord: 851..1062
score: 41.776
IPR001846von Willebrand factor, type D domainPROSITEPS51233VWFDcoord: 390..600
score: 38.576
IPR001846von Willebrand factor, type D domainPROSITEPS51233VWFDcoord: 29..242
score: 35.257
IPR002172Low-density lipoprotein (LDL) receptor class A repeatSMARTSM00192LDLa_2coord: 2475..2515
e-value: 8.3E-8
score: 41.9
IPR002172Low-density lipoprotein (LDL) receptor class A repeatPROSITEPS50068LDLRA_2coord: 2475..2514
score: 11.637
IPR002172Low-density lipoprotein (LDL) receptor class A repeatSUPERFAMILY57424LDL receptor-like modulecoord: 2479..2517
IPR014853Uncharacterised domain, cysteine-richSMARTSM00832c8_acoord: 1041..1116
e-value: 1.4E-25
score: 101.0
coord: 220..293
e-value: 1.3E-16
score: 71.2
coord: 579..653
e-value: 9.3E-25
score: 98.2
IPR014853Uncharacterised domain, cysteine-richPFAMPF08742C8coord: 1047..1113
e-value: 4.2E-17
score: 62.2
coord: 585..652
e-value: 4.4E-18
score: 65.4
coord: 226..292
e-value: 9.0E-16
score: 58.0
IPR000421Coagulation factor 5/8 C-terminal domainSMARTSM00231disc_4coord: 1664..1824
e-value: 0.02
score: 16.9
coord: 2060..2220
e-value: 0.011
score: 20.2
IPR000421Coagulation factor 5/8 C-terminal domainPROSITEPS50022FA58C_3coord: 2061..2220
score: 18.879
IPR000421Coagulation factor 5/8 C-terminal domainPROSITEPS50022FA58C_3coord: 1665..1824
score: 18.415
NoneNo IPR availableGENE3D4.10.1220.10coord: 2488..2522
e-value: 1.7E-8
score: 33.8
NoneNo IPR availableGENE3D2.10.70.10coord: 356..385
e-value: 7.8E-6
score: 25.5
coord: 1190..1230
e-value: 4.9E-5
score: 22.9
coord: 819..845
e-value: 0.0051
score: 16.4
coord: 721..754
e-value: 0.28
score: 10.9
NoneNo IPR availableGENE3D2.10.25.10coord: 289..355
e-value: 4.9E-16
score: 58.2
coord: 651..720
e-value: 1.3E-13
score: 50.4
coord: 1116..1189
e-value: 5.2E-11
score: 42.1
coord: 755..818
e-value: 4.5E-17
score: 61.5
NoneNo IPR availableCDDcd00112LDLacoord: 2476..2513
e-value: 7.75775E-5
score: 41.8086
IPR008979Galactose-binding domain-likeGENE3D2.60.120.260coord: 2057..2223
e-value: 8.3E-17
score: 60.9
coord: 1661..1826
e-value: 3.8E-16
score: 58.7
IPR008979Galactose-binding domain-likeSUPERFAMILY49785Galactose-binding domain-likecoord: 2061..2222
IPR008979Galactose-binding domain-likeSUPERFAMILY49785Galactose-binding domain-likecoord: 1666..1824
IPR002919Trypsin Inhibitor-like, cysteine rich domainPFAMPF01826TILcoord: 1121..1180
e-value: 1.4E-5
score: 25.2
coord: 656..713
e-value: 1.4E-8
score: 34.8
coord: 756..812
e-value: 1.6E-8
score: 34.6
coord: 296..349
e-value: 1.5E-8
score: 34.7
IPR002919Trypsin Inhibitor-like, cysteine rich domainSUPERFAMILY57567Serine protease inhibitorscoord: 652..715
IPR002919Trypsin Inhibitor-like, cysteine rich domainSUPERFAMILY57567Serine protease inhibitorscoord: 754..814
IPR002919Trypsin Inhibitor-like, cysteine rich domainSUPERFAMILY57567Serine protease inhibitorscoord: 292..351
IPR002919Trypsin Inhibitor-like, cysteine rich domainSUPERFAMILY57567Serine protease inhibitorscoord: 1119..1180
IPR023415Low-density lipoprotein (LDL) receptor class A, conserved sitePROSITEPS01209LDLRA_1coord: 2491..2513

Blast
BLAST of TCONS_00002711-protein vs. Swiss-Prot (Human)
Match: VWF (von Willebrand factor OS=Homo sapiens GN=VWF PE=1 SV=4)

HSP 1 Score: 619.772 bits (1597), Expect = 1.333e-180
Identity = 397/1302 (30.49%), Postives = 614/1302 (47.16%), Query Frame = 0
Query:    3 TITALLFLLAAQANAQTEINLRREQNMCSVWGSGHYQTFDGTTFNYPGQCTYALAIDCRPGKDDFAVHIENGVNCTAGKSCSRAVVVYLNDIPH-RISFDSTFTKDGKKVTLPYADQDVAVSRHAGYTYLDAWNGKLLVKYDGNNGVYVQVGDEFHNGAVCGLCGNNN-GPTDDFLKPGGQQAKNPTEYGNSWAKPIFGQTCKQVIGEVDYCNGTKDLVRLLSETKCKELKRSEVFTKCRQMVNPEPYYKACVQEACKCQGDHKCTCGAFEQYSRECLRHGITA-NWRSEDQCPVQCTDGKVFMECGPSCSAECGQKGVISNCKTGCIDGCHCPKGTVLSKGVCIPETQCPCLHNGNSYNHGVTVKMPGGCKNCTCNGGEWDCNNMECDGTASLYGNGHYNTFDGKSFSYRNTCPSILVSHRGKAPFSVITDRQSCASK---TCYMTIIIVY---KTNQIKLSTELGKFLASVNDLGTKLPIVSNKGFRVEAVSSMIRIETTEGLRVTWDGKSRVNIKIPPSFKNQVSGLLGNFNGKTIDDFTEVSGDLAHSEIE-FGNSWLLPSKCKPLSSTDFSLTPCEANHQNAQVAETKCDILNSGPFKKCHAMVDTVKYFDNCKQDVCGCGNHGSECFCAVLSAYAAECAMAGVEMKGWRKTSGCEVTCPVGQSYQECSSSCTHSCSDVITPDAKCNEDCIEGCACPKGMVKKDDISNVCVAKTACGCQVEDKLIANGGRVQKGCNTCVCREGSLVCTTLKCKSDIV-----------------KCKPDMVYTTCLPT---------CPNTCATKQLGQTCLADKQKCVSGCTCPDGTIEHDGKCITPEQCPCFHDGKTYEENSKMSRGCNDCICKSGKWKCTEDNCPGVCSVFGDPHYKTFDGKIFDYQGRCKHTMVSDTCAGQPSKYKKEQIHVEVNTQACGSQEVTCAKEITAIIHGSTFILKKGDKKAVIKPALEDKPTFKVYDYAGSYVHIVTDHGISLMWDNGTRLYITVQPALAGKLCGLCGNYDGSEANDFVTIQGDTTASATIFGDSWADDDSCPKAKEI-----EDTCKARPHRLDPSKKECSIIKSDVFKQCHHAVDPSIYYDRCVFDVCGCDQLGDCDCICDAVGAYQKACQDEGIYIGWRKGHSICEKEMCCPTNM---------TYQIKGQACPPTCQYPKGDPNCPTKFVEGC--QCPVGMVQKVDRPNGNVFCVPPSECVYCEMNGHRYIDGERV---PHKPGTAEGDETEITNPDCQECVCNKGKVVCKEIPGCTSTTVAPSTTTITTIS 1249
             + AL  +L     A+     R     CS++GS    TFDG+ +++ G C+Y LA  C+  K  F++ I +  N   GK  S +V  YL +     +  + T T+  ++V++PYA + + +   AGY  L       + + DG+    V + D + N   CGLCGN N    DDF+   G    +P ++ NSWA     Q C++       CN +   ++     +C+ LK + VF +C  +V+PEP+   C +  C+C G  +C C A  +Y+R C + G+    W     C   C  G  + +C   C+  C    +   C+  C+DGC CP+G +L +G+C+  T+CPC+H+G  Y  G ++     C  C C   +W C+N EC G   + G  H+ +FD + F++   C  +L        FS++ +   CA      C  ++ +       + +KL    G    +++    +LP++         V++ +R+   E L++ WDG+ R+ +K+ P +  +  GL GN+NG   DDF   SG LA   +E FGN+W L   C+ L        PC  N +  + +E  C +L S  F+ CH  V  + Y  NC+ DVC C + G EC C  L++YAA CA  GV +  WR+   CE+ CP GQ Y +C + C  +C  +  PD +CNE C+EGC CP G+    D    CV K  C C  + ++             C C +G + CT       ++                  C+P MV   C P          C  TC    L   C++    CVSGC CP G + H+ +C+  E+CPCFH GK Y     +  GCN C+C+  KW CT+  C   CS  G  HY TFDG  + + G C++ +V D C   P  ++     + V  + C    V C K +T ++ G    L  G+    +K  ++D+  F+V + +G Y+ ++    +S++WD    + + ++     K+CGLCGN+DG + ND  +           FG+SW     C   +++       TC     +       C I+ SDVF+ C+  VDP  Y D C++D C C+ +GDC C CD + AY   C   G  + WR   ++C +  C   N+          Y     AC  TCQ+P+    CP + VEGC   CP G +  +D       CV P +C  CE+ G R+  G++V   P  P   +    ++ N  C+ C    G VV    P  T   V+P+T  +  IS
Sbjct:    9 VLLALALILPGTLCAEGTRG-RSSTARCSLFGSDFVNTFDGSMYSFAGYCSYLLAGGCQ--KRSFSI-IGDFQN---GKRVSLSV--YLGEFFDIHLFVNGTVTQGDQRVSMPYASKGLYLETEAGYYKLSGEAYGFVARIDGSGNFQVLLSDRYFN-KTCGLCGNFNIFAEDDFMTQEGTLTSDPYDFANSWALSSGEQWCERASPPSSSCNISSGEMQKGLWEQCQLLKSTSVFARCHPLVDPEPFVALCEKTLCECAGGLECACPALLEYARTCAQEGMVLYGWTDHSACSPVCPAGMEYRQCVSPCARTCQSLHINEMCQERCVDGCSCPEGQLLDEGLCVESTECPCVHSGKRYPPGTSLSR--DCNTCICRNSQWICSNEECPGECLVTGQSHFKSFDNRYFTFSGICQYLLARDCQDHSFSIVIETVQCADDRDAVCTRSVTVRLPGLHNSLVKLKHGAG---VAMDGQDVQLPLLKGDLRIQHTVTASVRLSYGEDLQMDWDGRGRLLVKLSPVYAGKTCGLCGNYNGNQGDDFLTPSG-LAEPRVEDFGNAWKLHGDCQDLQKQHSD--PCALNPRMTRFSEEACAVLTSPTFEACHRAVSPLPYLRNCRYDVCSCSD-GRECLCGALASYAAACAGRGVRV-AWREPGRCELNCPKGQVYLQCGTPCNLTCRSLSYPDEECNEACLEGCFCPPGLYM--DERGDCVPKAQCPCYYDGEIFQPEDIFSDHHTMCYCEDGFMHCTMSGVPGSLLPDAVLSSPLSHRSKRSLSCRPPMVKLVC-PADNLRAEGLECTKTCQNYDL--ECMS--MGCVSGCLCPPGMVRHENRCVALERCPCFHQGKEYAPGETVKIGCNTCVCQDRKWNCTDHVCDATCSTIGMAHYLTFDGLKYLFPGECQYVLVQDYCGSNPGTFR-----ILVGNKGCSHPSVKCKKRVTILVEGGEIELFDGEVN--VKRPMKDETHFEVVE-SGRYIILLLGKALSVVWDRHLSISVVLKQTYQEKVCGLCGNFDGIQNNDLTSSNLQVEEDPVDFGNSWKVSSQCADTRKVPLDSSPATCHNNIMKQTMVDSSCRILTSDVFQDCNKLVDPEPYLDVCIYDTCSCESIGDCACFCDTIAAYAHVCAQHGKVVTWRTA-TLCPQS-CEERNLRENGYECEWRYNSCAPACQVTCQHPE-PLACPVQCVEGCHAHCPPGKI--LDELLQT--CVDPEDCPVCEVAGRRFASGKKVTLNPSDPEHCQICHCDVVNLTCEACQEPGGLVV----PP-TDAPVSPTTLYVEDIS 1263          

HSP 2 Score: 102.834 bits (255), Expect = 2.040e-21
Identity = 87/352 (24.72%), Postives = 152/352 (43.18%), Query Frame = 0
Query:   37 HYQTFDGTTFNYPGQCTYALAIDCRPGKDDFAVHIENGVNCTAG--KSCSRAVVVYLNDIPHRISFDSTFTKDGKKVTLPYADQDVAVSRHAGYTYLDAWN--GKLLVKYDGNNGVYVQVGDEFHNGAVCGLCG--NNNGPTDDFLKPGGQQAKNPTEYGNSWAKPIFGQTCKQVIGEVDYCNGTKDLVRLLSETKCKELKRSEVFTKCRQMVNPEPYYKACVQEACKCQGDHKCTCGAFEQYSRECLRHGITANWRSEDQCPVQCTDGKVFMECGPSCSAECGQKGVISNCKTGCIDGCHCPKGTVLSKGVCIPETQC-PCLHNGNSYNHGVTVKMPGG--CKNCTCNGGE 379
            H  TFDG  F   G C+Y L    +  + D  V + NG  C+ G  + C +++ V  + +   +  D   T +G+ V++PY   ++ V+ +    +   +N  G +      NN   +Q+  +       GLCG  + NG  +DF+   G    +       W     GQTC+ ++ E   C        L+ ++   ++    +F +C +++ P  +Y  C Q++C      +  C     Y+  C  +G+  +WR+ D C + C    V+  C   C   C   G +S+C     +GC CP   V+ +G C+PE  C  C+      +  +   +P    C+ CTC  G 
Sbjct: 1957 HIVTFDGQNFKLTGSCSYVLF---QNKEQDLEVILHNGA-CSPGARQGCMKSIEVKHSALSVELHSDMEVTVNGRLVSVPYVGGNMEVNVYGAIMHEVRFNHLGHIFTFTPQNNEFQLQLSPKTFASKTYGLCGICDENG-ANDFMLRDGTVTTDWKTLVQEWTVQRPGQTCQPILEE--QC--------LVPDSSHCQVLLLPLFAECHKVLAPATFYAICQQDSCH----QEQVCEVIASYAHLCRTNGVCVDWRTPDFCAMSCPPSLVYNHCEHGCPRHC--DGNVSSCGDHPSEGCFCPPDKVMLEGSCVPEEACTQCIGEDGVQHQFLEAWVPDHQPCQICTCLSGR 2287          
The following BLAST results are available for this feature:
BLAST of TCONS_00002711-protein vs. Swiss-Prot (Human)
Analysis Date: 2018-01-31 (Blastp Clytia hemisphaerica v1.0 proteins vs SwissProt (Homo sapiens))
Total hits: 1
Match NameE-valueIdentityDescription
VWF1.333e-18030.49von Willebrand factor OS=Homo sapiens GN=VWF PE=1 ... [more]
back to top