prot_H-canaliculatus_F_contig1439.3431.1 (polypeptide) Hapterophycus canaliculatus Oshoro5f female

You are viewing a polypeptide, more information available on the corresponding mRNA page

Overview
NamemRNA_H-canaliculatus_F_contig1439.3431.1
Unique Nameprot_H-canaliculatus_F_contig1439.3431.1
Typepolypeptide
OrganismHapterophycus canaliculatus Oshoro5f female (Hapterophycus canaliculatus Oshoro5f female)
Sequence length3098
Homology
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: D7G824_ECTSI (PDZ domain-containing protein n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D7G824_ECTSI)

HSP 1 Score: 2827 bits (7328), Expect = 0.000e+0
Identity = 2099/3342 (62.81%), Postives = 2275/3342 (68.07%), Query Frame = 0
Query:    1 MEWSHTHFDGYELLAVPSDSPASIVLLDSSALVLVQRVLLPQGDAPPRALTWQPGGASTGCSSGALACIVSTRGRGPELAVFTPNGVYLSSRVRYSWSCAARVPLSPLHKGA-CDRILLSWSSRGPLVVADQAISVWSNNAAVPSANANTFSSDDSXXXXXXXXXXXSAGEAPAGKRASYS---STFGPRSVYRNSLRLSRKWVRKGGXXXXXXXXXXXXXNGKAFHGTPTRQQRYPNFVALATSPCGCMFAAACEGSEDVTVWLRRRNVPPEDKPWETYGGGGXXXXXXSEGMVAXXXXXXXSEGDARGAGAWNFQPATVLNNDGPLVQVVWGRGPGAQQDYLLTLGGDGSAHVWLEQQWEDRFSMPTESGTVKFVKTATIKSWYGDNLRLKSVGFVKWAYFGSHSRDLVRDGGAASG-SAYHLLHQQEQAADGGARPYRSDNWIVGCGDDGRRFLWSLKDMPLPDGRTIPEPVEVASLPFSWLVGVCGGGAPXXXXXXXXXXXXRSSAMAPLISAVGYYDAQGADGAPTEVQVVAAAAGGTVAVMTARVHRVKARNRYVFRNTRSVVLPLGRTSAAGVSSGRQLNGDGGSNGSAAGSARSRKRVKP-SLPEGFPHPNLPLLARVERGEGDGGGEGGWTVEVHSWVNGGSHLQCIPCKPRGNATG--QPGHTAGAEPGQVDEARPQGMVLLSSSDSVVDXXXXGKAAAGSDGVAPVFGELTRKVRSGDELAFVPYALPSLTLECRSRHERIADGEDGPCLDAAVCYCGAXXXXXXXXXXRVRHAMWMSDWVGAEALPPALAVVDTLSRFHVFELDDGASSFSSXXXXXXXXXXXXXXXXGVDVGSALPAESGGRFNTG----GLLG-LGKRTPTKSSTGSLASLSNTSGHLVMKDLPQQAGWRDGRNSSPERTSRRARAEFAGRSFKGDRPPVEKTVVLPLDSKYGLGLTLAFQGNKVEVSTFVKHPDTTEMLPAEASGRMRPGDQLVDINGHSVEGLNAEEATSLIRQARRESMEGGAVSLELTFREAAPAPGAATPVRSGGSRSGSPRPRTGWYSRKAKLRGSARGSAKGLTGAGWASTDADQHALRWNLIGR-DEVPAFETCTLLPWLAQNVAAGGGEGAAGSAENEVSSSPASVSRWAEIGVYPPGDDRRRREALLLGFQPAKASSKSGDGEGYVVALLVEAA--AGGVSVAEVCRSPLEAGQQPVSFNREKTPPGAPANGRATVEAVCEDGWVRRWCVARDLPSGDAEG-AGFALRKSDICEPFREPGAGEDEK---GAEKARVVTPAALVAVASPTLLAVSRASEP--RPRQVLTPAQSFRDGERTRAPLEPEAATGAAGAPEMQSVLEVWSCAKTPYPRCPFKKDGTVLLPGLRPGQVVEGMCWVSPEAGDERGAAVSGHCLCVSVGGSVTVLARERLRHDSVSSGPEPAPGSGRSRWSPVFRVANPSSLLTCRTAGLRDFCQGLMAKYQRNLALSLVRTRGTKRLPATAAGAGMETVEEASKE---APATVVAPSHSPPAPLPLLRDEIAGLRG-TTQEGRRLEDWHPESLVALWCTSLMADGG-AKRGPRWEKGRERTLAVLHWVSKDA-GLECGGGVVAGTVVPLLAHGSGALDAPR-----AMEGD--LGRLKACFASYLKYRLEDEKIPEAGAEEGVPTGSGHGSFTSASSARRPPHASTRHFVKSAVSRDAT-PASARHVADTAAAAGVPRRLLNLSGAELSALSALLDCALGH-DVGGSAAVPAPSARPTGAGAGDTAAALFSKAPLSPPAPASSSALPGLRGGDKMP-TAVIATTNHMDDYASVFLLARGLRARLVPVGGAGGGGSKG------VTAEIASSAALAMLLAPNDTQSEVLEMLCPKSGGGTVSP---SPGLTWGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAAAMFFAALGKETKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLAHDFSTPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKDLQLALVIARLVEIRDGGGISVPTKAVGSLXXXXXXXXXXXXXXXXXGRVRGSFGA-----GFHGGVXXXXXFGAGDDEVEVTAAAAHKSIGGASRKLLRDELLPVFDGESDVSSASTSSARATPLQRNPFLECVTLFWLGQPDRALRALCRGPATEHDALNDAAASAPGAKAVSTVVTDKASGLVLCNRAIDIATRPFLVSKLQQRASGSEAGAGRAGRPARSFGGGGGGFAALTNA----------------LVNAERAAALRLTTVEYLARNGMEIPALEVLGSNAGGEWAAELATRQRADETEADVPKGVAAGKVGDPDGDDFLGGVFSGFGSSAPGALAKPKAGRAAAAA--ASGELSGDMFSAFDD-PPRRTVGKKQVTXXXXXXXXXXXXXXXXXXXXTRTAPPVALANSTAXXXXXXXXXXXXXXXXXXRKAKQ----------------------------------ASAAVN----------------------------------------------------------AAAAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-------------------------ASPPPG---------------------------AAAPXXXXXXXXXXXXXXXV------PRRSPAALPAKTLATTARPRLSSPIALXXXXXXXXXXXXXXGPL-VALSSPGSGTRTPGS--AQMSVSSPGGSSIGG-----GPTSPRASSYGYFEGEEPSWERYDKHVSGRDGGDXXXXXXXVDDDANGMARKREGPGKARNSDDADPPQRWDVWAEDLAYEAATKRLLRELRHLFAGLDALAPTAPNMAFALESLGT--TTAVPSGTTDGAAK----PGLLEGIRAVAGGLCEAHGLSARALTASAAGALSHASAGGSRRLGALCLLLNALDRPVAVDAAVCAAAGRLVQCCSVHGQAHLEDLSLRAAHARALGTEAEAAKEEAKAKAVLEAEGTDGDTVGAGGEGWNAAPRTESKRPA-GSFITAAARVAEAEAAEAAGGLGLENLRPHLSELRAAAFQLELCLALHARGCLRLSSRARAEAAVGFRAGLLVEALGRRYDVTAALVACPPSSVEADEPRLSDAQAVGEAFSAGGDE--------NGRGQGT-AWLDTPNADLLDALSR 3097
            MEWSHTHFDGYELLAVPS+SPASIV+LDSSALVLVQRVLLP+GDA PRALTWQPGGASTGCSSGAL+CIVSTRGRGPELAVFTP+GVYLSSRVRYSWSCAARVPLSPLHKG   DRILLSWS RGPLVVADQAISVWSNN AVPSANANTFSSD+S XXXXXXXXXX                 S FGPRSVYRNSLRLSRKWVRKGG                    +  R QR P FVALATSPCGCMFAAA +G+++VTVWLRRR++P   KPW   G  G                    E      GAWNFQPATVLNNDGPLVQVVWGRGPGAQQDYLLTLG +GSAHVW+EQQWEDR+SMPT SGT+KF +TATIK+WYGD+LRLKSVGFVKWAYFGSHSR LVRD GAASG SAYHLLHQQ+QAA+GGARPYRSDNWIVGCGDDGRRFLW LKDMPLPDGRT+PEPVEVASLPFSWLVGV  G A  XXXXXXXX    S+A APL+SAVGYYDAQGADGAP EVQVVA AAGGT+AV+TARVHR  +R+RYVFRNTRSV LP+G+   AGV    Q          A G A  R RV   +  E   HPNLPLLAR+E G        G  VEVHSWVNGGSHLQCIPCKPRG   G  QPGHTAGAEPG  D+     +V++S SDS       G + A     APV GEL +KVRSGDEL+FVPYALPS+TL CRS  E    G+  P  DA    CG            + HAMWMSDWVGAEALPPALAV+D   R HVFELDDGASS                       G  L A  GG  +TG     L+G LGKRTPTKSSTGSLASLSN S  L MKDL  Q G ++   S  ERTSRRAR +F G + + DRPPVEKTVVLPLDSKYGLGLTLAF+G++V VSTFV+HPDT+EMLPAEASGRMRPGD+LVDINGHSVEGLNAEEAT+LIRQARRE+MEGGAVSLELTFREA PA      VR   S+SGSP+PRTGWYS+K++ RGS RGSAKG T AGWASTDADQHALRW+ IG  DEVPAFETC LLPWL   VAA GG G  G+    +   PA  +RW EIGVYPPGD+RR REALLLGFQPA++        G VVALLV+AA   GGVSV E+CR+PL AGQQP+SFN EKTPPG    GRA  EAVCEDG VRRWC+AR LP GD E    F L   D+C+PF     G +E    G   +   TPA L+AVASP LLAV+R++E   +P++ LTPA+SFR+       L       AA   E  + +EVWSC+ TPYP   FKKDGTV LPG+   Q VEGMCWVSPEAGD+RGAAVSGHCLCVSV GSVT+LARER  H +V  G     G     WSPVFRVA+PSSLL CRTAGLRDFCQGLMAKYQRNLAL+LVRTRG K LPA                   A  +   PS  PP PLPLLRDEIAGLRG  T++GR LEDWHPESL ALWCTSL+A GG AK G RWEKGRERTL+VL WVSKDA G E G  VVAGTVVPLL HG             A +GD  LG LKAC ASYL  RLE E     G    VP  +G G FT+A   RRP     RH VKS  + D   PASAR VADTA AAGVPRRLL+LSGAELSALSALLD ALG  D  G +   APSA    AG  +TAAALFSKA  SPPA +SS+ +PG+R G  MP TAVIATTNHMDDYA+VFLL RGLR RL   G + GGGS+       V A IASSAALAMLLAP+ TQ EVLE+LCPK+GGG V+     PGL W DASAMLLPLWVRDA ELQRV E+VASATFLQDRDLMAAAMFFAALGKE KLLALAKADRGFVGQRN+ GGEGD+NS SGR QGAT G+RLEKLLAHDFS+PRGRS AEKNAY+LLRKRKFKSAAAVFLLPRPAM+KEALQVILMHVKDLQLAL+IARLVEIRDGGG     +  GSL      XXXXXXXXXXX              G +G  XXXXX             AA+KSIGGASRKLLRDELLPVFDGE+           ++PLQRNPFLEC TLFWLGQPDRALRALCRG           AA A G    STV  D  SGL LCNRAID+A RPFLVSKL+Q  S         G+PAR            T A                L+NA RAAA+RL T E+LARNGMEI ALEVLGS+AGG WAAELA  Q A       P+  AAG  G     D L G   G G  APG L KPKAG  AAAA  ASGELSGDMF  FD  PPRR        XXXXXXXXXXXXXXXXXXXX         A +  XXXXXXXXXXXXXXXXXX                                       ++AA                                                            AAAA XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX      XXXXXXXXXXXXXXXXXXXXXXXXXX      XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                         A+ P                             AAA  XXXXXXXXXXXXXX       P+R PAA       T  +PRLSSP A XXXXXXXXXXXXXX P  +  S PG   RTPG   +Q  V +P  +  GG     G TSPR+SSYGYFEGEEPSWE YD+HV+ + G             A G   +   P +   S  A  PQ+WDVWAEDLAYEAATKRLLRELRHL  GLDALAPTAPNMAFALESLG   T A  SG   GAA     P LLE +RAVA  LCEAH LSA AL A+AAG LSHASAGGSRRLGALCLLL AL RPVAVDAAVCAAAGRL+QCCS HGQAHLEDL LRAAH RALG EA         K +   E  +     A   G++A   +ES  PA GSF+TAA        AEAAGGLGLENLRPHLSELRAAAFQLELCLALHARGCLRLSSRARAEAA+GFRAGLLVE LGRRYDVTAALV CPP+S EADEPRLSD+Q   EA S GG          + R +G  AW+ TPNA+LLDALSR
Sbjct:    1 MEWSHTHFDGYELLAVPSESPASIVVLDSSALVLVQRVLLPEGDAAPRALTWQPGGASTGCSSGALSCIVSTRGRGPELAVFTPDGVYLSSRVRYSWSCAARVPLSPLHKGGGSDRILLSWSCRGPLVVADQAISVWSNNPAVPSANANTFSSDESDXXXXXXXXXXXXXXXXXXXXXXXXXXXSPFGPRSVYRNSLRLSRKWVRKGGVS------------------SDRRLQRPPGFVALATSPCGCMFAAAYKGTDEVTVWLRRRSMPGYGKPW---GANGRGDARNGSAAPVAGLGAEGGEDATDDDGAWNFQPATVLNNDGPLVQVVWGRGPGAQQDYLLTLGENGSAHVWVEQQWEDRYSMPTASGTIKFAQTATIKAWYGDSLRLKSVGFVKWAYFGSHSRHLVRDKGAASGGSAYHLLHQQDQAAEGGARPYRSDNWIVGCGDDGRRFLWRLKDMPLPDGRTVPEPVEVASLPFSWLVGVSRGAATAXXXXXXXXKEW-SAAAAPLVSAVGYYDAQGADGAPAEVQVVAPAAGGTIAVITARVHRADSRSRYVFRNTRSVALPMGQV--AGVPDPPQ---------EAGGGAGKRARVAGRAATEAVCHPNLPLLARLEGG--------GCAVEVHSWVNGGSHLQCIPCKPRGGGGGRLQPGHTAGAEPGNADKR----LVIVSRSDSGCCDSSDGSSDA-KGRFAPVLGELAQKVRSGDELSFVPYALPSVTLACRSTDENTGGGDSSP--DA----CGGRGGAA------IEHAMWMSDWVGAEALPPALAVIDGEKRLHVFELDDGASSS----------------------GENLDANGGGGVSTGDGLSSLVGMLGKRTPTKSSTGSLASLSNASRDLHMKDLVGQDGKKERSPSPQERTSRRARVDFGGPALRDDRPPVEKTVVLPLDSKYGLGLTLAFEGSRVNVSTFVRHPDTSEMLPAEASGRMRPGDELVDINGHSVEGLNAEEATALIRQARRENMEGGAVSLELTFREATPAANLKR-VRRPSSKSGSPKPRTGWYSKKSRARGSGRGSAKGSTAAGWASTDADQHALRWHPIGSSDEVPAFETCVLLPWLGSGVAALGGVGP-GAPSGAL---PAGGNRWTEIGVYPPGDERRVREALLLGFQPAESG-------GSVVALLVKAAPGVGGVSVVELCRAPLGAGQQPLSFNLEKTPPGV--RGRALAEAVCEDGRVRRWCIARSLPRGDDERETSFTLSTVDLCDPFGASSTGGEEDPDDGVGVSNRATPAPLIAVASPNLLAVARSTERPWKPQRALTPARSFRESSVRSEEL-------AAETTEPPASIEVWSCSDTPYPLRRFKKDGTVALPGI---QAVEGMCWVSPEAGDDRGAAVSGHCLCVSVAGSVTILARERWSH-AVLPGLWGGGGEEGWVWSPVFRVASPSSLLACRTAGLRDFCQGLMAKYQRNLALALVRTRGVKPLPAXXXXXXXXXXXXXXXXXXMARTSSSGPSR-PPFPLPLLRDEIAGLRGGATRDGRHLEDWHPESLAALWCTSLVAGGGGAKEGHRWEKGRERTLSVLRWVSKDAEGSESG--VVAGTVVPLLEHGGXXXXXXXXXXDGASKGDDHLGSLKACLASYLSCRLEAENAATDGRGSPVPGSNGSG-FTAA--VRRP-----RHSVKSPAAGDGPIPASARDVADTATAAGVPRRLLSLSGAELSALSALLDVALGQADASGPSPAVAPSAARPAAG-DNTAAALFSKA-SSPPA-SSSTDIPGVRSGSTMPPTAVIATTNHMDDYANVFLLTRGLRLRL---GDSAGGGSENASAAAAVEAGIASSAALAMLLAPSGTQKEVLEVLCPKAGGGAVAAVATGPGLAWSDASAMLLPLWVRDAAELQRVAESVASATFLQDRDLMAAAMFFAALGKEKKLLALAKADRGFVGQRNLGGGEGDINSTSGRSQGATQGQRLEKLLAHDFSSPRGRSAAEKNAYVLLRKRKFKSAAAVFLLPRPAMVKEALQVILMHVKDLQLALLIARLVEIRDGGGAVTSPRTTGSLGGFSGVXXXXXXXXXXXXXXXXXXXXXXXXXGLNGXXXXXXXX---XXXXXXXXXAAYKSIGGASRKLLRDELLPVFDGETGDGPTQGQQRSSSPLQRNPFLECATLFWLGQPDRALRALCRGLVASGHQGGSYAAPAGGIS--STVAADNLSGLGLCNRAIDVAMRPFLVSKLRQIES--------RGKPARDR---------STVAXXXXXXXXXXXXGDPLLINARRAAAVRLATAEHLARNGMEISALEVLGSDAGGGWAAELAEWQLARRAS---PENGAAGGDG-----DVLSGGGGGAGFKAPGTLMKPKAGTIAAAADSASGELSGDMFGGFDAAPPRRKASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTAAAATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTAAATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTAAAATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTAAATXXXXXXXXXXXXXXXXXXXXXXXXXXTAAAATXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXATAPXXXXXXXXXXXXXXXXXXXXXXXXXXRRTAAASTXXXXXXXXXXXXXXXXXXXXGPKRGPAA------TTPPKPRLSSPTAXXXXXXXXXXXXXXXXPPPITASPPGGHDRTPGGGGSQAGVGNPAAAGEGGVAGGLGRTSPRSSSYGYFEGEEPSWEEYDRHVAWQPG------DVESGPSAKGGGEQDTSPLEPTPSSSATFPQQWDVWAEDLAYEAATKRLLRELRHLLGGLDALAPTAPNMAFALESLGAEGTGAGSSGDAAGAAAGKPPPALLESVRAVAESLCEAHSLSAGALAAAAAGTLSHASAGGSRRLGALCLLLGALGRPVAVDAAVCAAAGRLLQCCSAHGQAHLEDLGLRAAHGRALGAEA---------KLLAREEAREAQDAAAAAAGFDATNLSESAGPAAGSFVTAATGAE----AEAAGGLGLENLRPHLSELRAAAFQLELCLALHARGCLRLSSRARAEAAMGFRAGLLVETLGRRYDVTAALVGCPPASPEADEPRLSDSQ-ESEALSPGGXXXXXXXXXXSDRDEGRRAWVGTPNAELLDALSR 3164          
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: A0A6H5K928_9PHAE (PDZ domain-containing protein n=1 Tax=Ectocarpus sp. CCAP 1310/34 TaxID=867726 RepID=A0A6H5K928_9PHAE)

HSP 1 Score: 2567 bits (6653), Expect = 0.000e+0
Identity = 2002/3376 (59.30%), Postives = 2191/3376 (64.90%), Query Frame = 0
Query:    1 MEWSHTHFDGYELLAVPSDSPASIVLLDSSALVLVQRVLLPQGDAPPRALTWQPGGASTGCSSGALACIVSTR----------------------GRGPELAVFTPNGVYLSSRVRYSWSCAARVPLSPLHKGA-CDRILLSWSSRGPLVVADQAISVWSNNAAVPSANANTFSSDDSXXXXXXXXXXXSAGEAPAGKRASYSST--FGPRSVYRNSLRLSRKWVRKGGXXXXXXXXXXXXXNGKAFHGTPTRQQRYPNFVALATSPCGCMFAAACEGSEDVTVWLRRRNVPPEDKPWETYGGGGXXXXXXSEGMVAXXXXXXX--SEGDARGAGAWNFQPATVLNNDGPLVQVVWGRGPGAQQDYLLTLGGDGSAHVWLEQQWEDRFSMPTESGTVKFVKTATIKSWYGDNLRLKSVGFVKWAYFGSHSRDLVRD-GGAASGSAYHLLHQQEQAADGGARPYRSDNWIVGCGDDGR-----------------RFLWSLKDMPLPDGRTIPEPVEVASLPFSWLVGVCGGGAPXXXXXXXXXXXXRSSAMAPLISAVGYYDAQGADGAPTEVQVVAAAAGGTVAVMTARVHRVKARNRYVFRNTRSVVLPLGRTSAAGVSSGRQLNGDGGSNGSAAGSARSRKRVKPSLPEGFPHPNLPLLARVERGEGDGGGEGGWTVEVHSWVNGGSHLQCIPCKPRGNATG--QPGHTAGAEPGQVDEARPQGMVLLSSSDSVVDXXXXGKAAAGSDGVAPVFGELTRKVRSGDELAFVPYALPSLTLECRSRHERIADGEDGPCLDAAVCYCGAXXXXXXXXXXRVRHAMWMSDWVGAEALPPALAVVDTLSRFHVFELDDGASSFSSXXXXXXXXXXXXXXXXGVDVGSALPAESGGRF--------NTGGLLGL-GKRTPTKSSTGSLASLSNTSGHLVMKDLPQQAGWRDGRNSSPERTSRRARAEFAGRSFKGDRPPVEKTVVLPLDSKYGLGLTLAFQGN---------------KVEVSTFVKHPDTTEMLPAEASGRMRPGDQLVDINGHSVEGLNAEEATSLIRQARRESMEGGAVSLELTFREAAPAPGAATPVRSGGSRSGSPRPRTGWYSRKAKLRGS--------------------------------------------------------ARGSAKGL----TGAGWASTDADQHALRWNLIGRDEVPAFETCTLLPWLAQNVAAGGGEGAAGSAENEVSSSPASVSRWAEIGVYPPGDDRRRREALLLGFQPAKASSKSGDGEGYVVALLVEAA--AGGVSVAEVCRSPLEAGQQPVSFNREKTPPGAPANGRATVEAVCEDGWVRRWCVARD-LPSGDAEGAGFALRKSDICEPFREPGAG---EDEKGAEKARVVTPAALVAVASPTLLAVSRASEP--RPRQVLTPAQSFRDGERTRAPLEPEAATGAAGAPEMQSVLEVWSCAKTPYPRCPFKKDGTVLLPGLRPGQVVEGMCWVSPEAGDERGAAVSGHCLCVSVGGSVTVLARERLRHDSVSSGPEPAPGSGRSRWSPVFRVANPSSLLTCRTAGLRDFCQGLMAKYQRNLALSLVRTRGTKRLPATAAGAGMETVEEASKEAPATVVAPSHS-----PPAPLPLLRDEIAGLRG-TTQEGRRLEDWHPESLVALWCTSLMADGG-AKRGPRWEKGRERTLAVLHWVSKDA-GLECGGGVVAGTVVPLLAHGSGALDAPR-----AMEGD--LGRLKACFASYLKYRLEDEKIPEAGAEEGVPTGSGHGSFTSASSARRPPHASTRHFVKS-AVSRDATPASARHVADTAAAAGVPRRLLNLSGAELSALSALLDCALGH-DVGGSAAVPAPSARPTGAGAGDTAAALFSKAPLSPPAPASSSALPGLRGGDKMP-TAVIATTNHMDDYASVFLLARGLRARLVPVGGAGGGGSK-----GVTAEIASSAALAMLLAPNDTQSEVLEMLCPKSGGGTVSP---SPGLTWGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAA--------AMFFAALGKETKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLAHDFSTPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKDLQLALVIARLVEIRDGGGISVPT-KAVGSLXXXXXXXXXXXXXXXXXGRVRGSFGAGFHGGVXXXXXFGAGDDEVEV--TAAAAHKSIGGASRKLLRDELLPVFDGESDVSSASTSSARATPLQRNPFLECVTLFWLGQPDRALRALCRG-PATEHDALNDAAASAPGAKAVSTVVTDKASGLVLCNRAIDIATRPFLVSKLQQRASGSEAGAGRAGRPARSFGGGGGGFAALTNALVNAERAAALRLTTVEYLARNGMEIPALEVLGSNAGGEWAAELATRQRADETEADVPKGVAAGKVGDPDGDDFLGGVFSGFGSSAPGALAKPKAGRAAAAAASGELSGDMFSAFDDPPRRTVGKKQVTXXXXXXXXXXXXXXXXXXXXTRTAPPVALANSTAXXXXXXXXXXXXXXXXXX--------------------------------------------------------------------RKAKQASAAVNAAAAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXASPPP-------------------------------------------------GAAAPXXXXXXXXXXXXXXXVPRRSPAALPAKTLATTARPRLSSPIALXXXXXXXXXXXXXXG-PLVALSSPGSGTRTPGSAQMSVSSPGGSSIGGGPTSPRASSYGYFEGEEPSWERYDKHVSGRDGGDXXXXXXXVDDDANGMARKREGPGKARNSDDADPPQRWDVWAEDLAYEAATKRLLRELRHLFAGLDALAPTAPNMAFALESLGTTTAVPSGTTDGAA-------KPGLLEGIRAVAGGLCEAHGLSARALTASAAGALSHASAGGSRRLGALCLLLNALDRPVAVDAAVCAAAGRLVQCCSVHGQAHLEDLSLRAAHARALGTEAEA-AKEEAKAKAVLEAEGTDGDTVGAGGEGWNAAPRTESKRPA-GSFITAAARVAEAEAAEAAGGLGLENLRPHLSELRAAAFQLELCLALHARGCLRLSSRARAEAAVGFRAGLLVEALGRRYDVTAALVACPPSSVEADEPRLSDAQAVGEAFSAGG 3072
            MEWSHTHFDGYELLAVPS+SPASIV+LDSSALVLVQRVLLP+GDA PRALTWQPGGASTGCSSGAL+CIVSTR                      GRGPELAVFTP+GVYLSSRVRYSWSCAARVPLSPLHKG   DRILLSWS RGPLVVADQAISVWSNN AVPSANANTFSSD+S      XXXXX     P  +  S SS+  FGPRSVYRNSLRLSRKWVRKGG                    +  R    P FVALATSPCGCMFAAA +G+++VT+WLRRR++P   KPW     G         G  A         +E      GAWNFQPATVLNNDGPLVQVVWGRGPGAQQDYLLTLG +G                                       RLKSVGFVKWAYFGSHSR LVRD G AA GSAYHLLHQQ+QAA+ G RPYRSDNWIVGCGDDGR                 RFLW LKDMPLPDGRT+PEPVEVASLPFSWLVGV    A XXXXXXXXXX   S+A APL+SAVGYYDAQGADGAP EVQVVA AAGGT+AV+TARVHR  +R+RYVFRNTRSV LP+G+   AGV    Q  G        +G+ +  K V  +  E   HPNLPLLAR+E G        G  VEVHSWVNGGSHLQCIPCKPRG   G  QPGHTAGAEPG  D+     +V++S SDS       G   A     A V GEL  KVRSGDEL+FVPYALP++TL CRS       G+  P  DA    CG            + HAMWMSDWVGAEALPPALAV+D   R HVFELDDG SS                       G  L A  GG              L G+ GKRTPTKSSTGSLASLSN S  L MKDL  Q G ++   S  ERTSRRAR +F G + + DRPPVEKTVVLPLDSKYGLGLTLAF+G+               KV+VSTFV+HPDT+EMLPAEASGRMRPGD+LVDINGHSVEGLNAEEAT+LIRQARRESMEGGAVSLELTFREAAPA  AA  VR   S+SGSP+PRTGWYS+K++ RGS                                                        ARGS        T AGWASTDADQHALRW+ IG DEVPAFETC LLPWL   VAA GG G  G+    +   P   +RW EIGVYPPGD+RR REALLLGFQPA++        GYVVALLV+AA   GGVSV E+CR+PL AGQQPVS N EKTPPG    GRA+ EA+CEDG VRRWC+AR     GD +   FAL   D+C+PF     G   + + G   ++  +PA LVAVASP LLAV+R+ E   +P++ LTPA+SF +       L       AA   E  + +EVWSC+ TPYP   FKKDGTV+LPG+   Q VEGMCWVSPEAGD+RGAAVSGHCLCVSV GSVTVLARER  H ++  G   A G     WSPVFRVA+PSSLL CRTAGLRDFCQGLMAKYQRNLAL+LVRTRG K LPA          +  ++  P   +A + S     PP+PLPLLRDE+AGLRG  T+EGR LEDWHPESL  LWCTSL+A GG AK G RWEKGRERTL+VL WVSKDA G E G  VVAG VVPLLAHG             A +GD  L RLKAC  SYL  RLE+E     G   G P    +G                RH VKS A +    PASAR VA     AGVPRRLL+LSGAELSAL ALLD ALG  D  G +   APSA    AG  +TAAALFSK  +S P+P+SS+ +PG+R G  MP TAVIATTNHMDDYA+VFLLARGLR RL   G + GGGS+     G+ A IASSAALAMLLAP+ TQ EVLE+LCPK+GGG V+     PGL W DASAMLLPLWVRDA ELQRV E+VASATFLQDRDLMAA        AMFFAALGKE KLLALAKADRGFVGQRN+SGGEGD+NS SGR QGAT G+RLEKLLAHDFS+PRGRS AEKNAY+LLRKRKFKSA AVFLLPRPAM+KEALQVILMHVKDLQLAL+IARLVEIRDGGG +V + +A GSL       XXXXXXXXXX               XXXXX             AAAA+KSIGGASRKLLRDELLPVFDGE++          +  LQRNPFLEC TLFWLGQPDRALRALCRG  A+ H    D + +AP     STV  D  SGL LCNRAIDIA RPFLVSKL+Q  S  +  AG     A +  GGGG +AA  + L+NA RAAA+RL T E+LARNGMEI ALEVLGS+AGG WAAELA RQ A  T    P+  AAG  G     D + G   G G  APG L KP AG +AAA                            XXXXXXXXXXXXXXXXXXXX         + + AXXXXXXXXXXXXXXXXXX                                                                    RKA            XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX      XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                                                       AAA XXXXXXXXXXXXXXX P+R  AA       T  +PRLSSP   XXXXXXXXXXXXX   P    S P +G              GG + G   TSPR+SSYGYFEGEEPSWE YD+HV GR  GD        +  A G   +++ P +A  S     PQ+WDVWAEDLAYEAATKRLLRELRHL  GLDALAPTAPNMA ALESLG      +G+  GAA        P LLE +RAVA  LCEAHGLSA AL A+AAGALSHASAGGSRRLGALCLLL AL RPVAVDAAVCAAAGRL+QCC+ HGQAHLEDL LRAAH RALG EA+  A+EEA+      A G D   +             ES RPA GSF+TAA     A  AEAAGGLGLENLRPHLSELRAAAFQLELCLALH RGCLRLSSRARAEAA+GFRAGLLVE LGRRYDVT ALV CPP+S EADEP L D+    EA S GG
Sbjct:    1 MEWSHTHFDGYELLAVPSESPASIVVLDSSALVLVQRVLLPEGDAAPRALTWQPGGASTGCSSGALSCIVSTRACSCLCFVAIVRLPPPSPSNPAGRGPELAVFTPDGVYLSSRVRYSWSCAARVPLSPLHKGGGSDRILLSWSCRGPLVVADQAISVWSNNPAVPSANANTFSSDESDSSDCGXXXXXXXXXXPKKQAPSSSSSSPFGPRSVYRNSLRLSRKWVRKGGVS------------------SDRRLHPPPGFVALATSPCGCMFAAAYKGTDEVTIWLRRRSMPGYGKPW-----GADDRGDARNGSAAPVAGLRAEGAEDATEDDGAWNFQPATVLNNDGPLVQVVWGRGPGAQQDYLLTLGENG---------------------------------------RLKSVGFVKWAYFGSHSRHLVRDKGAAAGGSAYHLLHQQDQAAEEGVRPYRSDNWIVGCGDDGRWESGPGVSLDSRRVWLARFLWRLKDMPLPDGRTVPEPVEVASLPFSWLVGVSHSAATXXXXXXXXXXKEWSAAAAPLVSAVGYYDAQGADGAPAEVQVVAPAAGGTIAVITARVHRADSRSRYVFRNTRSVALPMGQV--AGVPDPPQEAG--------SGAGKRPKVVGLAGTEAVCHPNLPLLARLEGG--------GCAVEVHSWVNGGSHLQCIPCKPRGGDGGRPQPGHTAGAEPGNADKK----VVIVSRSDSGCCDSSDGSGDA-KGRFALVLGELAHKVRSGDELSFVPYALPTVTLACRSMDGSTGGGDSSP--DA----CGGSGRAA------IEHAMWMSDWVGAEALPPALAVIDGEKRLHVFELDDGVSSS----------------------GEILDANGGGXXXXXXXXGDGLSSLAGMPGKRTPTKSSTGSLASLSNASRDLHMKDLVGQDG-KERPPSPQERTSRRARVDFGGPAVRDDRPPVEKTVVLPLDSKYGLGLTLAFEGSQGKNYSVSGSTPPPIKVKVSTFVRHPDTSEMLPAEASGRMRPGDELVDINGHSVEGLNAEEATALIRQARRESMEGGAVSLELTFREAAPAANAAR-VRRHSSKSGSPKPRTGWYSKKSRARGSGXXXXXXXTAAGWASTDADQHALRWHPIGSDEVPAFETSKSGSPKPRTGWYSKKARARGSGXXXXXXSTAAGWASTDADQHALRWHPIGSDEVPAFETCVLLPWLGSGVAALGGVGP-GAPSGAL---PGGGNRWTEIGVYPPGDERRVREALLLGFQPAESG-------GYVVALLVKAAPGVGGVSVVELCRAPLGAGQQPVSLNLEKTPPGV--QGRASAEALCEDGRVRRWCIARSRYCGGDEQEKSFALSTVDLCDPFGASSTGGVEDPDDGVGVSKTSSPARLVAVASPNLLAVARSVERPWKPQRALTPARSFPESSVRSEEL-------AAETTEPPASVEVWSCSDTPYPLRRFKKDGTVVLPGI---QAVEGMCWVSPEAGDDRGAAVSGHCLCVSVAGSVTVLARERPSH-AILPGLGGAGGEEGLVWSPVFRVASPSSLLACRTAGLRDFCQGLMAKYQRNLALTLVRTRGVKPLPAXXXXXXXXXXDTVAQGGPYPAMARTSSSGPSRPPSPLPLLRDEMAGLRGGATREGRHLEDWHPESLATLWCTSLVAGGGCAKEGRRWEKGRERTLSVLRWVSKDAQGSESG--VVAGAVVPLLAHGXXXXXXXXXXXXXASKGDDHLARLKACLVSYLSCRLEEENAVTDG--RGSPVAGSNGXXXXXXXXXXXXX--LRHSVKSPAAAGGPIPASARDVAXXXXXAGVPRRLLSLSGAELSALFALLDVALGQADASGPSPAVAPSAARPAAG-DNTAAALFSK--VSSPSPSSSTDIPGVRSGSTMPPTAVIATTNHMDDYANVFLLARGLRLRL---GESAGGGSEDAAAAGLEAGIASSAALAMLLAPSGTQKEVLEVLCPKAGGGVVAAVATGPGLAWSDASAMLLPLWVRDATELQRVAESVASATFLQDRDLMAARTPAGCWAAMFFAALGKEKKLLALAKADRGFVGQRNLSGGEGDINSTSGRSQGATQGQRLEKLLAHDFSSPRGRSAAEKNAYVLLRKRKFKSAVAVFLLPRPAMVKEALQVILMHVKDLQLALLIARLVEIRDGGGGAVASPRASGSLGGFSGVSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGGNAAAAYKSIGGASRKLLRDELLPVFDGETEHGPTQGQQRSSFRLQRNPFLECATLFWLGQPDRALRALCRGLVASGHQ---DGSYAAPAGGFSSTVAADNLSGLGLCNRAIDIAMRPFLVSKLRQIESRGKP-AGDRSTVAAAVAGGGGDWAA-GDPLINARRAAAVRLATAEHLARNGMEISALEVLGSDAGGGWAAELAERQLARRT---GPENGAAGGDG-----DVVSGGGRGAGFKAPGTLMKPLAGASAAAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTAAAXXXXXXXXXXXXXXXXXXXXXXXXXXXXATTAXXXXXXXXXXXXXXXXXXXXXPKAATXXXXXXXXXXXXXXXXXXXXXXXXXXXRRKASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPPSAASXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSAAAAAXXXXXXXXXXXXXXXXGPKRGAAA------TTPPKPRLSSPTXXXXXXXXXXXXXXXDASPPTTASPPAAG-----------GGEGGVAGGLARTSPRSSSYGYFEGEEPSWEEYDRHV-GRQPGDVES-----EPSAKGGG-EQDTPLEATPSSWGAHPQQWDVWAEDLAYEAATKRLLRELRHLLGGLDALAPTAPNMAVALESLGAE-GTGAGSNGGAAGADAGKPPPALLESVRAVAESLCEAHGLSAGALAAAAAGALSHASAGGSRRLGALCLLLGALGRPVAVDAAVCAAAGRLLQCCNAHGQAHLEDLGLRAAHGRALGAEAKLLAREEAREAQDAAASGLDATNL------------PESARPAAGSFVTAA----PAAEAEAAGGLGLENLRPHLSELRAAAFQLELCLALHERGCLRLSSRARAEAAMGFRAGLLVETLGRRYDVTVALVGCPPASPEADEPSLFDSH-ESEALSPGG 3164          
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: K0SS84_THAOC (Rav1p_C domain-containing protein (Fragment) n=1 Tax=Thalassiosira oceanica TaxID=159749 RepID=K0SS84_THAOC)

HSP 1 Score: 97.1 bits (240), Expect = 3.040e-16
Identity = 70/202 (34.65%), Postives = 105/202 (51.98%), Query Frame = 0
Query: 1811 IASSAALAMLLAPNDTQSEVLEMLCPKSGGGTVSPSPGLTWGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAAAMFFAALGKETKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLAHDFSTPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKDLQLALVIARLVE 2012
            +AS+A L+ L+  +D+Q ++++   P              W  A A+ LP WVR    L  V E +A   +   +++M  A+++ A+    KL A+A  DR   G++                        L+ ++ HDFS+ RGR  AEKNAY LLRKRK+  AA+ FLL  P M+K A+ VI M ++D  LA ++ARLVE
Sbjct:   51 VASAAILSALM--SDSQPKLIDACRPAD--------EKYDWASARAIGLPYWVRSDKALASVAEEIAQTIYKSTKNVMDCALYYIAMRNMKKLRAIAATDRSLSGKKF-----------------------LKFIMDHDFSSDRGRKAAEKNAYSLLRKRKYAHAASFFLLAEPPMIKTAIDVITMQLQDPSLAFMVARLVE 219          
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: A0A7S2EUC3_9STRA (Hypothetical protein (Fragment) n=1 Tax=Ditylum brightwellii TaxID=49249 RepID=A0A7S2EUC3_9STRA)

HSP 1 Score: 97.8 bits (242), Expect = 3.540e-16
Identity = 74/221 (33.48%), Postives = 110/221 (49.77%), Query Frame = 0
Query: 1811 IASSAALAMLLAPNDTQSEVLEMLCPKSGGGTVSPSPGLTWGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAAAMFFAALGKETKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLAHDFSTPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKDLQLALVIARLVE----IRDGGGISVPTKAVG 2027
            IAS+  L+ LL  +++Q+ +L   C         P   + W  A  +LLP W+R    L++ +E +A + F Q RD++  A+F+  +     +  +A  D                NS SGR       +  +    +DF++ RG+  AEKNAY LLRKRK+ +AA  FLLP P ML  AL VI+  +KDL LA  +ARL+        G G+   T  +G
Sbjct: 1075 IASAGCLSALL--SNSQTNLLSACC--------KPGEKMDWSTARGILLPFWLRSDECLRKTSEEIAQSIFKQKRDILECALFYVIIRNTRSMRNMAATD----------------NSESGR-------KFFKFTTNYDFTSERGKKAAEKNAYSLLRKRKYLAAATFFLLPEPPMLNTALDVIVTKMKDLSLAFFVARLINNPICASSGNGLGGNTLTIG 1262          
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: A0A4D9D482_9STRA (Uncharacterized protein n=1 Tax=Nannochloropsis salina CCMP1776 TaxID=1027361 RepID=A0A4D9D482_9STRA)

HSP 1 Score: 97.1 bits (240), Expect = 7.420e-16
Identity = 74/194 (38.14%), Postives = 101/194 (52.06%), Query Frame = 0
Query: 1820 LLAPNDTQSEVLEMLCPKSGGGTVSPSPGL-TWGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAAAMFFAALGKETKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLAHDFSTPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKDLQLALVIARLVE 2012
            LLA N+T+  ++E       GG+     G  TW     + LPLW R  + L+ + E +A+A F + +D M AA+ + A G+ T L  LAKA R                            + L +LL  D ++ RGR V EKNAY LLR R + SAAA+FLLP P  L   L V++ H+KD  LAL++ARLVE
Sbjct: 1687 LLARNNTR--LVETAVEGREGGSGGRQGGAWTWEAVRDLYLPLWARSVVPLRHLAETMAAAAFRRSQDPMEAALLYLAAGRRTTLSNLAKAAR-------------------------EDKKALARLLQFDLNSARGRQVVEKNAYHLLRHRAYLSAAALFLLPDPPQLTLCLDVLVRHMKDPLLALLVARLVE 1853          
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: A0A7S2KBJ0_9STRA (Hypothetical protein n=2 Tax=Skeletonema marinoi TaxID=267567 RepID=A0A7S2KBJ0_9STRA)

HSP 1 Score: 95.1 bits (235), Expect = 2.540e-15
Identity = 60/164 (36.59%), Postives = 91/164 (55.49%), Query Frame = 0
Query: 1850 TWGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAAAMFFAALGKETKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLA-HDFSTPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKDLQLALVIARLVE 2012
            +W  A ++ LP W+R   EL  + E +A   +   R++M  A+++ A+    KL A+A  DR                        + +G++  K ++ HDFS+ RGR+ AEKNAY LLRKRK+ SAA+ FLL  P M+K A+ VI + ++D  LA  +ARL+E
Sbjct:  545 SWDSARSIGLPFWLRSQKELVSIAEEIAQTIYKDTRNVMDCALYYIAMRNMKKLKAIAATDR------------------------SESGKKFFKFISDHDFSSDRGRNSAEKNAYSLLRKRKYASAASFFLLAEPPMIKSAVDVIRLQLQDTSLAFFVARLIE 684          
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: A0A7S3V901_9STRA (Hypothetical protein (Fragment) n=2 Tax=Chaetoceros debilis TaxID=122233 RepID=A0A7S3V901_9STRA)

HSP 1 Score: 93.2 bits (230), Expect = 9.750e-15
Identity = 72/202 (35.64%), Postives = 107/202 (52.97%), Query Frame = 0
Query: 1811 IASSAALAMLLAPNDTQSEVLEMLCPKSGGGTVSPSPGLTWGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAAAMFFAALGKETKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLAHDFSTPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKDLQLALVIARLVE 2012
            IAS A LA LL  +D+QS +L+  C K    T        W  A ++ +P W+R    L+ ++E +A   + +  D++  A+F+   G    L  +A ADR                  SGR         L+ +  ++FS+ RGR  AEKNA+ LLRKRK+ +AAA FLL  P M+K AL +I+  ++DL LAL +ARL++
Sbjct:  511 IASGACLAALL--SDSQSHLLKA-CKKESHWT--------WDIARSLRIPFWLRSDDALKSISEEIAQTKYKESMDVVECALFYVITGNMRMLKTVAAADR----------------KTSGRTF-------LQFVTKYNFSSQRGRMAAEKNAFSLLRKRKYGAAAAFFLLAEPPMMKSALNIIVTKMEDLSLALFVARLIK 678          
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: B8CG37_THAPS (Uncharacterized protein n=1 Tax=Thalassiosira pseudonana TaxID=35128 RepID=B8CG37_THAPS)

HSP 1 Score: 93.2 bits (230), Expect = 1.060e-14
Identity = 101/363 (27.82%), Postives = 150/363 (41.32%), Query Frame = 0
Query: 1851 WGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAAAMFFAALGKETKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLAHDFSTPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKDLQLALVIARLVEIRDGGGISVPTKAVGSLXXXXXXXXXXXXXXXXXGRVRGSF-------GAGFHGGVXXXXXFGAGDDEVEVTAAAAHKSIGGASRKLLRDEL-LPVFDGESDVSSASTSSARAT-PLQRNPFLECVTLFWLGQPDRALRALCRGPAT---EHDALNDAAASAPGAKAVSTVVTDKASGLVLCNRAIDIATRPFLVSKLQQR 2201
            W  A A+ +P WVR    L  + E +A A +   + +M  A+++ A+     L A+A  DR   G +                        L+ ++ HDFS+ RGR  AEKNAY LLRKRK+ +AA+ FLL  P M+K AL VI   +KD+ LA ++ARL+E       ++P  A+                      + G F       G GF GG                        IGG S  L  D      +      S+ S    + T P   +  +E V L WL +P+ A+  L   P     +  ++ND A  +    A S+V   K   L   N  I+  + PFL+  ++ +
Sbjct: 1924 WVMARAVGIPFWVRSKKTLVSIAEEIAQAIYKSTKSVMDCALYYVAMRNMKTLRAIAATDRSDSGTKF-----------------------LKFIIDHDFSSERGRKAAEKNAYSLLRKRKYATAASFFLLAEPPMIKTALDVIKSQMKDITLAFMVARLMENAPKSS-AMPDDAL---------------------TIGGGFNLSSMGGGGGFAGG----------------------GPIGGTSLDLEEDGAKFSEWSPSLGPSARSVLQTKGTSPAVEDNCMESVKLLWLNRPNEAMLRLAHMPTNSVADASSINDVAVPSISGDA-SSVTNGKV--LQKTNEVINFCSGPFLLKAMEPK 2216          
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: D7G803_ECTSI (Uncharacterized protein n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D7G803_ECTSI)

HSP 1 Score: 86.3 bits (212), Expect = 1.460e-14
Identity = 52/88 (59.09%), Postives = 59/88 (67.05%), Query Frame = 0
Query: 2117 ATPLQRNPFLECVTLFWLGQPDRALRALCRG-PATEHDALNDAAASAP---GAKAVSTVVTDKASGLVLCNRAIDIATRPFLVSKLQQ 2200
            ++PLQRNPF E  TLFWLGQPDRAL  LC G  A+ H     AA +AP   G    STV     SGL LCNRAI++A RPFLVSKL+Q
Sbjct:   89 SSPLQRNPFQEFATLFWLGQPDRALWVLCYGLVASGHHDDPYAAPAAPTVEGKGVSSTVAAANLSGLDLCNRAIEVAMRPFLVSKLRQ 176          
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Match: A0A5D6XJG0_9STRA (Rav1p_C domain-containing protein n=1 Tax=Pythium brassicum TaxID=1485010 RepID=A0A5D6XJG0_9STRA)

HSP 1 Score: 90.5 bits (223), Expect = 6.810e-14
Identity = 60/168 (35.71%), Postives = 91/168 (54.17%), Query Frame = 0
Query: 1847 PGLTWGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAAAMFFAALGKETKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLAHDFSTPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKDLQLALVIARLVEIR 2014
            P  +W D   + L +WV++  +L+   E +A +TF + +D M+  +F+ ALGK+  L ALAK  +                        +   ++L   L HDF+  R  + A KNAY LL K++++SAAA FL+  P  L+EAL+V+   + D  LALV+ARLVE R
Sbjct: 1436 PQASWDDLRPLWLGIWVKNTKDLRAAVERLAKSTFARTKDAMSVCLFYIALGKKNVLNALAKMSK------------------------SEQSKKLAVFLDHDFAQERWCNAAVKNAYSLLSKKQYESAAAFFLVCEPPRLQEALRVLTARMADPSLALVVARLVEYR 1579          
The following BLAST results are available for this feature:
BLAST of mRNA_H-canaliculatus_F_contig1439.3431.1 vs. uniprot
Analysis Date: 2022-09-16 (Diamond blastp: OGS1.0 of Hapterophycus canaliculatus Oshoro5f female vs UniRef90)
Total hits: 25
Match NameE-valueIdentityDescription
D7G824_ECTSI0.000e+062.81PDZ domain-containing protein n=1 Tax=Ectocarpus s... [more]
A0A6H5K928_9PHAE0.000e+059.30PDZ domain-containing protein n=1 Tax=Ectocarpus s... [more]
K0SS84_THAOC3.040e-1634.65Rav1p_C domain-containing protein (Fragment) n=1 T... [more]
A0A7S2EUC3_9STRA3.540e-1633.48Hypothetical protein (Fragment) n=1 Tax=Ditylum br... [more]
A0A4D9D482_9STRA7.420e-1638.14Uncharacterized protein n=1 Tax=Nannochloropsis sa... [more]
A0A7S2KBJ0_9STRA2.540e-1536.59Hypothetical protein n=2 Tax=Skeletonema marinoi T... [more]
A0A7S3V901_9STRA9.750e-1535.64Hypothetical protein (Fragment) n=2 Tax=Chaetocero... [more]
B8CG37_THAPS1.060e-1427.82Uncharacterized protein n=1 Tax=Thalassiosira pseu... [more]
D7G803_ECTSI1.460e-1459.09Uncharacterized protein n=1 Tax=Ectocarpus silicul... [more]
A0A5D6XJG0_9STRA6.810e-1435.71Rav1p_C domain-containing protein n=1 Tax=Pythium ... [more]

Pages

back to top
InterPro
Analysis Name: InterProScan on OGS1.0 of Hapterophycus canaliculatus Oshoro5f female
Date Performed: 2022-09-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001478PDZ domainSMARTSM00228pdz_newcoord: 933..1016
e-value: 4.6E-4
score: 29.5
IPR001478PDZ domainPROSITEPS50106PDZcoord: 924..998
score: 12.828
IPR022033RAVE complex protein Rav1 C-terminalPFAMPF12234Rav1p_Ccoord: 1941..2013
e-value: 3.3E-17
score: 62.2
NoneNo IPR availableGENE3D2.30.42.10coord: 921..1013
e-value: 2.5E-11
score: 45.6
NoneNo IPR availablePANTHERPTHR13950:SF9RABCONNECTIN-3Acoord: 1771..2271
NoneNo IPR availablePANTHERPTHR13950RABCONNECTIN-RELATEDcoord: 1771..2271
IPR041489PDZ domain 6PFAMPF17820PDZ_6coord: 962..1003
e-value: 2.0E-6
score: 27.5
IPR036034PDZ superfamilySUPERFAMILY50156PDZ domain-likecoord: 916..1010

Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
H-canaliculatus_F_contig1439contigH-canaliculatus_F_contig1439:333..15709 -
Analyses
This polypeptide is derived from or has results from the following analyses
Analysis NameDate Performed
InterProScan on OGS1.0 of Hapterophycus canaliculatus Oshoro5f female2022-09-29
Diamond blastp: OGS1.0 of Hapterophycus canaliculatus Oshoro5f female vs UniRef902022-09-16
OGS1.0 of Hapterophycus canaliculatus Oshoro5f female2021-02-24
Relationships

This polypeptide derives from the following mRNA feature(s):

Feature NameUnique NameSpeciesTypePosition
mRNA_H-canaliculatus_F_contig1439.3431.1mRNA_H-canaliculatus_F_contig1439.3431.1Hapterophycus canaliculatus Oshoro5f femalemRNAH-canaliculatus_F_contig1439 333..15709 -


Sequences
The following sequences are available for this feature:

polypeptide sequence

>prot_H-canaliculatus_F_contig1439.3431.1 ID=prot_H-canaliculatus_F_contig1439.3431.1|Name=mRNA_H-canaliculatus_F_contig1439.3431.1|organism=Hapterophycus canaliculatus Oshoro5f female|type=polypeptide|length=3098bp
MEWSHTHFDGYELLAVPSDSPASIVLLDSSALVLVQRVLLPQGDAPPRAL
TWQPGGASTGCSSGALACIVSTRGRGPELAVFTPNGVYLSSRVRYSWSCA
ARVPLSPLHKGACDRILLSWSSRGPLVVADQAISVWSNNAAVPSANANTF
SSDDSGSSSGGSDDESSAGEAPAGKRASYSSTFGPRSVYRNSLRLSRKWV
RKGGGGGGGGGGGGGGGNGKAFHGTPTRQQRYPNFVALATSPCGCMFAAA
CEGSEDVTVWLRRRNVPPEDKPWETYGGGGGPGDGPSEGMVAAAGGGGGS
EGDARGAGAWNFQPATVLNNDGPLVQVVWGRGPGAQQDYLLTLGGDGSAH
VWLEQQWEDRFSMPTESGTVKFVKTATIKSWYGDNLRLKSVGFVKWAYFG
SHSRDLVRDGGAASGSAYHLLHQQEQAADGGARPYRSDNWIVGCGDDGRR
FLWSLKDMPLPDGRTIPEPVEVASLPFSWLVGVCGGGAPAAAAAAAAAAA
SRSSAMAPLISAVGYYDAQGADGAPTEVQVVAAAAGGTVAVMTARVHRVK
ARNRYVFRNTRSVVLPLGRTSAAGVSSGRQLNGDGGSNGSAAGSARSRKR
VKPSLPEGFPHPNLPLLARVERGEGDGGGEGGWTVEVHSWVNGGSHLQCI
PCKPRGNATGQPGHTAGAEPGQVDEARPQGMVLLSSSDSVVDDSSDGKAA
AGSDGVAPVFGELTRKVRSGDELAFVPYALPSLTLECRSRHERIADGEDG
PCLDAAVCYCGAGAACAGQGRGRVRHAMWMSDWVGAEALPPALAVVDTLS
RFHVFELDDGASSFSSSSKESLSDAAADGGGGGVDVGSALPAESGGRFNT
GGLLGLGKRTPTKSSTGSLASLSNTSGHLVMKDLPQQAGWRDGRNSSPER
TSRRARAEFAGRSFKGDRPPVEKTVVLPLDSKYGLGLTLAFQGNKVEVST
FVKHPDTTEMLPAEASGRMRPGDQLVDINGHSVEGLNAEEATSLIRQARR
ESMEGGAVSLELTFREAAPAPGAATPVRSGGSRSGSPRPRTGWYSRKAKL
RGSARGSAKGLTGAGWASTDADQHALRWNLIGRDEVPAFETCTLLPWLAQ
NVAAGGGEGAAGSAENEVSSSPASVSRWAEIGVYPPGDDRRRREALLLGF
QPAKASSKSGDGEGYVVALLVEAAAGGVSVAEVCRSPLEAGQQPVSFNRE
KTPPGAPANGRATVEAVCEDGWVRRWCVARDLPSGDAEGAGFALRKSDIC
EPFREPGAGEDEKGAEKARVVTPAALVAVASPTLLAVSRASEPRPRQVLT
PAQSFRDGERTRAPLEPEAATGAAGAPEMQSVLEVWSCAKTPYPRCPFKK
DGTVLLPGLRPGQVVEGMCWVSPEAGDERGAAVSGHCLCVSVGGSVTVLA
RERLRHDSVSSGPEPAPGSGRSRWSPVFRVANPSSLLTCRTAGLRDFCQG
LMAKYQRNLALSLVRTRGTKRLPATAAGAGMETVEEASKEAPATVVAPSH
SPPAPLPLLRDEIAGLRGTTQEGRRLEDWHPESLVALWCTSLMADGGAKR
GPRWEKGRERTLAVLHWVSKDAGLECGGGVVAGTVVPLLAHGSGALDAPR
AMEGDLGRLKACFASYLKYRLEDEKIPEAGAEEGVPTGSGHGSFTSASSA
RRPPHASTRHFVKSAVSRDATPASARHVADTAAAAGVPRRLLNLSGAELS
ALSALLDCALGHDVGGSAAVPAPSARPTGAGAGDTAAALFSKAPLSPPAP
ASSSALPGLRGGDKMPTAVIATTNHMDDYASVFLLARGLRARLVPVGGAG
GGGSKGVTAEIASSAALAMLLAPNDTQSEVLEMLCPKSGGGTVSPSPGLT
WGDASAMLLPLWVRDALELQRVTEAVASATFLQDRDLMAAAMFFAALGKE
TKLLALAKADRGFVGQRNISGGEGDLNSASGRPQGATAGERLEKLLAHDF
STPRGRSVAEKNAYILLRKRKFKSAAAVFLLPRPAMLKEALQVILMHVKD
LQLALVIARLVEIRDGGGISVPTKAVGSLGGMGGMGGYGGGVGGFGGRVR
GSFGAGFHGGVGGGGGFGAGDDEVEVTAAAAHKSIGGASRKLLRDELLPV
FDGESDVSSASTSSARATPLQRNPFLECVTLFWLGQPDRALRALCRGPAT
EHDALNDAAASAPGAKAVSTVVTDKASGLVLCNRAIDIATRPFLVSKLQQ
RASGSEAGAGRAGRPARSFGGGGGGFAALTNALVNAERAAALRLTTVEYL
ARNGMEIPALEVLGSNAGGEWAAELATRQRADETEADVPKGVAAGKVGDP
DGDDFLGGVFSGFGSSAPGALAKPKAGRAAAAAASGELSGDMFSAFDDPP
RRTVGKKQVTAATSGELSGDMFGGFDVPPRTRTAPPVALANSTASGQLSG
DMFSGFDVAPPQRKAKQASAAVNAAAAASGELSGDMFGGFDAAPTQQKVK
QASASTNAAAAASGELSGDMFGGFDATPPQRKTKQTSAAVNAAAAASGEL
SGDMLGGFDAAPPRREAKQASASTNAAAAASGELSGDMFGGFDAAPPQRK
AKQLSASTNAAAAASGELSGDMFGGFDATPPRRKASPPPGAAAPVASGEL
SGDMFGGFDVPRRSPAALPAKTLATTARPRLSSPIALATGELSSDLFSGF
DGPLVALSSPGSGTRTPGSAQMSVSSPGGSSIGGGPTSPRASSYGYFEGE
EPSWERYDKHVSGRDGGDGEDGGDGVDDDANGMARKREGPGKARNSDDAD
PPQRWDVWAEDLAYEAATKRLLRELRHLFAGLDALAPTAPNMAFALESLG
TTTAVPSGTTDGAAKPGLLEGIRAVAGGLCEAHGLSARALTASAAGALSH
ASAGGSRRLGALCLLLNALDRPVAVDAAVCAAAGRLVQCCSVHGQAHLED
LSLRAAHARALGTEAEAAKEEAKAKAVLEAEGTDGDTVGAGGEGWNAAPR
TESKRPAGSFITAAARVAEAEAAEAAGGLGLENLRPHLSELRAAAFQLEL
CLALHARGCLRLSSRARAEAAVGFRAGLLVEALGRRYDVTAALVACPPSS
VEADEPRLSDAQAVGEAFSAGGDENGRGQGTAWLDTPNADLLDALSR*
back to top
Annotated Terms
The following terms have been associated with this polypeptide:
Vocabulary: INTERPRO
TermDefinition
IPR036034PDZ_sf
IPR041489PDZ_6
IPR022033Rav1p_C
IPR001478PDZ