prot_C-linearis_contig9.16510.1 (polypeptide) Chordaria linearis ClinC8C monoicous

You are viewing a polypeptide, more information available on the corresponding mRNA page

Overview
NamemRNA_C-linearis_contig9.16510.1
Unique Nameprot_C-linearis_contig9.16510.1
Typepolypeptide
OrganismChordaria linearis ClinC8C monoicous (Chordaria linearis ClinC8C monoicous)
Sequence length1029
Homology
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: D7FQZ3_ECTSI (Aminopeptidase n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D7FQZ3_ECTSI)

HSP 1 Score: 1690 bits (4377), Expect = 0.000e+0
Identity = 873/1052 (82.98%), Postives = 931/1052 (88.50%), Query Frame = 0
Query:    1 MESGRPRSATALFALIGALSPQPMHAFAARMGSPALLPAYAKNVPSLLMRKP-RVP-TTGMAALASSSFSGSAANTMRRAGFWTTPRAASASGGLGLNMSS---LSPQTRWVSAGGGLR---------------VRGXXXXXXXXXXXXXXXXXXXXT----AEAVTAPVAKYRKDYKAPGHWTRHVTLDINLGDGASVVTAELDLERNADSPGEDLFLDGEELKLFRVYTRSANGELAQLKEGDDYTLSDEGMLIKASSLPPSNKYTVGTVVQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFSVRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVDKSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSAAGEYDAEAKTYKLTLKQSSKKEGTLPFHIPVVVGLLLKEDGSEA-VASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKLRYKYADEELAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVDSFRATLTQKDIPDKSVQAYALRLPDSSTLGEEMDVIEPDALFKALGHVRKSLAEALAPELRAVYDSMAPTGPYKKNAEEVGRRRLRNTVLSYLHEPRDAAAAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYEDAGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQNKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQALMRKQLIRIRDSSPSKDTYEVAARCL 1027
            M    P+SATALFALIGALSPQPMHAFAARMGS ALLP   +  PS L RKP R+P   G++ALASS+ S     TMRRA FWTTPRAA+AS G  ++ S+   LSP+TRWVSAGGGLR               VRG                    +    AEA TAP  KYRKDYKAPGHWTRHVTLDINLGDGA+VVT+ELD+ERNADSPG DLFLDGEELKLFRVY R  NGE+ QL+E DDY+L+D+GMLIK SSLP  NKYTVGTVVQI+PK NTRLSGLYTSGG FCTQCEAEGFRRITFAQDRPDVMSTFSVRLNA PSGDFPVMLSNGNNPVPPTL+PSPPGGEKVDKSY VWEDPIPKPSYLFALVAG+LGSIKS FVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDE+KFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTA+VLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTC+DWFQLTLKEGLTVFRDQLFSGDMGSHA KRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMY+TL+GEAGFR+GMDLYFKRHDGQAVSCDDFRAAMAD+N+ NLDQFERWYLQ GTPEVSAAGEYDAEAKTYKLTLKQSSKKE TLPFHIPVV GLLLKEDGSE  VAS+ LELKEEEQTFTFE+VASEPIPSLLRD SAPVKLRYKY+DE+LAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAG EP  LP +L+DSFRATL +KDIPDKS+QAYALRLPD +TL EEMDVI+PDAL+KAL HVR+SLA+ALA ELR+VYDS AP+GPYKKN+ EVGRRRLRNT+LSYLHEPRDAAAAKLC +QF++ADCMTDKLAAV CLA+FDD GGAFPERAEAL KFY DA GD LVLNKWFGIQA+ADLPDLL RVK LMQH+DFTLKNPNRLRSVVSSFA  Q+KFHAKDGSGYEF+GDMVLEVDQLNPQVASRLAT FSSYRR+DEKRQ LMR+QL RIRDSSPSKDTYEVA RCL
Sbjct:    1 MNHDTPQSATALFALIGALSPQPMHAFAARMGSSALLPTLVRTAPSSLTRKPTRMPCAAGISALASSTSS----LTMRRAAFWTTPRAAAASRGPAMSTSAAARLSPKTRWVSAGGGLRSTGEAALLCTTVIWRVRGGAAARAAAGTGMVVATSLSASTASMAEAPTAPAPKYRKDYKAPGHWTRHVTLDINLGDGATVVTSELDMERNADSPGSDLFLDGEELKLFRVYIRDDNGEVTQLQEEDDYSLNDDGMLIKNSSLPSGNKYTVGTVVQIAPKENTRLSGLYTSGGNFCTQCEAEGFRRITFAQDRPDVMSTFSVRLNAWPSGDFPVMLSNGNNPVPPTLIPSPPGGEKVDKSYAVWEDPIPKPSYLFALVAGDLGSIKSTFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEEKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAYVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCRDWFQLTLKEGLTVFRDQLFSGDMGSHAAKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYRTLVGEAGFRKGMDLYFKRHDGQAVSCDDFRAAMADANDVNLDQFERWYLQPGTPEVSAAGEYDAEAKTYKLTLKQSSKKEDTLPFHIPVVTGLLLKEDGSEGCVASRGLELKEEEQTFTFEDVASEPIPSLLRDLSAPVKLRYKYSDEDLAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGGEPGALPASLIDSFRATLLEKDIPDKSLQAYALRLPDPATLAEEMDVIDPDALYKALKHVRRSLADALASELRSVYDSAAPSGPYKKNSAEVGRRRLRNTILSYLHEPRDAAAAKLCFDQFEAADCMTDKLAAVACLANFDDAGGAFPERAEALSKFYNDADGDALVLNKWFGIQAAADLPDLLSRVKDLMQHEDFTLKNPNRLRSVVSSFAGSQHKFHAKDGSGYEFLGDMVLEVDQLNPQVASRLATMFSSYRRFDEKRQGLMREQLARIRDSSPSKDTYEVATRCL 1048          
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: A0A6H5KYN1_9PHAE (Uncharacterized protein n=1 Tax=Ectocarpus sp. CCAP 1310/34 TaxID=867726 RepID=A0A6H5KYN1_9PHAE)

HSP 1 Score: 1431 bits (3703), Expect = 0.000e+0
Identity = 709/801 (88.51%), Postives = 754/801 (94.13%), Query Frame = 0
Query:  182 LERNADSPGEDLFLDGEELKLFRVYTRSANGELAQLKEGDDYTLSDEGMLIKASSLPPSNKYTVGTVVQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFSVRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVDKSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSAAGEYDAEAKTYKLTLKQSSKKEGTLPFHIPVVVGLLLKEDGSEAVASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKLRYKYADEELAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVDSFRATLTQKDIPDKSVQAYALRLPDSSTLGEEMDVIEPDALFKALGHVRKSLAEALAPELRAVYDSMAPTGPYKKNAEEVGRRRLRNTVLSYLHEPRDAAAAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYEDAGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQNKFHAKDGSGYEFMGDMVLEVDQLNPQVA 982
            +ERNADSPG DLFLDGEELKLFRVY R+ NGE+ QL+EGDDY+L+D+GMLIK SSLP  NKYTVGTVVQI+PK NTRLSGLYTSGG FCTQCEAEGFRRITFAQDRPD      VRLNA PSGDFPVMLSNGNNPVPPTL+PSPPGGEKVDKSY +WEDPIPKPSYLFALVAG+LGSIKS FVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDE+KFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTA+VLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTC+DWFQLTLKEGLTVFRDQLFSGDMGSHA KRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMY+TL+GEAGFR+GMDLYFKRHDGQAVSCDDFRAAMAD+N+ NLDQFERWYLQ+GTPEVSAAGEYDAEAKTYKLTLKQSSKKE TLPFHIPVV GLLLKEDGSEAVAS+VLELKEEEQTFTFE+V SEPIPSLLRD SAPVKLRYKY+DE+LAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAG EP  LP +LVDSFRATL +KDIPDKS+QAYALRLPD +TL EEMDVI+PDAL+KAL HVR+SLA+ALA ELR+VYDS AP+GPYKKN+ EVGRRRLRNT+LSYLHEPRDAAAAKLC +QF++ADCMTDKLAAV CLA+FDD GGAFPERAEAL KFY DA GD LVLNKWFGIQA+ADLPDLL RVK LMQH+DFTLKNPNRLRSVVSSFA  Q+KFHAKDGSGYEF+GDMVLEVDQLNPQ +
Sbjct:    1 MERNADSPGSDLFLDGEELKLFRVYVRNDNGEVTQLQEGDDYSLNDDGMLIKNSSLPSGNKYTVGTVVQIAPKENTRLSGLYTSGGNFCTQCEAEGFRRITFAQDRPD------VRLNAWPSGDFPVMLSNGNNPVPPTLIPSPPGGEKVDKSYAIWEDPIPKPSYLFALVAGDLGSIKSTFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEEKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAYVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCRDWFQLTLKEGLTVFRDQLFSGDMGSHAAKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYRTLVGEAGFRKGMDLYFKRHDGQAVSCDDFRAAMADANDINLDQFERWYLQAGTPEVSAAGEYDAEAKTYKLTLKQSSKKEDTLPFHIPVVTGLLLKEDGSEAVASRVLELKEEEQTFTFEDVPSEPIPSLLRDLSAPVKLRYKYSDEDLAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGGEPGALPASLVDSFRATLLEKDIPDKSLQAYALRLPDPATLAEEMDVIDPDALYKALKHVRRSLADALASELRSVYDSAAPSGPYKKNSAEVGRRRLRNTILSYLHEPRDAAAAKLCFDQFEAADCMTDKLAAVACLANFDDAGGAFPERAEALSKFYNDADGDALVLNKWFGIQAAADLPDLLSRVKDLMQHEDFTLKNPNRLRSVVSSFAGSQHKFHAKDGSGYEFLGDMVLEVDQLNPQAS 795          
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: A0A835ZE57_9STRA (Aminopeptidase n=1 Tax=Tribonema minus TaxID=303371 RepID=A0A835ZE57_9STRA)

HSP 1 Score: 1120 bits (2897), Expect = 0.000e+0
Identity = 587/910 (64.51%), Postives = 699/910 (76.81%), Query Frame = 0
Query:  139 AEAVTAPVAKYRKDYKAPGHWTRHVTLDINLGD-GASVVTAELDLERNADSPGEDLFLDGEELKLFRVYTRSANGELAQLKEGDDYTLSDEGMLIKASSLPPSNKYTVGTVVQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFS-VRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVD-KSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSAAGEYDAEAKTYKLTLKQSSKK-----------------EGTLPFHIPVVVGLLLKEDGSEAVASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKLRYKYADEELAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVDSFRATLTQKDIPDKSVQAYALRLPDSSTLGEEMDVIEPDALFKALGHVRKSLAEALAPELRAVYDSMAPTGPYKKNAEEVGRRRLRNTVLSYLHEPRDAAAAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYEDAGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQNKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQALMRKQLIRIRDSSP-SKDTYEVAARCL 1027
            AEA  AP  K+RKDY       + V LD NL + G ++VTA+L +   +   G D+FLDGEEL+L  V   + +G+   L    DY  + EG+ + +S+LP    + + T V++ P+ NT+LSGLY SGGMFCTQCEAEGFRRIT+ QDRPDVM+ F+ VR+  +     PV+LSNGN             G+  D + Y VWEDP PKPSYLFALVAG+LGSIKS + T SGR VKLEIFSEKENVD+LDWAM+SLKA+MKWDED +GLEYDLD+FN+VAVNDFNMGAMENKGLNVFNTA+VLAKP+TATDLDYER+EGVIGHEYFHNWTGNRVTC+DWFQLTLKEGLTVFRDQ FS  MGS AVKRIEDVR LR+RQFPED+ PL+HPIRPE Y  MDNFYT+TVY KGAEVIRMY TLLG  GFR+GMDLYFKRHDGQAV+CDDFRAAMAD+NN +L QFERWYLQ+GTP + A G YDA+AKT+ L LKQ+S K                 E   PFHIPV VGLLL+ DGSE V SKVLELKE EQTFTF+N+ SEP+PS+LR FSAPVKLRY+Y+DE+LAFLM +DTDSFNRWEAGQQL+ R++L + ++ + G+    LP   +DSF+ATL Q D  DKS+QAYAL LP SSTL EEMDVI+PDAL  A  HVR S+A AL PEL+A Y+S+AP+GPY+K  EEVGRRRLRNT+L Y+H  +D AAA LC +Q++SADCMTDK+AA+ CLAD        P+R  AL  FY+ A  D LVLNKWF +QAS+DLPD+L RVK L+ H DFTLKNPNRLRS+VS FA+   KFHAK G+ Y F+GDM L+VDQ+N QVASRL  + S +RR+D +RQALM++QL+RIRD+   SKDTYEVAARCL
Sbjct:  119 AEAQAAPKEKFRKDYMPLERTIQEVQLDFNLQEEGGTIVTAKLTMSAPS-GRGGDVFLDGEELELVSV---AVDGK--PLHADVDYKATSEGLTLMSSALPADRAFELSTAVRVKPELNTKLSGLYKSGGMFCTQCEAEGFRRITYYQDRPDVMARFTRVRVEGN-KNSVPVLLSNGNKVEE---------GDLGDGRHYAVWEDPFPKPSYLFALVAGDLGSIKSQYKTMSGRDVKLEIFSEKENVDKLDWAMDSLKASMKWDEDTYGLEYDLDLFNIVAVNDFNMGAMENKGLNVFNTAYVLAKPDTATDLDYERVEGVIGHEYFHNWTGNRVTCRDWFQLTLKEGLTVFRDQQFSAYMGSDAVKRIEDVRGLRARQFPEDSSPLAHPIRPEQYQVMDNFYTSTVYQKGAEVIRMYHTLLGTDGFRKGMDLYFKRHDGQAVTCDDFRAAMADANNVDLSQFERWYLQAGTPTLEATGTYDADAKTFSLKLKQASHKLLMSTLCPECTKPTPNQEHKDPFHIPVKVGLLLR-DGSEVVPSKVLELKEAEQTFTFDNIPSEPVPSILRGFSAPVKLRYEYSDEDLAFLMTYDTDSFNRWEAGQQLAQRLLLSLTKKAQTGESLPELPKVFLDSFKATLLQ-DTGDKSLQAYALSLPSSSTLAEEMDVIDPDALGAAAKHVRTSIATALMPELKAKYESLAPSGPYQKTGEEVGRRRLRNTILDYIHSLKDDAAADLCFKQYESADCMTDKVAALSCLADIPG-----PKREAALESFYQFAKSDALVLNKWFSLQASSDLPDVLDRVKALVSHPDFTLKNPNRLRSLVSVFAANMTKFHAKSGAAYAFLGDMCLQVDQINAQVASRLVGSLSQWRRFDAERQALMKEQLLRIRDAEGISKDTYEVAARCL 1005          
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: A0A7S2QAG2_9DINO (Hypothetical protein n=2 Tax=Brandtodinium nutricula TaxID=1333877 RepID=A0A7S2QAG2_9DINO)

HSP 1 Score: 931 bits (2405), Expect = 0.000e+0
Identity = 504/903 (55.81%), Postives = 627/903 (69.44%), Query Frame = 0
Query:  144 APVAKYRKDYKAPGHWTRHVTLDINLGDGASVVTAELDLERNAD--SPGEDLFLDGEELKLFRVYTRSANGELAQLKEGDD-YTLSDEGMLIKASSLPPSNKYTVGTVVQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFS-VRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVDKSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSAAGEYDAEAKTYKLTLKQS---------SKKEGTLPFHIPVVVGLLLKEDGSEAVASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKLRYKYADEELAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVDSFRATLTQKDIPDKSVQAYALRLPDSSTLGEEMDV-IEPDALFKALGHVRKSLAEALAPELRAVYDSMAPTGPYKK--NAEEVGRRRLRNTVLSYLHEPRDAAAAKLCLEQFKSA---DCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYEDAGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQNKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQALMRKQL--IRIRDSSPSKDTYEVAAR 1025
            A + K+RKDY+ P +W + V LD+ + +G + V A L  + N     PG +L LDGE+L+L  V    A+G    L EG + Y LS +GM+++ +  P ++ + + T V + P+ NT+LSGLY SG M+CTQ EAEGFRR T+  DRPDVM+ ++ VR+ A      PV+L NGN       +      +   + + V++DP  KPSYLFA+VAG+LG I+  F T SG +VKL +FSE+ENVDQL  AM SLK AMKWDED+FGLEYDLD++NVVAVNDFNMGAMENKGLN+FNTA  LA+P+TATD DYERIEGVIGHEYFHNWTGNRVTC+DWFQLTLKEGLTVFRDQ FSGDMGS AVKRI +VR LR+RQFPED GP++HP+RPESYIAMDNFYTATVY KGAEVIRMY+TLLG  GFR+GMDLYF+RHDG AVSCDDFRAAMA++N  +L QFE WYLQ GTP+V     +DA AKTY LTL Q            K    P  IPVVVGLL +  G E V SKVLEL E E++FTFE + +EP+PSLLRDFSAPVK+ Y Y DE+LAFL  +DTDSFNRWEA Q L  + I       + G E + L     ++ R  +  ++  D S+ AYAL LP  S + E M   I+P  L KA G VR ++A AL  + +  Y+ + P    +   +    G+R LRN  L YL  P+DAAA  LC  QF+ A    CMTDKLAA+G L +  DG    PE  +AL  FY+DA GDFLV+NKWF +QA+AD+PD+L RV+KL+ H DFT  NPNRLRSVVS FA     FH  DGSGY FM + VL+VD+LNPQVASRLA  FS++ + D  RQ L+++QL  +R +++  SKDTYEV ++
Sbjct:   12 ATIEKFRKDYRQPNYWIQEVQLDVRIFEGETKVEALLRCKLNDSVAEPGAELVLDGEDLRLESVEVVDADGTARGLAEGPEGYELSADGMVVRGT--PKASSFDLRTRVVVEPEKNTQLSGLYFSGCMYCTQMEAEGFRRFTYFPDRPDVMAKYTRVRVEADKK-KCPVLLGNGNE------IDKGDCTDDASRHFAVFQDPFAKPSYLFAIVAGDLGKIEDSFTTASGNEVKLAVFSEQENVDQLGHAMVSLKKAMKWDEDRFGLEYDLDVYNVVAVNDFNMGAMENKGLNIFNTALTLARPDTATDADYERIEGVIGHEYFHNWTGNRVTCRDWFQLTLKEGLTVFRDQEFSGDMGSPAVKRIGEVRVLRARQFPEDGGPMAHPVRPESYIAMDNFYTATVYCKGAEVIRMYQTLLGREGFRKGMDLYFERHDGGAVSCDDFRAAMAEANGRDLSQFEEWYLQPGTPQVQVRSAWDAAAKTYTLTLSQKVGAGQAHLPEAKRREKPMLIPVVVGLLDRATGKELVPSKVLELTEAERSFTFEGMDAEPVPSLLRDFSAPVKMDYPYTDEDLAFLAAYDTDSFNRWEAAQSLGAKAITDQYAA-EPGSEHA-LSPGFAEALRRIVNDRETQDLSLLAYALILPAESAILETMTPPIDPVRLHKARGAVRAAIAAALREDFQRRYEELTPAQGEELVIDGPNAGKRALRNVCLGYLAAPKDAAAVSLCAAQFEEARTRGCMTDKLAALGHLVEMPDG----PEVTQALQAFYDDAKGDFLVINKWFTMQAAADVPDILARVEKLIAHPDFTFTNPNRLRSVVSVFAGNVTGFHNADGSGYAFMREQVLKVDKLNPQVASRLALAFSTWAKLDAGRQGLIKEQLTVLRAKEAELSKDTYEVVSK 899          
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: K8YSL1_NANGC (Aminopeptidase N n=3 Tax=Monodopsidaceae TaxID=425072 RepID=K8YSL1_NANGC)

HSP 1 Score: 929 bits (2401), Expect = 0.000e+0
Identity = 521/925 (56.32%), Postives = 635/925 (68.65%), Query Frame = 0
Query:  140 EAVTAPVAK------YRKDYKAPGHWTRHVTL--------DINLGDGASV--VTAELDLERNAD---SPG-EDLFLDGEELKLFRVYTRSANGELAQLKEGDDYTLSDE------GMLIKASSLPPSNKYTVGTVVQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFSVRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVDKSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSAAGEYDAEAKTYKLTLKQSSK----KEGTLPFHIPVVVGLLLKEDGSEAVASKVLELKEEEQTFTFENV-ASEPIPSLLRDFSAPVKLRYKYADEELAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVDSFRATLT---QKDIPDKSVQAYALRLPDSSTLGEEMDVIEPDALFKALGHVRKSLAEALAPELRAVYDSMAPTGPYKKNAEEVGRRRLRNTVLSYLHEPRDAAAAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYEDAGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQ-NKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQALMRKQLIRIR--DSSPSKDTYEVAARCL 1027
            EAV++P  K      YR  Y+ P +W RH  L        D     GA+V  VT+ L +ERNAD   SPG  +L LDGEEL L  +     NG+     +G  + L+ E      G  I+A+S    + +T+ T V I P++N  LSG Y SG  +C+QCEAEGFRRIT+  DRPDVM+ F VR+ A  S   P++LSNGN      L     GG    + +  WEDP PKPSYLFA+VAG+LGSI   + T SGR V+LEIFSE ENVD+L+ AM SLK +M+WDE+ +GLEYDLD++N+VAVNDFNMGAMENKGLNVFNTAFVLAKPETATD DYERIEGVI HEYFHNW+GNRVTC+DWFQLTLKEGLTVFRDQ FS DMGS AVKRIEDVR LR+RQF ED+GP +H IRPESY+ MDNFYT+TVY KGAEVIRMY TLLG AGFR+GMDLYF RHDG AV+CDDFRAAMAD+N  +  QFERWYLQ+GTP V+A G YDA  K Y LTL Q +     +   LPFHIPV VGLL K DG E V S VLELKE  QTF FENV    P+ SLLR FSAPV+L+ + +DE+LAFLM HDTDSFNRWEA Q+L+++ IL   E +  G  P+PL    VD+FRA L         D+S+ AYAL LPD  TL  EMDV+ P AL +   HV+KSLA AL PE   VY ++    PY    +E+GRRR++N  LSYL   +DA A  L    F+ A CMTD +AA+ CL+         PE+ EAL  FY +A GD LVLNKWF IQA ADLPD + RV  L +H DF++KNPNR R+++ +FA+    +FHA+DG GY  + DMVL VD+LNPQVA+RLA  FS +R+++  R+ +M+ QL R+       S+DT+E+A + L
Sbjct:  144 EAVSSPTEKKKLQPKYRSSYRQPDYWIRHTDLLFQIVPDPDPEAPAGATVTYVTSTLTVERNADGGPSPGIPNLELDGEELTLMEI---KVNGQPLPW-DGSAFLLTAESDLLVLGKTIEAASAGKGD-FTLQTRVLIRPESNFELSGFYKSGSAYCSQCEAEGFRRITYYLDRPDVMARFKVRIEAEKS-KLPLLLSNGNKMATGELD----GG----RHFAEWEDPFPKPSYLFAVVAGDLGSIVDSYKTMSGRDVRLEIFSEHENVDKLEHAMTSLKKSMRWDEEVYGLEYDLDVYNIVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDTDYERIEGVIAHEYFHNWSGNRVTCQDWFQLTLKEGLTVFRDQQFSADMGSAAVKRIEDVRVLRARQFAEDSGPRAHAIRPESYMKMDNFYTSTVYEKGAEVIRMYHTLLGAAGFRKGMDLYFARHDGSAVTCDDFRAAMADANGKDFSQFERWYLQAGTPVVTAKGAYDAAQKRYSLTLSQHTAGTVGQPSKLPFHIPVRVGLLGK-DGRELVPSTVLELKEASQTFEFENVEGGTPVASLLRGFSAPVRLKVEQSDEDLAFLMAHDTDSFNRWEASQKLASQAILAATEALAGGGTPAPLAPTTVDAFRAVLKAGLDDTGVDRSLLAYALSLPDELTLLGEMDVMRPVALHEGREHVKKSLAAALKPEFMDVYQALDKDVPYLVTPQEIGRRRMKNVCLSYLCTEKDAGAVALAASAFQRATCMTDSIAALACLSSLPG-----PEKDEALEIFYTNAKGDPLVLNKWFSIQALADLPDTIDRVHALTRHPDFSMKNPNRFRALIGAFANSNLARFHAEDGRGYVLVADMVLAVDKLNPQVAARLAGAFSLWRKFENTRRNMMKAQLDRLMAVGDGLSRDTFEIAIQGL 1048          
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: L1IQH4_GUITC (Uncharacterized protein n=1 Tax=Guillardia theta (strain CCMP2712) TaxID=905079 RepID=L1IQH4_GUITC)

HSP 1 Score: 928 bits (2399), Expect = 0.000e+0
Identity = 507/899 (56.40%), Postives = 634/899 (70.52%), Query Frame = 0
Query:  138 TAEAVTAPVAKYRKDYKAPGHWTRHVTLDINLGDGASVVTAELDLERNADSP-GEDLFLDGEELKLFRVYTRSANGELAQLKEGDDYTLSDEGMLIKASSLPPSNKYTVGTVVQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFSVRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVDKSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSAAGEYDAEAKTYKLTLKQS---SKKEGTL-PFHIPVVVGLLLKEDGSEAVASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKLRYKYADEELAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVDSFRATLTQKDIPDKSVQAYALRLPDSSTLGEEMDVIEPDALFKALGHVRKSLAEALAPELRAVYDS-MAPTGPYKKNAEEVGRRRLRNTVLSYLHEPRDAAAAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYED-AGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQNKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQALMRKQLIR-IRDSSPSKDTYEVAARCLK 1028
            +A A   PV K+RKDYK P +W R+V L I + DG + V  +L  ER   +     L LD E++++  V       EL+      DY   ++ +L     LP  +K+ + TVV+I P+ NT+LSGLY S  M+CTQCEAEGFRRIT   DRPDVM+ + VR+ A      PV+LSNGN      L+     GE   + +  WEDP PKPSYLFA+VAG+LGSIK  F T+SGRKV LEIFSE  NVDQLDWAM+SLK +MKWDE++FGLEYDLDI+N+VAVNDFNMGAMENKGLNVFNTA VLAKP TATD DYER++GVI HEYFHNWTGNRVTC+DWFQLTLKEGLTVFRDQ FS DM S AVKRIEDVR LR+ QFP+D  P++HPIRPESYIAMDNFYTATVY KGAEVI MY+TLLG+ GFR+GMDLYFKRHDG AV+CDDFRAAMAD+N  +L+QFERWY Q+GTP V A   Y A+ + ++L L QS   S  + T  P+HIPV VGL+ K DG + V  +VLELKE  QTF FE V  EP+ SLLR FSAPV ++   +DEELAFLM +D DSFNRWEAGQ+L TR IL  V+  + GK+   LP  +VD+ + TL  +DI DKS+QAYAL LP   TLG +MDVI+PDAL  A   V++S+A+ L  E    Y S   P+ P++ +A+ VGRRR++N  L YL   ++    KLCL+Q   +  MTD +AA   LA   D       R +AL  FYE  A G+ L+L KWF +QA AD+   L  V  L+QH DF+LKNPN+ RSV+ +FA     FHA DGSGY ++ D +L++D++NPQ+++RL ++FS++RRYD+KRQAL++ +L R I  S  S+D YE+A++ LK
Sbjct:    5 SAVATEQPVEKFRKDYKEPDYWVRNVDLLIQIHDGQTTVRGKLSAERRKGAQESATLRLDAEDVEVVSVLLNGK--ELSS----SDYHFPEKDVLEIKCGLP--DKFELETVVKIKPEDNTQLSGLYKSSSMYCTQCEAEGFRRITPMLDRPDVMAKYKVRIEADQKS-CPVLLSNGN------LVSKGEMGE--GRHFAEWEDPFPKPSYLFAVVAGDLGSIKDTFTTRSGRKVALEIFSEHANVDQLDWAMQSLKDSMKWDEERFGLEYDLDIYNIVAVNDFNMGAMENKGLNVFNTACVLAKPSTATDSDYERVQGVIAHEYFHNWTGNRVTCRDWFQLTLKEGLTVFRDQQFSADMTSEAVKRIEDVRILRAAQFPQDDSPMAHPIRPESYIAMDNFYTATVYNKGAEVIGMYQTLLGKEGFRKGMDLYFKRHDGTAVTCDDFRAAMADANGVDLEQFERWYTQAGTPTVEAKASYSADKQRFELVLSQSCGPSPGQPTKQPYHIPVRVGLIGK-DGKDLVPERVLELKEASQTFVFEGVKEEPVVSLLRGFSAPVNVKLPRSDEELAFLMANDQDSFNRWEAGQELFTRSILANVKAFQDGKDME-LPQVIVDAAKRTLLLEDI-DKSLQAYALTLPSLLTLGAKMDVIDPDALVAACKFVKESMAKKLRAEFETAYKSNQLPSEPFRNDADAVGRRRIKNVCLDYLMALKEDKYTKLCLDQALQSTAMTDLVAATSLLAGSSDEAA----RKQALENFYEKHAKGNDLILCKWFTMQAMADVTTSLSDVGALLQHPDFSLKNPNKCRSVIGAFAGNMKHFHAADGSGYRWLTDRILDIDKMNPQMSARLVSSFSTFRRYDQKRQALIKAELERLIATSGLSRDAYEIASKSLK 879          
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: A0A7S4UZK6_9STRA (Hypothetical protein n=1 Tax=Ditylum brightwellii TaxID=49249 RepID=A0A7S4UZK6_9STRA)

HSP 1 Score: 924 bits (2388), Expect = 0.000e+0
Identity = 503/907 (55.46%), Postives = 641/907 (70.67%), Query Frame = 0
Query:  140 EAVTAPVAKYRKDYKAPGHWTRHVTLDINLGDGASVVTAELDLERNADSPG---EDLFLDGEELKLFRVYTRSANGELAQLKEGDDYTLSDEGMLIKASSLPPSNKYTVGTVVQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFS-VRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVDKSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSAAGEYDAEAKTYKLTLKQSSKKEGTLPFHIPVVVGLLLKEDGSEAVASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKL--RYKYADEE-LAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVDSFRATLTQKDIPDKSVQAYALRLPDSSTLGEEMDVIEPDALFKALGHVRKSLAEALAPELRAVYDSM-----APTGPYKKNAEEVGRRRLRNTVLSYLHEPRDAA-----AAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYEDAGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQNKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQALMRKQLIRIRDSSPSKDT-YEVAARCLK 1028
            ++ + PV  YRKDY+A   +   + +D  + DG + VT EL ++ N DS     +DL LDGEE    ++ +   NG  A +K+ +DY +    +++K+S+L  ++   V T V++ P+ NT+LSGLY SG M+CTQCEA GFRRIT+  DRPD M+ F  VR+ A    ++PV+L NGN      L+ S  G  +  + Y+VW DP PKPSYLF +VAGNLGSI+  + T SGR V+LEIFSEKENV +LD+AMESLK +MKWDEDKFGLEYDL I+N+VAVNDFNMGAMENKGLNVFNTA+VLA P TATD DYER+EGVIGHEYFHNWTGNRVTC+DWFQLTLKEGLTVFRDQ FSGDMGS+AVKRIEDVR LR RQF EDAGP+SHPIRP+SYI MDNFYTATVY KGAEVIRMY TLL  AGFR+GMDLYF+RHDG  V+CDDFR+AMAD+N  +LDQF  WY  SGTP V+ +  YDAE+ T+ LTL Q S  +   P HIPV VGL+ KE G E V +KVLELKE  QTF F ++  + +PS+LR FSAPVK+       DE  LAFL   DTD FN+WE+GQ+L T +I   +++    K+     + + ++F  TLT +++ D S+QAYAL  P  STL EE+DV++P AL +A G V+K++A     E+RA YD +     A  G +K +A  VGRRRLRN +L YL   ++ A     AA+L   Q+ +A  MTDK+AA+  L+  D G GA   R  A+ KFY+DA GD LVL+KWF +QA+ADLPD+L RVK L +H DFTL NPNR RS+VS FA+    FHA++G GY F+G +V E+D++NPQ+ASR+A +   ++RYDEKR  LM+ +L ++    P  D  +EV +R LK
Sbjct:  116 QSTSEPVVNYRKDYEALPFFVNKINMDFKITDGKTTVTTELFIDANPDSASLKHKDLNLDGEE-DAVKLLSLQINGVDA-IKD-EDYEIKPGKLVLKSSALSSTSTTKVTTTVEVIPEENTQLSGLYKSGPMYCTQCEATGFRRITYYPDRPDNMAVFERVRIEADKE-NYPVLLGNGN------LMES--GDLEDGRHYSVWSDPFPKPSYLFCIVAGNLGSIRDTYQTTSGRNVQLEIFSEKENVGKLDYAMESLKRSMKWDEDKFGLEYDLGIYNIVAVNDFNMGAMENKGLNVFNTAYVLADPATATDSDYERVEGVIGHEYFHNWTGNRVTCRDWFQLTLKEGLTVFRDQEFSGDMGSNAVKRIEDVRGLRGRQFNEDAGPMSHPIRPDSYINMDNFYTATVYSKGAEVIRMYNTLLTPAGFRKGMDLYFERHDGSGVTCDDFRSAMADANGVDLDQFGLWYSTSGTPTVTYSSSYDAESGTFSLTLSQKSNSDD--PLHIPVSVGLIDKESGEEVVPTKVLELKEATQTFEFTDIKGDVVPSILRGFSAPVKVVPESGVVDESSLAFLAAKDTDGFNKWESGQRLFTSLIFQTMDD----KQSEQTLSFVNEAFIQTLTSENMSDYSIQAYALTPPTESTLAEELDVVDPVALRQARGSVKKAIARKFQTEIRAKYDELTAAMEAERGEFKVDATSVGRRRLRNVLLDYLCSIKETAEEQKAAAELATAQYNAATGMTDKIAALAALSSMD-GEGA-DARDAAIQKFYDDANGDALVLDKWFAVQATADLPDVLDRVKALTKHPDFTLSNPNRCRSLVSVFATNAAPFHAENGDGYSFVGGIVAELDKINPQIASRVAGSLIQWKRYDEKRGQLMKGELEKLVSMKPISDNLFEVVSRGLK 1002          
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: A0A7S4B6E4_CHRCT (Hypothetical protein n=1 Tax=Chrysotila carterae TaxID=13221 RepID=A0A7S4B6E4_CHRCT)

HSP 1 Score: 912 bits (2356), Expect = 0.000e+0
Identity = 497/911 (54.56%), Postives = 628/911 (68.94%), Query Frame = 0
Query:  140 EAVTAPVAK-YRKDYKAPGHWTRHVTLDINLGDGASVVTAELDL----------ERNADSPGEDLFLDGEELKLFRVYTRSANGELAQLKEGDDYTLSDEGMLIKASSLPPSNKYTVGTVVQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFSVRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVDKSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSAAGEYDAEAKTYKLTLKQSSK----KEGTLPFHIPVVVGLLLKEDGSEAVASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKLRYKYADEELAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPL--PTNLVDSFRATLTQKDIPDKSVQAYALRLPDSSTLGEEMDVIEPDALFKALGHVRKSLAEALAPELRAVYDSMAPTG--PYKKNAEEVGRRRLRNTVLSYLHEPRDAAAAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYE--DAGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQ-NKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQALMRKQLIRIRDSSP-SKDTYEVAARCL 1027
            +A  AP  + +RKDY+   +    V LD N+    ++V + L L          +R  D  G  L L+GE+L L     +S   +   L EG DYT+  E + +     PP++ +T+ +VV I P+ NT+LSGLY S G  CTQCEAEGFRRIT+  DRPDVMS ++VR+ A     +PV+LSNGN            G  +  + +  +EDP  KPSYLFALVAG+L  I+  FVT SG KV+L I+SE EN+DQLDWAM+SLK +MKWDE+ +G EYDLD++++VAVNDFNMGAMENKGLNVFNTA VLAKP TATD DYER++GV+ HEYFHNW+GNRVTC+DWFQLTLKEGLTVFRDQ FS DM S AVKRIEDVR +RS QF +D GP++HPIRPESYIAMDNFYT TVY KGAEVIRMY+TLLG  GFR+GMDLYF+RHDG+AV+CDDFRAAMAD+N  +L QFERWY Q+GTP VSA GEYDA AK Y LTL Q +     +    PF IPV   LLL EDG      KVL+L + EQTF FENV S P+PSLLR FSAPV+L+ + +D  LAFL  ++ D FNRW+A QQL+TRV+L +  +I +G  P  L  P + VD+FRATLT   + D S+++Y+L LPD +TL +EM  ++PDAL  AL   RK LA  L  +L A+YD +AP     ++ + E VGRRRLRN  L YL +  D  A  LC+ QF +A CMT+ +AA   L+          ER + LG FYE   A  + LV+NKWF +QASA+ PD L  VK LM+H+ +   NPNR+R+VV +FAS     FHA DGSGY+F+ DMV+++D+ NPQVA+RL   F  +R+Y E R+ LM+K+L RI+DS   SKDT+E+A+R L
Sbjct:   91 DAPAAPPKEIFRKDYRPYPYIVESVHLDFNVQAEETIVKSSLRLVPRSEGKSGTKRERDEEGTKLELNGEDLTL-----KSLELDGTALVEGTDYTVDAEFLTVLK---PPNSPFTLTSVVSIKPQLNTQLSGLYASSGNLCTQCEAEGFRRITYFPDRPDVMSKYTVRVEAD-KAKYPVLLSNGNEIAK--------GEAEGGRHWAEFEDPFRKPSYLFALVAGDLAGIEDSFVTMSGNKVRLAIWSEHENIDQLDWAMQSLKDSMKWDEETYGREYDLDVYHIVAVNDFNMGAMENKGLNVFNTACVLAKPSTATDADYERVQGVVAHEYFHNWSGNRVTCRDWFQLTLKEGLTVFRDQHFSADMTSEAVKRIEDVRIMRSAQFLQDGGPMAHPIRPESYIAMDNFYTVTVYNKGAEVIRMYRTLLGADGFRKGMDLYFERHDGEAVTCDDFRAAMADANGVDLTQFERWYTQAGTPTVSAKGEYDATAKKYTLTLSQKTAATPGQPTKEPFFIPVQTALLL-EDGKLHEPPKVLQLTQAEQTFVFENVPSAPVPSLLRGFSAPVRLQIERSDATLAFLASNEDDPFNRWDASQQLATRVLLDLASKISSGTSPDDLTLPPSFVDAFRATLTDGGL-DPSLKSYSLTLPDYTTLSQEMSPVDPDALCGALKTARKQLASTLRADLAALYDKLAPPAGQKFEVSPESVGRRRLRNCCLGYLAKLADDEAKALCVAQFDAATCMTESIAAAVALSSLPG-----KERDQVLGTFYERAKANKEALVINKWFALQASAESPDALQVVKGLMEHEAYDATNPNRVRAVVQTFASANPGAFHAADGSGYKFIADMVIDIDKKNPQVAARLCNAFGQWRKYKEDRKVLMQKELERIKDSPKLSKDTFEIASRSL 977          
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: A0A0G4EQU1_VITBC (Uncharacterized protein n=2 Tax=Vitrella brassicaformis TaxID=1169539 RepID=A0A0G4EQU1_VITBC)

HSP 1 Score: 910 bits (2351), Expect = 2.580e-316
Identity = 497/900 (55.22%), Postives = 619/900 (68.78%), Query Frame = 0
Query:  145 PVAKYRKDYKAPGHWTRHVTLDINLGDGASVVTAELDLERNADSP-GEDLFLDGEELKLFRVYTRSANGE-LAQLKEGDDYTLSDEGMLIKASSLP--PSNKYTVGTVVQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFSVRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVD-KSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSA-AGEYDAEAKTYKLTLKQSSK----KEGTLPFHIPVVVGLLLKEDGSEAVASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKLRYKYADEELAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVDSFRATLTQKDIPDKSVQAYALRLPDSSTL-GEEMDVIEPDALFKALGHVRKSLAEALAPELRAVYDSM-----APTGPYKKNAEEVGRRRLRNTVLSYLHEPRDAAAAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYEDAGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQNKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQALMRKQLIRIRDSSP-SKDTYEVAARCL 1027
            P  K R DY+ P +W + V LD ++    + V + L   RN  SP G DL L GE+L L  V     NG+ L++  EG  +   D+ ++I+   LP  P   + V T V I P+ N +LSGLY SGGM+CTQCEAEGFRRIT+  DRPDVMS ++VR+ AS S ++PV+LSNGN             G+  D + Y  +EDP PKP YLFALV GNL SI S F T SGRKV+LEIFSE  NV +LDWAMESLK AMKWDE+KFG EYDLD+FN+VAV+DFNMGAMENKGLNVFNTA +LA P+T+TD DYERI GVI HEYFHNWTGNRVTC+DWFQLTLKEGLTVFRDQ FS DM S AVKRIED+  LRSRQFPEDAGP++HPIRPESYIAMDNFYTATVY KGAEVIRMY+TLL + GFR+GMDLYF RHDGQAV+CDDFRAAMAD+N  +L QFERWYLQ+GTP V+  +  YD   K + + + QS+     +    P HIP+  GLL K  G E   S VLELK E++TF F+NV  EPI S+LRDFSAP +L+++ + ++LAFLM HDTD FNRWEAGQ L+T VI     ++ AG+ P  LP   V+++R  LT  DI D+S+Q Y+LRLPD  TL GE    I PD L  A  HVRK L EA   ELR VYD +     A  G +K +   + RRRLRN +L  +   +D    KL +E F++  CM+D+++A+  LAD         ER   + KFY D+  D L + KWF +QA +DLPD + RVK+LM+H DF LKNPN+LR+VV +FA+    FH KDG+GY  + D++ EVD+LNPQ+ASR A   + ++  +  R A+M+ QL R+RD+   S DT E+  + L
Sbjct:  103 PKEKKRLDYRPPEYWIKKVDLDFDVHKDTTTVRSVLTCYRNRGSPSGADLVLHGEDLTLKEV---KLNGKVLSEGAEGYGHD-EDKQLVIRGKLLPADPDELFKVETTVAICPEKNFQLSGLYKSGGMYCTQCEAEGFRRITYFLDRPDVMSLYTVRVEASKS-EYPVLLSNGNQIS---------SGDAADGRHYASFEDPHPKPCYLFALVVGNLKSIHSDFTTTSGRKVRLEIFSEPHNVAKLDWAMESLKKAMKWDEEKFGREYDLDVFNIVAVDDFNMGAMENKGLNVFNTALILASPDTSTDADYERIMGVIAHEYFHNWTGNRVTCRDWFQLTLKEGLTVFRDQEFSRDMASAAVKRIEDIIMLRSRQFPEDAGPMAHPIRPESYIAMDNFYTATVYEKGAEVIRMYQTLLSKHGFRKGMDLYFTRHDGQAVTCDDFRAAMADANTSDLKQFERWYLQAGTPVVTVESASYDPSLKQFTIVVSQSTPATPGQAEKHPLHIPIKTGLLSKATGKELQPSMVLELKGEKETFVFDNVPEEPIASILRDFSAPCRLKFERSPDDLAFLMAHDTDDFNRWEAGQTLATIVIKDTYNKLSAGESPPALPGVFVEAWRKVLTATDI-DRSLQTYSLRLPDEKTLIGEIAAPIAPDHLHNARQHVRKGLVEACKKELREVYDKLTQEVAAEGGVFKVDEPSIARRRLRNALLISMAVLQDPETVKLAVEHFETGLCMSDRISALYALADI-----PVAEREAVIQKFYGDSKDDKLKMCKWFAVQAQSDLPDTVERVKELMKHPDFELKNPNKLRAVVGAFANNNFHFHRKDGAGYSLVCDVIKEVDKLNPQMASRFAVFLAGWKNVEPVRSAMMKAQLERLRDTENLSNDTMEIVIKGL 982          
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Match: B8LCH3_THAPS (Aminopeptidase aminopeptidase-like protein (Fragment) n=1 Tax=Thalassiosira pseudonana TaxID=35128 RepID=B8LCH3_THAPS)

HSP 1 Score: 895 bits (2313), Expect = 3.140e-312
Identity = 502/909 (55.23%), Postives = 632/909 (69.53%), Query Frame = 0
Query:  145 PVAKYRKDYKAPGHWTRHVTLDINLGDGASVVTAELDLERNA-DSPGEDLFLDGEE--LKLFRVYTRSANGELAQLKEGDDYTLSDEGMLIKASSLPP--SNKYTVGTV--VQISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTF-SVRLNASPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVDKSYTVWEDPIPKPSYLFALVAGNLGSIKSFFVTK-SGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDEDKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDYERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDM-GSHAVKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVIRMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFERWYLQSGTPEVSAAGEYDAEAKTYKLTLKQSSKKEGTLPFHIPVVVGLLLKEDGSEAVASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKLRYKY---ADEE--LAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVDSFRATLTQKDIPDKSVQAYALRLPDSSTLGEEMDVIEPDALFKALGHVRKSLAEALAPELRAVYD----SMAPTGP-YKKNAEEVGRRRLRNTVLSYL------HEPRDAAAAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYEDAGGDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFASCQNKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQALMRKQLIRIRDSSPSKDTYEVAARCL 1027
            P   +R DY+   +   +V ++ ++ DG ++V + L L+ N  +    DL LDGE   L L  +   S       L EG DYT+S + + I +S LPP  SN+ T   V  V+I P+ NT+LSGLY SG M+CTQCEA GFRRIT+  DRPD M+ F SVR+ A     +PV+L NGN              ++  + Y VWEDP PKPSYLF +VAGNLGSI S + T+ SGRKV LEIFSE ENV +LD AMESLK +MKWDED FGLEYDLDI+NVVAVNDFNMGAMENKGLNVFNTA+VLA  ++A+D DYERIE VIGHEYFHNWTGNRVTC+DWFQLTLKEGLTV+RDQ FSGDM  SHAVKRIEDV  LR+RQF EDAGP+SHPIRPESYI+MDNFYTATVY KGAEVIRMY+TLLG+ GFR+GMDLYFKRHDG AV+CDDF +AMAD+N+ +L QF RWY  +GTP V    +YDA+AKT+ LTL Q S  +   P HIPV VGLL KE G E VA+KVL+LKE+EQTF F  +  + +PSLLR FSAPVKL       ADEE  LAFL   DTD FNRWEAGQ+L T +I    + ++  +  S     ++++F+  L   D  D S+QAYAL +P  STL EE+DV++P AL +A G+V+K++A     E+++ YD    SM   G  ++ +A  +G+RRLRN +L YL       E R+ A+ K  ++QF+S+  MTD+ +A+  L   D       ER  AL KFY+DA GD LVLNKWF +QA ADLPD+L RVK L+ H +FTL NPNR RS++S+F+     FHA +G GY+F+GDMV +VD+LNPQ++SR+  +   +RRYDEKR +LM+ +L ++     S D +EV +R L
Sbjct:    1 PTEIFRSDYQPLPYLISNVQMNFDIRDGETIVESTLTLKGNVKNGNNNDLVLDGEADALTLLSITLNSK-----PLIEGSDYTISGDTLTISSSILPPIDSNETTATLVTKVKIHPEENTQLSGLYKSGTMYCTQCEAMGFRRITYYTDRPDNMAVFDSVRIEADKEL-YPVLLGNGNKLEEGE-------SDEEGRHYAVWEDPFPKPSYLFCIVAGNLGSIASSYTTRPSGRKVHLEIFSEPENVGKLDHAMESLKKSMKWDEDTFGLEYDLDIYNVVAVNDFNMGAMENKGLNVFNTAYVLADAKSASDTDYERIESVIGHEYFHNWTGNRVTCRDWFQLTLKEGLTVYRDQEFSGDMMNSHAVKRIEDVNALRARQFAEDAGPMSHPIRPESYISMDNFYTATVYSKGAEVIRMYRTLLGKDGFRKGMDLYFKRHDGNAVTCDDFLSAMADANDVDLSQFSRWYSTNGTPTVKYETKYDADAKTFYLTLSQESNIDE--PLHIPVAVGLLDKESGDEVVATKVLDLKEKEQTFEFSGLEGDVLPSLLRGFSAPVKLVRSSGNDADEEKALAFLAARDTDGFNRWEAGQKLYTSLIF---QTMRGAQAESKTMDYVLEAFQRALAL-DTKDYSIQAYALIMPSESTLSEELDVVDPVALHEARGNVKKAIARKFYNEIKSKYDELTKSMENNGDNFQVDATSIGQRRLRNVLLDYLCCIKETPEEREIAS-KFAMDQFESSYGMTDRYSALSSLVSMDG-----EERETALQKFYDDANGDALVLNKWFTVQALADLPDVLDRVKALVDHPEFTLSNPNRCRSLISAFSMNAAHFHAINGDGYKFIGDMVAQVDKLNPQMSSRMGGSLIQWRRYDEKRSSLMKAELEKLAGGKLSNDLFEVVSRGL 884          
The following BLAST results are available for this feature:
BLAST of mRNA_C-linearis_contig9.16510.1 vs. uniprot
Analysis Date: 2022-09-16 (Diamond blastp: OGS1.0 vs UniRef90)
Total hits: 25
Match NameE-valueIdentityDescription
D7FQZ3_ECTSI0.000e+082.98Aminopeptidase n=1 Tax=Ectocarpus siliculosus TaxI... [more]
A0A6H5KYN1_9PHAE0.000e+088.51Uncharacterized protein n=1 Tax=Ectocarpus sp. CCA... [more]
A0A835ZE57_9STRA0.000e+064.51Aminopeptidase n=1 Tax=Tribonema minus TaxID=30337... [more]
A0A7S2QAG2_9DINO0.000e+055.81Hypothetical protein n=2 Tax=Brandtodinium nutricu... [more]
K8YSL1_NANGC0.000e+056.32Aminopeptidase N n=3 Tax=Monodopsidaceae TaxID=425... [more]
L1IQH4_GUITC0.000e+056.40Uncharacterized protein n=1 Tax=Guillardia theta (... [more]
A0A7S4UZK6_9STRA0.000e+055.46Hypothetical protein n=1 Tax=Ditylum brightwellii ... [more]
A0A7S4B6E4_CHRCT0.000e+054.56Hypothetical protein n=1 Tax=Chrysotila carterae T... [more]
A0A0G4EQU1_VITBC2.580e-31655.22Uncharacterized protein n=2 Tax=Vitrella brassicaf... [more]
B8LCH3_THAPS3.140e-31255.23Aminopeptidase aminopeptidase-like protein (Fragme... [more]

Pages

back to top
InterPro
Analysis Name: InterProScan on OGS1.0
Date Performed: 2022-09-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001930Peptidase M1, alanine aminopeptidase/leukotriene A4 hydrolasePRINTSPR00756ALADIPTASEcoord: 279..294
score: 32.81
coord: 420..430
score: 69.64
coord: 475..487
score: 41.76
coord: 456..471
score: 57.59
NoneNo IPR availableGENE3D1.10.1740.60coord: 358..441
e-value: 1.6E-39
score: 135.5
NoneNo IPR availablePFAMPF17900Peptidase_M1_Ncoord: 163..348
e-value: 1.2E-7
score: 32.1
NoneNo IPR availablePANTHERPTHR46322FAMILY NOT NAMEDcoord: 144..1027
NoneNo IPR availablePHOBIUSSIGNAL_PEPTIDE_C_REGIONSignal peptide C-regioncoord: 20..28
NoneNo IPR availablePHOBIUSSIGNAL_PEPTIDESignal Peptidecoord: 1..28
NoneNo IPR availablePHOBIUSNON_CYTOPLASMIC_DOMAINNon cytoplasmic domaincoord: 29..1028
NoneNo IPR availablePHOBIUSSIGNAL_PEPTIDE_N_REGIONSignal peptide N-regioncoord: 1..7
NoneNo IPR availablePHOBIUSSIGNAL_PEPTIDE_H_REGIONSignal peptide H-regioncoord: 8..19
NoneNo IPR availableSUPERFAMILY55486Metalloproteases ("zincins"), catalytic domaincoord: 366..611
NoneNo IPR availableSUPERFAMILY63737Leukotriene A4 hydrolase N-terminal domaincoord: 148..355
IPR012779Peptidase M1, alanyl aminopeptidaseTIGRFAMTIGR02414TIGR02414coord: 152..1025
e-value: 0.0
score: 1116.1
IPR014782Peptidase M1, membrane alanine aminopeptidasePFAMPF01433Peptidase_M1coord: 387..601
e-value: 6.0E-52
score: 176.4
IPR024601Peptidase M1, alanyl aminopeptidase, C-terminalPFAMPF17432DUF3458_Ccoord: 700..1028
e-value: 3.7E-112
score: 374.8
IPR042097Aminopeptidase N-like , N-terminalGENE3D2.60.40.1730coord: 156..356
e-value: 5.8E-57
score: 194.5
IPR035414Peptidase M1, alanyl aminopeptidase, Ig-like foldPFAMPF11940DUF3458coord: 606..696
e-value: 2.6E-28
score: 98.3
IPR038438Alanyl aminopeptidase, Ig-like domain superfamilyGENE3D2.60.40.1840coord: 606..697
e-value: 7.8E-30
score: 105.0
IPR027268Peptidase M4/M1, CTD superfamilyGENE3D1.10.390.10coord: 442..605
e-value: 3.7E-65
score: 220.7
IPR037144Peptidase M1, alanyl aminopeptidase, C-terminal domain superfamilyGENE3D1.25.50.10coord: 698..1028
e-value: 1.2E-114
score: 384.9

Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
C-linearis_contig9contigC-linearis_contig9:928197..941284 +
Analyses
This polypeptide is derived from or has results from the following analyses
Analysis NameDate Performed
InterProScan on OGS1.02022-09-29
Diamond blastp: OGS1.0 vs UniRef902022-09-16
OGS1.0 of Chordaria linearis ClinC8C monoicous2021-02-24
Relationships

This polypeptide derives from the following mRNA feature(s):

Feature NameUnique NameSpeciesTypePosition
mRNA_C-linearis_contig9.16510.1mRNA_C-linearis_contig9.16510.1Chordaria linearis ClinC8C monoicousmRNAC-linearis_contig9 928160..941912 +


Sequences
The following sequences are available for this feature:

polypeptide sequence

>prot_C-linearis_contig9.16510.1 ID=prot_C-linearis_contig9.16510.1|Name=mRNA_C-linearis_contig9.16510.1|organism=Chordaria linearis ClinC8C monoicous|type=polypeptide|length=1029bp
MESGRPRSATALFALIGALSPQPMHAFAARMGSPALLPAYAKNVPSLLMR
KPRVPTTGMAALASSSFSGSAANTMRRAGFWTTPRAASASGGLGLNMSSL
SPQTRWVSAGGGLRVRGGGGARTGTTTAGAGASLSASTAEAVTAPVAKYR
KDYKAPGHWTRHVTLDINLGDGASVVTAELDLERNADSPGEDLFLDGEEL
KLFRVYTRSANGELAQLKEGDDYTLSDEGMLIKASSLPPSNKYTVGTVVQ
ISPKANTRLSGLYTSGGMFCTQCEAEGFRRITFAQDRPDVMSTFSVRLNA
SPSGDFPVMLSNGNNPVPPTLLPSPPGGEKVDKSYTVWEDPIPKPSYLFA
LVAGNLGSIKSFFVTKSGRKVKLEIFSEKENVDQLDWAMESLKAAMKWDE
DKFGLEYDLDIFNVVAVNDFNMGAMENKGLNVFNTAFVLAKPETATDLDY
ERIEGVIGHEYFHNWTGNRVTCKDWFQLTLKEGLTVFRDQLFSGDMGSHA
VKRIEDVRTLRSRQFPEDAGPLSHPIRPESYIAMDNFYTATVYVKGAEVI
RMYKTLLGEAGFRRGMDLYFKRHDGQAVSCDDFRAAMADSNNFNLDQFER
WYLQSGTPEVSAAGEYDAEAKTYKLTLKQSSKKEGTLPFHIPVVVGLLLK
EDGSEAVASKVLELKEEEQTFTFENVASEPIPSLLRDFSAPVKLRYKYAD
EELAFLMKHDTDSFNRWEAGQQLSTRVILGMVEEIKAGKEPSPLPTNLVD
SFRATLTQKDIPDKSVQAYALRLPDSSTLGEEMDVIEPDALFKALGHVRK
SLAEALAPELRAVYDSMAPTGPYKKNAEEVGRRRLRNTVLSYLHEPRDAA
AAKLCLEQFKSADCMTDKLAAVGCLADFDDGGGAFPERAEALGKFYEDAG
GDFLVLNKWFGIQASADLPDLLPRVKKLMQHKDFTLKNPNRLRSVVSSFA
SCQNKFHAKDGSGYEFMGDMVLEVDQLNPQVASRLATTFSSYRRYDEKRQ
ALMRKQLIRIRDSSPSKDTYEVAARCLK*
back to top
Annotated Terms
The following terms have been associated with this polypeptide:
Vocabulary: INTERPRO
TermDefinition
IPR001930Peptidase_M1
IPR012779Peptidase_M1_pepN
IPR014782Peptidase_M1_dom
IPR024601Peptidase_M1_pepN_C
IPR042097Aminopeptidase_N-like_N
IPR035414Peptidase_M1_pepN_Ig-like
IPR038438PepN_Ig-like_sf
IPR027268Peptidase_M4/M1_CTD_sf
IPR037144Peptidase_M1_pepN_C_sf