prot_C-linearis_contig103.825.1 (polypeptide) Chordaria linearis ClinC8C monoicous

You are viewing a polypeptide, more information available on the corresponding mRNA page

Overview
NamemRNA_C-linearis_contig103.825.1
Unique Nameprot_C-linearis_contig103.825.1
Typepolypeptide
OrganismChordaria linearis ClinC8C monoicous (Chordaria linearis ClinC8C monoicous)
Sequence length2244
Homology
BLAST of mRNA_C-linearis_contig103.825.1 vs. uniprot
Match: D8LKD1_ECTSI (Aardvark n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D8LKD1_ECTSI)

HSP 1 Score: 463 bits (1191), Expect = 1.490e-134
Identity = 659/1175 (56.09%), Postives = 744/1175 (63.32%), Query Frame = 0
Query:    1 MFGRKRGAKGXXXXXXXXXXXXXXXXXXXXKKAVRRLLDKPGPDRMADVASMAALENVKPFDRSALGEADACDAVAIVMAMFPDHRLIQVEACKAVEALADGNGENVQLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNLFHGEANIEKARNQDKDNPARCKDSKIPRNNCSRNLGRGRACELVTGALRQFPMDQDVQIEACGAVANLANNSKSNRAKLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXEIRTRGKQALLWLTTKSNSKKPQSTNKLTGPKTRRENARSKEKFRAKSFSTSSVGGPGG--GNRGYRGGEEQEDYGSSDGEMEDVIFETEDD------------YSDSVTSFGDDEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKQEGWEDDDVASMASSVDYGGLASTGPVPPAAEVAG----GRHSXXXXXXXXXXXXXXXXXXXXXXXXXXXAAIGVV---PTPPNNRPRSKAEQPIGALLEDFDSDGEGSYGPGSG----VESVEVAAQDHLQAVAESEPVAGGGKVAVGPSPPKVQAPSPDMIIGSGGDPAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXARESASRAGGDPATRNSTAAHVSAATTGVEEWDADGFEANDTDSAVPPAREAMQQLERAAARVVRGERAAAAPGSGYHRGTESRGGSRSSNAGVSRRSSAAEGSAMSSGAVPLPTDFPGDVRDSGGHGSRSEVSFGSARSSSHRRRQSAAELLLDPAMASAPESAKLIHEDEAVARSLQTITGRDPVWFANMLPNVERLVQRSREMEAATWEVRDNLLRSKDSPLKGSARRREYFDVLYLGLCGAHIAAAVVARGGIVADSKDGWLRKASALVRIVCLACPCAKSGVELLGQALQLADRVKMADYLEQVAKSSKTPRDVSYLVEHATLQLL-REGRAETDRRSLDAMKAFDNPTDEWLPAWLLEEAAVEGDQHDGSVFVVPNRASGPPETEYARRAAMVDGCTIIKAIGKRSLRKMEREDDKVAVLVEAVARSIAKTTGG 1149
            MFGRKRGAK                     KKAV+RLL+KPGP+RMA +A++AA EN+ PF+ SALGEADAC+AVA VM+MFP+HR +QVEAC+AVEALADG  ENVQLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX NLF GE                                                               LANNSKSNR   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    RG+QAL WL++K+      S  KLTGP++RR+ AR + +  + SF  SS G      GNRG RGG +QE  GS D      + +TEDD              DS TS+G D+ XXXXXXXXXXXXXXXXXXXXXXXX    XXXXXXXXXX    W+DD + S+ SSV   G A    +   A V G    G                                +G+V   P PP NRP S+AEQP+GALLEDFDSDGEG+YGP                        +E  +GGGK AVGPSPP+ Q               XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                              +        +D     PP R AM Q                  GS  HR   SRG   S +AG SR+SS  + S  S+GAV +P++FP   + S    SRSE SFGS  SS+ RRR SA E+L+ PAMA A   + L+ +DEAVA +LQTITG++PVWF  M+ NVERLVQRS EME+        L R KDSP   +  R EYFDVLY GLCGAHIAAAVVARGGIVA+SKDGWLRKASALVRIVCLACP  K GVELLG+ALQLA+R KMAD++E VAKSS++PR VS L E   L+L+  EGRAE D +SL+AM+  ++PTDEWLP WLLEEAAVEGDQ DG V V   RA G P TE ARRAA+VDGCT+IK IGKRS+ K+  +++K       V +  A+   G
Sbjct:    1 MFGRKRGAKASRGASKGNAKA---------KKAVKRLLEKPGPERMAVLANLAAAENLHPFNGSALGEADACNAVATVMSMFPEHRQVQVEACRAVEALADGTEENVQLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXTNLFSGEXXXX-------------------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLANNSKSNRIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRGEQALEWLSSKNRVGTTLSMKKLTGPESRRQQARVRART-SSSFGGSSFGAGDNTRGNRG-RGGRDQERKGSFDS-----VDDTEDDDXXXXXXXXXXXXYDSATSYGGDDXXXXXXXXXXXXXXXXXXXXXXXXXGPLEXXXXXXXXXXXXXXWDDDGM-SVVSSVVSTGKAGKKMMMTGATVLGSDGNGEEEEAHSRQSSAELTKVVTPVAVTAPIPSTVPLGIVGLAPVPPPNRPLSRAEQPMGALLEDFDSDGEGNYGPXXXEXXXXXXXXXXXXXXXXXXXXNEQASGGGKDAVGPSPPRAQPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXD--------DDVQQVAPPTRSAMAQR-----------------GSSGHRKEGSRGSQSSRSAGGSRKSSMTDMSLRSAGAVSVPSEFPSKQKFS----SRSERSFGSGLSSA-RRRMSAEEVLMTPAMAMASAVSGLVRKDEAVAYTLQTITGKNPVWFPGMVTNVERLVQRSIEMESVMVAALSELRRFKDSPYDDTLARGEYFDVLYRGLCGAHIAAAVVARGGIVAESKDGWLRKASALVRIVCLACPSIKPGVELLGKALQLAERDKMADHVEHVAKSSRSPRHVSSLAEQTALRLVFEEGRAEADEKSLEAMRRLEHPTDEWLPVWLLEEAAVEGDQGDGGVVVAGLRARGAPGTETARRAAVVDGCTMIKEIGKRSVYKIVMQENKEGRFGAEVGKKAARAGRG 1103          
BLAST of mRNA_C-linearis_contig103.825.1 vs. uniprot
Match: A0A6H5L1U6_9PHAE (Uncharacterized protein (Fragment) n=1 Tax=Ectocarpus sp. CCAP 1310/34 TaxID=867726 RepID=A0A6H5L1U6_9PHAE)

HSP 1 Score: 67.4 bits (163), Expect = 3.570e-7
Identity = 94/250 (37.60%), Postives = 111/250 (44.40%), Query Frame = 0
Query: 1869 AGSSVASDANGLPEENMFFNXXXXXXXXXXXXXXXXXHSRGSSCSKDSMSEALARHTRPTSRGSGSIRGGNRQETP----AELAEAEAEANSRNSYNRSVAAAVSA--NPGYADSACGSVAVLDR----------PFDAHHPDYDDDDLGKGK------AAKKMAVSGRGDGHSHRTSHGGGGHSRKDGSGSPERSSSXXXXXXXXXXXXXXXXXXXXXXXXGRRPQRDAPSSPAGYIHNAEKHARRRQE 2096
            +G+S  + +NGLPEENMFF          XXXXXXXX                 +H R +SR     R   +QE P    AE A     +  R SY  S+AAA  A  NPGYADSACGSV +LDR          P+D    D      G G       AA   A SGRGDG         G   R   SGS ERS+S                         R+ ++  PSSPAG+IH AE+ ARRRQE
Sbjct:  675 SGNSDKNASNGLPEENMFFMRKQKTKWRGXXXXXXXXXXXXXXXXXXXX-RLREKHQRSSSRTRLHDRITEQQEPPGVGAAEAARLREASEERKSYGGSIAAAAKAVANPGYADSACGSVVILDRESGMGLSVLNPYDEVVVDRTGSGSGGGNKVAIFAAAPSAASSGRGDGRPSAVVAATGATGR---SGSRERSNSSGL----------------------RQLEQRRPSSPAGHIHTAEQKARRRQE 898          
The following BLAST results are available for this feature:
BLAST of mRNA_C-linearis_contig103.825.1 vs. uniprot
Analysis Date: 2022-09-16 (Diamond blastp: OGS1.0 vs UniRef90)
Total hits: 2
Match NameE-valueIdentityDescription
D8LKD1_ECTSI1.490e-13456.09Aardvark n=1 Tax=Ectocarpus siliculosus TaxID=2880... [more]
A0A6H5L1U6_9PHAE3.570e-737.60Uncharacterized protein (Fragment) n=1 Tax=Ectocar... [more]
back to top
InterPro
Analysis Name: InterProScan on OGS1.0
Date Performed: 2022-09-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR000225ArmadilloSMARTSM00185arm_5coord: 259..301
e-value: 10.0
score: 13.2
coord: 103..145
e-value: 28.0
score: 9.7
coord: 214..257
e-value: 0.16
score: 21.1
coord: 176..213
e-value: 310.0
score: 1.6
IPR011989Armadillo-like helicalGENE3D1.25.10.10coord: 29..157
e-value: 2.1E-6
score: 28.9
IPR011989Armadillo-like helicalGENE3D1.25.10.10coord: 165..391
e-value: 3.8E-26
score: 93.8
NoneNo IPR availablePANTHERPTHR22895UNCHARACTERIZEDcoord: 260..387
coord: 183..266
coord: 68..149
IPR016024Armadillo-type foldSUPERFAMILY48371ARM repeatcoord: 85..388

Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
C-linearis_contig103contigC-linearis_contig103:143973..166651 +
Analyses
This polypeptide is derived from or has results from the following analyses
Analysis NameDate Performed
InterProScan on OGS1.02022-09-29
Diamond blastp: OGS1.0 vs UniRef902022-09-16
OGS1.0 of Chordaria linearis ClinC8C monoicous2021-02-24
Relationships

This polypeptide derives from the following mRNA feature(s):

Feature NameUnique NameSpeciesTypePosition
mRNA_C-linearis_contig103.825.1mRNA_C-linearis_contig103.825.1Chordaria linearis ClinC8C monoicousmRNAC-linearis_contig103 138239..169183 +


Sequences
The following sequences are available for this feature:

polypeptide sequence

>prot_C-linearis_contig103.825.1 ID=prot_C-linearis_contig103.825.1|Name=mRNA_C-linearis_contig103.825.1|organism=Chordaria linearis ClinC8C monoicous|type=polypeptide|length=2244bp
MFGRKRGAKGSKGGGSRGAKSRKAKKNTKAKKAVRRLLDKPGPDRMADVA
SMAALENVKPFDRSALGEADACDAVAIVMAMFPDHRLIQVEACKAVEALA
DGNGENVQLLGEADCCQLVEAALERFPRDANVQTQGCRAVTNLFHGEANI
EKARNQDKDNPARCKDSKIPRNNCSRNLGRGRACELVTGALRQFPMDQDV
QIEACGAVANLANNSKSNRAKLGKAGACSLVVKCMATFPDNIDVQHAACV
AVGNLANRHTENKKLLFAAGACNKVCAALASFQGDVGVQYMGCGAVGNLS
NSNAQNCARLGEAGACLLVAGGMRAFPDDRNLQHVGCAAVANLAIGNQPN
SGRLVKAGAATAVRQALDLFVEDAEIRTRGKQALLWLTTKSNSKKPQSTN
KLTGPKTRRENARSKEKFRAKSFSTSSVGGPGGGNRGYRGGEEQEDYGSS
DGEMEDVIFETEDDYSDSVTSFGDDEEEEEEGQERDISDISEEGEDWEGW
GGEEGKRDVSDISEEKQEGWEDDDVASMASSVDYGGLASTGPVPPAAEVA
GGRHSRSSSAKKITAPPAAPIIPPPAAATKKTAAIGVVPTPPNNRPRSKA
EQPIGALLEDFDSDGEGSYGPGSGVESVEVAAQDHLQAVAESEPVAGGGK
VAVGPSPPKVQAPSPDMIIGSGGDPAAPPPQQRQQQVSTRGGSSAPSSHK
RGKGSSSSNTARESASRAGGDPATRNSTAAHVSAATTGVEEWDADGFEAN
DTDSAVPPAREAMQQLERAAARVVRGERAAAAPGSGYHRGTESRGGSRSS
NAGVSRRSSAAEGSAMSSGAVPLPTDFPGDVRDSGGHGSRSEVSFGSARS
SSHRRRQSAAELLLDPAMASAPESAKLIHEDEAVARSLQTITGRDPVWFA
NMLPNVERLVQRSREMEAATWEVRDNLLRSKDSPLKGSARRREYFDVLYL
GLCGAHIAAAVVARGGIVADSKDGWLRKASALVRIVCLACPCAKSGVELL
GQALQLADRVKMADYLEQVAKSSKTPRDVSYLVEHATLQLLREGRAETDR
RSLDAMKAFDNPTDEWLPAWLLEEAAVEGDQHDGSVFVVPNRASGPPETE
YARRAAMVDGCTIIKAIGKRSLRKMEREDDKVAVLVEAVARSIAKTTGGG
GGGGSRNTAVATGSPSSKHSRGRSRDMQQPQHRRGREGEEARPQAPPTAA
STIHYPPISPSTLPPGQGTVSSPARSALGAARQAQEEGARAADADAAKVK
KAERTAARRAEAAVGAPSLVTSDTNSNVEPSHPPKPYGSRGGLSRGRIGS
GSSTGSTATAATATAAAAVAAVRARSATSATIPASPSSEEEPPGKVSRSI
RRPSAGAASAAEDAAAAAAAAAAASSAAPAERVRKTPGKITVPDILKQST
KPPKQASIADVSGSIVKKPRGGGGAGAGAAGSASVSASSSAGKPAAVKPK
SYASTAAAAAASSAATTAVLKKSVVNARPPPTVTLVKGHVRRGSASSTSP
EGTTRASHGPTTGGLSTFAARSIVDLVSGSGKSSRTGMATAEKPADDAPK
GDSAIGEAREPSRGAIQESALSGASATARGDSNRAARTERRSTSSVERKQ
GPRKLRLAGKGCSERRRRALGPDKGKDEAAAAVAAEGGTGTPRGRSPQKR
TPSGVSGGASAAGARGGGRRSPRGRQGSSPKQAAAVATAQRSWGRSGSWY
EPRRRGISPALVAAPALPGQEEGDEDEDDDDDDDDDDDDEAAIGGGGGNS
VSGSDGDSYSGSVAALPTPFPPPRPMGVIPLQEKARDAQGRRVSRSRSRS
RSREMVIPEGEPSVVTTTTRGTHGRRGSRGMMPESESQPIVGITTITTSN
EADGPNREAPEPENQSSVAGSSVASDANGLPEENMFFNRRKKVGGSRNSG
GGGGGHSRGSSCSKDSMSEALARHTRPTSRGSGSIRGGNRQETPAELAEA
EAEANSRNSYNRSVAAAVSANPGYADSACGSVAVLDRPFDAHHPDYDDDD
LGKGKAAKKMAVSGRGDGHSHRTSHGGGGHSRKDGSGSPERSSSRSGSKR
HAQQQRQQQRQQQRQQQRGRRPQRDAPSSPAGYIHNAEKHARRRQEKRRG
SDAPHSPASSRKSSGSSKFFSNFLSNSKKTAIAEAAARDSFGSSGDSGPE
GVSGTKGSFGGTRPTHKRRGSGSSIGLSPLSKDKYSVPPAANGAYVKSTS
VSQLIARNEKRISGGVGGGDGRGHVKSGSGPVVSSRWPFSGAR*
back to top
Annotated Terms
The following terms have been associated with this polypeptide:
Vocabulary: INTERPRO
TermDefinition
IPR000225Armadillo
IPR011989ARM-like
IPR016024ARM-type_fold