prot_C-linearis_contig105.960.1 (polypeptide) Chordaria linearis ClinC8C monoicous

You are viewing a polypeptide, more information available on the corresponding mRNA page

Overview
NamemRNA_C-linearis_contig105.960.1
Unique Nameprot_C-linearis_contig105.960.1
Typepolypeptide
OrganismChordaria linearis ClinC8C monoicous (Chordaria linearis ClinC8C monoicous)
Sequence length1875
Homology
BLAST of mRNA_C-linearis_contig105.960.1 vs. uniprot
Match: A0A6H5JT89_9PHAE (Uncharacterized protein n=1 Tax=Ectocarpus sp. CCAP 1310/34 TaxID=867726 RepID=A0A6H5JT89_9PHAE)

HSP 1 Score: 357 bits (915), Expect = 6.960e-96
Identity = 220/379 (58.05%), Postives = 250/379 (65.96%), Query Frame = 0
Query:  519 QAYEAELAERYRQGGRKARFGNGGRTGATGAPASAAAARGGR---PSSGGGATWEGAGA---PPSHGGPGWREPAEYWGGG-------PGYHPWGGP-MMP-EQHHLGSPP-VSKRHLLPWPIEKIEEMEAMGAGVRAGEDDETWVRIAAAMSSSVSEAKDQWASYCEMMRHKRWMSGSGAGDGSAGSSRAMMPVVPAGGPPAPPPWAAGEHPSWRYRTE---------AAAGYPS----RGGVPPPEAGAYRGALAYS-APPTGPMPGPPGRPASVVGRCAFCGEMWNTNSPTAAAAIQAARWCCPACESGAQPGPMPRHWEDEEGYIAMPRIAPGHHHPYYSAXXXXXXXXXXXXVAAGEHPFHGRSPRGGWPERSR 867
            QAY+AELAERYRQGGRKARFG  GR G +      +A+ GGR   PSSGGG  W+ + A   P + GG  W E  +YWGG        PGY  WGG  M+P EQH L  PP VS+RHLLPWP+EKI+EMEAMGAG R GEDD+ WVRIAA MSS+++EAKDQWA+YCEMMRH+RWMSGS +                 G PP P  WAA +HPSWRYR+E         AAAGYP     R  +PPPEAG YR  + YS AP  GP P P  RP+SVVGRC+FCGE W+T+SP AAAAI+A RWCCPACESG QPGPMPRHWEDE+GYIAMPR+ PG H PYYS       XXXX     G     GR P   W ER R
Sbjct:  575 QAYDAELAERYRQGGRKARFGMAGRVGVS------SASHGGRGRPPSSGGGPPWDASSAVERPLAGGGGRWSE--QYWGGHEPRGAMPPGYQRWGGTTMVPREQHDLAGPPSVSRRHLLPWPLEKIKEMEAMGAG-RGGEDDDAWVRIAARMSSTIAEAKDQWAAYCEMMRHERWMSGSSSAXXXXXXXXXXXXXXXXGAPPGPARWAAEQHPSWRYRSEPTGVSAAAAAAAGYPGGMPPRSRLPPPEAGPYRQPVGYSVAPGAGPSP-PSARPSSVVGRCSFCGEAWSTDSPAAAAAIKAGRWCCPACESGTQPGPMPRHWEDEDGYIAMPRVMPGQH-PYYSVEAAAASXXXXEAAIRG-----GRHPPAAWVERGR 937          
BLAST of mRNA_C-linearis_contig105.960.1 vs. uniprot
Match: D7G3G2_ECTSI (Uncharacterized protein n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D7G3G2_ECTSI)

HSP 1 Score: 313 bits (801), Expect = 3.370e-82
Identity = 211/406 (51.97%), Postives = 232/406 (57.14%), Query Frame = 0
Query:  519 QAYEAELAERYRQGGRKARFGNGGRTGATGAPASAAAARGGRPSSGGGATWEGAGAPPSHGGPG---------------------------WREPAEYWGGG-------PGYHPWGGPMMP-EQHHLGSPP-VSKRHLLPWPIEKIEEMEAMGAGVRAGEDDETWVRIAAAMSSSVSEAKDQWASYCEMMRHKRWMSGSGAGDGSAGSSRAMMPVVPAGGPPAPPPWAAGEHPS-WRYRTEAAAGYPSRGGV-------------PPPEAGAYRGAL-AYS-APPTGPMPGPPGRPASVVGRCAFCGEMWNTNSPTAAAAIQAARWCCPACESGAQPGPMPRHWEDEEGYIAMPRIAPGHHHPYYSAXXXXXXXXXXXXVAAGEHPFHGRSPRGGWPERSRPHGVG 872
            QAY+AELAERYRQGGRKARFG  GR G + A                           SHG                              W E  EYWGG        PGY  WGG M+P EQH L  PP VS+RHLLPWP+EKIEEMEAMGAG R GEDDE WVRIAA MSS+++EAKDQWA+YCEMMRH+RWMSGS                   G PP P  WAA +HPS WRYR+E                        PPPEAG YR ++  YS AP  GP P PP R +SVVGRC+FCGE W+T+SP AAAAI+A RWCCPACESG QPGPMPRHWEDE+GYIAMPR+ PG H PYYS    XXXXXXXXX          R P   W ER RP  VG
Sbjct:  571 QAYDAELAERYRQGGRKARFGMAGRVGVSSA---------------------------SHGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRWSE-HEYWGGHEPGGAVPPGYQRWGGMMVPPEQHDLAGPPSVSRRHLLPWPLEKIEEMEAMGAG-RGGEDDEDWVRIAARMSSTIAEAKDQWAAYCEMMRHQRWMSGSSXXXXXXXXXXXXXXXXXXGAPPGPARWAAEQHPSSWRYRSEPTGASAXXXXXXXXXXXXXXXXXRPPPEAGPYRQSVQGYSVAPGAGPSP-PPARSSSVVGRCSFCGEAWSTDSPAAAAAIKAGRWCCPACESGTQPGPMPRHWEDEDGYIAMPRVMPGQH-PYYSVEAAXXXXXXXXXXX-----XXXRHPPAAWAERGRPPTVG 940          
BLAST of mRNA_C-linearis_contig105.960.1 vs. uniprot
Match: D7G8Z3_ECTSI (Zn(2)-C6 fungal-type domain-containing protein n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D7G8Z3_ECTSI)

HSP 1 Score: 70.5 bits (171), Expect = 2.520e-8
Identity = 32/49 (65.31%), Postives = 40/49 (81.63%), Query Frame = 0
Query:   23 TTPRLRKSCDPCSLAKRRCDGQPQCSLCRKKGLPCVYGERQKSGPKGRK 71
            +T ++R+SCD C+LAKRRCDG+ +CSLC KK + CVY  RQKSGPKG K
Sbjct:   20 STLKMRRSCDACALAKRRCDGELRCSLCCKKSIRCVYSTRQKSGPKGHK 68          
BLAST of mRNA_C-linearis_contig105.960.1 vs. uniprot
Match: A0A6H5JWH9_9PHAE (Zn(2)-C6 fungal-type domain-containing protein n=1 Tax=Ectocarpus sp. CCAP 1310/34 TaxID=867726 RepID=A0A6H5JWH9_9PHAE)

HSP 1 Score: 68.2 bits (165), Expect = 1.390e-7
Identity = 31/49 (63.27%), Postives = 39/49 (79.59%), Query Frame = 0
Query:   23 TTPRLRKSCDPCSLAKRRCDGQPQCSLCRKKGLPCVYGERQKSGPKGRK 71
            +T ++R+SCD C+LAKRRCDG+ +C LC KK + CVY  RQKSGPKG K
Sbjct:   20 STLKMRRSCDACALAKRRCDGELRCLLCCKKSIRCVYSTRQKSGPKGHK 68          
The following BLAST results are available for this feature:
BLAST of mRNA_C-linearis_contig105.960.1 vs. uniprot
Analysis Date: 2022-09-16 (Diamond blastp: OGS1.0 vs UniRef90)
Total hits: 4
Match NameE-valueIdentityDescription
A0A6H5JT89_9PHAE6.960e-9658.05Uncharacterized protein n=1 Tax=Ectocarpus sp. CCA... [more]
D7G3G2_ECTSI3.370e-8251.97Uncharacterized protein n=1 Tax=Ectocarpus silicul... [more]
D7G8Z3_ECTSI2.520e-865.31Zn(2)-C6 fungal-type domain-containing protein n=1... [more]
A0A6H5JWH9_9PHAE1.390e-763.27Zn(2)-C6 fungal-type domain-containing protein n=1... [more]
back to top
InterPro
Analysis Name: InterProScan on OGS1.0
Date Performed: 2022-09-29
IPR TermIPR DescriptionSourceSource TermSource DescriptionAlignment
IPR001138Zn(2)-C6 fungal-type DNA-binding domainSMARTSM00066gal4_2coord: 25..68
e-value: 5.8E-7
score: 39.1
IPR001138Zn(2)-C6 fungal-type DNA-binding domainPFAMPF00172Zn_cluscoord: 29..64
e-value: 1.3E-6
score: 28.4
IPR001138Zn(2)-C6 fungal-type DNA-binding domainPROSITEPS00463ZN2_CY6_FUNGAL_1coord: 30..57
IPR001138Zn(2)-C6 fungal-type DNA-binding domainPROSITEPS50048ZN2_CY6_FUNGAL_2coord: 30..59
score: 8.764
NoneNo IPR availablePFAMPF13921Myb_DNA-bind_6coord: 1211..1274
e-value: 4.6E-6
score: 26.8
NoneNo IPR availableGENE3D1.10.10.60coord: 1207..1272
e-value: 6.2E-11
score: 44.1
IPR036864Zn(2)-C6 fungal-type DNA-binding domain superfamilyGENE3D4.10.240.10coord: 22..82
e-value: 1.2E-7
score: 33.6
IPR036864Zn(2)-C6 fungal-type DNA-binding domain superfamilySUPERFAMILY57701Zn2/Cys6 DNA-binding domaincoord: 24..63
IPR017877Myb-like domainPROSITEPS50090MYB_LIKEcoord: 1203..1267
score: 9.256
IPR009057Homeobox-like domain superfamilySUPERFAMILY46689Homeodomain-likecoord: 1205..1271

Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
C-linearis_contig105contigC-linearis_contig105:476449..489310 +
Analyses
This polypeptide is derived from or has results from the following analyses
Analysis NameDate Performed
InterProScan on OGS1.02022-09-29
Diamond blastp: OGS1.0 vs UniRef902022-09-16
OGS1.0 of Chordaria linearis ClinC8C monoicous2021-02-24
Relationships

This polypeptide derives from the following mRNA feature(s):

Feature NameUnique NameSpeciesTypePosition
mRNA_C-linearis_contig105.960.1mRNA_C-linearis_contig105.960.1Chordaria linearis ClinC8C monoicousmRNAC-linearis_contig105 476400..491569 +


Sequences
The following sequences are available for this feature:

polypeptide sequence

>prot_C-linearis_contig105.960.1 ID=prot_C-linearis_contig105.960.1|Name=mRNA_C-linearis_contig105.960.1|organism=Chordaria linearis ClinC8C monoicous|type=polypeptide|length=1875bp
MEPGPTAGPRRAAGGRRERNVSTTPRLRKSCDPCSLAKRRCDGQPQCSLC
RKKGLPCVYGERQKSGPKGRKGGLPAPQGPSRHQQQHRPGPPVEPARHPA
MYPPHAALGGLGPGPPPGPRPRDYEDDDGAEGFDVRPLSSRVVHASSSVR
MADGGPGVSSSGESMPGAWPGHDGGGPGGYYNHYGFRRSAGVAGGGGTGG
LGGHVPDAADDDRFHAYPSKYRRTAGAGWATRHPHGGGGGRGGGGGGGGG
GRPIGGETPRYLEGPPPMANQTGGDVSERQSISKSNAPAGGRPRSPLSSQ
WRGSGGGGGQGQWRAVSPSNVSRADVNAAPGSAAPAPGAENPQAKPYGGG
DGRDGGHQNNPAWKPAVCWQARRATPPRMGEPAADEDTGPKEEGPSGDGG
GGGGGRAAAAGHSSSSSRPSPSYPSRNGGGEGERRDDHAAASRPPSGSPS
GGSQRRQQQQPGDAEAGGKSGGGGHHGEGGRPGAAIAVLPPPPSSGRGGP
PPPSQYAEDPADHHHYHQQAYEAELAERYRQGGRKARFGNGGRTGATGAP
ASAAAARGGRPSSGGGATWEGAGAPPSHGGPGWREPAEYWGGGPGYHPWG
GPMMPEQHHLGSPPVSKRHLLPWPIEKIEEMEAMGAGVRAGEDDETWVRI
AAAMSSSVSEAKDQWASYCEMMRHKRWMSGSGAGDGSAGSSRAMMPVVPA
GGPPAPPPWAAGEHPSWRYRTEAAAGYPSRGGVPPPEAGAYRGALAYSAP
PTGPMPGPPGRPASVVGRCAFCGEMWNTNSPTAAAAIQAARWCCPACESG
AQPGPMPRHWEDEEGYIAMPRIAPGHHHPYYSAEAAAASAAASAAVAAGE
HPFHGRSPRGGWPERSRPHGVGRPHGSGSPSERNPTWREEEADIQAEAAA
RQAASRRSHGSKSPATVMGKPQQQQQQQQSPATTRLEHDNAPPFASAAAS
SPPSPGNGSKNNGDLRRSSPPQTSSSSGGGGGDQQQHQPQQNRRSSSPVL
AAGASPREDNNSGDSNGSGGGGGRAPGDRGGSGRGSASPAPDAARGSAGA
SGSGKPLAAHGSSSGDRGEGGSDAPPLSSSSRGFGPPAEEGKALGVPASV
PASFPACCCAETRGLFPAAALLLDEFWPRRVENSGSGSVLMGSQRVRTAG
FTTDGYHDRWQATSNVNNTVNTRASCPSVAPRSGERGAVDPLEESGYTTD
QAGTPTRDSQWPREDDEKLVELVEARANANGNRGERLRESEINWQRIASN
FDGRTAAQCEARWSEHVNPALQEQQASSRRPEGGHATANASSAGNARQEA
SAAAAAAAAATGGSEASKETNDASADDRCQNSQGGAPRHWRQGGAGRNAS
PGPGPRHTEHAAAPHHSDGGGYPPYVPKHNGADNGGKPDPRGDAGWRHHH
HQPRQPLQYRSHPRPPPQHPHHRSAGSGSGVSITQGMPQHSRMVPVPPPE
SSSRAVASWGPHAPPPPGSSRRPPPHHWMVNGGGAEGRGEAHRLDEYGRI
MHETAPPRAPAPHGMALHPGGAVVMMEVAGGGADFTDRGRGGGRGVSQSG
AAAEEEPYGWYVGGNNKGRGAPPPAGPPADSRRGYPGEEETGSRRGEAGQ
RSPPRPPQHQQGSGGGTGATPAGRPAPPTNGGSRGGSAGTSTSRSPSRES
PKATLMARVAEGARSGGGEGERGGRPDNGSGGSSSPGNNNRPPPKQQKYA
DDDEQQRGSKRGGGTATVPSPPQQQKKKLQRTEERTTGGGGDGGDGCSFV
RKKSAVDSLTTEGWTFRTSTGGAGRGGISTPPKAEKTSPSCSSSTTSKQQ
HQSAASDGDGGRVKLCSSATPAAAAAIAKNAASSAVDAKGGERTAARAAT
AKQEEEDGRHRGQRMNAWEGGRNK*
back to top
Annotated Terms
The following terms have been associated with this polypeptide:
Vocabulary: INTERPRO
TermDefinition
IPR001138Zn2-C6_fun-type_DNA-bd
IPR036864Zn2-C6_fun-type_DNA-bd_sf
IPR017877Myb-like_dom
IPR009057Homeobox-like_sf