mRNA_S-dermatodea_contig11749.1105.1 (mRNA) Saccorhiza dermatodea SderLu1190fm monoicous

You are viewing an mRNA, more information available on the corresponding polypeptide page

Overview
NamemRNA_S-dermatodea_contig11749.1105.1
Unique NamemRNA_S-dermatodea_contig11749.1105.1
TypemRNA
OrganismSaccorhiza dermatodea SderLu1190fm monoicous (Saccorhiza dermatodea SderLu1190fm monoicous)
Homology
BLAST of mRNA_S-dermatodea_contig11749.1105.1 vs. uniprot
Match: D8LG47_ECTSI (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D8LG47_ECTSI)

HSP 1 Score: 219 bits (558), Expect = 1.200e-59
Identity = 212/499 (42.48%), Postives = 239/499 (47.90%), Query Frame = 1
Query:    4 AIWRETFGEEMPAEVAQGSWIDMDTIRDKELREEFASSIAPFALLWSALSEWVTPDTRAVCRAGHAAAEAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVIGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXMVSRA------PFPSADFRVKRAGAVAGAVEAELRQRRAVRQRGGSAHDTGSDFQGXXXXXXXXXXXXXXXDGPTVTSALRHSGVSTMVNMHLAQAERALSVAGXXXXXXXXXXXXD---------------NAGGARKKWRWSKTAPDERPGTR-RGGWLEETARDSGAAT---------ATXXXXXXEVVATETMTXXXXXXXXEARGMTAVVGQRGXXXXXXXXXXXXXXXXXXNPIPLSEEKETLVPEN---GGGAECRRAVAAVIDTLDADSASA----AANLSITQWKVVSLVLLTA--LXXXXXXXXXXXXXQVAAAAGV---------EACAWLCGPRASVVSAREFEMLVEVLLEE 1353
            AIWRETFGEEMPAEV+ GSWIDMD +RDKE+REEFASSIAPFALLWSALSEWV+P+TRAVC+AG AA   XXXXXXXXXXX                     +  V                              S A      P P ADF+ ++AGAVA A+EAE+  R A R R     +                      +GPT+T ALRH+GVSTMVNMHLAQAERAL  A         XXXX                N  G+RKKW WS  A   R G R  GGW+EE    + AA          A       +     T             G TA                           P +E            GGGAECRRAVAAVIDTL+A   SA    AA+LSI QWKVVS+VLLTA  L      XXXXXXX    A GV         EACA LCG    V SAREF +LV+V+LEE
Sbjct:  386 AIWRETFGEEMPAEVSSGSWIDMDAVRDKEMREEFASSIAPFALLWSALSEWVSPETRAVCQAGRAAXXXXXXXXXXXXXXSPLPSAGKAKTGGVNGRSLSEIGDVAAEPAGSPAITGASCCDVSEQGPRIDSSHSSSASPGHLGPVP-ADFQARKAGAVAEAIEAEM-SRHAKRNRDRRKANANDSPNKGTGDSCARGGGATVGEGPTLTGALRHTGVSTMVNMHLAQAERALIAAAPTLHVRGGXXXXXXXXGRSGILQTSGQHNNVGSRKKWIWSDKATGARQGARVPGGWVEEEGATAAAAAVAAEAVPLKAPPAGGVCKANGETTGDDVPAVATASGTGTTARGENADAETNQRESSAVALEGRQTREAPAAEPAAAAAAGEHVAGGGAECRRAVAAVIDTLNAAVGSASSATAADLSIAQWKVVSVVLLTAVGLGAGAIGXXXXXXXXXXXAPGVTGEEAEVAAEACARLCGAEMGV-SAREFRILVDVMLEE 881          
BLAST of mRNA_S-dermatodea_contig11749.1105.1 vs. uniprot
Match: A0A6H5K2G0_9PHAE (RNA polymerase II subunit B1 CTD phosphatase RPAP2 homolog n=1 Tax=Ectocarpus sp. CCAP 1310/34 TaxID=867726 RepID=A0A6H5K2G0_9PHAE)

HSP 1 Score: 208 bits (530), Expect = 2.820e-55
Identity = 220/499 (44.09%), Postives = 249/499 (49.90%), Query Frame = 1
Query:    1 RAIWRETFGEEMPAEVAQGSWIDMDTIRDKELREEFASSIAPFALLWSALSEWVTPDTRAVCRAGHAAAEAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX-VIGVXXXXXXXXXXXXXXXXXXXXXXXXXXXXMVSRAPFPS--ADFRVKRAGAVAGAVEAELRQRRAVRQRGG-SAHDTGSDFQGXXXXXXXXXXXXXXXDGPTVTSALRHSGVSTMVNMHLAQAERALSVAGXXXXXXXXXXXXDNAGG---------------ARKKWRWSKTAPDERPGTR-RGGWLEETARDSGAATATXXXXXXEVV-------ATETMTXXXXXXXXEARGM-TAVVGQRGXXXXXXXXXXXXXXXXXXN-PIPLSEEKETLVPEN---GGGAECRRAVAAVIDTLDADSASA----AANLSITQWKVVSLVLLTA--LXXXXXXXXXXXXXQVAAAA----------GVEACAWLCGPRASVVSAREFEMLVEVLLEE 1353
            RAIWRETFGEEMPAEV+ GSWIDMD +RDKE+REEFASSIAPFALLWSALSEWV+P+TRAVCRAG AA   XXXXXXXXXX                       V                                 S   F S  ADF  ++AGAVA A+EAE+  R A R R    A+   S  +G               +GPT+T ALRH+GVSTMVNMHLA AERAL  A         XXXX    G               +RKKW WS TA   R G R  GGW+E+    +      XXXXXX          A E  T         A G  T V G+                        P +E            GGGAECRRAV+AVIDTL+  + SA    AA+LSI QWKVVS+VLLTA  L   XXXXXXXXXX   AAA            EAC+ LCG    V SAREF +LV+V+LEE
Sbjct:  654 RAIWRETFGEEMPAEVSSGSWIDMDAVRDKEMREEFASSIAPFALLWSALSEWVSPETRAVCRAGRAAXXXXXXXXXXXXXPLSSAGKAKTSGVNGRSSSEIGDVAAEPASSPAITAASCCDVSEQSPRIDSSHSSSASPGHFGSVPADFLARKAGAVAEAIEAEM-SRHAKRNRDRRKANPNDSPMKGTGDSSARGGGTTVG-EGPTLTGALRHTGVSTMVNMHLALAERALIAAATTLHVRGGXXXXXXXXGCSGILQTSGQHNNMRSRKKWVWSDTAAGARQGARVPGGWVEKEGPTAAXXXXXXXXXXXXXXXXXXXXKAKEGTTGDDVPAVATASGTGTTVSGKHADAETNQRESPAISVAGRQTGKAPAAEPAAAAAAGEHVAGGGAECRRAVSAVIDTLNVAAGSASSATAADLSIAQWKVVSVVLLTAVGLGTGXXXXXXXXXXXXGAAAPRVTGEEAEVAAEACSRLCGEEMGV-SAREFRILVDVMLEE 1149          
The following BLAST results are available for this feature:
BLAST of mRNA_S-dermatodea_contig11749.1105.1 vs. uniprot
Analysis Date: 2022-09-19 (Diamond blastx: OGS1.0 vs UniRef90)
Total hits: 2
Match NameE-valueIdentityDescription
D8LG47_ECTSI1.200e-5942.48RNA polymerase II subunit B1 CTD phosphatase RPAP2... [more]
A0A6H5K2G0_9PHAE2.820e-5544.09RNA polymerase II subunit B1 CTD phosphatase RPAP2... [more]
back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
S-dermatodea_contig11749contigS-dermatodea_contig11749:4829..6642 +
Analyses
This mRNA is derived from or has results from the following analyses
Analysis NameDate Performed
Diamond blastx: OGS1.0 vs UniRef902022-09-19
OGS1.0 of Saccorhiza dermatodea SderLu1190fm monoicous2021-02-24
Properties
Property NameValue
Ec32 ortholog descriptionProtein of unknown function DUF408
Ec32 orthologEc-11_001970.1
Hectar predicted targeting categoryother localisation
Exons2
Model size1359
Cds size1359
Stop1
Start0
Relationships

The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesTypePosition
1622924215.9660244-CDS-S-dermatodea_contig11749:4828..49361622924215.9660244-CDS-S-dermatodea_contig11749:4828..4936Saccorhiza dermatodea SderLu1190fm monoicousCDSS-dermatodea_contig11749 4829..4936 +
1694079099.2715528-CDS-S-dermatodea_contig11749:4828..49361694079099.2715528-CDS-S-dermatodea_contig11749:4828..4936Saccorhiza dermatodea SderLu1190fm monoicousCDSS-dermatodea_contig11749 4829..4936 +
1622924215.974162-CDS-S-dermatodea_contig11749:5391..66421622924215.974162-CDS-S-dermatodea_contig11749:5391..6642Saccorhiza dermatodea SderLu1190fm monoicousCDSS-dermatodea_contig11749 5392..6642 +
1694079099.2819927-CDS-S-dermatodea_contig11749:5391..66421694079099.2819927-CDS-S-dermatodea_contig11749:5391..6642Saccorhiza dermatodea SderLu1190fm monoicousCDSS-dermatodea_contig11749 5392..6642 +


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesTypePosition
mRNA_S-dermatodea_contig11749.1105.1prot_S-dermatodea_contig11749.1105.1Saccorhiza dermatodea SderLu1190fm monoicouspolypeptideS-dermatodea_contig11749 4829..6642 +


Sequences
The following sequences are available for this feature:

protein sequence of mRNA_S-dermatodea_contig11749.1105.1

>prot_S-dermatodea_contig11749.1105.1 ID=prot_S-dermatodea_contig11749.1105.1|Name=mRNA_S-dermatodea_contig11749.1105.1|organism=Saccorhiza dermatodea SderLu1190fm monoicous|type=polypeptide|length=453bp
RAIWRETFGEEMPAEVAQGSWIDMDTIRDKELREEFASSIAPFALLWSAL
SEWVTPDTRAVCRAGHAAAEAAAATAVTTTVTTTGTGGDGDGNTASTETQ
ENEVIGVVAASQSKSPPNTESNAATAAGAAAAAAAMVSRAPFPSADFRVK
RAGAVAGAVEAELRQRRAVRQRGGSAHDTGSDFQGGGGGGGGRAGGGGGG
DGPTVTSALRHSGVSTMVNMHLAQAERALSVAGGGGVRGRKGGGDDNAGG
ARKKWRWSKTAPDERPGTRRGGWLEETARDSGAATATAAAATGEVVATET
MTPPEEAKKREARGMTAVVGQRGGVVGDDGGLGGSGGGGGGNPIPLSEEK
ETLVPENGGGAECRRAVAAVIDTLDADSASAAANLSITQWKVVSLVLLTA
LGLGGGKGEGRGGGQVAAAAGVEACAWLCGPRASVVSAREFEMLVEVLLE
EE*
back to top

mRNA from alignment at S-dermatodea_contig11749:4829..6642+

Legend: CDSpolypeptide
Hold the cursor over a type above to highlight its positions in the sequence below.
>mRNA_S-dermatodea_contig11749.1105.1 ID=mRNA_S-dermatodea_contig11749.1105.1|Name=mRNA_S-dermatodea_contig11749.1105.1|organism=Saccorhiza dermatodea SderLu1190fm monoicous|type=mRNA|length=1814bp|location=Sequence derived from alignment at S-dermatodea_contig11749:4829..6642+ (Saccorhiza dermatodea SderLu1190fm monoicous)
AGGGCAATTTGGCGAGAAACTTTCGGCGAAGAGATGCCCGCAGAGGTTGC GCAAGGGTCGTGGATCGACATGGACACCATCCGGGACAAGGAGCTGCGAG AGGAATTTGTGAGTCTCACTTATGTTCCGTCGTATGGCCGTTTTTATTAG CTTTTTATTGGACGAGGGTGAGAAAGACCGGTTGGTCTACATGTAGCATT TCAGATCCCGTGAGCGGTAGGTACCAAACTAGGGATGGTCATACCTCTCC CCGATACCAACTGTGCTGTACGCGTCCTCGGGATTTTATCGGCGGAAGCA ACCAGTCACGTTCGTTCAGCAACGGATTATATTTTGGAGTGTTTCATAGA AAAACAAGCCATGTTTGGGGGTGGCACGCGCGAAGCGAGTCGGGGAAGGC TCCCACGGACTGTCGCACAAGGCGAGGCACACACACATAAACACGCACAA ACGTGAGGGCACGTTCGTCGAGGACGCCTCCCCGAGTGACGCCATGCCCG CACACCCTCACACACCCGCCACCCCCTCTCGCCCCCCCGCCATATCTTAA AAACACGTCACAGGCATCGTCTATCGCACCCTTCGCCCTCTTGTGGAGCG CCCTCTCGGAGTGGGTTACCCCGGACACCCGGGCGGTGTGCCGCGCCGGC CATGCGGCAGCAGAAGCAGCAGCAGCGACAGCGGTAACCACAACGGTAAC GACAACTGGCACCGGGGGCGACGGCGACGGCAACACCGCTTCAACAGAAA CTCAAGAAAACGAGGTCATCGGGGTCGTCGCCGCATCACAATCAAAATCA CCTCCGAACACGGAGAGCAACGCAGCAACGGCGGCGGGGGCGGCGGCGGC GGCTGCGGCGATGGTCTCGCGAGCCCCATTCCCGTCCGCTGATTTCCGGG TGAAGAGGGCCGGCGCGGTTGCGGGCGCGGTGGAAGCGGAGCTCCGGCAG CGCCGCGCGGTGCGTCAAAGAGGCGGTTCTGCTCACGATACTGGTAGCGA TTTCCAAGGTGGCGGTGGTGGCGGCGGCGGCAGGGCAGGGGGGGGAGGGG GAGGGGATGGACCCACCGTCACAAGCGCTCTGCGTCACAGCGGGGTGAGC ACCATGGTGAACATGCACCTGGCGCAGGCGGAGCGCGCGCTTAGCGTTGC CGGCGGCGGCGGCGTGAGAGGTCGAAAGGGTGGTGGTGATGACAACGCGG GGGGGGCTCGGAAAAAATGGAGGTGGTCTAAAACGGCCCCCGACGAGCGG CCGGGGACTAGACGAGGCGGCTGGTTGGAGGAAACTGCGCGGGATTCCGG GGCCGCTACGGCGACGGCGGCGGCGGCGACGGGGGAGGTGGTGGCAACGG AAACGATGACGCCACCAGAAGAAGCGAAGAAACGCGAAGCCCGGGGGATG ACCGCTGTCGTCGGTCAGCGCGGCGGCGTAGTTGGTGACGATGGTGGCCT TGGTGGTAGTGGTGGTGGTGGTGGTGGTAATCCGATCCCGCTCTCGGAGG AAAAGGAGACCCTCGTGCCCGAGAACGGAGGGGGGGCGGAGTGCCGCCGC GCCGTAGCGGCGGTGATCGACACCCTGGACGCCGACAGCGCGAGCGCGGC AGCGAACCTGTCCATCACTCAGTGGAAGGTGGTTTCCTTGGTGCTGCTCA CGGCGCTCGGGCTCGGCGGGGGTAAAGGCGAGGGGCGGGGCGGGGGACAA GTGGCGGCCGCGGCAGGGGTGGAGGCGTGCGCGTGGTTGTGCGGGCCCCG GGCGAGCGTTGTCTCCGCGAGAGAGTTCGAGATGCTTGTGGAGGTGTTAT TGGAGGAGGAGTGA
back to top

Coding sequence (CDS) from alignment at S-dermatodea_contig11749:4829..6642+

>mRNA_S-dermatodea_contig11749.1105.1 ID=mRNA_S-dermatodea_contig11749.1105.1|Name=mRNA_S-dermatodea_contig11749.1105.1|organism=Saccorhiza dermatodea SderLu1190fm monoicous|type=CDS|length=2718bp|location=Sequence derived from alignment at S-dermatodea_contig11749:4829..6642+ (Saccorhiza dermatodea SderLu1190fm monoicous)
AGGGCAATTTGGCGAGAAACTTTCGGCGAAGAGATGCCCGCAGAGGTTGC
GCAAGGGTCGTGGATCGACATGGACACCATCCGGGACAAGGAGCTGCGAG
AGGAATTTAGGGCAATTTGGCGAGAAACTTTCGGCGAAGAGATGCCCGCA
GAGGTTGCGCAAGGGTCGTGGATCGACATGGACACCATCCGGGACAAGGA
GCTGCGAGAGGAATTTGCATCGTCTATCGCACCCTTCGCCCTCTTGTGGA
GCGCCCTCTCGGAGTGGGTTACCCCGGACACCCGGGCGGTGTGCCGCGCC
GGCCATGCGGCAGCAGAAGCAGCAGCAGCGACAGCGGTAACCACAACGGT
AACGACAACTGGCACCGGGGGCGACGGCGACGGCAACACCGCTTCAACAG
AAACTCAAGAAAACGAGGTCATCGGGGTCGTCGCCGCATCACAATCAAAA
TCACCTCCGAACACGGAGAGCAACGCAGCAACGGCGGCGGGGGCGGCGGC
GGCGGCTGCGGCGATGGTCTCGCGAGCCCCATTCCCGTCCGCTGATTTCC
GGGTGAAGAGGGCCGGCGCGGTTGCGGGCGCGGTGGAAGCGGAGCTCCGG
CAGCGCCGCGCGGTGCGTCAAAGAGGCGGTTCTGCTCACGATACTGGTAG
CGATTTCCAAGGTGGCGGTGGTGGCGGCGGCGGCAGGGCAGGGGGGGGAG
GGGGAGGGGATGGACCCACCGTCACAAGCGCTCTGCGTCACAGCGGGGTG
AGCACCATGGTGAACATGCACCTGGCGCAGGCGGAGCGCGCGCTTAGCGT
TGCCGGCGGCGGCGGCGTGAGAGGTCGAAAGGGTGGTGGTGATGACAACG
CGGGGGGGGCTCGGAAAAAATGGAGGTGGTCTAAAACGGCCCCCGACGAG
CGGCCGGGGACTAGACGAGGCGGCTGGTTGGAGGAAACTGCGCGGGATTC
CGGGGCCGCTACGGCGACGGCGGCGGCGGCGACGGGGGAGGTGGTGGCAA
CGGAAACGATGACGCCACCAGAAGAAGCGAAGAAACGCGAAGCCCGGGGG
ATGACCGCTGTCGTCGGTCAGCGCGGCGGCGTAGTTGGTGACGATGGTGG
CCTTGGTGGTAGTGGTGGTGGTGGTGGTGGTAATCCGATCCCGCTCTCGG
AGGAAAAGGAGACCCTCGTGCCCGAGAACGGAGGGGGGGCGGAGTGCCGC
CGCGCCGTAGCGGCGGTGATCGACACCCTGGACGCCGACAGCGCGAGCGC
GGCAGCGAACCTGTCCATCACTCAGTGGAAGGTGGTTTCCTTGGTGCTGC
TCACGGCGCTCGGGCTCGGCGGGGGTAAAGGCGAGGGGCGGGGCGGGGGA
CAAGTGGCGGCCGCGGCAGGGGTGGAGGCGTGCGCGTGGTTGTGCGGGCC
CCGGGCGAGCGTTGTCTCCGCGAGAGAGTTCGAGATGCTTGTGGAGGTGT
TATTGGAGGAGGAGTGAGCATCGTCTATCGCACCCTTCGCCCTCTTGTGG
AGCGCCCTCTCGGAGTGGGTTACCCCGGACACCCGGGCGGTGTGCCGCGC
CGGCCATGCGGCAGCAGAAGCAGCAGCAGCGACAGCGGTAACCACAACGG
TAACGACAACTGGCACCGGGGGCGACGGCGACGGCAACACCGCTTCAACA
GAAACTCAAGAAAACGAGGTCATCGGGGTCGTCGCCGCATCACAATCAAA
ATCACCTCCGAACACGGAGAGCAACGCAGCAACGGCGGCGGGGGCGGCGG
CGGCGGCTGCGGCGATGGTCTCGCGAGCCCCATTCCCGTCCGCTGATTTC
CGGGTGAAGAGGGCCGGCGCGGTTGCGGGCGCGGTGGAAGCGGAGCTCCG
GCAGCGCCGCGCGGTGCGTCAAAGAGGCGGTTCTGCTCACGATACTGGTA
GCGATTTCCAAGGTGGCGGTGGTGGCGGCGGCGGCAGGGCAGGGGGGGGA
GGGGGAGGGGATGGACCCACCGTCACAAGCGCTCTGCGTCACAGCGGGGT
GAGCACCATGGTGAACATGCACCTGGCGCAGGCGGAGCGCGCGCTTAGCG
TTGCCGGCGGCGGCGGCGTGAGAGGTCGAAAGGGTGGTGGTGATGACAAC
GCGGGGGGGGCTCGGAAAAAATGGAGGTGGTCTAAAACGGCCCCCGACGA
GCGGCCGGGGACTAGACGAGGCGGCTGGTTGGAGGAAACTGCGCGGGATT
CCGGGGCCGCTACGGCGACGGCGGCGGCGGCGACGGGGGAGGTGGTGGCA
ACGGAAACGATGACGCCACCAGAAGAAGCGAAGAAACGCGAAGCCCGGGG
GATGACCGCTGTCGTCGGTCAGCGCGGCGGCGTAGTTGGTGACGATGGTG
GCCTTGGTGGTAGTGGTGGTGGTGGTGGTGGTAATCCGATCCCGCTCTCG
GAGGAAAAGGAGACCCTCGTGCCCGAGAACGGAGGGGGGGCGGAGTGCCG
CCGCGCCGTAGCGGCGGTGATCGACACCCTGGACGCCGACAGCGCGAGCG
CGGCAGCGAACCTGTCCATCACTCAGTGGAAGGTGGTTTCCTTGGTGCTG
CTCACGGCGCTCGGGCTCGGCGGGGGTAAAGGCGAGGGGCGGGGCGGGGG
ACAAGTGGCGGCCGCGGCAGGGGTGGAGGCGTGCGCGTGGTTGTGCGGGC
CCCGGGCGAGCGTTGTCTCCGCGAGAGAGTTCGAGATGCTTGTGGAGGTG
TTATTGGAGGAGGAGTGA
back to top