mRNA_S-firma_F_contig1037.528.1 (mRNA) Sphaerotrichia firma ET2_F female

You are viewing an mRNA, more information available on the corresponding polypeptide page

Overview
NamemRNA_S-firma_F_contig1037.528.1
Unique NamemRNA_S-firma_F_contig1037.528.1
TypemRNA
OrganismSphaerotrichia firma ET2_F female (Sphaerotrichia firma ET2_F female)
Homology
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: D8LRG1_ECTSI (HNH endonuclease family protein n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D8LRG1_ECTSI)

HSP 1 Score: 458 bits (1178), Expect = 1.640e-159
Identity = 239/333 (71.77%), Postives = 263/333 (78.98%), Query Frame = 1
Query:   34 MAHRKRMRT------TWLLGACVVNSL-QGGAAFALPPQQDLLRHSSRHCQVDASSFGIGRAGRAWCGASGPAAVARRGDISMSAKQKRGRKTAGVRSRRGGPRTDVEGYSHDNFIKLRKEVSGQVETSKQGNRRSKKINFSGGIDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYLYFESEILEED 1011
            MAHRKR R       TW LGACV+N+  QG +AFA           S   Q D +S G   A +A        A A R +  +S K K+  K   VRSRR GP  +VEGYSH  F+K++ EV+ QV T+K+G+RR++ +NF+GGIDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRS+RSPSVVVQLPSVIALKEFVNNHR+HPPFTRRNLFLRDSHQCQYC KYF PHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDL+SIGMSL KYP TPTFSQLQ  ARRYPP EIHETW DYLYFESEILEE+
Sbjct:    1 MAHRKRARNNGARAKTWFLGACVMNAAAQGASAFAFSSYS--ASSFSLRGQADPASRGTVLAKQA-------VAPAGRAENLLSMKIKKAGKKP-VRSRRSGPPANVEGYSHAAFVKMKNEVALQVATNKKGSRRTRTVNFNGGIDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSVRSPSVVVQLPSVIALKEFVNNHRTHPPFTRRNLFLRDSHQCQYCQKYFAPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLKSIGMSLAKYPVTPTFSQLQAKARRYPPLEIHETWGDYLYFESEILEEE 323          
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: A0A836CK35_9STRA (HNH endonuclease family protein n=1 Tax=Tribonema minus TaxID=303371 RepID=A0A836CK35_9STRA)

HSP 1 Score: 259 bits (663), Expect = 2.630e-82
Identity = 121/185 (65.41%), Postives = 145/185 (78.38%), Query Frame = 1
Query:  445 IDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYLYFESEI 999
            ++SCP LVLNADYQPLSYLPLSLW WQ+ IKAV+ ++VVVLAT+ DR +RS +V ++LPSVIALK +VN H   P FTRRNL+LRD +QC YC K F   +LSFDHV P+KLGG  +W NVVT+C RCNN K DCHP+ L  IGM LNK PH PTF +LQN ARRYPP  IH+TW +YLY++S I
Sbjct:   78 LESCPALVLNADYQPLSYLPLSLWPWQEAIKAVWMDRVVVLATH-DRYVRSANVELKLPSVIALKGYVNQHAKVPSFTRRNLYLRDGYQCAYCSKQFQAGDLSFDHVTPRKLGGPTSWTNVVTSCHRCNNAKGDCHPKSLSRIGMRLNKAPHIPTFYELQNQARRYPPKLIHKTWAEYLYYDSAI 261          
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: A0A7S2S5X4_9STRA (Hypothetical protein n=2 Tax=Rhizochromulina marina TaxID=1034831 RepID=A0A7S2S5X4_9STRA)

HSP 1 Score: 228 bits (581), Expect = 3.270e-68
Identity = 106/200 (53.00%), Postives = 141/200 (70.50%), Query Frame = 1
Query:  382 GQVETSKQGNRRSKKINFSGGIDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYL 981
            G V T  +     K +N+   ++ CP LVLNAD+QPLSY+PLS+W WQD IKAVF+E+VVV+A Y +  +RS SV VQ+PSVIALK++V   R+ P FTRRN+FLRDS+ CQYC       +L+FDHV+P+  GG  +W+NVVTAC RCNN+K D  P+ L  +G+SL + P  P+  +LQ  AR++PP  +H TW DYL
Sbjct:  188 GSVSTILEECMVKKSVNYFNNLEQCPVLVLNADFQPLSYIPLSIWSWQDAIKAVFAERVVVVANY-EHEVRSASVAVQVPSVIALKQYVPQARATPTFTRRNVFLRDSYCCQYCGTCAQTQHLTFDHVLPRSKGGDTSWNNVVTACHRCNNKKKDMMPQQLHHLGLSLLREPRVPSHYELQAAARKFPPKVMHHTWRDYL 386          
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: A0A2E4F2L9_9PROT (HNH endonuclease n=1 Tax=Rhodospirillaceae bacterium TaxID=1898112 RepID=A0A2E4F2L9_9PROT)

HSP 1 Score: 217 bits (553), Expect = 6.560e-67
Identity = 101/185 (54.59%), Postives = 135/185 (72.97%), Query Frame = 1
Query:  445 IDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYLYFESEI 999
            +D+CP LVLNADY+PLSY PLSLW WQ+ +KAVF ++V ++A Y ++ +RSPS  ++LPSVI+LK +V ++R  P FTR N+FLRD   CQYC KYF  H+L+FDHV+PK  GG   W NV+TAC  CN RK +   R +  +GM L   P  PT  +LQN+ RRYPPN +HE+W D+LY++SE+
Sbjct:    3 VDTCPALVLNADYRPLSYFPLSLWSWQETVKAVFLDRVHIVAEY-EQEVRSPSWQMRLPSVISLKRYVQSNR-RPAFTRFNVFLRDGFTCQYCRKYFDTHDLTFDHVLPKSRGGFTNWTNVITACASCNLRKGN---RLMAEVGMHLVYMPRVPTVYELQNVGRRYPPNYLHESWSDFLYWDSEL 182          
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: A0A7W6WA90_9PROT (5-methylcytosine-specific restriction endonuclease McrA n=2 Tax=Roseospira visakhapatnamensis TaxID=390880 RepID=A0A7W6WA90_9PROT)

HSP 1 Score: 213 bits (542), Expect = 3.190e-65
Identity = 105/189 (55.56%), Postives = 133/189 (70.37%), Query Frame = 1
Query:  433 FSGGIDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYLYFESEI 999
              GG    P LVLNADY+PLSY PLSLW WQD +K+VF ++V V+  Y D  IRSPS+ V+LPSVI+LKEFV+  R  P FTR N+FLRD   CQYC + FP H L+FDHVVP+  GG+ TW+NVVTAC  CN RKS+  P+   +  M L + P  PT   LQ++ R +PPN +HE+W DYLY++SE+
Sbjct:    1 MEGGPPHAPALVLNADYRPLSYFPLSLWPWQDAVKSVFLDRVHVVCEY-DTVIRSPSLEVRLPSVISLKEFVSTARM-PAFTRFNVFLRDRFTCQYCGRRFPTHELTFDHVVPRSWGGRTTWENVVTACAACNLRKSNRTPK---TASMPLMREPRVPTHHALQDVGRAFPPNFLHESWRDYLYWDSEL 184          
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: UPI001B298D4B (HNH endonuclease n=1 Tax=Rhodospirillales bacterium TMPK1 TaxID=2812848 RepID=UPI001B298D4B)

HSP 1 Score: 213 bits (542), Expect = 4.710e-65
Identity = 102/191 (53.40%), Postives = 135/191 (70.68%), Query Frame = 1
Query:  427 INFSGGIDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYLYFESEI 999
            +  S  +DSCP LVLNAD++PLSY PLSLW WQD +KAVF ++V +L+ Y D+++RSP+  ++LPSVI+LKE+V   R  P FTR N+FLRDS  CQYC   FP H L+FDHV+P+  GG+ TWDNV+TAC  CN RK D   +  R  GM     P  P+  +LQ   R++PPN +HE+W DYLY++SE+
Sbjct:   12 LRLSPPLDSCPALVLNADFRPLSYFPLSLWSWQDTVKAVFLDRVNILSHY-DQTVRSPNFEMRLPSVISLKEYVQATR-RPAFTRFNVFLRDSFTCQYCGTGFPTHELTFDHVIPRSRGGRTTWDNVLTACSACNLRKGD---KLCRECGMHPRLEPFQPSAFELQENGRQFPPNFLHESWRDYLYWDSEL 197          
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: A0A2E3IPI6_9PROT (HNH endonuclease n=5 Tax=Alphaproteobacteria TaxID=28211 RepID=A0A2E3IPI6_9PROT)

HSP 1 Score: 212 bits (540), Expect = 6.600e-65
Identity = 108/192 (56.25%), Postives = 141/192 (73.44%), Query Frame = 1
Query:  436 SGGIDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYLYFESEILEED 1011
            +G +++ P LVLNAD++PLSYLPLSLWGWQD +KA F  +V V+A Y ++ IRSPS  +QLPSVIALK++V  +R+ P FTR N+FLRD   CQYC   FP  +L+FDHV+P+  GG  +W+NVVTAC +CN RK +   R LR  GM L K P  PT  QLQ++ R +PPN +HE+WEDYLY++S +LE D
Sbjct:    3 TGSLENYPALVLNADFRPLSYLPLSLWGWQDSVKACFQGRVNVVAEY-EQVIRSPSFEMQLPSVIALKDYVPMNRA-PAFTRFNVFLRDHFNCQYCSTRFPVQSLTFDHVIPRSKGGGTSWENVVTACQKCNLRKGN---RYLRDSGMRLIKEPLRPTAYQLQDVGRSFPPNFLHESWEDYLYWDS-VLERD 188          
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: A0A6I7P8K3_9PROT (HNH endonuclease n=1 Tax=Geminicoccaceae bacterium TaxID=2448052 RepID=A0A6I7P8K3_9PROT)

HSP 1 Score: 211 bits (536), Expect = 3.110e-64
Identity = 101/185 (54.59%), Postives = 132/185 (71.35%), Query Frame = 1
Query:  445 IDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYLYFESEI 999
            +D+CP LVLNADYQPLSY PLSLWGWQD +KA+F ++V V++ Y DR IRSPS  ++LPSV+ALKE+V     +PPFTR NLFLRD   CQYC   F   +L+FDHVVP+  GG+ +WDN+VTAC RCN +K +  P +    GM     P  PT  +LQ   R +PPN +HE+W D+LY+++E+
Sbjct:   12 LDACPALVLNADYQPLSYFPLSLWGWQDAVKAMFLDRVAVVSYY-DREIRSPSFRLRLPSVVALKEYVQQSH-YPPFTRFNLFLRDRFGCQYCGTRFRAEDLTFDHVVPRSRGGQTSWDNIVTACTRCNLQKGNKLPAEC---GMPPRTKPRRPTNYELQRAGRAFPPNYLHESWRDFLYWDTEL 191          
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: A0A1G7CT82_9PROT (5-methylcytosine-specific restriction endonuclease McrA n=2 Tax=Rhodospirillaceae TaxID=41295 RepID=A0A1G7CT82_9PROT)

HSP 1 Score: 210 bits (535), Expect = 3.620e-64
Identity = 104/183 (56.83%), Postives = 132/183 (72.13%), Query Frame = 1
Query:  451 SCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYLYFESEI 999
            S P LVLNADY+PLSY PLSL  WQD +KAVF ++V V+  Y DR I SPS+ ++LPSVI+LK+FV   R+ P FTR N+FLRD   CQYC + FP H+L+FDHVVP+  GG+ TW NVVTACG CN RKS+  PR   +  M L + PH P+   LQ + R +PPN +HE+W DYLY++SE+
Sbjct:    7 SSPALVLNADYRPLSYFPLSLVPWQDAVKAVFLDRVNVVCEY-DRVIHSPSLEIRLPSVISLKQFVRTART-PAFTRFNVFLRDRFTCQYCGQRFPTHDLTFDHVVPRSWGGRTTWANVVTACGACNLRKSNRTPR---AASMPLRQPPHVPSHHMLQEIGRAFPPNYLHESWRDYLYWDSEL 184          
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Match: A0A1G8D236_9PROT (5-methylcytosine-specific restriction endonuclease McrA n=1 Tax=Roseospirillum parvum TaxID=83401 RepID=A0A1G8D236_9PROT)

HSP 1 Score: 210 bits (535), Expect = 3.620e-64
Identity = 104/185 (56.22%), Postives = 129/185 (69.73%), Query Frame = 1
Query:  445 IDSCPCLVLNADYQPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIALKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLGGKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLARRYPPNEIHETWEDYLYFESEI 999
            +++CP LVLNAD++PLSY PLSLW WQD IKAVF E+V V++ Y +R IRSP   V+LPSVI+LKEF    R  P FTR N+FLRD   CQYC + FP H L+FDHVVP+  GG+ TW NVVTACG CN +K    PR     GM L + P  PT   LQ   R +PPN +HE+W D+LY++SE+
Sbjct:    5 LENCPALVLNADFRPLSYFPLSLWSWQDTIKAVFLERVNVVSEY-EREIRSPRQAVRLPSVISLKEFAPVAR-RPAFTRFNVFLRDRFTCQYCGQRFPTHELTFDHVVPRSKGGRTTWHNVVTACGACNLKKGSRLPRQA---GMPLIREPMEPTSPMLQEFGRAFPPNYLHESWRDFLYWDSEL 184          
The following BLAST results are available for this feature:
BLAST of mRNA_S-firma_F_contig1037.528.1 vs. uniprot
Analysis Date: 2022-09-19 (Diamond blastx: OGS1.0 of Sphaerotrichia firma female vs UniRef90)
Total hits: 25
Match NameE-valueIdentityDescription
D8LRG1_ECTSI1.640e-15971.77HNH endonuclease family protein n=1 Tax=Ectocarpus... [more]
A0A836CK35_9STRA2.630e-8265.41HNH endonuclease family protein n=1 Tax=Tribonema ... [more]
A0A7S2S5X4_9STRA3.270e-6853.00Hypothetical protein n=2 Tax=Rhizochromulina marin... [more]
A0A2E4F2L9_9PROT6.560e-6754.59HNH endonuclease n=1 Tax=Rhodospirillaceae bacteri... [more]
A0A7W6WA90_9PROT3.190e-6555.565-methylcytosine-specific restriction endonuclease... [more]
UPI001B298D4B4.710e-6553.40HNH endonuclease n=1 Tax=Rhodospirillales bacteriu... [more]
A0A2E3IPI6_9PROT6.600e-6556.25HNH endonuclease n=5 Tax=Alphaproteobacteria TaxID... [more]
A0A6I7P8K3_9PROT3.110e-6454.59HNH endonuclease n=1 Tax=Geminicoccaceae bacterium... [more]
A0A1G7CT82_9PROT3.620e-6456.835-methylcytosine-specific restriction endonuclease... [more]
A0A1G8D236_9PROT3.620e-6456.225-methylcytosine-specific restriction endonuclease... [more]

Pages

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
S-firma_F_contig1037contigS-firma_F_contig1037:17028..23770 -
Analyses
This mRNA is derived from or has results from the following analyses
Analysis NameDate Performed
Diamond blastx: OGS1.0 of Sphaerotrichia firma female vs UniRef902022-09-19
OGS1.0 of Sphaerotrichia firma ET2_F female2021-02-24
Properties
Property NameValue
Seed ortholog2880.D8LRG1
PFAMsHNH_5
Max annot lvl2759|Eukaryota
Evalue1.76e-160
EggNOG OGsCOG1403@1|root,2QQI3@2759|Eukaryota
DescriptionHNH endonuclease
COG categoryV
Hectar predicted targeting categorychloroplast
Ec32 ortholog descriptionHNH endonuclease family protein
Ec32 orthologEc-18_004400.1
Exons5
Model size1014
Cds size981
Stop0
Start1
Relationships

The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesTypePosition
mRNA_S-firma_F_contig1037.528.1prot_S-firma_F_contig1037.528.1Sphaerotrichia firma ET2_F femalepolypeptideS-firma_F_contig1037 17028..23737 -


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesTypePosition
1622815569.301942-CDS-S-firma_F_contig1037:17027..172171622815569.301942-CDS-S-firma_F_contig1037:17027..17217Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 17028..17217 -
1696949422.7978814-CDS-S-firma_F_contig1037:17027..172171696949422.7978814-CDS-S-firma_F_contig1037:17027..17217Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 17028..17217 -
1622815569.3164825-CDS-S-firma_F_contig1037:17728..179131622815569.3164825-CDS-S-firma_F_contig1037:17728..17913Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 17729..17913 -
1696949422.8144484-CDS-S-firma_F_contig1037:17728..179131696949422.8144484-CDS-S-firma_F_contig1037:17728..17913Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 17729..17913 -
1622815569.3298237-CDS-S-firma_F_contig1037:18366..184961622815569.3298237-CDS-S-firma_F_contig1037:18366..18496Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 18367..18496 -
1696949422.8228214-CDS-S-firma_F_contig1037:18366..184961696949422.8228214-CDS-S-firma_F_contig1037:18366..18496Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 18367..18496 -
1622815569.339069-CDS-S-firma_F_contig1037:18815..189661622815569.339069-CDS-S-firma_F_contig1037:18815..18966Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 18816..18966 -
1696949422.8310103-CDS-S-firma_F_contig1037:18815..189661696949422.8310103-CDS-S-firma_F_contig1037:18815..18966Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 18816..18966 -
1622815569.3480766-CDS-S-firma_F_contig1037:23412..237371622815569.3480766-CDS-S-firma_F_contig1037:23412..23737Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 23413..23737 -
1696949422.8396606-CDS-S-firma_F_contig1037:23412..237371696949422.8396606-CDS-S-firma_F_contig1037:23412..23737Sphaerotrichia firma ET2_F femaleCDSS-firma_F_contig1037 23413..23737 -


The following UTR feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesTypePosition
1622815569.3572505-UTR-S-firma_F_contig1037:23737..237701622815569.3572505-UTR-S-firma_F_contig1037:23737..23770Sphaerotrichia firma ET2_F femaleUTRS-firma_F_contig1037 23738..23770 -
1696949422.8497207-UTR-S-firma_F_contig1037:23737..237701696949422.8497207-UTR-S-firma_F_contig1037:23737..23770Sphaerotrichia firma ET2_F femaleUTRS-firma_F_contig1037 23738..23770 -


Sequences
The following sequences are available for this feature:

protein sequence of mRNA_S-firma_F_contig1037.528.1

>prot_S-firma_F_contig1037.528.1 ID=prot_S-firma_F_contig1037.528.1|Name=mRNA_S-firma_F_contig1037.528.1|organism=Sphaerotrichia firma ET2_F female|type=polypeptide|length=327bp
MAHRKRMRTTWLLGACVVNSLQGGAAFALPPQQDLLRHSSRHCQVDASSF
GIGRAGRAWCGASGPAAVARRGDISMSAKQKRGRKTAGVRSRRGGPRTDV
EGYSHDNFIKLRKEVSGQVETSKQGNRRSKKINFSGGIDSCPCLVLNADY
QPLSYLPLSLWGWQDVIKAVFSEKVVVLATYGDRSIRSPSVVVQLPSVIA
LKEFVNNHRSHPPFTRRNLFLRDSHQCQYCMKYFPPHNLSFDHVVPKKLG
GKGTWDNVVTACGRCNNRKSDCHPRDLRSIGMSLNKYPHTPTFSQLQNLA
RRYPPNEIHETWEDYLYFESEILEEDE
back to top

mRNA from alignment at S-firma_F_contig1037:17028..23770-

Legend: UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
>mRNA_S-firma_F_contig1037.528.1 ID=mRNA_S-firma_F_contig1037.528.1|Name=mRNA_S-firma_F_contig1037.528.1|organism=Sphaerotrichia firma ET2_F female|type=mRNA|length=6743bp|location=Sequence derived from alignment at S-firma_F_contig1037:17028..23770- (Sphaerotrichia firma ET2_F female)
CAGACTCTAGAGCAGCAGGCATTTTCGTAGCTAATGGCGCACCGTAAGAG AATGCGAACCACGTGGCTCCTCGGAGCGTGTGTAGTGAACTCGTTGCAAG GCGGGGCAGCGTTTGCTCTGCCTCCTCAGCAGGATTTACTCCGCCACAGC AGTCGCCATTGCCAAGTTGACGCATCGTCGTTCGGAATAGGGCGTGCGGG GCGAGCTTGGTGCGGAGCTTCAGGGCCCGCTGCCGTGGCTCGTAGGGGGG ATATCTCGATGTCGGCGAAGCAGAAGAGGGGACGGAAGACCGCTGGCGTT CGATCGCGGCGGGGCGGGCCTAGGACAGACGTGGAAGGGTACAGCCACGA CAATTTCAGTGAGTGTTTTTTTCGTCTGCCAGCCAGCTGCGATCACTATC TCTGTTGTACAGCAGGAGGTACGCTCGACCAGCGACGAGGTGTGACGTCG GAATCGGCGGCGGGATTGTTCCTTTTGGTTGTGCTGCTGTCCTGTGCACT CGTGGTGCATTCCAAGTGCTGTGAAGGCTGTAGCGGCCGGTTTATTTTCA AACTTCCTGTTGTTACTTCAAATACGCCAGACAGACACCGCCGAGGTTTT TTTTTCCTCGCCGCCGGTTGCTGATTTTTTTCAAAGTTGAAAAGTTGAGG CCTGGATTTTCGATTTTCGAAACAAAAATGATGACACTTTGGAACCAGCG GCACGCGCCCGACACCCGACGCCCCCGACCCATCCCAGAAGCGCTGCGCT GCTGCCTGCCTGCCTGCCTGCCTGGTGTGTTCTCTTCTCTACGTTCTTCT TTCCCCGTCTCTTCAGCACGTGGTTACCCTCCACGTCCTGGCCGTTGGAT TCAGTATCAGTACTAGATCTTGGGTTATCTTCGGAGCCGCTGAATCGGGG TAGGCGGCTCCTCGTAACTGCATTATTAAGCTGTGTGCGGATGCAACAAA CGTGGTGAACTGTGTGGATGAAAAGCGTACCTCAGCCACAGACGATGGTT CAAAGTCTCTGATGCATCTATCGTGCCGGTCGAAGGAACAAAGAAATGCA GACTCGGTAAGAACATTAATTCCACGAGTATACGAATGAACACACCGAAA GTGCATGAGTACCCAACATAAGAACCACGCTGCCCTATATCCATACGACC ACACACCCTACAGAAGATACTGCTCCACATCATCCTCAGCCCCAATGTTC GGCGCACTCCCTTTCTAACTGCGCTTCGACCATTTCATCGACCCAGTCGG AGTTGTGCACCAACTGGAGCTGCTCGTCCCCGTCAAGCTGGTGCTGCTCC TTCATTAGCCACCTGGCATATTCCCTGTACTCGACACTCGGGCACAACGC CCGGCAACACGCCGTCCTCAAGGCTGAACTTCGCCAAACGAGGGGCCGCG GGCTCATCTCTCGCCCTTTCCCGAAGGTCCTTCACGCATCTGCGTTTCGT TCGGCGAAAGGGACGTCAACAATGGAGACGGGAATTTCAGAACGGTGCTT TCATGACAACGGGAACCGGAGGGGCCTCAACAATCAAGACGGGAAATGTT GAAACGACGGCGGACAAAATTGTGCTATCCCGCTTGTGCTATCCCGCTTG TTGACAACAGAGTAGCCTGCGGTTGGTTTCAAACTCCTGGAACTCACACG ACATACATACCTCGCGCACTGGAAGAGCTGTCCCGGCGGGGGCGCGGGCT CTGTTGTTGGCCACCTCTCCTGGCACGTGCGGCAATGCCGAGGGTGGTAC TCGTAGAGGCGTTCGTAGAACTGGGAGCACTTCTGCTGCATCTGCACCAA GAGCAAGCAAGAGCAAACAAAACAACATAAACGAACGGGTTTAATCTACT GCTTCTTCACTTCACCAGGGAAGAACGGGACCAACAGGCAACCGACAAAA TTCCCCCTACGGTACAGTATGTAGAAATAACTCACATGCGTTCCCCTGAT GTCGATGTCGAAAATTGGCCTCAGCCCCGAGCGATCTGCGCCAGCGCCGC GACCATCACCGCCGCCGCCACCATCACCATCACCACCGCCGCCCCCACCA CCATCGGCGCCACCATCACCGCCGCCGCCACCATCACCGGCGCCACAATC ACCGCCGCCGCCACAATCACCGCCGCCGCCACCATCACCGCCGCCGCCAC CATCGCCGCCGCCGCCGCCACCAACATCGCCATCACCACCATCATCGCCT CCGCCCGCTCCCTCATCGTAATCGAGATCGTCTAAGAAACGACCACGACG CCTGTACACCCTGTTCCTATTCCGCACATCAGACTCGTACGGGGAGAGAG AGGCCCGCGTTCTCGCCCTGCTCGCCGCAGCAGCCCCCCGACGTTCTTCC CGCGCAGGAGTAGACTCGGTGGCCCGTGTCGACGCCGTTTGCGCTGCATC ACGCTGCCGGCGTGCCTGCCGCTCAGGAGTAGACTGGTCTCTGAGGCGTG CCTGCTCCCTCGCCCTACTCCTCGCGCGGGTAGCCTCTTGTCGCGCCACC TCGTCTGGTTGCCGAGGTGGCCGCCCACGCCCCCTGCCTCGACCACGCCC CCGCCCCCGGCCCCGATGCGCACGAAGCTCGTTGCCTGTCTGATCTAAAT TGCGAACATTGGCCTGACGTTCTCGTGAGCCCCGCCGCACGACTGCTGTT GCCGCAGCAGCATCGCTGAAAGAAGGGAGGGGGAAGGGACGACGGGAATA AACCATAAATTGAACGGGGGCAGTCTACTTCAACAGTCCACAACTCCCTC CGAAAAAATAACCTGCGAATTATGCATACTATACAGCAGTACATCCATAC AGCAGTACATGCCTACAGCAGTACATCCGAACAAATGCGCATCATGCCCA AAGTAAGATGCGCATCATGTGTAGCGCATCATGTGTTGCATCATGTGTAC CTAAAGATCGTGCGTACAGCACAGCAGCAGTGTAGTTATGGGTGGTCAAC CCGCCTTTTTGCGCACCAATACTCGCTCCAAATAGACCATCCATCTCTCT TTATATATATCGTTCTCTCTCGCCGCAACAGCAGTAGTAACATGCTAGCA GCAGCAACAAATTAGTCCTCGAGGCCACGCAGCAGCAGCAAATTAGTCCT CGTAGGCCACGCAACACACACACACATGAATCGGCCCACCTTTTTCCGCC ATGGACCCCGAGGAGGGTGTCTCTGTGCTTCCACCACCACGCCCACGCCA GGCCGGGCCGCCGAGCGGGCCGCCGCCGAGAGCCGCCGAGCCGCCGACCT CCTGCCCGGCGGACCCCTGCTCCTCTCAGCTGACCCCCCCCCCCCTCCCC CCCGAGACAAATATGTGGGAACCATCCTCCCCACACAACACAGACGAGTC TCTTTGAATTTTGAATTTTCTCGCGTGCTCTCCACCCTGCAAGTGCTCTC CAAACCTGCGCGCTCTCGGCGTAGAAGCAAGCTACAGCTCTACGCACTTG CACCGACACCAGGAGTGACATACACACCAGCATGCATCTGCTGGCTGTTA GCCATGGTGCCTGTGCGTCCGTGTTCGTTGGTGTTTGCGGTCGATGGGGG GGGGTGCTATTTTCCGCAGCTTGGGAGGGAGAGTACAGCAGCAGAAAGCA GGCCTTGACGCTGTTGACGGCGCGAGCCCGCGGCCGCGTGCAAAACCACA GGTACCAGCGAGTAGATGCTATTTTTGGTGTTCGTTCAAGCCCTTGACTC CCTCGATCCGTGCCGAAATGAGCGCAGTCGAGGTCAAAAAGCACGTTCTA AAGCGCATAGACAAACACCAACAACATCAAAAATGTTTTGGTGGCACCCT ACATGGTATTTTTTCGGATCTTTTTTCACGTCTCACACACCTAACCCCGA CCCACTATATTCTAGGTGCCTGACGGCGGCTAGCGCCGCCTACGGCTCCT CATTACAATGCATATGCAGGTTGTGCTTCAGCTCTTCCAATACGACCTTC ACACCAGTTCTGGCAAACCCGTTTGACCCTGCAATCAACATGTCCAGCAG TAGGTAATTCCCTTGGGTATGCTCTCACAATTTGCTGTTGCCGCAGCACC CGTAACCTGGAGGGATATCAATCGATACGTACAAACAGGGCATCTATTTT CAGCTGCAGCCCAAACAGATCGTGAAGGGCAGTACGTGTAATTTAGGAGG GTTGTATCTTATAGCCCCGTGTATCAAGGCTTCAATCCTTCACACAGCAG CACCTTCACACACCTCTTTTGTTTCTCCTGCACACAGAGTAGCACCTCCA CACAGCTCTTGGTTTCGATCCCTCGCCAACGCTCCTCGTAGGGACGCAAG GGAGAACTGCTGTTGACATGTACATTTGAGGTGGTCAGGCATGCACACAA GTCACCGTGTTTGGCCCATGGCCGCTGCTGCTGCTGCTGCTGTCATGGAA GTGGCGAAACTCTGCGAAACGCCAAGGTCGTCCGAAGGCTTCATCCTGAT GCCACGAACTACACACGCAAGAAGTGCCGAACAAGCACGATACGATTCCT GAGGTGCAGCTGCAGCTCCCCCCTCATGCGGCTCTCCCAAAACACCCCAA GGCGCCTCGCGCGTGGTGGGGCTTTGCCTGTCCTGGCCCTCACCGCGACA ACCGGTATTTCGTGCTCCGTCCCCCCAAGGCTTCCTAGGGATGCGACGTC CATTATCGAACGAATAGCAACGTATGATCAACGAGTGAACCCACCCCTGC CTCCCACCCCTAATCTTGCAATCCCCGCTGCCCCGCTTGCCCTCTGTGTC GCCCTGCTTTGTCGCGCTTTTGCCGCCCCAACTCTGTGTTTTTCTACATA CCAGTCAAGTTGAGGAAAGAGGTGTCCGGGCAGGTGGAAACCTCGAAGCA GGGCAACCGCAGATCTAAAAAGATCAACTTCAGTGGGGGCATCGACTCGT GCCCGTGCCTCGTCCTCAACGCGGACTACCAGCCTTTGTCCTACCTGCCC CTCAGGCAAGTGGGTTGTTGCGCCGCTGCGAAGGAGCGCCATGACAGCTA GCCTAGCCTACTGTGCGCAACTTTTACCACCAGAGGAGAAGAGGAGCGGT CCTGTGGTGGCAGGATATTTTATCTCGTGTCACCGCGAAAGGGTCCCCCG GGACAGTATCTAGGCTTTTTCGTTGCTGCCTTGCGTCCAGAACCTCTTTT CTAGTGACTGCTGCATCCGAGCGTTTTTCGTTAAGCGCCGCCTCCCGTGG CTTGGTTTGGCTGCTTGGCTTACCTCTCCCCGCCGCCGCCGACCTCTGTG CCTAACCTTGCCTCCCTGGCACAGTCTGTGGGGGTGGCAGGATGTGATCA AGGCGGTGTTCAGCGAAAAGGTTGTCGTCTTGGCCACGTACGGGGACCGA TCTATTCGTTCTCCGAGCGTTGTGGTTCAGCTCCCAAGCGTGATCGCTTT GAAGGTACGATTATGGGGCTAACCTGGTGTTCTGCAGTCACGTCGCCCCC GACCAAGCGCGTGGGTCTTCGTAGGGCACGTGCCTGGATTGCTCACCTTT GTGGGGTTTCGGCGCGGCCAGAAGATATGGACCACCGCGCCAGTGAGCAG GGCGGTAGCAGTTAATAGCGCATGCCGCAATGTCATACGCTCGGTCCTTC CTATCAACGGCCGTGGAGGTCGTGCTCCGTCGCCTCTTCTGTTGTTCCGC CCTACACTACTGCTGTATGTGTATGATCACGTGGACGTCCTGCTCGGATT CGGGGCCGTTTTCCAGCACCGGTGTTAGTCACGGTAACGAAGCAGCTTCC CTCTACATGTCTCAGCTCACCCGTCCAACTTTAACCACGCCCAGCGCGGT CCTCCCCCGCCCACTGCCTCCCCTCACCCTAACGCTCCCGCCGCGTCCCT CCCGCAGGAGTTCGTCAACAACCACCGGAGTCACCCGCCTTTCACGCGGC GCAACCTGTTCCTGCGCGACTCGCATCAGTGTCAATATTGTATGAAGTAC TTCCCCCCGCACAACCTGAGCTTCGACCACGTGGTGCCGAAGAAGCTCGG CGGGAAGGGCACGTGGGACAACGTCGTCACCGCCTGCGGGAGGTAGTATG TAGTATACGATCCGGGCTACTCCTTTCCTGTTGACCTGTGTTTTCTTGCT TGTTTCGGTGTCTCGCGTAGCGCTGTGTATGTAGCCATTCCCGTCCAAGC TGCGAGAACTCGTGTTGTGGGGTAGTTTTTACTCGGCCGGGGGGCACGCG GACGAAGAAGGGACGTACCCGCCCCGCGGGGCCGACAATGGCCACAGAAG CCGCTGAAACCGCTCAATCGCGGGGCGGGGCTTGGCAGCTCCCGCGTGAC GCACGTGCATTCGTCCGTCATGAGCTTTGTTGGGGTGAGATTGAGATGCA GTAAGCTTGGAGGTGTCGTACGTCGCGTGAAATGTGGGGAGAAACCTCGG CATCCATTGAAACGACTGGAGAGGGGGTGGTGGTATTGTAGACGCGGGTT GGGGTTAGCCGTGTTATTATTGTTGTATGCACATCAACAGTTCCACGCTT GGGACTAACCACCGTAGAAAATATTGCCCTCCTCCTCGTGTGCGCGCGCG CAGATGCAACAACCGCAAGAGCGACTGCCACCCGCGAGACCTCAGGTCCA TCGGGATGAGCCTCAACAAGTACCCCCACACTCCCACGTTCAGCCAGCTG CAGAACCTGGCCAGAAGGTACCCCCCCAACGAGATCCACGAGACGTGGGA AGACTACCTCTACTTCGAAAGCGAGATCCTGGAGGAGGACGAG
back to top

Coding sequence (CDS) from alignment at S-firma_F_contig1037:17028..23770-

>mRNA_S-firma_F_contig1037.528.1 ID=mRNA_S-firma_F_contig1037.528.1|Name=mRNA_S-firma_F_contig1037.528.1|organism=Sphaerotrichia firma ET2_F female|type=CDS|length=1962bp|location=Sequence derived from alignment at S-firma_F_contig1037:17028..23770- (Sphaerotrichia firma ET2_F female)
ATGGCGCACCGTAAGAGAATGCGAACCACGTGGCTCCTCGGAGCGTGTGT
AGTGAACTCGTTGCAAGGCGGGGCAGCGTTTGCTCTGCCTCCTCAGCAGG
ATTTACTCCGCCACAGCAGTCGCCATTGCCAAGTTGACGCATCGTCGTTC
GGAATAGGGCGTGCGGGGCGAGCTTGGTGCGGAGCTTCAGGGCCCGCTGC
CGTGGCTCGTAGGGGGGATATCTCGATGTCGGCGAAGCAGAAGAGGGGAC
GGAAGACCGCTGGCGTTCGATCGCGGCGGGGCGGGCCTAGGACAGACGTG
GAAGGGTACAGCCACGACAATTTCAATGGCGCACCGTAAGAGAATGCGAA
CCACGTGGCTCCTCGGAGCGTGTGTAGTGAACTCGTTGCAAGGCGGGGCA
GCGTTTGCTCTGCCTCCTCAGCAGGATTTACTCCGCCACAGCAGTCGCCA
TTGCCAAGTTGACGCATCGTCGTTCGGAATAGGGCGTGCGGGGCGAGCTT
GGTGCGGAGCTTCAGGGCCCGCTGCCGTGGCTCGTAGGGGGGATATCTCG
ATGTCGGCGAAGCAGAAGAGGGGACGGAAGACCGCTGGCGTTCGATCGCG
GCGGGGCGGGCCTAGGACAGACGTGGAAGGGTACAGCCACGACAATTTCA
TCAAGTTGAGGAAAGAGGTGTCCGGGCAGGTGGAAACCTCGAAGCAGGGC
AACCGCAGATCTAAAAAGATCAACTTCAGTGGGGGCATCGACTCGTGCCC
GTGCCTCGTCCTCAACGCGGACTACCAGCCTTTGTCCTACCTGCCCCTCA
GTCAAGTTGAGGAAAGAGGTGTCCGGGCAGGTGGAAACCTCGAAGCAGGG
CAACCGCAGATCTAAAAAGATCAACTTCAGTGGGGGCATCGACTCGTGCC
CGTGCCTCGTCCTCAACGCGGACTACCAGCCTTTGTCCTACCTGCCCCTC
AGTCTGTGGGGGTGGCAGGATGTGATCAAGGCGGTGTTCAGCGAAAAGGT
TGTCGTCTTGGCCACGTACGGGGACCGATCTATTCGTTCTCCGAGCGTTG
TGGTTCAGCTCCCAAGCGTGATCGCTTTGAAGTCTGTGGGGGTGGCAGGA
TGTGATCAAGGCGGTGTTCAGCGAAAAGGTTGTCGTCTTGGCCACGTACG
GGGACCGATCTATTCGTTCTCCGAGCGTTGTGGTTCAGCTCCCAAGCGTG
ATCGCTTTGAAGGAGTTCGTCAACAACCACCGGAGTCACCCGCCTTTCAC
GCGGCGCAACCTGTTCCTGCGCGACTCGCATCAGTGTCAATATTGTATGA
AGTACTTCCCCCCGCACAACCTGAGCTTCGACCACGTGGTGCCGAAGAAG
CTCGGCGGGAAGGGCACGTGGGACAACGTCGTCACCGCCTGCGGGAGGAG
TTCGTCAACAACCACCGGAGTCACCCGCCTTTCACGCGGCGCAACCTGTT
CCTGCGCGACTCGCATCAGTGTCAATATTGTATGAAGTACTTCCCCCCGC
ACAACCTGAGCTTCGACCACGTGGTGCCGAAGAAGCTCGGCGGGAAGGGC
ACGTGGGACAACGTCGTCACCGCCTGCGGGAGATGCAACAACCGCAAGAG
CGACTGCCACCCGCGAGACCTCAGGTCCATCGGGATGAGCCTCAACAAGT
ACCCCCACACTCCCACGTTCAGCCAGCTGCAGAACCTGGCCAGAAGGTAC
CCCCCCAACGAGATCCACGAGACGTGGGAAGACTACCTCTACTTCGAAAG
CGAGATCCTGGAGGAGGACGAGATGCAACAACCGCAAGAGCGACTGCCAC
CCGCGAGACCTCAGGTCCATCGGGATGAGCCTCAACAAGTACCCCCACAC
TCCCACGTTCAGCCAGCTGCAGAACCTGGCCAGAAGGTACCCCCCCAACG
AGATCCACGAGACGTGGGAAGACTACCTCTACTTCGAAAGCGAGATCCTG
GAGGAGGACGAG
back to top