mRNA_C-tenellus_contig10031.1.1 (mRNA) Choristocarpus tenellus KU2346

You are viewing an mRNA, more information available on the corresponding polypeptide page

Overview
NamemRNA_C-tenellus_contig10031.1.1
Unique NamemRNA_C-tenellus_contig10031.1.1
TypemRNA
OrganismChoristocarpus tenellus KU2346 (Choristocarpus tenellus KU2346)
Homology
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: A0A4C1WP54_EUMVA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 n=6 Tax=Eumeta variegata TaxID=151549 RepID=A0A4C1WP54_EUMVA)

HSP 1 Score: 89.7 bits (221), Expect = 7.490e-18
Identity = 58/169 (34.32%), Postives = 82/169 (48.52%), Query Frame = 1
Query:   10 CETYKECKSEALSYQRSGRTPKQPGEVVHTDLVGPFKAD-ITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMAR-EGIPVKCISGDGAGELGRSVKFQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQLVKAGLGDEYWFFAITDLAFKTGVC 510
            CE     K   L +  S R  ++  E+VHTD+VGPFK   +   +YF  F+D+ SR   VY LK K+ A +A   Y  Q  R  G  +K +  D   E   + +    L N G   R S  RTPQ NG+AER  + ++  AR  L+++ L D +W  A+         C
Sbjct:   78 CEVCLRGKMTRLPFLASERQSEETLEIVHTDIVGPFKTQSVNGARYFITFIDDRSRWCEVYFLKQKSGALEAFKMYQTQAERVTGKKIKYLQSDNGKEY-CNAEMDNFLRNQGIQRRLSVVRTPQQNGVAERFNRTIVEMARCLLLQSSLSDMFWADAVATACHLRNRC 245          
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: W8ADM5_CERCA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 (Fragment) n=1 Tax=Ceratitis capitata TaxID=7213 RepID=W8ADM5_CERCA)

HSP 1 Score: 89.4 bits (220), Expect = 1.770e-17
Identity = 53/150 (35.33%), Postives = 80/150 (53.33%), Query Frame = 1
Query:   52 QRSGRTPKQPGEVVHTDLVGPF-KADITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMARE-GIPVKCISGDGAGELGRSVKFQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQLVKAGLGDEYWFFAITDLAF 495
            Q S    K+  E++HTD+ GP  K  I   +YF  F+D+ SR K VY LK+K+    A  ++     ++    +K +  D   E   S  F+  L  NG   + + P TPQ NG+AERA + L+  ARS +V +GLG+ +W  A+   A+
Sbjct:  449 QESKNRAKKLCEIIHTDICGPINKKSIGGSRYFATFIDDMSRYKCVYFLKSKDEIFSAFKSFKAMAEKQTNCKIKILRSDNGREY-LSKNFENFLKENGIVRQLTVPYTPQQNGVAERANRTLVEMARSMIVHSGLGECFWAEAVATAAY 597          
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: A0A4C1V7J1_EUMVA (Copia protein n=2 Tax=Eumeta variegata TaxID=151549 RepID=A0A4C1V7J1_EUMVA)

HSP 1 Score: 88.2 bits (217), Expect = 3.660e-17
Identity = 55/158 (34.81%), Postives = 79/158 (50.00%), Query Frame = 1
Query:   43 LSYQRSGRTPKQPGEVVHTDLVGPFKAD-ITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMAR-EGIPVKCISGDGAGELGRSVKFQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQLVKAGLGDEYWFFAITDLAFKTGVC 510
            L +  S R  ++  E+VHTD+VGPFK   +   +YF  F+D+ SR   VY LK K+ A +A   Y  Q  R  G  +K +  D   E   + +    L N G   R S  RTPQ NG+AER  + ++  AR  L+++ L D +W  A+         C
Sbjct:    4 LPFLASERQSEETLEIVHTDIVGPFKTQSVNGARYFITFIDDRSRWCEVYFLKQKSGALEAFKMYQTQAERVTGKKIKYLQSDNGKEY-CNAEMDNFLRNQGIQRRLSVVRTPQQNGVAERFNRTIVEMARCLLLQSSLSDMFWADAVATACHLRNRC 160          
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: A0A4C1XAK2_EUMVA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 n=1 Tax=Eumeta variegata TaxID=151549 RepID=A0A4C1XAK2_EUMVA)

HSP 1 Score: 87.0 bits (214), Expect = 1.020e-16
Identity = 54/153 (35.29%), Postives = 77/153 (50.33%), Query Frame = 1
Query:   58 SGRTPKQPGEVVHTDLVGPFKAD-ITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMAR-EGIPVKCISGDGAGELGRSVKFQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQLVKAGLGDEYWFFAITDLAFKTGVC 510
            S R  ++  E+VHTD+VGPFK   +   +YF  F+D+ SR   VY LK K+ A +A   Y  Q  R  G  +K +  D   E   + +    L N G   R S  RTPQ NG+AER  + ++  AR  L+++ L D +W  A+         C
Sbjct:  260 SERQSEETLEIVHTDIVGPFKTQSVNGARYFITFIDDRSRWCEVYFLKQKSGALEAFKMYQTQAERVTGKKIKYLQSDNGKEY-CNAEMDNFLRNQGIQRRLSVVRTPQQNGVAERFNRTIVEMARCLLLQSSLSDMFWADAVATACHLRNRC 411          
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: A0A4C1YKI8_EUMVA (Retrovirus-related Pol polyprotein from transposon TNT 1-94 n=1 Tax=Eumeta variegata TaxID=151549 RepID=A0A4C1YKI8_EUMVA)

HSP 1 Score: 87.0 bits (214), Expect = 1.060e-16
Identity = 54/153 (35.29%), Postives = 77/153 (50.33%), Query Frame = 1
Query:   58 SGRTPKQPGEVVHTDLVGPFKAD-ITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMAR-EGIPVKCISGDGAGELGRSVKFQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQLVKAGLGDEYWFFAITDLAFKTGVC 510
            S R  ++  E+VHTD+VGPFK   +   +YF  F+D+ SR   VY LK K+ A +A   Y  Q  R  G  +K +  D   E   + +    L N G   R S  RTPQ NG+AER  + ++  AR  L+++ L D +W  A+         C
Sbjct:  454 SERQSEETLEIVHTDIVGPFKTQSVNGARYFITFIDDRSRWCEVYFLKQKSGALEAFKMYQTQAERVTGKKIKYLQSDNGKEY-CNAEMDNFLRNQGIQRRLSVVRTPQQNGVAERFNRTIVEMARCLLLQSSLSDMFWADAVATACHLRNRC 605          
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: A0A2D5QAQ1_9EURY (Uncharacterized protein (Fragment) n=1 Tax=Euryarchaeota archaeon TaxID=2026739 RepID=A0A2D5QAQ1_9EURY)

HSP 1 Score: 86.7 bits (213), Expect = 1.620e-16
Identity = 55/158 (34.81%), Postives = 87/158 (55.06%), Query Frame = 1
Query:   55 RSGRTPKQPGE--------VVHTDLVGPFKAD-ITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMAREGIPVKCISGDGAGELGRSVK--FQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQLVKAGLGDEYWFFAITDLAF 495
            ++ RTP +  E        +VHTDL GPF+ +    + Y Q+FVD+++R K  Y L+TKN AT     ++  +   G+PV CI  DG GE   + +  +       G   +K+ P +P+SNG+AERA + L+  AR+ ++ A +  E W +A+   AF
Sbjct: 1789 KATRTPHKRNEFETHYALQLVHTDLTGPFEVEGHGGFFYAQIFVDDNTRRKFPYFLRTKNQATTMLRRFVRDV---GLPV-CIRLDGGGEFEGAFEEGWIDTCIELGIKLQKTDPESPESNGVAERANRTLITIARTMMLAAAVPKELWPYAVQHAAF 1942          
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: UPI001004C95A (uncharacterized protein LOC114075238 n=1 Tax=Solanum pennellii TaxID=28526 RepID=UPI001004C95A)

HSP 1 Score: 85.1 bits (209), Expect = 5.510e-16
Identity = 51/139 (36.69%), Postives = 72/139 (51.80%), Query Frame = 1
Query:   85 EVVHTDLVGPFKA-DITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMARE-GIPVKCISGDGAGELGRSVKFQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQLVKAGLGDEYWFFAITDLAF 495
            E+VH+DL GPFK        YF  F+D+ SR   V+ LK+K+   D   ++   + R+ G  +KCI  D  GE      F R     G   +K+PP+TPQ NG+AER  + L+   R  L  A L D +W  A+   A+
Sbjct:  648 ELVHSDLCGPFKVRSHGGALYFVTFIDDHSRKLWVFPLKSKDQVLDVFKSFQALVERQTGKTLKCIRSDNGGEY--IGPFDRYCREQGIRHQKTPPKTPQLNGLAERMNRTLVERVRCMLSDAKLSDSFWAEALNTAAY 784          
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: B3Y003_BOMMO (Polyprotein n=1 Tax=Bombyx mori TaxID=7091 RepID=B3Y003_BOMMO)

HSP 1 Score: 84.7 bits (208), Expect = 7.180e-16
Identity = 41/120 (34.17%), Postives = 66/120 (55.00%), Query Frame = 1
Query:   79 PGEVVHTDLVGPFKADITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMAREGIPVKCISGDGAGELGRSVKFQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQL 438
            PGE+VHTD+ GPF+   ++Y+Y+ +F D+ +  +MVY ++ K+   D     I Q    G  +KC+  D  GE   +     +L  +G   R   P TP+ NG++ER  + L+  ARS +
Sbjct:  491 PGEIVHTDVCGPFQPSFSNYKYYVLFKDDFTGYRMVYFIRKKSEVKDKLVLMIAQTKTVGYTIKCLLSDNGGEFD-NASINNVLDTHGIAQRLVTPYTPEQNGVSERENRTLVETARSMM 609          
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: UPI00067B46BF (retrovirus-related Pol polyprotein from transposon TNT 1-94 n=1 Tax=Amyelois transitella TaxID=680683 RepID=UPI00067B46BF)

HSP 1 Score: 83.2 bits (204), Expect = 2.480e-15
Identity = 52/161 (32.30%), Postives = 80/161 (49.69%), Query Frame = 1
Query:   37 EALSYQRSGRTP-------KQPGEVVHTDLVGPFKADITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMAREGIPVKCISGDGAGELGRSVKFQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQLVKAG-LGDEYWFFAITDLAF 495
            E   Y +S R P       K  GE++HTD+ GPF   I+ YQY+ +F D+ S  +MVY ++ K+   D     + ++   G  VK +  D  GE   +   +++L   G   R + P TP+ NG +ER  + L+ AARS +   G L    W   I  +A+
Sbjct:  476 EGCIYGKSHRKPFGTRERAKAVGELIHTDVCGPFAKSISKYQYYVLFKDDYSSYRMVYFIRHKSEVKDKLLLMLQEVKNAGHTVKTLLSDNGGEF-NNENVRKILQRYGIQQRLTMPYTPEQNGCSERENRTLVEAARSIMHARGELPQVLWAELINTVAY 635          
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Match: A0A8I6TLX8_CIMLE (Integrase catalytic domain-containing protein n=1 Tax=Cimex lectularius TaxID=79782 RepID=A0A8I6TLX8_CIMLE)

HSP 1 Score: 82.8 bits (203), Expect = 3.180e-15
Identity = 54/165 (32.73%), Postives = 86/165 (52.12%), Query Frame = 1
Query:   10 CETYKECKSEALSYQRSGRTPKQPGEVVHTDLVGPFKA-DITDYQYFQVFVDESSRGKMVYGLKTKNAATDATAAYIDQMARE-GIPVKCISGDGAGE-LGRSVKFQRMLANNGT*WRKSPPRTPQSNGIAERAIK*LMGAARSQLVKAGLGDEYWFFAITDLAF 495
            CE   + K   L ++RS    ++P E+VHTDL GP +   I   +YF + +D+ SR  M+Y LK+K+  T     Y   +  E  + +K +  D   E + R++  +  L   G   +++   TPQ NG+AERA + ++  AR  L  AGL  EYW  A++   +
Sbjct:   21 CEVCIKGKHVRLPFKRSKFRAQKPLELVHTDLCGPMETKSIGGNRYFFILIDDYSRYTMIYFLKSKDEVTQVFKEYCALVENEMNLQIKTLRSDNGMEYINRNM--ENFLREKGIKHQRTVRYTPQQNGLAERANRAIVDKARCLLQDAGLPKEYWAEAVSTAVY 183          
The following BLAST results are available for this feature:
BLAST of mRNA_C-tenellus_contig10031.1.1 vs. uniprot
Analysis Date: 2022-09-19 (Diamond blastx: OGS1.0 vs UniRef90)
Total hits: 25
Match NameE-valueIdentityDescription
A0A4C1WP54_EUMVA7.490e-1834.32Retrovirus-related Pol polyprotein from transposon... [more]
W8ADM5_CERCA1.770e-1735.33Retrovirus-related Pol polyprotein from transposon... [more]
A0A4C1V7J1_EUMVA3.660e-1734.81Copia protein n=2 Tax=Eumeta variegata TaxID=15154... [more]
A0A4C1XAK2_EUMVA1.020e-1635.29Retrovirus-related Pol polyprotein from transposon... [more]
A0A4C1YKI8_EUMVA1.060e-1635.29Retrovirus-related Pol polyprotein from transposon... [more]
A0A2D5QAQ1_9EURY1.620e-1634.81Uncharacterized protein (Fragment) n=1 Tax=Euryarc... [more]
UPI001004C95A5.510e-1636.69uncharacterized protein LOC114075238 n=1 Tax=Solan... [more]
B3Y003_BOMMO7.180e-1634.17Polyprotein n=1 Tax=Bombyx mori TaxID=7091 RepID=B... [more]
UPI00067B46BF2.480e-1532.30retrovirus-related Pol polyprotein from transposon... [more]
A0A8I6TLX8_CIMLE3.180e-1532.73Integrase catalytic domain-containing protein n=1 ... [more]

Pages

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
C-tenellus_contig10031contigC-tenellus_contig10031:4366..4895 +
Analyses
This mRNA is derived from or has results from the following analyses
Analysis NameDate Performed
Diamond blastx: OGS1.0 vs UniRef902022-09-19
Choristocarpus tenellus KU2346 OGS1.02022-07-08
Properties
Property NameValue
Stop1
Start0
Seed ortholog6334.EFV48956
PFAMsRVT_2,Retrotran_gag_2,gag_pre-integrs,rve
Model size532
Max annot lvl33208|Metazoa
Hectar predicted targeting categoryother localisation
Exons1
Evalue1.22e-06
EggNOG OGsCOG2801@1|root,KOG0017@2759|Eukaryota,38DPC@33154|Opisthokonta,3BGB6@33208|Metazoa,3D5G4@33213|Bilateria
DescriptionEncoded by
Cds size348
COG categoryL
Relationships

The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesTypePosition
1680789111.4872868-CDS-C-tenellus_contig10031:4365..47131680789111.4872868-CDS-C-tenellus_contig10031:4365..4713Choristocarpus tenellus KU2346CDSC-tenellus_contig10031 4366..4713 +


The following UTR feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesTypePosition
1680789111.501292-UTR-C-tenellus_contig10031:4713..48951680789111.501292-UTR-C-tenellus_contig10031:4713..4895Choristocarpus tenellus KU2346UTRC-tenellus_contig10031 4714..4895 +


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesTypePosition
mRNA_C-tenellus_contig10031.1.1prot_C-tenellus_contig10031.1.1Choristocarpus tenellus KU2346polypeptideC-tenellus_contig10031 4366..4713 +


Sequences
The following sequences are available for this feature:

protein sequence of mRNA_C-tenellus_contig10031.1.1

>prot_C-tenellus_contig10031.1.1 ID=prot_C-tenellus_contig10031.1.1|Name=mRNA_C-tenellus_contig10031.1.1|organism=Choristocarpus tenellus KU2346|type=polypeptide|length=116bp
ELQCETYKECKSEALSYQRSGRTPKQPGEVVHTDLVGPFKADITDYQYFQ
VFVDESSRGKMVYGLKTKNAATDATAAYIDQMAREGIPVKCISGDGAGEL
GRSVKFQRMLANNGT*
back to top

mRNA from alignment at C-tenellus_contig10031:4366..4895+

Legend: CDSpolypeptideUTR
Hold the cursor over a type above to highlight its positions in the sequence below.
>mRNA_C-tenellus_contig10031.1.1 ID=mRNA_C-tenellus_contig10031.1.1|Name=mRNA_C-tenellus_contig10031.1.1|organism=Choristocarpus tenellus KU2346|type=mRNA|length=530bp|location=Sequence derived from alignment at C-tenellus_contig10031:4366..4895+ (Choristocarpus tenellus KU2346)
GAACTGCAATGCGAGACCTACAAAGAATGCAAGTCCGAGGCCCTAAGCTA TCAACGGAGTGGCAGGACCCCCAAGCAGCCCGGCGAGGTTGTCCACACAG ATTTGGTGGGACCTTTTAAAGCAGACATCACTGACTATCAATATTTTCAG GTGTTCGTCGATGAGAGCAGCAGAGGCAAGATGGTTTATGGATTGAAGAC AAAAAACGCGGCGACGGACGCAACCGCTGCTTATATCGACCAGATGGCCA GGGAAGGTATACCGGTCAAATGCATCAGTGGGGATGGTGCTGGCGAGCTT GGGAGATCTGTGAAGTTCCAAAGGATGCTGGCGAACAACGGTACCTGATG GAGGAAATCTCCACCGAGAACACCACAGAGCAATGGAATCGCCGAAAGGG CGATAAAGTAACTCATGGGGGCAGCAAGGAGTCAACTGGTAAAAGCTGGG CTGGGCGACGAGTATTGGTTCTTTGCCATCACGGACCTTGCCTTCAAGAC TGGAGTTTGCCACACGAATACCTGGGAGGA
back to top

Coding sequence (CDS) from alignment at C-tenellus_contig10031:4366..4895+

>mRNA_C-tenellus_contig10031.1.1 ID=mRNA_C-tenellus_contig10031.1.1|Name=mRNA_C-tenellus_contig10031.1.1|organism=Choristocarpus tenellus KU2346|type=CDS|length=348bp|location=Sequence derived from alignment at C-tenellus_contig10031:4366..4895+ (Choristocarpus tenellus KU2346)
GAACTGCAATGCGAGACCTACAAAGAATGCAAGTCCGAGGCCCTAAGCTA
TCAACGGAGTGGCAGGACCCCCAAGCAGCCCGGCGAGGTTGTCCACACAG
ATTTGGTGGGACCTTTTAAAGCAGACATCACTGACTATCAATATTTTCAG
GTGTTCGTCGATGAGAGCAGCAGAGGCAAGATGGTTTATGGATTGAAGAC
AAAAAACGCGGCGACGGACGCAACCGCTGCTTATATCGACCAGATGGCCA
GGGAAGGTATACCGGTCAAATGCATCAGTGGGGATGGTGCTGGCGAGCTT
GGGAGATCTGTGAAGTTCCAAAGGATGCTGGCGAACAACGGTACCTGA
back to top