mRNA_D-mesarthrocarpus_Contig1041.5.2 (mRNA) Discosporangium mesarthrocarpum MT17_79

You are viewing an mRNA, more information available on the corresponding polypeptide page

Overview
NamemRNA_D-mesarthrocarpus_Contig1041.5.2
Unique NamemRNA_D-mesarthrocarpus_Contig1041.5.2
TypemRNA
OrganismDiscosporangium mesarthrocarpum MT17_79 (Discosporangium mesarthrocarpum MT17_79)
Homology
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: D8LB20_ECTSI (Calcium-dependent cytoplasmic cysteine proteinase, papain-like protein n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D8LB20_ECTSI)

HSP 1 Score: 600 bits (1546), Expect = 2.430e-207
Identity = 283/459 (61.66%), Postives = 347/459 (75.60%), Query Frame = 3
Query:   75 MGTVGQSCPLPVRILISPITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTYRDKNFPPGPQSLGSELVAKFQ--GDSVQWKRADELVSATD--KHFHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIPTQQGSPIFAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRR-SSATASSDWSWRRYEFRFKPSETDIRAASTLKTNEVHDSKKFWSILLAYAAGQSGAMCCSIIGKTVEGQRRDGLIELHAYSILRCVDLHGQKLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHRES-NDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGCPGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAETRAGRSCVGCVCCV 1433
            MG      PL +RI++SP+ L+++  RVY   CIG+ ++R+ R   R   R VCCG GLT+ D +FP   +S+G EL  KF+  G  V+WKRAD L+ A D  + FHLF+KGV A+DVKQGA+GDCWLVAAM+TLA ++PGAI KLF NSE S RGKYKVRLFDI +  W T+ +DD IPT+ G+PIFAKPNG ELWV+LLEKA+AK+CGSYS I+GGFEAWGLKVLTGN+V+ FRR  SA       WRRYEF FKPS  + R AS++ T EVH+S KFW +LL YA G+ GAMCCSI G   E  R+DGLIE HAYSI+R +D+HG++LIQIRNPWG +GEWKG W+DG+S+W++SPLI  A+ H  + NDGTFWMCW+DF +VW EITIC+RS DASDL LDV+EDLGCPGP VGC+GGC  YW  CRGC+HV+CP  S   TR GR C G +CCV
Sbjct:    2 MGESVAEWPLCMRIVLSPLLLLWFGTRVYVLPCIGVAMQRMLRCCCRGFLRHVCCGCGLTFTDTSFPATDKSIGKELTHKFKAAGGGVEWKRAD-LIKARDDDEPFHLFNKGVKAEDVKQGALGDCWLVAAMATLAGTMPGAIKKLFGNSERSYRGKYKVRLFDITQDRWRTITVDDNIPTRHGNPIFAKPNGKELWVVLLEKAVAKFCGSYSNIAGGFEAWGLKVLTGNHVWTFRRLKSARPGKPGQWRRYEFLFKPSADNKRDASSVGTEEVHESDKFWGVLLTYAQGRKGAMCCSITGPVSERVRKDGLIENHAYSIMRAIDMHGEQLIQIRNPWGKKGEWKGRWADGTSEWKTSPLISVAVGHENNDNDGTFWMCWEDFREVWMEITICSRSTDASDLALDVHEDLGCPGPCVGCLGGCFGYWCLCRGCKHVICPTKSKETTRKGRKCTGLICCV 459          
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: A0A7S4G5T7_9EUGL (Hypothetical protein n=1 Tax=Eutreptiella gymnastica TaxID=73025 RepID=A0A7S4G5T7_9EUGL)

HSP 1 Score: 343 bits (881), Expect = 1.510e-107
Identity = 174/443 (39.28%), Postives = 248/443 (55.98%), Query Frame = 3
Query:  123 SPITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTYRDKNFPPGPQSLGSELVAKFQGD--SVQWKRADELVSATDKHFHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIPTQQ--GSPIFAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRRYEFRFKPSET----DIRAASTLKTNEVHDSKKFWSILLAYAAGQSGAMCCSIIGKTVEGQRRDGLIELHAYSILRCVDLHGQKLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHRESNDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGCPGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAETRAGRSCVGCVC 1427
            SP  L++Y+L++YC  C+ IY  R   T     S + CC +G  ++D  F P P SLG E    F GD   ++WKRA +L  A      LF + +   D++QG VG+CWL++A + LA   PGAI + FI  EYS RGKYKV ++D +KG W TMV+DD  P  +  G P+FAKP   ELWV++ EKA AKYCGSY  +SGG   W L  +TG+ VF   + +     D +WRR++  +  ++T      R+    +T EV++  + ++ILL Y   ++     S  GK      + G+++ HAYSIL+   +   +LI++RNPWG   EW G WSD    W   P +K   +H  ++DGTFWM   DF K +  + +  R+    DL L++ E+ G  G   GC  GC R+W CC+G   ++C       TRAGR C  C C
Sbjct:   17 SPFILLWYSLKIYCVPCLSIYFSRACSTFCCCKSWVSCC-LGCRFKDSEFRPAPSSLGEE----FTGDISDIEWKRAHQLKQAGQTSMKLFDRDIEPSDIEQGGVGNCWLMSAFACLA-EYPGAIQRCFITKEYSVRGKYKVSIYDKLKGAWETMVVDDHFPCSKKSGQPLFAKPKDQELWVMIYEKAFAKYCGSYHGLSGGHSVWALNAMTGDPVFKLMKEA-----DGTWRRFDLTYHSADTGHTKQRRSIGLKQTQEVYNQDRLFNILLEYNVSEACLAASSTAGKDTATNAKGGIVQGHAYSILKLKAIGDIRLIKLRNPWGGF-EWGGNWSDKCPLWNQHPAVKRECEHVVADDGTFWMSLDDFVKHFRNVDVLDRTTGMQDLALNLYEEKGPCGVCYGCCTGCCRFWYCCQGFYRLMCGRVGTGNTRAGRKCCCCFC 447          
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: A0A7S0N3J2_9CHLO (Hypothetical protein n=1 Tax=Pyramimonas obovata TaxID=1411642 RepID=A0A7S0N3J2_9CHLO)

HSP 1 Score: 340 bits (873), Expect = 4.880e-106
Identity = 185/450 (41.11%), Postives = 266/450 (59.11%), Query Frame = 3
Query:   87 GQSCPLPVRILISPITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTYRDKNFPPGPQSLGSELVAKFQGDSVQWKRADELVSATDKHFHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIPTQQGS--PIFAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRRYEFRFKPSETDIRAASTLKTNEVHDSKKFWSILLAYAAGQSGAMCCSIIGKTVEGQRRD-GLIELHAYSILRCVDLHGQKLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHRESNDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGCPGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAETRAG-RSCVGCV 1424
            G++ P P   L +P  LVY ++ +YC  C+GI+  R  R++   L  +  CG+   Y+DK F  GP ++G     K      +W++ +EL         LF   +   DV QGAVGDCWL+AA + LA   PGAI K+F+  E+++R KY+VRL+D     W T+ IDD+IP ++G+    F+KPNGNE+WVILLEKA AK+CGSY+ + GG   W  + +TG++VFFF+    T      W R +  +     + R    ++T E +D KK W ++  Y A +S  +  SI G   E +  + GL+  HAYSIL+  ++   KLI++RNPWG   EWKG WSD SS W+  P +  A+  ++ +DG+FWM + DF K ++ + +C+R+ D  DL LDV ED+G  GP VGCV GCL +W CCRG R + C   SD +T +G R    CV
Sbjct:   32 GRAWP-PFVCLCAPFILVYNSITIYCLPCVGIFFTRFFRSLFYCLCSL--CGL-YEYKDKKFT-GPAAIGEMADLK----DCEWRKPEELERTGGDIMKLFEGEIEPQDVAQGAVGDCWLIAAFACLA-EFPGAIQKVFVTREWNTRRKYQVRLYDGFHQRWETVTIDDRIPVEKGTNKAAFSKPNGNEMWVILLEKAFAKFCGSYANLDGGHTVWAWQAMTGDHVFFFKFLPDTKK----WARMDISYPTKRHNKREVGFVQTEEQYDEKKIWEMIKRYDARKS-VLAASITGAGGEVKHDNVGLVSGHAYSILKVREVKDFKLIKLRNPWGTF-EWKGKWSDRSSDWKQYPDVAKAVDFKDEDDGSFWMEYGDFVKHFNRVQVCSRTTD-EDLCLDVREDIGFCGPAVGCVSGCLGFWCCCRGARVIFCGKKSDDKTLSGHRGLCSCV 464          
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: A0A0M0JWJ2_9EUKA (Calpain catalytic domain-containing protein n=1 Tax=Chrysochromulina tobinii TaxID=1460289 RepID=A0A0M0JWJ2_9EUKA)

HSP 1 Score: 315 bits (807), Expect = 2.600e-96
Identity = 171/447 (38.26%), Postives = 251/447 (56.15%), Query Frame = 3
Query:  108 VRILISPITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTYRDKNFPPGPQSLGS-ELVAKFQGDS-VQWKRADELVSAT---------DKHFHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIPTQQ--GSPIFAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRRYEFRFKPSETDIRAASTLKTNEVHDSKKFWSILLAYAAGQSGAMCCSIIGKTVEGQRRDGLIELHAYSILRCVDLHGQ-KLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHRESNDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGC----PGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAET 1394
            V +L  P  +VY +  +YCF C   YL R A T+   + ++ CC     + DK FP    S+G  +  +K Q +  ++WKRA E+V A               LF  GV   D+ QG +GDCWL++A+  +A   PG + K+F+ + YS RGKY +R+FD   G W T+ IDD +P ++  G  ++A+P G ELWV+LLEKA AK+CGSY+ + GG E W  + LTG+ VF   R   + S+D  W R+E   KP  T+ R     K++E    ++ + ++  Y   +  A+  + I    E +R DGL+  HAYS+L      G   L+++RNPWG   EWKGAWSD +++W++ P I+  +K    NDG+FWM W DF   ++ + +C RSR   DL+LD++ED GC     GP  GC  GC  YW CC G + + C   +  +T
Sbjct:   10 VNLLCCPFVMVYNSFWIYCFGCCLEYLIRFANTVGCFVFKL-CCWWCCEHVDKAFPANASSIGPWKAKSKEQIEQEIEWKRATEVVDALGPQPVAGKPQPRVKLFEGGVEVTDIAQGGLGDCWLMSALCCMAER-PGQLYKIFVQNAYSDRGKYSIRIFDGRAGKWVTVTIDDLLPVEKATGQLLYAQPKGRELWVLLLEKAFAKFCGSYADLDGGHEIWAFEALTGDPVFSLMREGGS-SNDGHWVRHELVHKPG-TEKRKIGLRKSDEKFSDEQTFQLVRTYIRAE--ALMTASISNHGEAKRSDGLVAGHAYSLLDAKSFSGGIHLVRLRNPWGTF-EWKGAWSDDAAEWKTHPKIQRLIKPSADNDGSFWMNWDDFIAHFNGLDVCNRSRGVRDLYLDLHEDDGCRRHNAGPAKGCAYGCFLYWFCCEGAKALYCGKVAGKQT 449          
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: A0A0M0JH90_9EUKA (Calpain catalytic domain-containing protein n=1 Tax=Chrysochromulina tobinii TaxID=1460289 RepID=A0A0M0JH90_9EUKA)

HSP 1 Score: 314 bits (805), Expect = 8.740e-96
Identity = 176/448 (39.29%), Postives = 245/448 (54.69%), Query Frame = 3
Query:  108 VRILISPITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTYRDKNFPPGPQSLG--SELVAKFQGDSVQWKRADELVSAT---------DKHFHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIPTQQ--GSPIFAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRRYEFRFKPSETDIRAASTLK-TNEVHDSKKFWSILLAYAAGQSGAMCCSIIGKTVEGQRRDGLIELHAYSILRCVDLHGQ-KLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHR-ESNDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGC---PGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAET 1394
            V +   PI LVY +  +YCF C+  Y+ R+A ++   + R+ CC     Y DK+FP    S+G   E   +     ++WKRA E+V                 LF  GV   D+ QGAVGDCWL++A+  +A   PG + K+F+ + YS RGKY +RLFD   G W T+ IDD +P ++  G  +FA+P G ELWV+LLEKA AK+CGSY  ++GG E W  + LTG+ VF   R   T      W R+E    PS    + A  L+ T E +     + ++  Y   +  A+  + I    E +R  GL+  HAYS+L      G   L+++RNPWG   EWKGAWSDG+ +W   P I+  ++   + NDG+FWM W+DF   +  I IC RSR   DL+LD++ED GC    GP VGC  GC  YW CC G R + C   +  +T
Sbjct:   30 VNLFCCPIVLVYKSCAIYCFGCMFEYISRLANSVGCFVFRL-CCWWCCEYVDKSFPANASSIGPWKEKSLEQIAREIEWKRATEVVDELGPRPVAGQPQPRVKLFEDGVSVSDIAQGAVGDCWLMSALCCMAEH-PGQLYKIFVQNAYSDRGKYSIRLFDGRAGMWVTVTIDDLLPVEKATGRLLFAQPKGRELWVLLLEKAFAKFCGSYEGLNGGNEIWAFEALTGDPVFSLLRKHGT------WVRHELAHMPSRAGKKRAIGLRETKEKYADDVTFHLVRTYLRAE--ALMTASISSKGEEKRATGLVAGHAYSLLDAKAFAGGINLVRLRNPWGDF-EWKGAWSDGAPEWTRHPKIRRCIRPTFDENDGSFWMLWEDFVSNFDGIDICNRSRGVRDLYLDLHEDDGCRRHAGPAVGCAYGCFLYWCCCEGVRALYCGKVATKKT 466          
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: A0A7S0X2U4_9CHLO (Hypothetical protein n=1 Tax=Mantoniella antarctica TaxID=81844 RepID=A0A7S0X2U4_9CHLO)

HSP 1 Score: 309 bits (792), Expect = 5.370e-94
Identity = 178/449 (39.64%), Postives = 251/449 (55.90%), Query Frame = 3
Query:  117 LISPITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTY-RDKNFPPGPQSLGS--ELVAKFQGDSVQWKRADELVSATD-KHFHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIPTQQGSPI--FAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRRYEFRF---KPSETDIRAASTLKTNEVHDSKKFWSILLAYAAGQSGAMCCSIIGKTVEGQRR--DGLIELHAYSILRC------VDLHGQKLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHRESNDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGCPGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAETRAGRSC 1412
            L+ P  LVY ++ VY   C G+Y  R+  +I+  L   +C  +G  Y +DK +  G Q+LG   +  A      V+W RAD+L       H HLF   +   D+ QGAVGDCWLVAAM+ +A    GAI   F N+EY+ RGKY VRL+D   G W  + +DD  P  +G+    + KPNGNELW IL+EKA AK+CGSY ++ GG+  W    +TG+NV  F+ S  T      W+R    F    P     R      T++      F++ILL Y++  S  M  S+I K    + +  DGL+  H YS+L        +   G KL+++RNPWG   EW GAW+DGS +W ++P IK  LK+ +++DGTFWM ++DF   ++ I IC R+   +DL LDVNE+ GC GP VGC+ GC  +W CC+G R +     +  +T +   C
Sbjct:   34 LLLPFVLVYNSIVVYLLPCFGVYFMRLFGSIIGTLCCCLCKAMGWYYFQDKEWG-GQQALGDTEDCTAAEMAAKVEWVRADKLDCVPKGMHIHLFEGDIEPADLCQGAVGDCWLVAAMAGMAEH-EGAIRNCFENTEYNDRGKYTVRLWDGRAGVWVRVTVDDYFPVNKGTKTATYMKPNGNELWAILMEKAFAKFCGSYGSLDGGWAVWAWHAMTGDNVLQFKVSDGTT-----WKRRNMVFIGDDPGHAGRRRIGFKSTDDEIPEDAFFNILLKYSSKDS-VMGASMILKEAASEEKMNDGLVAGHMYSLLEVRRAGAMLGQGGTKLLKLRNPWGTF-EWNGAWADGSKEWDANPGIKRELKYVDTDDGTFWMEYKDFVSRFNTIDICDRTTK-NDLRLDVNEEAGCTGPLVGCLVGCGSFWCCCQGVRTIYFGNQTSEDTESAAGC 472          
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: C1MT99_MICPC (Calcium-dependent cytoplasmic cysteine proteinase, papain-like protein n=1 Tax=Micromonas pusilla (strain CCMP1545) TaxID=564608 RepID=C1MT99_MICPC)

HSP 1 Score: 308 bits (789), Expect = 1.320e-93
Identity = 187/467 (40.04%), Postives = 264/467 (56.53%), Query Frame = 3
Query:   66 IVRMGTVGQSCPLPVRI-LISPITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTY-RDKNFPPGPQSLGSELVAKFQGDSVQWKRADEL-VSATD---KHFHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIPTQQGSP--IFAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRRYEFRFKPSETDIRA---ASTLKTNEVHDSKKFWSILLAYAA-GQSGAMCCSIIGKTVEGQRRDGLIELHAYSIL-------RCVDLHGQKLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHRESNDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGCPGPTVGCVGGCLRYWVCCRGCRHVVC--PVASDAETRAG 1403
            + + G + Q    PV + L+ P  L++ A+RVY   CIGIY+ R AR  L  +   +C   G  Y  DK F  G ++LG + +       V W RADEL + A D   KH  LF  G+  +D+ QGA+GDCWLVA ++ LA   PGAI K+F   EY+  GKY VRL+D   G W T+ +DD+IP + G+   +F  PNG ELW ILLEKA AK+ GSY A+ GG   W    +TG+NV+ F++       + SW R +  F   +  + A      L  ++  D + F+ +LL Y++ G        + G   E +  +GL+  H YSIL         V   G+K+I++RNPWG   EWKGAWSDGS +W   P IK  L++ +S+DG+FWM ++DF+  ++ + +C R+   +DL LDV E  GC GPT  CVGGC  +W  C GC  + C    +S+ ET+AG
Sbjct:   17 LTQCGKICQGIAHPVWLCLLWPFILLWNAIRVYLLPCIGIYVMRGARCALGPIFCCLCRACGCYYYEDKEFT-GQRALGEKRI------DVDWVRADELDIVAKDDEVKHIALFH-GIEPEDLCQGALGDCWLVAGLACLA-EYPGAIRKVFRECEYNDVGKYHVRLWDGRVGKWVTVTVDDRIPVKAGTKETVFMHPNGCELWAILLEKAFAKFVGSYGALDGGLAVWAWHAMTGDNVYDFKKRP-----NGSWIRRDLVFLNKKKGLEARCDVGYLPNDDEIDEEDFFGVLLKYSSKGAVLGAARMVSGAEREEKNSEGLVAGHQYSILDVRRVGTSMVRTGGRKMIKLRNPWGTF-EWKGAWSDGSREWDDHPKIKKELEYEDSDDGSFWMEYKDFADRFNSVDVCDRTT-TTDLVLDVREADGCLGPTKACVGGCGFFWCMCGGCSTIFCGNKTSSETETKAG 467          
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: A0A8J4CF99_9CHLO (Calpain catalytic domain-containing protein n=2 Tax=Volvox TaxID=3066 RepID=A0A8J4CF99_9CHLO)

HSP 1 Score: 306 bits (784), Expect = 4.490e-93
Identity = 170/450 (37.78%), Postives = 256/450 (56.89%), Query Frame = 3
Query:  126 PITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTYRDKNFPPGPQSLGS---ELVAKFQGDSVQWKRADELVSATDKH-FHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIPT-QQGSPIFAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRR-------------YEFRFKPSETDIRAASTLKTNEVHDSKKFWSILLAYAAGQSGAMCCSIIGKTVEGQRRDGLIELHAYSILRCVDLHGQKLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHRESNDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGCPGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAETRAGRSCVGC 1421
            P+ L+  A+R+Y   CI +Y+ R+ R IL  +   VCC    T++DK+FP    S+G    +  A+   + VQW+R  ++V+A+ +    LF+  +   D+ QG +GDCWL++A++ LA +  GA+ ++F+  EY++ GKYKVRL+D  K  W T+ +DD IP    G PIFAKPNG+E WV+LLEKA+AK+ GSY+ + GG   W L+ LTG+ VF F   S T      W+R             Y+ R KPS+ ++            DS++ ++ +L Y   +S     S  G   + Q  +G+++ HAY+I     +   +L+Q+RNPWG   EWKG WSD SS W  +P +K AL  + ++DGTFWM W+DF + +  +  CAR+    D+ LD++E+    GP +GC+ GC RYWVCC G R +     S    +    C  C
Sbjct:   32 PLILLIQAIRIYFLGCILVYISRLGRGILCGMC--VCCHC--TFKDKSFPHNHISIGEWKDKTPAQIDSE-VQWRRIADIVAASSRTGAKLFAGRIEPSDICQGQLGDCWLMSALACLA-NQDGAVQQIFVTKEYNAYGKYKVRLYDAPKEAWVTIAVDDWIPCGSNGLPIFAKPNGDEAWVLLLEKAMAKFKGSYARLDGGITMWALECLTGDFVFKFNMDSKTGK----WKRFDIVHVPKAEAGGYDIRLKPSDDEL------------DSEEMFNSMLFYNRKRSFISASS--GSGTDTQDVNGIVQGHAYAICNVKRVDRFQLVQLRNPWGTF-EWKGNWSDLSSLWEENPKVKRALDFKPADDGTFWMEWKDFCEHYKSLDFCARTTGFEDISLDIHEERPYCGPVLGCLEGCFRYWVCCLGVRALFFSRKSRHFEKPPTGCCAC 456          
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: A0A836C4C1_9CHLO (Calpain catalytic domain-containing protein n=1 Tax=Edaphochlamys debaryana TaxID=47281 RepID=A0A836C4C1_9CHLO)

HSP 1 Score: 302 bits (773), Expect = 1.710e-91
Identity = 174/443 (39.28%), Postives = 260/443 (58.69%), Query Frame = 3
Query:  126 PITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTYRDKNFPPGPQSLGS-ELVAKFQGDS-VQWKRADELVSATDKH-FHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIP-TQQGSPIFAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRRYEFRFKPSET----DIRAASTLKTNEVHDSKKFWSILLAYAAGQSGAMCCSIIGKTVEGQRRDGLIELHAYSILRCVDLHGQKLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHRESNDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGCPGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAETRAGRSCVGCVCC 1430
            P+ L+  A+R+Y   C+  Y +R+AR +L  L   VCC    TY+DK FPP  +S+G+ E  +  Q D+ V+W+R  ++V+A+ K    LF+  +   D+ QG +GDCWL++A++ LA +  GA+ ++F+  EY+  GKY++RL+D  K  W T+VIDD IP  QQG  IFAKPNG+E WV+LLEKA+AK+ GSY+ + GG   W L+ LTG+ VF F+  + T      WRRYE   +  +     D+    T   ++   S++ +S LL Y   ++  +  S  G   + Q  +G+++ HAY++     +   +L+Q+RNPWG   EW GAWSD S  W   P +K AL    ++DGTFWM W+DF   ++ +  C+ S    D+ LDV+E+    GPT GCV GC +YW+CC G R +     S    R  ++  GC  C
Sbjct:   27 PVVLLVQAIRIYFLGCLFTYGKRMARGVLCGLC--VCCSC--TYKDKAFPPTARSIGAWEGKSPEQIDAEVKWRRIGDIVAASSKTGAKLFAGKIEPADIAQGGLGDCWLMSALACLA-NREGAVQQVFLTKEYTHYGKYRIRLYDAPKDKWVTVVIDDWIPCNQQGQSIFAKPNGDEAWVLLLEKAVAKFKGSYANLDGGHTMWALECLTGDYVFKFKADAKTGR----WRRYEMVHQAKQGGGGYDVMVRPT---DDDLGSEEMFSTLLWYNRKRA-FLAASNAGGGNDTQNVNGIVQGHAYAVCNAKLVDRFQLVQLRNPWGTF-EWAGAWSDNSPLWEQHPKVKRALDFMPADDGTFWMEWKDFCAHYNCLEFCSWSTGFDDIALDVHEEHLICGPTWGCVEGCFKYWLCCLGIRALFFSRQSRNFERPEKA--GCCAC 453          
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Match: A0A7S1SMU3_9CHLO (Hypothetical protein n=1 Tax=Tetraselmis chuii TaxID=63592 RepID=A0A7S1SMU3_9CHLO)

HSP 1 Score: 302 bits (774), Expect = 4.520e-91
Identity = 166/449 (36.97%), Postives = 248/449 (55.23%), Query Frame = 3
Query:   72 RMGTVGQSCPLPVRILISPITLVYYALRVYCFSCIGIYLRRIARTILRALSRIVCCGVGLTYRDKNFPPGPQSLGSELVAKFQGDSVQWKRADELVSATDKHFHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSSRGKYKVRLFDIVKGTWCTMVIDDQIPTQQGS--PIFAKPNGNELWVILLEKAIAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRRYEFRFKPSETDIRAASTLKTNEVH--DSKKFWSILLAYAAGQSGAMCCSIIGKTVEGQRRDGLIELHAYSILRCVDLHGQKLIQIRNPWGARGEWKGAWSDGSSQWRSSPLIKAALKHRES-----NDGTFWMCWQDFSKVWSEITICARSRDASDLFLDVNEDLGCPGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAE 1391
            R   V  + P  V +   P  L   + R+YC  C+ I+ ++    +L A     C   G  Y+DK FPP  +S+G   ++  +   ++W+   E V       +LF   ++  D++QG++GDCWL++A + LA + PGAI ++F N   +  GKYK +LF      W  + IDD IP   G+  P+FA+P+G+E WV+LLEKA AKYCGSYSA+ GG   W L+ LTG++VF F +  +++     W+++    KP E   RAA    +  V   D++ F++ L+     +   + CS+ G   +    +G++  HAYS+L  V+  G +L+Q+RNPWG+  EWKG WSD  S W  +P +  A K   S     NDG FWM WQDF   +S +  C R+    DL LD NE+ GC GP  GC  GC ++W CC G + + C  A+ A+
Sbjct:   54 RAARVSVAMPKGVHVACCPCILCVQSTRIYCCGCMHIFGQKFVSRVLCA----PCIACGRRYKDKRFPPCRESIG---MSDGELSKIEWRSCAE-VCGKGGGMNLFYDDINPSDIRQGSLGDCWLLSAFACLA-NYPGAIQRVFHNKTINQYGKYKFKLFSRPLEKWIVVKIDDMIPCDAGTGQPLFARPSGDEAWVMLLEKAFAKYCGSYSALKGGDTLWALEALTGDHVFKFIKEDSSSG----WKKFTLVHKPEEGK-RAAYLSSSTAVQPLDNEAFFA-LIKEQCKKGSVLECSM-GSGNDSADTEGIVHGHAYSLLNIVESKGLRLLQLRNPWGSF-EWKGKWSDNDSSWSQNPKVAKACKWHASGKGQTNDGLFWMDWQDFMSYFSYVGFCFRTTGIDDLSLDSNEEKGCCGPVGGCFTGCFKFWCCCHGVKALCCAQATSAD 485          
The following BLAST results are available for this feature:
BLAST of mRNA_D-mesarthrocarpus_Contig1041.5.2 vs. uniprot
Analysis Date: 2022-09-19 (Diamond blastx: OGS1.0 vs UniRef90)
Total hits: 25
Match NameE-valueIdentityDescription
D8LB20_ECTSI2.430e-20761.66Calcium-dependent cytoplasmic cysteine proteinase,... [more]
A0A7S4G5T7_9EUGL1.510e-10739.28Hypothetical protein n=1 Tax=Eutreptiella gymnasti... [more]
A0A7S0N3J2_9CHLO4.880e-10641.11Hypothetical protein n=1 Tax=Pyramimonas obovata T... [more]
A0A0M0JWJ2_9EUKA2.600e-9638.26Calpain catalytic domain-containing protein n=1 Ta... [more]
A0A0M0JH90_9EUKA8.740e-9639.29Calpain catalytic domain-containing protein n=1 Ta... [more]
A0A7S0X2U4_9CHLO5.370e-9439.64Hypothetical protein n=1 Tax=Mantoniella antarctic... [more]
C1MT99_MICPC1.320e-9340.04Calcium-dependent cytoplasmic cysteine proteinase,... [more]
A0A8J4CF99_9CHLO4.490e-9337.78Calpain catalytic domain-containing protein n=2 Ta... [more]
A0A836C4C1_9CHLO1.710e-9139.28Calpain catalytic domain-containing protein n=1 Ta... [more]
A0A7S1SMU3_9CHLO4.520e-9136.97Hypothetical protein n=1 Tax=Tetraselmis chuii Tax... [more]

Pages

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
D-mesarthrocarpus_Contig1041contigD-mesarthrocarpus_Contig1041:2584..14213 +
Analyses
This mRNA is derived from or has results from the following analyses
Analysis NameDate Performed
Diamond blastx: OGS1.0 vs UniRef902022-09-19
Discosporangium mesarthrocarpum MT17_79 OGS1.02022-07-08
Properties
Property NameValue
Stop1
Start1
Seed ortholog2880.D8LB20
PFAMsCalpain_III,Peptidase_C2
Model size2168
Max annot lvl2759|Eukaryota
KEGG koko:K08582
Hectar predicted targeting categoryother localisation
Exons11
Evalue4.93e-212
EggNOG OGsKOG0045@1|root,KOG0045@2759|Eukaryota
Descriptioncalcium-dependent cysteine-type endopeptidase activity
Cds size1362
COG categoryO
BRITEko00000,ko01000,ko01002
Relationships

The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesTypePosition
mRNA_D-mesarthrocarpus_Contig1041.5.2prot_D-mesarthrocarpus_Contig1041.5.2Discosporangium mesarthrocarpum MT17_79polypeptideD-mesarthrocarpus_Contig1041 2658..13481 +


The following UTR feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesTypePosition
1680789179.752036-UTR-D-mesarthrocarpus_Contig1041:2583..26571680789179.752036-UTR-D-mesarthrocarpus_Contig1041:2583..2657Discosporangium mesarthrocarpum MT17_79UTRD-mesarthrocarpus_Contig1041 2584..2657 +
1680789179.915167-UTR-D-mesarthrocarpus_Contig1041:13481..142131680789179.915167-UTR-D-mesarthrocarpus_Contig1041:13481..14213Discosporangium mesarthrocarpum MT17_79UTRD-mesarthrocarpus_Contig1041 13482..14213 +


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesTypePosition
1680789179.7694218-CDS-D-mesarthrocarpus_Contig1041:2657..28491680789179.7694218-CDS-D-mesarthrocarpus_Contig1041:2657..2849Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 2658..2849 +
1680789179.7798178-CDS-D-mesarthrocarpus_Contig1041:3111..32401680789179.7798178-CDS-D-mesarthrocarpus_Contig1041:3111..3240Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 3112..3240 +
1680789179.79774-CDS-D-mesarthrocarpus_Contig1041:5214..53451680789179.79774-CDS-D-mesarthrocarpus_Contig1041:5214..5345Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 5215..5345 +
1680789179.812676-CDS-D-mesarthrocarpus_Contig1041:6776..69321680789179.812676-CDS-D-mesarthrocarpus_Contig1041:6776..6932Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 6777..6932 +
1680789179.8246667-CDS-D-mesarthrocarpus_Contig1041:7766..78531680789179.8246667-CDS-D-mesarthrocarpus_Contig1041:7766..7853Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 7767..7853 +
1680789179.8408911-CDS-D-mesarthrocarpus_Contig1041:8957..90771680789179.8408911-CDS-D-mesarthrocarpus_Contig1041:8957..9077Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 8958..9077 +
1680789179.8515468-CDS-D-mesarthrocarpus_Contig1041:11083..111621680789179.8515468-CDS-D-mesarthrocarpus_Contig1041:11083..11162Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 11084..11162 +
1680789179.8597891-CDS-D-mesarthrocarpus_Contig1041:11433..115381680789179.8597891-CDS-D-mesarthrocarpus_Contig1041:11433..11538Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 11434..11538 +
1680789179.8679645-CDS-D-mesarthrocarpus_Contig1041:12346..124181680789179.8679645-CDS-D-mesarthrocarpus_Contig1041:12346..12418Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 12347..12418 +
1680789179.8769503-CDS-D-mesarthrocarpus_Contig1041:13005..131561680789179.8769503-CDS-D-mesarthrocarpus_Contig1041:13005..13156Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 13006..13156 +
1680789179.886571-CDS-D-mesarthrocarpus_Contig1041:13341..134811680789179.886571-CDS-D-mesarthrocarpus_Contig1041:13341..13481Discosporangium mesarthrocarpum MT17_79CDSD-mesarthrocarpus_Contig1041 13342..13481 +


Sequences
The following sequences are available for this feature:

protein sequence of mRNA_D-mesarthrocarpus_Contig1041.5.2

>prot_D-mesarthrocarpus_Contig1041.5.2 ID=prot_D-mesarthrocarpus_Contig1041.5.2|Name=mRNA_D-mesarthrocarpus_Contig1041.5.2|organism=Discosporangium mesarthrocarpum MT17_79|type=polypeptide|length=454bp
MGTVGQSCPLPVRILISPITLVYYALRVYCFSCIGIYLRRIARTILRALS
RIVCCGVGLTYRDKNFPPGPQSLGSELVAKFQGDSVQWKRADELVSATDK
HFHLFSKGVHADDVKQGAVGDCWLVAAMSTLAASIPGAIAKLFINSEYSS
RGKYKVRLFDIVKGTWCTMVIDDQIPTQQGSPIFAKPNGNELWVILLEKA
IAKYCGSYSAISGGFEAWGLKVLTGNNVFFFRRSSATASSDWSWRRYEFR
FKPSETDIRAASTLKTNEVHDSKKFWSILLAYAAGQSGAMCCSIIGKTVE
GQRRDGLIELHAYSILRCVDLHGQKLIQIRNPWGARGEWKGAWSDGSSQW
RSSPLIKAALKHRESNDGTFWMCWQDFSKVWSEITICARSRDASDLFLDV
NEDLGCPGPTVGCVGGCLRYWVCCRGCRHVVCPVASDAETRAGRSCVGCV
CCV*
back to top

mRNA from alignment at D-mesarthrocarpus_Contig1041:2584..14213+

Legend: UTRpolypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
>mRNA_D-mesarthrocarpus_Contig1041.5.2 ID=mRNA_D-mesarthrocarpus_Contig1041.5.2|Name=mRNA_D-mesarthrocarpus_Contig1041.5.2|organism=Discosporangium mesarthrocarpum MT17_79|type=mRNA|length=11630bp|location=Sequence derived from alignment at D-mesarthrocarpus_Contig1041:2584..14213+ (Discosporangium mesarthrocarpum MT17_79)
CGAGCGTCGCGCGTTTCTCAAGGAATATCTGCGAGACAAACAACTCGACG AAACAAAGTGTTCGGATAGTCAGGATGGGCACGGTTGGTCAGTCGTGCCC TCTTCCTGTGAGGATCTTGATAAGCCCTATCACTCTGGTGTACTACGCCT TACGGGTCTACTGTTTCTCTTGCATAGGGATCTACCTACGAAGGATAGCT CGGACAATTTTGAGAGCGTTGTCGCGGATCGTTTGCTGTGGTGTCGGCTT GACCTACCGGGACAAGGTGGGATATGCTGGGATAGAGTACGTTATCGAGA TACAGAAAAAAGAAGAGACGTCGCTTGGTGTACCTCTCGGATTGGTCGAT TCCTGCGACAGAGTGCGGCCTGCCGCAGACGTTCGGACTTTTGGAGGCGC ACAACCGCTACCCAACTGTACCCTTTGTTTAGCCATCCTGCTTCACAATC CGCGTTCTTGATCCACCCACTGTGTGGGTATCTTACCAGCCATGCCTTAA CCCCCTGCTTCCCTTTCGAACAAGTCAGAATTTCCCCCCAGGACCGCAAT CACTTGGATCGGAGCTTGTCGCAAAGTTTCAGGGGGACTCTGTGCAGTGG AAGAGGGCAGATGAGCTTGTGTCCGCGACAGATAAACACTTTCATCTTTT CAGCAAGGTCGGGTCATAAGATAGCCCACACTCCCCACCTATAGTATAGC ATTTGAATTTTCCTTGGTTGGTAGCTTTATTGATTGGCCGTAGGTTTATA GCTTTTACTGACATCTGCGATCGAAGTGCTCTGGTGTTGGTAGCAAGGAA TCGTCCCGTGTCTGTGTTTTGTTCCTTGTTTTCCGTAATTTTTGACGCAA ATTCGTTTTTGAATTTTTTGGGACCTTATGAAGGGTTTATCGCAGAAAAA CTGGTGGCATCAATTTTTTAATTCAGGGGAGTTGATTGTCAATTTGAAAT ATGTTGGGAGGAGTATGGAAATCTACTGAAATGAACTACTTCCAAGTAGT GTTGATCAAATAACGGGAAAAAACTTCCCAAACAAGGACAGGGGAGGGGG ATTATGCATTTGGTGTAAGAAGATATGTGCTGGACTTCCACTGGGTCACA GTAGATCCAACAGATTAGGAGTTTTTGATCAACCCAGCCGGGAGTAAACA CGCCTTCAAGTTGTTATCAACTGTTGAGGTGGTGTTGATCCAGTCAACGA GCCACCAGAGAAGCAGTGGTCGTCAGACGTCCCACCCCCGCACCGGTCAG AAATGGCATCTAGCCGTAACTGCCGGATCACTCTGGGGGCTGCTTTTAGC TGAAATTTCTTGGAAGCAGATGCGGAGGGTGATGATAGTCTGTCGGTCAT GTCTATTTGTTCGACTTCCACCTGCCGCACAAGTTGCTGTCTGTCTTGCA CGAACTGCTCCCGCAGTAGCCGTAACGCAGCCCCGGATTTTTGGCGACGA ATGCCTGCAAGATACGGATCTTCAGTGCGGCACGGCGCGGTGGTGCCTGC CACTCAGTCTAGCACTGCATGATACTGTATGGCACGGAGGGGCGGCGAGT GGGGGGATTGCCCCTGGGCCGACGGTAGCAGCACTATGTAGTGCAACAAA GTGACGCCACAGGTGTACAAGGTGATGTCACAAGTGCACAAAGTGATGTC ACAAGTAGCTCTTTGCACCTGTAACGATGTGGCAGTGAGAGTGGTACTAA TAGGCTTGGCTTTGCCTTCGTTGTGTGGCTTTCTTTGGTGTTACCTCGTT GCAAAGAGGTTGAAGTTGTGGTTAATAGGTTAAAGTTGTGGTGAAGAGGT TAAAGTTGTGGTTAATAGGTTAAAGTTGTGGTGAAGAGGTTAAAGTTGTG GTTAATAGGTTAAAGTTCTGGTGAAGAGGTTAAAGTTGTGCTTAATAGGT TGAAATTGTGCTTAATAGGTTGAAATTGTGCTTAATAGGTTAAAGTTGTG CTTTATAGGTTGAAGTTGTGTTTAATTGGTTAAAGTTGTGCTTAACAGGT TAAAGTTGTGCTTAACAGGTTAAAGTTGTGCTTAATAGGTTACAGTCGTG CTTAATAGGTTGAAGTAGTGCTTAATAGGTTAAAGTAGTGGTTAATAGGT TGAACCTCCTGATGCGACACACGGCAAAAGAGTGTATGACGGCAGGCATC TCAGAGGGTACAGCGTGCTCTGAAGTGCTTGGCTGTGAACCCTGGCCCTA ACCCTAGCCCCCCCCCACAAGAAATGGACGCTTCAGCTTGTTGTATTTTT CTTAACATGCTCCAGATGTCGAGCGATGTCGGTCCAGAGGAGCTTCTGCA CATGCACATGTGCGCGGCTACAGCTGTATGTATCGAGTACAAGGGCTCCG TCCTTTAGGATATCAATGTTGTTGTTCACCAGGCCATCTCATCTGCATGT TCCCTGCAGAGGTCATTAAAGGGTCTTGGCCCAACCGACCCGATTTAATC CAACTACAGCACGAAATTCTCCTGGGCTACATAGAGCATCTGGCAACTCA GGGGTGGTCCTTTGGGCATGTACGGGCATATGCCTTTCTACAGTGTGTAG CCATGGTGCCAGGTCTTGCCGAGGGCATCGAGCGTGAAGCTTGACCTGTC CGATTTTTGTTTCTTATTGTTTTTTTTCCAGGGGGTGCATGCCGATGACG TGAAGCAGGGGGCCGTGGGGGACTGCTGGCTGGTCGCAGCGATGTCTACG CTTGCCGCCAGCATACCCGGCGCAATTGCCAAGCTCTTTATCAACAGTGA GTACTCTTCCAGGTGAGGGGAGAAGTCTGACCGCTCACTCTCGTGGTATT GGCGCAGTAGAAACAACAGGAGGGAGGAGGTGTCAAGGTTTTTGGCCTGA TGATGCTTGTAGCTCGCATGAGCGTTGATGTTTGCGCGGGCAAGGGGAGG GAGCCGCTGGGGGATGGCCATCGTTCTACCCTATCAACCCTAGCCCTAGA ACCCCAGCCCTAACCCTAACCCTCAAGGATAGGGGAGGTATGTGGGGATT GTTGCATTATTCAACGCAAGCATGAAGATATAAAGCTCGCAGGGGGGGCT ATCCTCTGCTCTGGAGCTCTGCTCTGCTCCAGAGCTACAAAAAATCGAGT TTTACACTGCCACCCAACGCTGTCACCTGCACGTTTCGATCTCGATTTTT CCACCCATTTTCGAAGGGTTTTTTTTTTTTTCACCGCTTGGCCCTTTCTG TCTTGAAGTACGCGCATCTATGTATCGTGGTGTCTGTCCTCCCTGATCAC ACGGTGAGATCGGCGCTTGAAGGCTTTGGGCCGCCCTTCATTTTGTTTTT TTTTTTTTTTTTTTTGGGGGGGGGGGGGATTTTTGTTGCGCTTTGTGTGC CGTTAAGAGTGGACCTTGGCTACAACAGCGATGAGCCATGTGCATTGCTT CTGAGGGTGGCTGAGCATCAGTGTCAGATGTCCACTCGTGTTCAGCGCGG CACCCCTTTCCTGCAATCCATACCAGGAAGTGCAGCAGCATGCCTGGCCC TGACCCCTCTATGCACATTATCTAAGAGAACACTCTCGGAAGCACTCCAG AGTACATTAGAATGCACGTGTGGGGCATGGTGAGGTACTTGAGGGTAGTT GGGAGTACGGTGCAGTAGGGTGGTTGGGTGGATGTGCATTTATGAAGGAA AAAGAAGCAACAAAGGTCGTGGTGAAATCGATCGAATTGGTATCATGTTC TTTCAAGGTAGTCTTGAAGGATGACTGCAAGTTGTAAGAGTGGTGTCAAC TAGTTTGTCAGTGGATCACACCCATTGATGAAGCACTCCAAAGTGATGTC GTGTCCTTTTTGTCGACAAACCACGCCCATTGATGGAGTACTTTGGAGCG TTAGGGCAGGGTGTTAGGGTTAGAGTTAGGGCCTTACCTCACAAACAGTG GGACTGGTATTAGGGTTAGGGTTAGCACCTACACGGAACGCTGGCGCCCA TGTTTGTCAACAGCACACTGCTGCTGTACATTTTTTCACATCCAATTTTC CCTTCAATGAGACATGATTGTGATCTCTCTACAGTATGACACTGTGCATT GTTCTTCATGTGCTGCTCATGTACTACTACTACTACTACTACTACTACTA CTACTACTACCACTACTACTACTACCACTACTACTACCACTACCACTACT ACCACTACTACTACTACTACTGCAACACCCAATCATGAACAAGGGGCAAG TACAAGGTGCGTCTCTTCGATATTGTCAAGGGCACGTGGTGCACCATGGT TATTGACGACCAGATACCGACACAGCAGGGCAGCCCCATCTTTGCCAAGC CCAACGGCAACGAGCTCTGGGTCATCCTCCTTGAGAAAGCTATCGCAAAG TGAGGGCTCCTGGAGCAGGAACCTTGATGTAATGAAACTTGTTTTTTTTG TTGTCTTTCTTTAGTCGGCAGTGTGGAGGAAAGGGCTCGAGAGCTTGACC AGACAACTCTCTGGCTTCTCGTTACCCATCAGCCTACGCCCCCGTGCCTT AACCCCAAAACCTGACCCTAACCCTGGCTTGGTGAGATGCAATGAAGTGA AATTTGATTTAGAAGAAGGCTTGAGCAAGGGTTTTGCCCGGTGCTTCATA CTTGGAAGCACCACGGTGAAGTTTGGTTGGTTGTAGTCCTCGCAGTCCTG GTAGCTCATTGGTTGTGGCCCTCAGGCCCTCATTGTTCTTCTAGCCCCAG AGCAACTGTAACGAATCATGCTAGTAGTGTGTAGCTGACAGTAAAGGGTT AGTGTTAAGGCCAGGGTGGGCACACTGTCAGAGTGTTAGAAAAAGGGGCG GGTGAGTGGTGGAAAAGCCTTGTCTATGTGATTCCACCGTAGTAGTCTTC TCTCTCAATTGGGTTTTTTTAGAGTTAGGGTGATCAAAGATGCACTGTAC TACTCATTGGTTGGCGACTGTTTCTGGGCAGGTGAATGGTGATGGTTACT GTGTGTAATAATAGGTCAGAGTTGGGTTAGGGTTAGGGCTAGGGGTTATA TAGGGTTAGGGTCAAGTTTAATTTGGCACTCTAGTAGGGTTAGGCTTAGG GGTAGGTGTAGTACTCCAATGCGTTGAAGCGGAAGAGCTGTGGAGTGTAC TTGAGCACACATGCATGTTTTCCTAATGTGGTCCTTTCCTTTTTGATGCC GTTGCTGTGCCACTTGTTTTGTCTTCATCTCAGATATTGTGGGAGCTATT CAGCTATCTCTGGGGGGTTTGAGGCCTGGGGGCTCAAGGTTCTCACGGGC AACAATGTTTTCTTCTTCAGGTAAGCTCAGCCCTAACCCTATAAACCGAA TGTCGCGATGGGTTATTGATCTTGAGTAAGTCCAATGTGGTGACTTTGGT TGAACCACCCCAAACCCTAACCCTCAAAGCGACGAGGAGGCAAAATATAG TGCTTGGAAGTAATACGTCAACTTGCCGTAGCTCGCGAAAAAAAGAGAAT GTTACGTAAGAATAATAATAATAATAATAATAATAATAATAATAGTGATA ACAATAGTGATAACAATAGTGATAATAACAATAATAGTAAAGACAAGAAG GAAAAGAGTAATGGCATCAACCCGCGTCCATGACAGCCGCGGCTAGACCC CCCCCGTGAGGCCTCGTACGGGTTACCCAAGAGTGGTAAGAAAGACCGGC CCCTCACCTTTACAACAGTAGAGCTAGTATTCCGTGGCAGTTGCAGGAGA GCTGTCCTGGTTATTGTTACCAGGTTTATAGACGTACGTAAGCATCCTCT GTGTGTTACCTTCGTCGTGTTTCCCTCCCTAATTCTCTGCATCCAATGGA TGGATGACCTGCACGGTGAGGACCACCGCCTAAGAAACAACCTTGGGGTT GCAAGTGCGGCTAGTACGTGCACCTGTCACTAGCCGATTTTGGAGTCTCA CATCACCAGGAAAGGAGGGAGGCAAAACAGCAGGATCAGGGTTGTATAGA CCTAACCATACCGAACCCTAGAACTAACCATAACCGTAACCATAACCCAG GCCTTAAGGGCAATCCTACCTTTTAACCTTAAAAGTGAATTACCCTTGTG CTACCTGTGATCCCTAGATATGTAATCAAAATCGCCTGTCTTGCCCCTCG CATTGGAAACACCCGAGGGCATGTGCTGTGTTGGTGCGGTAGATCACACA TCCTTTCCCTATATTACACATCATGTCCACAAATCACACAAGAGCCAGAG CCTCTGCACAGCATATACTTATCATACAGCACAGTGGCGCCGTATCCTCC CATGCCACCACCCTAACAGTGATAGTAAAGACCTACTTCGTCAGAACGAG TCCTCAAAAGCCCAGGCTGTTTGTGTGTAAATTGTTTTTTGCTATTTTCT CGTCATCATGCATGTATGTGCCAGGCGGTCTTCAGCGACAGCGTCCAGTG ATTGGTCATGGAGACGGTACGAGTTCCGCTTCAAGCCGTCGGAAACCGAC ATCCGGGCGGCTTCCACTCTGAAGACAAACGAGGTTCACGACAGGTGAGC ACTCCCATCTGCTTACTGCAGCGCACCACACAGCTGCAGTGCAGGTGCAG CGCTGTAAGGGAATAAGAGCCCACCATTCTGTCTGGAGGAAAGGATGAAA TGGCTGCGACCAAGAAGGTTGTGCATGATCGATTTCTGGTTGTGTATGAG TTGTAGGCAGGACGGACCTAGTTAGCCCGTACCTTTCAGCTCTCAAGGAA ATAGCTATCACTATATTCTTTCAAGTCTGAAATAAGGGTCAAACATTGAT TTAAGTCCCTCATCCGTACGTCGGAGAGACTATCCATGAAGGTCTCTCAG AGACATTTTGAATGTAGCTGTCGATAACGAGTAAGTCTCTAACATGCTTA CTTTTTCAACACCAGCATTCAATTAAACATAGTCGACTGCCTCATATTCA ACTGCTCTTCGTGTGCAGGCATTCCGTGGGAAAACTGCAACAAACAGTAT ACTGGTGAAATTGAGAGTGAAGGTTCGAAATGAGAGTGGGGACTCTATAT GAAAGGGGTTGTCCGGACTTTGATTAACGAGCGAAAGCTTGTCAGGGCGA GATCAGCGAAAATCGAAGAGTGGTGCAGATGGGCCAAAAGATTATTTTGG CCGTTTGTTGGGCAAGAGTTTTCAACTCGAAATCATTTGCGACAAATTAG AGCGTATTGTAATCGAAACACGGCATGTTGACGTGACGTATTATGCTAAG AAGTGTATATATGTATGTAGTATGACAGTACAGCCAGAGAGACAGGTAGA ACATGCAGCATACAGTAGGTGGTCCTCAAAGCTGGTGACCGTGTCGCCGG GGGTATAGAATCCCCACTCTCATTTCGAACCTTCACTCTCAATTTTGCCA GTGCACTGTTAACAAGGGATGGAACACCGCCCCTGAAGACAACTTCTAGA ACTTTGCCTTCCTCCTCGAGTTGTGAAAGATGATGTAAGACAGGTTCTGC AGGTACAACAAAGAGAGCGAAATGAACTCTCGTGACACGTTTTACGCTGT TGTAATCTTCCCGACCCCCCAAAAACCGAGAGGTAATAAAAACTTGGAGA CTGTGCATTTAATTCCGAGTGTGACCAACTCAAGACAAAACGGTGAGGTA CTGTCTGTACCTGCCAAAAACACTCGAGGTGTCAAACCACAGGATTCGCT GCAGACAATGATTTCGCACCGATGATAACATACCGGTGGTTTTTGTATTT CCTCGAGAACAAGTCCACGAGAACATGAACCGCTTGGAGCACGATATGCT CTCGGTCTGGGTCTTGGGCCAGGTCCCTGCACTGCAGTCATTGTTCCATC GGATTGAGCTAGACCGGCCCGTGTCGATTACTCAAAGTGTAGTCCGAAGT GAAATACTTGGCTTGAGCAGCTCTACTATGTTAGCTTGAAAAAATGCTCT GTTTCGGTGCAGATGCCGCGGGTGTGTGGAATGGTTTTCGGCGTAGTTGA TTAATTAATGTGTGGATGAAATCTACCCTCCCTGGGGCTGGGATATAAAA TGGCATTGGACTTAAGTGCCTGGGAGTGGAGTTCATTGGCACTTTAATCA ATCAATCGTATCAAAACATGTTATGCTTTACCCTCGGTGTGTTACGATAA CCGAACCTCTTTCTTCGGGGCCTTCGAACAGAGCCAGCAGAGGAGCAGAA CCATATTAGCTGAGCCCTAGAACAGAACCCTAATGGAAAGGAACCATTCA ATCAGAGTACGATGATACACCAGCGGCCACTTTCCAGTTTGTAATAATTG CAGTGACCGTGGGAACGTTTCGTATTCTTTATTAGATGGAGCTGTAGAGC TCTGTAACCAAGTCTCCAATGACAAACCCAACATAAATTGGAAGGAGCGT TGGGTGCATTGCTGTTATGCTGAGAAGGCCAAAGAAGAGTAAGAAAAAGC TCACCCATGCACACCTTGATCTATATATGTCTGTGTGCCCGTGTGTATCT TTGCCATCTCGTGGCCCGCGCACTCCGGTGAATTGATTGGGATCGATCAG CAAGAAGTTCTGGAGCATCCTGCTAGCGTACGCGGCAGGGCAAAGTGGGG CGATGTGCTGCAGCATTATCGGGAAAACGGTGGGTGATACCCTTTGCGCT CCCACGATCAGCCCCCTTTTTGTAGCCCACCTCGGCGCTGTTATATATTA AACCACAACGCACATCCAGCTGCGTGTATGTATAGCTGTCTTATGCTGCA ATATTCCCTCCACCCCGGCACTGTGAAACTGTGTCGGCTGTTTTACAGGG CCACATGAATCCTCCGCATTCTGCTTCATGTTCCTCCTTCAACCCTCTTG TTCCTTTTTTTTCTCCCGCTTTTTTGTTCGCAAACACTGTTTTTTTTCAG GTGGAGGGGCAACGGCGGGATGGTTTGATCGAGCTGCATGCCTACTCTAT CCTGCGGTGTGTCGACCTACACGGGCAAAAGCTCATTCAGATTCGCAACC CTTGGGTGAGAGAAAGTCAAGTCCGCGTTAGGTTTGGGATCACCCAGCCT CTTCATGTCAGAAATCCCTTTTTTGTTCTTGGGGTTAGGGTAGTTTTGTG GCGGCGATCCCGTTGGGGCTGACTCCACCTGTCTGCCTATTCCTGGTTCA CTGGGCGACTGATTGGGTGATTGGGCAGACTACTTGAGGGGGCAGGTCAA CGTTTCTTGTGTGGCATGCCATGTGCGAGGGGCAGTCTTTGTCTTGCGTG GTGCGATCTCGACTGGGACACCTTTTTTACTTGATGGATGATTCTGAACA TTTCGTCCCTCTGCACTCGGGTATGTTGTCCGTCTTGTGTTATCCACCCT TGGTGCGTTTGGCTTAAATGATCAAGTGTGGCTCGGTGAGGCTTGTGTTG ATCACAACTGACAGTACAAAGTTCAGAATCAAACTAACTTCAATCCATGT GAGTTACTTCGTATTTTAATTGTCTAAACTTGAAAAGCAGATACACATGG AACACAGCACCGACACCGTACTTGAAGATGCTCTCAATGTCAGGAAATAC TTTGGTCTGCGTGTTCGTTACTCCGAAGCCACATGTATCTGGCCGCACAC CTGTTTTCCGAAACCACAGAATAACTGCACAATCCCTTCCCCAGCTCTTA TCCCATAGTTAACCACCGATGTCATATTGGTAAATAGAAATGGTACCAGC GACACCTTGACGCAGAACCCTGGCCGCTTCTTGATGCCCCTTCCCTGTGG CTTGGTATTGATTTACCTCACCCCTTTGCTGGCTGAAAATGTGTGGTCAT TTTTTTTTTCCAGGGGGCGCGGGGGGAATGGAAGGGTGCTTGGTCTGACG GATCGTCCCAGTGGAGATCTTCTCCCCTTATCAAGGTCCGGGGTATGCAG CGCCTCCTTCCGCCCCACTTTCACGGGAACAGTTCCCCGAACCCGTGTTA GGGACAGCATAGAGCGACTTACGATATTTATACTTCTGTAGTATCTTGAC ATTTATGTAAATGAGAAAAATACTTGTGACTTGTTCTTTCAAACTCCTCG GAGTTGTAATGATTTGAAGTTGTGGGGTTGAGAGAAAACAAGCACAAGTG TAGAGTTCCACCTCTTCACGCTGGACAGAAAAACGCGGTGCTCGCACCAC TTCGTGCGACAGGTGACCCCCGCAGGAGTCCTTTCTTGGCTCGTCGTTTC TCCTTGTGTGGGTGTTCCACTAGCTGCGTCACTTCTTTTGCTTCGACTTA CAATGCACGCTCTAGAGCACCGCAGGAGGAACGGTCATTGGATGACTGTC GTGTTCCTTTGACAGCCCATGTGCCACACCTTTCCTAATGGCCTGATGAA TCCACCTCTGATGAACTGTTGCGCATTTCAGCTGAGTTCTGCTGTGTTGG CTCTTCCAAGTTTTTCTGTAGGCAAGACTTGGCTCAGATACTTGCTTGGT GTGACCTCTGCTTACTTTTCAGGCAGCTCTCAAGCATCGCGAAAGCAACG ACGGGACCTTTTGGATGTGCTGGCAAGACTTCAGCAAGGTTTGGTCAGAG ATCACCATCTGTGCCCGGTCGAGGGATGCCAGCGACCTGTTCCTTGATGT CAATGAGGATTTGGGCTGTCCAGGTACGAGAGTAGTAGAGCAGGTCCCCC CTATATGCAAGGGATAGAGTTAAGATTACAACTACTGTATGTAGAGTACT TCAAAATAGTGAGTGCTGTGTCAATGCAAGCCACTATGTTTTCACTGGTA AAGCAAATGCTGAGATCCGGTCATCGTGCCCCGGCTTCCGTGCGTCCATT CTTTCCAGGTCCGACAGTCGGGTGTGTTGGTGGCTGTCTCAGGTACTGGG TCTGCTGCAGGGGCTGCCGACACGTGGTATGCCCTGTGGCTAGTGATGCA GAGACACGAGCTGGCCGAAGCTGTGTGGGGTGTGTGTGCTGTGTTTAGCA GCATCCGATGCTGGTGCAAAGCAGTATGGGTGCTTTCTGGTTGCACCCAT GTGCACAACGCTTAAATTGTGGACGCTGAGAAGCCTTAATTGCACGGTTT CGTGTGAAGTATCGAGCGTTGTTGAGATGGCTTGACAGCAGTAACACAGA ACTTGATCTGGCCAGCAACAAATATTGTACTCAGTGAGCGTGCGTGGTTT GGGGCGTTCACAAAATAAGAAGGCGAGGCACATGTCTTTGCCGTGGACAG TATATTGGTGAAATTGAGAGTGAAGGTTCGAAATGAGAGCGGGGACGGGG ACTCTATATGAAAGGAGTTGTCCGGGCTTTGTTTGATGAACGAGCGAAAG CTTGTCGGGGCGAGATCAGCAAATATCGAAGAGTGGTGCAGATGGGCCAA AGAATTATTTTGGCCATTTGTTGGGCAAAAGTTTTCAACTCGAAGTCACC TGCTTCGACGAACTTCACTCTTTGCGTGTTGTGTCTCCGCGCCTTCCCCT TGATCATAAATCGTGATGCACTGCATTTTTCGCGCCCTTGATCAATATTC TTGTGATCTATGCATCGCATACTGATACAAGACGAGCATATATATATAGT ACGACAGTACACGGTGGACGTACTTGCATTGTTTGTTGGGGCAGCAGTAA TCCCCACTCTCGTTTCGAACCTTCACTCTCAGTTTCACCAATATACTGTT CGTAATAAAATGAGCAGGTTTTGCATTCCC
back to top

Coding sequence (CDS) from alignment at D-mesarthrocarpus_Contig1041:2584..14213+

>mRNA_D-mesarthrocarpus_Contig1041.5.2 ID=mRNA_D-mesarthrocarpus_Contig1041.5.2|Name=mRNA_D-mesarthrocarpus_Contig1041.5.2|organism=Discosporangium mesarthrocarpum MT17_79|type=CDS|length=1362bp|location=Sequence derived from alignment at D-mesarthrocarpus_Contig1041:2584..14213+ (Discosporangium mesarthrocarpum MT17_79)
ATGGGCACGGTTGGTCAGTCGTGCCCTCTTCCTGTGAGGATCTTGATAAG
CCCTATCACTCTGGTGTACTACGCCTTACGGGTCTACTGTTTCTCTTGCA
TAGGGATCTACCTACGAAGGATAGCTCGGACAATTTTGAGAGCGTTGTCG
CGGATCGTTTGCTGTGGTGTCGGCTTGACCTACCGGGACAAGAATTTCCC
CCCAGGACCGCAATCACTTGGATCGGAGCTTGTCGCAAAGTTTCAGGGGG
ACTCTGTGCAGTGGAAGAGGGCAGATGAGCTTGTGTCCGCGACAGATAAA
CACTTTCATCTTTTCAGCAAGGGGGTGCATGCCGATGACGTGAAGCAGGG
GGCCGTGGGGGACTGCTGGCTGGTCGCAGCGATGTCTACGCTTGCCGCCA
GCATACCCGGCGCAATTGCCAAGCTCTTTATCAACAGTGAGTACTCTTCC
AGGGGCAAGTACAAGGTGCGTCTCTTCGATATTGTCAAGGGCACGTGGTG
CACCATGGTTATTGACGACCAGATACCGACACAGCAGGGCAGCCCCATCT
TTGCCAAGCCCAACGGCAACGAGCTCTGGGTCATCCTCCTTGAGAAAGCT
ATCGCAAAATATTGTGGGAGCTATTCAGCTATCTCTGGGGGGTTTGAGGC
CTGGGGGCTCAAGGTTCTCACGGGCAACAATGTTTTCTTCTTCAGGCGGT
CTTCAGCGACAGCGTCCAGTGATTGGTCATGGAGACGGTACGAGTTCCGC
TTCAAGCCGTCGGAAACCGACATCCGGGCGGCTTCCACTCTGAAGACAAA
CGAGGTTCACGACAGCAAGAAGTTCTGGAGCATCCTGCTAGCGTACGCGG
CAGGGCAAAGTGGGGCGATGTGCTGCAGCATTATCGGGAAAACGGTGGAG
GGGCAACGGCGGGATGGTTTGATCGAGCTGCATGCCTACTCTATCCTGCG
GTGTGTCGACCTACACGGGCAAAAGCTCATTCAGATTCGCAACCCTTGGG
GGGCGCGGGGGGAATGGAAGGGTGCTTGGTCTGACGGATCGTCCCAGTGG
AGATCTTCTCCCCTTATCAAGGCAGCTCTCAAGCATCGCGAAAGCAACGA
CGGGACCTTTTGGATGTGCTGGCAAGACTTCAGCAAGGTTTGGTCAGAGA
TCACCATCTGTGCCCGGTCGAGGGATGCCAGCGACCTGTTCCTTGATGTC
AATGAGGATTTGGGCTGTCCAGGTCCGACAGTCGGGTGTGTTGGTGGCTG
TCTCAGGTACTGGGTCTGCTGCAGGGGCTGCCGACACGTGGTATGCCCTG
TGGCTAGTGATGCAGAGACACGAGCTGGCCGAAGCTGTGTGGGGTGTGTG
TGCTGTGTTTAG
back to top