|
Homology
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: D7FSC4_ECTSI (Cellulose synthase (UDP-forming), family GT2 n=2 Tax=Ectocarpus TaxID=2879 RepID=D7FSC4_ECTSI) HSP 1 Score: 1381 bits (3574), Expect = 0.000e+0 Identity = 682/799 (85.36%), Postives = 742/799 (92.87%), Query Frame = 1
Query: 1 SIPQMEDGRGGIPSGHRRMKSSGGFSQRLGKMQEALGMAHHSVPGMG-EKRRAGKQGASYDPRFVQRGERINPQTIETFSSSFLIRGIVLLNIGLGIAYLVWRFTQTQGVEEHLMWXXXTFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMDDLVTGDDGGDLYRLDRNQTYTNMGLDDDISKSKGGDEVEELTHGVSLLADESVHITPGFFQVFRGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVVNDARHQLQPDFLQRTTPYFFDVDEITGQYKWAKVAFVQTPQRFRQELPNDPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALRGRSPDGKTGVDAKDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVFLAQFAAVAGAVYWTLHDGFEHYYKNTLSICAGAFLGMFYLWPMIALQMGVTKPSFWFFKLGAYVILGVAMVILGQF--FDLQIG 2388
S+ ++E+GR +GHRR+KS GGF+QRLGKMQEALGMAH S P G +KRR+GK GA YDPRFVQRGE+INPQT E+FSSSFLIRG+V+LN+ G Y+VWRF +TQ V WXXX FF+VE+FL+ AIW+GH+QRLFAVQRIR TMDQIVSIDPAVGANAVV+ILLPTAGERLDVVLKCLLGASSQRSWP+++ K GRGDGLRVIVLDEKRRKEVY+LTSGVHAL+TQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKD+GLSRAVAMVRQMDDLVTGDDGGDLYRLDRNQ++ N GLD+DISKSK GDEVEELTHGVSLLADESVHITPGFFQVFRGQAKQAACMIYYSRKDAGTP+ISPKAGNMNAAMFP+DDPTSPPLIGPSTIVVVNDARHQLQP+FLQRTTPYFFDVD IT QYKWAKVAFVQTPQRFR++LP+DPLGNHAASQYDVINIGKDGIG VSSSGQGSLWRVEAL+GRSPDGKTGVD KDL LVG +LGFRAE+LIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQP NLTWRIKQVLRWHQGAVQLLY KG RYTS GG+FPT+FHR+YAFDQATYYLQAIPGYVLLLMP+VYG+TGQ PFNT+IT YFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQV+W++LIR NP+HAWVA+VPTWPL LVF AQF A+AGAVYWTLH+GF+ YYKNTLSICAGAFLGMFYLWPM+ALQ+G+ +PSFWFFKLGAYV+LGV+MVILG DLQIG
Sbjct: 14 SMGKVEEGRS---NGHRRVKSGGGFTQRLGKMQEALGMAH-STPASGTDKRRSGKHGA-YDPRFVQRGEKINPQTKESFSSSFLIRGVVVLNVASGFTYMVWRFLKTQDVPPEYKWXXXVFFMVEVFLLFAIWLGHSQRLFAVQRIRTTMDQIVSIDPAVGANAVVAILLPTAGERLDVVLKCLLGASSQRSWPSSSAAKMGRGDGLRVIVLDEKRRKEVYMLTSGVHALSTQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDVGLSRAVAMVRQMDDLVTGDDGGDLYRLDRNQSFLNQGLDEDISKSKEGDEVEELTHGVSLLADESVHITPGFFQVFRGQAKQAACMIYYSRKDAGTPRISPKAGNMNAAMFPIDDPTSPPLIGPSTIVVVNDARHQLQPEFLQRTTPYFFDVDAITQQYKWAKVAFVQTPQRFRKDLPDDPLGNHAASQYDVINIGKDGIGAVSSSGQGSLWRVEALKGRSPDGKTGVDPKDLGLVGKKLGFRAEMLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPTNLTWRIKQVLRWHQGAVQLLYTKGLRYTSFGGSFPTVFHRMYAFDQATYYLQAIPGYVLLLMPVVYGVTGQPPFNTDITSYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVSWTRLIRGNPDHAWVARVPTWPLTLVFAAQFVAIAGAVYWTLHNGFDTYYKNTLSICAGAFLGMFYLWPMMALQVGLGRPSFWFFKLGAYVVLGVSMVILGSIPGLDLQIG 807
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: D7FSC3_ECTSI (Cellulose synthase (UDP-forming), family GT2 n=2 Tax=Ectocarpus TaxID=2879 RepID=D7FSC3_ECTSI) HSP 1 Score: 830 bits (2144), Expect = 3.530e-282 Identity = 443/788 (56.22%), Postives = 535/788 (67.89%), Query Frame = 1
Query: 145 KRRAGKQGASYDPR-FVQRGERINPQTIETFSSSFLIRGIVLLNIGLGIAYLVWRFTQT----QGVEEHLMWXXX---TFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMDDLVTGDDGGDLYRLDR---NQTYT-------NMGLDDDISKSKGGDEVEELTHGV------SLLADESVHITPGFFQVF------------RGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVVNDARHQLQPDFLQRTTPYFFDVD------EITGQYKWAKVAFVQTPQRFRQELPNDPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALRGRSPDGKTGVDAKDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVFLAQFAAVAGAVYWTLHDGFEHYYKNTLSICAGAFLGMFYLWPMIALQMGVTKPSFWFFKLGAYVILGVAMVILGQFFDLQ 2382
KR +G + D F R NPQ F S+ L+R + + N GLG+ YL WR+T T G +++ W FF E FL A+W+G QRLF VQRI+VTMD I S+D VG NA V ILLPTAGE L+VV K L+GA SQR W + PG LRVIVLDEKRR EVY + +GVH + + ++IL AEGV LT F +WC G+ + H++ D L+ AV ++R +D + + D + L+ + TY N S + + E++ +L +++ITPGFF+V+ + K +IYYSRK+AGTPKISPKAGNMNAA+FPVDDPT PL G STIVVVNDARHQL+ +FLQRT PYFF++ G+Y+WAKVAFVQTPQRFR EL NDPLGNHA SQYDVIN GKDGIG VSSSGQGSLWRVEAL+G+ PDGK D +L LVG +LGFR+E+LIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQP++LTWRIKQVLRWHQGAVQLL KG RYTS GG+FPT++HR+YAFDQATYYLQAIPGYVLL+MP+VYG+TG PF T + YF YF PFIVTA+LPT IS+QWR IDSHRLTRDEQTWLSTTYVQIYAFLQV W+ L R +PE+AWVAKVPTWPL VFL Q AVAG VYW + GF +Y N SI A L M LWPM++L +G + PSF++ KL +V LG + V+L F+ +
Sbjct: 233 KRASGMEDDDQDSNLFFDRVYCANPQEKRLFPSAHLVRVLAVANAGLGVLYLHWRYTSTFPPTVGRWDYMSWKLYWWWLFFSAEFFLAIAVWVGLAQRLFPVQRIKVTMDDITSVDDQVGYNARVCILLPTAGENLEVVFKALVGALSQRLWDSGLPGSQT----LRVIVLDEKRRLEVYRVAAGVHRIGELLAGRRIQQILMAEGVTELTQKGFIDWCRNGSGYERKHLYDDKKLNEAVQVLRLLDAMCLANGLTDAFALEARPSSATYNPITAAAWNAAAQAQKSNKPSVEAMAEMSDAARSEAEAKMLGASAMNITPGFFEVYGTHLDPDNMETSKEVQKGLPTLIYYSRKNAGTPKISPKAGNMNAAIFPVDDPTMTPLTGESTIVVVNDARHQLEGNFLQRTVPYFFELAGGHPTVASGGRYRWAKVAFVQTPQRFRMELSNDPLGNHAISQYDVINHGKDGIGAVSSSGQGSLWRVEALKGQRPDGKIVDDPTELDLVGKKLGFRSEMLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPSSLTWRIKQVLRWHQGAVQLLLFKGIRYTSFGGHFPTMWHRLYAFDQATYYLQAIPGYVLLIMPIVYGVTGTPPFVTSLKDYFQYFTPFIVTALLPTAISSQWRKIDSHRLTRDEQTWLSTTYVQIYAFLQVVWTGLTRKSPENAWVAKVPTWPLTFVFLGQVFAVAGGVYWVVQKGFVIWYANFFSIVVVAGLAMHALWPMVSLSLGWSIPSFYYIKLFLWVFLGFSAVVLTNVFNAE 1016
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: A0A836CP13_9STRA (Glyco_trans_2-like domain-containing protein n=1 Tax=Tribonema minus TaxID=303371 RepID=A0A836CP13_9STRA) HSP 1 Score: 757 bits (1955), Expect = 3.030e-253 Identity = 408/814 (50.12%), Postives = 530/814 (65.11%), Query Frame = 1
Query: 1 SIPQMEDGRGGIPSGH-------RRMKSSGGFSQRLGKMQEALGMAHHSVPGMGEKRRAGKQGASYDPRFVQRGERINPQTIETFSSSFLIRGIVLLNIGLGIAYLVWRFTQTQ-GVEEHLM---------WXXXTFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMDDLVTGDDGGDLYRLDRNQTYTNMGLDDDISKSKGGDEVEELTHGVSLLADES---------VHITPGFFQVFRGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVVNDARHQLQPDFLQRTTPYFFDVDEITGQYKWAKVAFVQTPQRFRQELP-NDPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALRGRSPDGKTGVDAKDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVFLAQFAAVAGAVYWTLHDGFEHYYKNTLSICAGAFLGMFYLWPMIALQMGVTKPSFWFFKLGAYVILGVAMVIL 2361
S P M R +P H +R+ S R+ + Q+A G+ + V G + DP F+ + + +PQ + S+F+IR IV+LNI + AYL WR T+T ++++ W F+ VEI L AIWIGHTQRLFAVQR+R+TMD IV D +VG N+ V+ILLPT GE+LDVV+K LLG S R W T+ + + RV+VLDEKRRK V + + V+ALAT + P+ ILQAEGV +T FYEW G+ + H++ D L++A ++ M++ + + + GL++ + KG +V+ + + + S V I PG+ Q F +IYY+R+DAGTPK+SPKAGNMN+A+F +D P PPLIG STIVVVND RHQLQP+FLQRT PYFF++D +Y+WAKVAFVQTPQRF + +DPLGNHAA QYDVIN GKDGIG VSSSGQGSLWRV AL+G DG + D K+ LVGH LGFR+E+LIEDTHTSIE+FR GW S YVNEPGE L+ CTHQP ++ WRIKQVLRWHQGAVQLL+ KG +T GG FPTIFHRIYAFDQATYYLQAIPGY+LLLMP++YG+ GQ PFNT + +F +F PFIVTA+LPTVIS WR +DSHRLTRDEQ WLSTTYVQIYAFL + W ++ AW + PTWPL+ VF +F A+ GA+ W +GFE + N + + A L + LWPM++LQMG PS ++ KL A++++G+ +V++
Sbjct: 270 SRPPMGAPRATVPGDHGAININAKRLPSQ----TRITEFQKASGVKAYGVMDGGV--------SDVDPNFLTKMNKPDPQGMTPLPSAFVIRAIVVLNICVSCAYLWWRVTRTIFQIDDYFFGIPFLPVQGWAW-AFYAVEICLTIAIWIGHTQRLFAVQRVRLTMDDIVREDDSVGYNSRVAILLPTNGEKLDVVMKALLGVVSLRGWDTSVEKCAAQ----RVVVLDEKRRKGVLNMAAAVYALATIVRHPNVLSILQAEGVHAITAKGFYEWWKTGGGYARKHLYNDHFLNQACRLLEYMEEEASTEA----------VAMSVFGLEELPAGVKGKQDVKRNSKKTTQASANSMLQAMNVGNVTIEPGYVQTFNTNVALPT-LIYYTRRDAGTPKVSPKAGNMNSALFALDYPDMPPLIGSSTIVVVNDCRHQLQPEFLQRTVPYFFELDADGQRYRWAKVAFVQTPQRFTNNVQADDPLGNHAAVQYDVINHGKDGIGAVSSSGQGSLWRVAALKGVDADGNSYADVKERGLVGHRLGFRSEMLIEDTHTSIEMFRAGWSSRYVNEPGEHLSICTHQPNSIAWRIKQVLRWHQGAVQLLFFKGIGFTVWGGKFPTIFHRIYAFDQATYYLQAIPGYMLLLMPIIYGVCGQPPFNTTVGEFFLFFTPFIVTAMLPTVISGSWRGVDSHRLTRDEQVWLSTTYVQIYAFLSMCWQQIRCKGTADAWAVRAPTWPLFAVFAGEFGAIIGALVWVSQEGFERWAANLICVIVSASLAIHALWPMVSLQMGWQVPSLYYLKLLAWLLIGIFIVLI 1055
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: A0A835YSM9_9STRA (Cellulose synthase, family GT2 n=1 Tax=Tribonema minus TaxID=303371 RepID=A0A835YSM9_9STRA) HSP 1 Score: 728 bits (1879), Expect = 1.810e-245 Identity = 380/747 (50.87%), Postives = 499/747 (66.80%), Query Frame = 1
Query: 154 AGKQGASYDPRFVQRGERINPQTIETFSSSFLIRGIVLLNIGLGIAYLVWRFTQT-QGVEEHLMWXXXTFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMDDLVTGDDGGDLYRLDRNQTYTNMGLDDDISKSKGGDEVEE---------LTHGVSLLADESVHITPGFFQVFRGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVVNDARHQLQPDFLQRTTPYFFDVDEITGQYKWAKVAFVQTPQRFR-QELPNDPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALRGRSPDGKTGVDAKDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVFLAQFAAVAGAVYWTLHDGFEHYYKNTLSICAGAFLGMFYLWPMIALQMGVTKPSFWFFKLGAYVILGVAMVIL 2361
AG G +P F+ + ++ +PQT+ S+ +IR IV++NIG+ +AYL WR T T ++ + F +T W+ RI++TMD++V D +VG N+ V+ILLPT GE LDVV+K +LG S R W + + RV++LDEKRRK V + + V+ALAT + P+ ILQAEGV+ +T FYEW G+ + H++ D L++A ++ M++ + + GL++ + KG +V+ H + L +V I PG+ Q F IYY+R+DAGTPK+SPKAGNMNAA+F +D P PPLIG STIVVVND RHQLQP+FLQRT PYFF++D +Y+WAKVAFVQTPQRF+ + +DPLGNHAA QYDVIN GKDGIG VSSSGQGSLWRV AL+G DG+ D + SL+GH LGFR+E+LIEDTHTSIE+FR GW S YVNEPGE L+ CTHQP ++ WRIKQVLRWHQGAVQLL+ KG +T GG FPTIFHRIYAFDQATYYLQAIPGY+LLLMP++YG+TG+ PFNT++ +F +F PFIVTA+LPTVIS WR +DSHRLTRDEQ WLSTTYVQIYAFL + W ++ + AW + PTWPL++VF +FAA+ GA+ W +GFE + N +SI A L + LWPM++LQMG PS ++ KL A++I+G+ +V++
Sbjct: 91 AGDGGPHDEPNFLTKVDKPDPQTMTPLPSAMVIRAIVIINIGVSLAYLYWRVTHTITNIDSY------------FFGIT--WLPV--------RIKMTMDELVREDDSVGYNSRVAILLPTNGENLDVVMKAMLGCISLRGWDASVEKCISQ----RVVILDEKRRKGVLNMAAAVYALATIVRHPNVLSILQAEGVQAITAKGFYEWWKTGGGYARKHLYNDHFLNQACRLLEYMEEEAANEA----------VAMSVFGLEELPAGVKGKQDVKRNSKKTTQATANHMLQALNVGNVTIEPGYVQTFNTNVALPT-FIYYTRRDAGTPKVSPKAGNMNAALFALDYPDMPPLIGNSTIVVVNDCRHQLQPEFLQRTIPYFFELDADGQRYRWAKVAFVQTPQRFQTNQQADDPLGNHAAVQYDVINHGKDGIGAVSSSGQGSLWRVAALKGVDADGQQYADTQQRSLIGHRLGFRSEMLIEDTHTSIEMFRAGWGSRYVNEPGEHLSMCTHQPNSIAWRIKQVLRWHQGAVQLLFFKGIGFTCWGGKFPTIFHRIYAFDQATYYLQAIPGYMLLLMPIIYGVTGEPPFNTKVGEFFLFFTPFIVTAMLPTVISGSWRGVDSHRLTRDEQVWLSTTYVQIYAFLSMCWQQIRCKGTDDAWAVRAPTWPLFVVFAGEFAAIVGAMIWVSKEGFEKWAANLISIIVSASLAIHALWPMVSLQMGWQVPSLYYLKLMAWLIIGIFIVVI 800
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: A0A835YVP7_9STRA (Cellulose synthase, family GT2 n=1 Tax=Tribonema minus TaxID=303371 RepID=A0A835YVP7_9STRA) HSP 1 Score: 681 bits (1757), Expect = 6.460e-228 Identity = 365/737 (49.53%), Postives = 473/737 (64.18%), Query Frame = 1
Query: 211 NPQTIETFSSSFLIRGIVLLNIGLGIAYLVWRFTQTQGVEEHLMWXXX---------TFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMD--DLVTGDDGGDLYRLDRNQTYTNMGLDD----DISKSKGGDEVEELTHGVSLLADESVHITPGFFQVFRGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVVNDARHQLQPDFLQRTTPYFFDVDEITG-----QYKWAKVAFVQTPQRFRQEL--PNDPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALRGRSPDGKTGVDAKDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVFLAQFAAVAGAVYWTLHDGFEHYYKNTLSICAGAFLGMFYLWPMIALQMGVTKPSFWFFKLGAYVILGVAMV 2355
+PQ SS LIR ++L N+ + YL WR T T + TF+ VEI L IWIGH+QRLFAV NA V ILLPTAGERLD+V+ LLG SQR W A G+ + +VIVLDEKRRK V + + V+ALAT +PS +ILQAE ++ YEW + G + H++ D L+RA ++ M L G++G L+ L T M D D K VE L +++ + I P F Q F ++YY+R+D GTP++SPKAGNMN+A+FP+D P L+G STI+ VND RHQLQP+FLQRT PYFF + + +Y W +VAFVQTPQRF +++ +DPLGN+AA QYD+IN GKDGIG VSSSG GSLWRVEAL+G + DG D + +L+G E+GFR+E+LIEDTHTSI++FR GW S YVNEPGE L+ CTHQP ++ WRIKQVLRWHQGAVQLL+ KG +TS GG FPTI+HR+YAFDQATYYLQAIPGY+LLLMP++YGITG+SPFNTE+ +F +F P+IV+A+LPT+IS WR +D+++L RDEQ WLSTTYVQ+YAFL + + L E+AW K PTWPL+ VF +F A+ GA++W GF+ + +N LS+ A A L +F LWPM+A+QMG PS + K+ + LGV +V
Sbjct: 10 DPQRTTPMSSHLLIRCVILANLAFSVLYLWWRVTSTITTINSYFFGIKALPVQAWGWTFYAVEICLTIGIWIGHSQRLFAVS----------------SYNARVCILLPTAGERLDIVMLALLGCISQRMW---ACGRKSKSAMFKVIVLDEKRRKAVLQMCAAVYALATLARSPSIVQILQAENAASIDAKGLYEWWRDGGGHARRHLYNDSHLNRACQLLEFMLRLSLCEGNEGN-LFNL------TEMPEDQLQSPDQKKRSARHLVEGLLQALNI-PQGAAKIPPAFMQSFSTNGALPT-LVYYTRRDPGTPRVSPKAGNMNSALFPIDYPDDESLVGDSTIIAVNDCRHQLQPNFLQRTVPYFFKLQQSANDGSGLEYTWDRVAFVQTPQRFPKDMNAEDDPLGNNAAVQYDIINHGKDGIGAVSSSGHGSLWRVEALKGLAADGTRYADPTNRALIGSEVGFRSEMLIEDTHTSIDMFRHGWTSRYVNEPGEHLSTCTHQPDSIAWRIKQVLRWHQGAVQLLFYKGITFTSFGGKFPTIWHRVYAFDQATYYLQAIPGYILLLMPIIYGITGESPFNTEVAEFFLFFTPYIVSAMLPTLISGSWRGVDANKLQRDEQVWLSTTYVQVYAFLSMLATALRCQKHENAWAVKAPTWPLFAVFFGEFCAIGGALFWVARYGFDRWSQNLLSVLASAALAVFALWPMVAMQMGWRIPSAYHLKVLVWATLGVLVV 718
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: A0A835ZCE1_9STRA (Cellulose synthase, family GT2 n=1 Tax=Tribonema minus TaxID=303371 RepID=A0A835ZCE1_9STRA) HSP 1 Score: 679 bits (1753), Expect = 1.780e-224 Identity = 368/756 (48.68%), Postives = 472/756 (62.43%), Query Frame = 1
Query: 211 NPQTIETFSSSFLIRGIVLLNIGLGIAYLVWRFTQTQGVEEHLMWXXX---------TFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMDDLV-TGDDGGDLYRLDRNQTYTNMGLDDDISKSKG--GDEVEELTHGVSLLADESV-----------------------HITPGFFQVFRGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVVNDARHQLQPDFLQRTTPYFFDVDEITGQYKWAKVAFVQTPQRFR--QELPNDPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALRGRSPDGKTGVDAKDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVFLAQFAAVAGAVYWTLHDGFEHYYKNTLSICAGAFLGMFYLWPMIALQMGVTKPSFWFFKLGAYVILGVAMVILGQ 2367
NPQ S+ L+R I+ N+G YL WR T T + + F+ VEI L I IGHTQR+F ++R V M+ +V D VG NA V+ILLP+AGERLD+V+ LLGA SQ +W G LR+I+LDEKRRK V +T+ V+AL T I P ILQAEG+ FY+W G+ + H++ D L++A ++ M+ G+D ++ L ++M +D G +VEE S+L ++ + G+ F ++YY+RKDAGTP++SPKAGN+NAA+F VD P PLIG +TIVVVND RHQL P FLQRT PYFF++D Y WA+VAFVQTPQRF+ Q +DPLGNHAA QYDVIN GKDGIG VSSSG GSLWRVEALRG DG+ D + +G LGFR+++LIEDTHTSI++FR GW S YVNEPGE L+ CTHQP ++ WRIKQVLRWHQGAVQLL+ KG YTS GG FPT++HRIYAFDQATYYLQAIPGY+LLLMP++YG+TG SPF T + +F YF P+IVT +LPTVIS W +D+++L RDEQ WLSTTYVQIYAFL + W+ L E+AW K PTWPL+ VF Q AA+ G ++W GF + +N +S+ A A L M LWPM++LQMG PS + K+ + +LG +V++
Sbjct: 209 NPQAFFALGSTPLLRLIMAANVGFSALYLYWRATSTITTIDTYFYNIKYFPVQIWAWVFYGVEICLTIGILIGHTQRMFPIKRAIVAMEDLVREDDCVGYNARVAILLPSAGERLDIVMLALLGAMSQSTW---RGGNRTTSQMLRIIILDEKRRKGVLNMTAAVYALGTLIRNPEVVTILQAEGIDAENVKGFYDWWKFGGGYARKHLYNDPWLNKACLLLEYMEKHAGNGEDADSIFLL------SDMPIDAAAGSKAALHGKDVEEFAR--SMLQALNIPTPSPDAGXXXXXXXXXXXXXXXXVDAGYVHQFSSNP-DLPTLLYYTRKDAGTPRVSPKAGNLNAAIFAVDYPEDDPLIGDATIVVVNDCRHQLNPTFLQRTVPYFFELDAEGQHYGWARVAFVQTPQRFKPDQMTLDDPLGNHAAVQYDVINRGKDGIGAVSSSGHGSLWRVEALRGADVDGRRYADPTVVDNIGKTLGFRSQMLIEDTHTSIDMFRHGWTSRYVNEPGEHLSICTHQPNSIAWRIKQVLRWHQGAVQLLFYKGISYTSFGGRFPTLWHRIYAFDQATYYLQAIPGYILLLMPIIYGLTGNSPFETRVADFFLYFTPYIVTGMLPTVISGSWGDVDANKLQRDEQVWLSTTYVQIYAFLSMLWTSLRCQKHENAWAIKAPTWPLFTVFFGQVAALGGGLFWVGKYGFRAWAQNLISVFASALLCMHALWPMVSLQMGWKIPSMYIIKILVWALLGGFIVLINH 952
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: A0A835YHN4_9STRA (Cellulose synthase, family GT2 n=1 Tax=Tribonema minus TaxID=303371 RepID=A0A835YHN4_9STRA) HSP 1 Score: 630 bits (1624), Expect = 1.040e-206 Identity = 356/795 (44.78%), Postives = 465/795 (58.49%), Query Frame = 1
Query: 214 PQTIETFSSSFLIRGIVLLNIGLGIAYLVWR-----FTQTQGVEEHLMWXXXT------------FFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMDDLVTGDDGGD-LYRLDRNQT---YTNMGLDDDISKSKGGDEVEELTHGVSLLADESVHITPGFFQVFRGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVVNDARHQLQPDFLQRTTPYFFDVDEITGQYKWAKVAFVQTPQRFR-QELPNDPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALRGRSPDGKTGVDAKDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCG--GNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVFLAQFAAVAGAVYWTLHDGFEH-------------------------------------------------------------YYKNTLSICAGAFLGMFYLWPMIALQMGVTKPSFWFFKLGAYVILG 2343
PQ +LI VLLN+ YL WR T Q W FF E L+ I IGH+QRLFAVQR V MD + +D + NA VS+ LPTAGE+ DVVLK LLG +QR W G + + +R+IVLDEK+RK V LT+ + LA +L P ++ILQ EGV +L + + W G + + L A++ MDD+ ++GG + ++ + + N+ + D K +E++ +V I PG+ + + + +IYYSRK+ GTPK+SPKAGNMNAA+FP+D P PLIG STIVVV+D RHQLQPDFLQRT PYFF++ + + Y WAKVAFVQTPQRF Q+ +DPLGNHAA QYDVIN GKDGIG V SSG GSLWRV ALRG +G+ D +L LVGH+LGFR+E+LIEDTHTS+E+FR GWRSVY+NEP E L+ CTHQP N+ WRIKQVLRWHQGAVQLL+ KG YT+ +PT++HR+Y FDQ TYY+QAIPGY+LL+MP+VYG+TGQ+PF+ I +F F+PFIVTAVLPTVI +D RL+RDEQ WLSTTY+Q+YAF +TW A AW K PTWPL++ F +F A+ G ++W +++ F++ ++ N +S+ A A + +F LWPM+A+Q G S + KL AYVI+G
Sbjct: 43 PQGKRFLRREYLIWATVLLNLATAAYYLYWRVTGGSITDIQNGMPGDQWVPDNPGPIWVRIYAWLFFASEACLIIGIMIGHSQRLFAVQRTVVNMDDLALVDSNITYNARVSVFLPTAGEKPDVVLKALLGCMAQRGW-----GAASKLSYMRIIVLDEKKRKGVLALTAAAYKLAECMLNPELQRILQFEGVLSLNAIDVFAWWKTGGGHARQFLHDHDLLYEICAIMELMDDIAKNENGGKGTWGKPKDPSKVRHFNLEVGDKSYFEKNRATLEDIIM-------PNVTIDPGYIKTYES-SDLLPRVIYYSRKEPGTPKVSPKAGNMNAAIFPIDYPEQVPLIGDSTIVVVDDCRHQLQPDFLQRTVPYFFELHKPSNTYTWAKVAFVQTPQRFPFQKEKDDPLGNHAAMQYDVINHGKDGIGAVGSSGHGSLWRVAALRGLDANGRCYADPSNLRLVGHKLGFRSEMLIEDTHTSLEMFRAGWRSVYINEPNENLSVCTHQPDNIAWRIKQVLRWHQGAVQLLWFKGPWYTTFSPCAQYPTMWHRLYGFDQCTYYMQAIPGYMLLVMPIVYGVTGQAPFSATIFDFFVRFIPFIVTAVLPTVILGNRPGVDMDRLSRDEQVWLSTTYIQMYAFFSMTWQIFTCAKAGDAWTVKAPTWPLFVAFYGEFLAILGGLFWLIYNNFQNNRIQNNSSNAAEAFNINSNKEQIRINNLRPDLINEATKNSTQLRMTNEFIISNLIGYPQIQWFLNFISVVASAAMAIFALWPMVAMQKGWKPLSLYQSKLVAYVIVG 824
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: A0A835YIK8_9STRA (Cellulose synthase, family GT2 n=1 Tax=Tribonema minus TaxID=303371 RepID=A0A835YIK8_9STRA) HSP 1 Score: 497 bits (1280), Expect = 7.650e-158 Identity = 307/695 (44.17%), Postives = 406/695 (58.42%), Query Frame = 1
Query: 304 RFTQTQGVEEHLMWXXXTFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKD-IGLSRAVAMVRQMDDLVTGDDGGDLYRLDRNQTYTNMGLDDDISKSKGGDEVEELTHGVSLLADESVHITPGFFQVFRGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVVNDARHQLQPDFLQRTTPYFFDVD-EITGQYKWAKVAFVQTPQRFRQELPN----DPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALR-GRSPDGKTGVDAKD--LSLVGHE--LGFRAELLIEDTHTSIELFRQGWRSVYVNEP----GEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVFLAQFAA-VAGAVYWTLHDGFEHYYKNTLSICAGAFLGMFYLWPMIALQMGVTKPSFWFFKLGAYVIL 2340
R T++ ++ +W FF E+ L+ + + H R F + R RVTMD++V DP G + V+ILLPTAGE+L V+L+ L G R W ++ R D LRVI+LDEKRR +V L S V+ LA +L R IL+ E V T AFYE + G + + D + R VA+V ++D L+ DG ++RLD T + S + I+PGF + + A+ ++Y+SR DAG P++SPKAGNMN A+F D LI + ++VVNDARH LQP+FLQR PYFF D G Y WA VAFVQTPQRF ++P DPLGN AA+Q+D++N G+DG GV S GQGSLWRV ALR G PDG +D K L+G LGFR+E+LIEDTHTS++L RQGWRSVYV P GEVLA CT P ++TWR+KQVLRWHQGAVQL GF Y G++ + + R++A D TY LQA G +LL+ P+VYG T QSPFN + +YF PF++TA LPT+ + W S R+ RDEQ W +TTYVQ+ A V W KL+R +P AW A P WPLY FLA AA VA YW + GF + + AG F + LWP+++ +GVT P ++ ++ +IL
Sbjct: 14 RLTKSLDGIKYPIWGYI-FFGAEVLLIVGLLVSHVSRAFPIHRERVTMDELVDSDPQTG-DLKVAILLPTAGEKLQVMLQALFGVLQLRLWSSSR----ARCDTLRVIILDEKRRWQVQQLASLVYTLAEVVLDKGVRDILRREEVPASTARAFYELFSD--GLRRHTMNTDNLMFVRGVAIVDEIDKLLHDSDGS-VHRLDGTATTSLAPPXXXXXXXXXXXXXXVRARRASCFVQNT--ISPGFTKTWAKNARLPT-LVYHSRTDAGMPRVSPKAGNMNCAIFRKDG-KGETLIAGAAVIVVNDARHALQPEFLQRALPYFFTRDARRAGAYVWADVAFVQTPQRF-DDVPQWADPDPLGNQAATQFDIVNPGRDGASGVLSCGQGSLWRVAALRDGIRPDGSKYIDTKADREGLIGRTGGLGFRSEVLIEDTHTSLDLLRQGWRSVYVVSPASSKGEVLARCTLPPDSVTWRVKQVLRWHQGAVQLALSHGFAYVFGSGHWASPWQRVFALDAITYVLQAFAGQILLVFPIVYGFTNQSPFNALNLQFATYFFPFLITAALPTMAALGWLKTSSDRVMRDEQVWFATTYVQLQAVCNVIWCKLLRRDPADAWTATCPVWPLYAQFLAIAAAMVANTGYW-IQRGFTSPWVWVSCMGAGLF-ALHSLWPLVSFGLGVTLPPAYYNRVFGMLIL 692
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: D7FIF6_ECTSI (Cellulose synthase (UDP-forming), family GT2 n=1 Tax=Ectocarpus siliculosus TaxID=2880 RepID=D7FIF6_ECTSI) HSP 1 Score: 526 bits (1355), Expect = 6.820e-157 Identity = 309/755 (40.93%), Postives = 436/755 (57.75%), Query Frame = 1
Query: 154 AGKQGASYDPRFVQRGERI-NPQTIETFSSSFLIRGIVLLNIGLGIAYLVWRFTQTQGVEEHLMWXXXTFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMDDLVTGDDGGDLYRLDRNQTYTNMGLDDDISK---SKGGDEVEELTHGVSLLADESVHITPGFFQVFRGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVVNDARHQLQPDFLQRTTPYFFDVDEITGQYKWAKVAFVQTPQRFRQELP---NDPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALR-GRSPDG----KTGVDAKDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVFLAQFAAVAGAVYWTLHDGFEHYYKNT---LSICAGAFLGMFYLWPMIALQMGVTKPSFWFFKL-GAYVILG-VAMVILGQ 2367
+G + + F R + PQ E+ S++ IR I ++N+ AYL WR T++ +H++W F E + + +GH+ R F R +V MD + ID A+GA V ++L+PT GE+ V+LK L G R W + K+ R D LR++VLDEKRR+EV+ L S V+ L+ +L + R+IL EGV ++ FYE H G ++ D+ R + +V ++D L+ +D D+IS+ S ++E S + I PG +V+ + K ++YYSR DAG P+ISPKAGNMN A+F + P PLIG + ++V+ND RH+L P+FLQRT PYFF D+ YKWA +AF+QTPQRF DPLGN AA+Q+D++N G+DG GG S GQGS+WRV+ LR G PDG K G+ + G LGFRAE+LIEDTHTSI+LFRQGW+SVYVN P E LA CT P + WR KQVLRWHQGAVQL KG+ Y G N+ T F +++AFD +Y+LQA G +LL+ P+VYG T +PFNT + YF PFI+T VLPTV + W+ S ++ RDEQ W +T++VQIYA + W + R +P +AW K P WPLYL F A+ AV + D Y+ +S A + LWP+++ +GVT P ++ ++ G V++ V++ +LG+
Sbjct: 1298 SGSDHSENETHFYNRRRHVFEPQHTESAPSAWWIRTIGVINLLCMAAYLWWRITRSLKGVDHIIWAFI-FLSAECIMAIGMIVGHSSRSFPAHREKVYMDDLTDIDEAIGALKV-AVLIPTCGEKTAVMLKALFGNLQLRLWKS----KNARRDTLRILVLDEKRREEVHKLVSLVYTLSEVVLDKTVREILMREGVAPISAKGFYE--HFANGEHGQRMYDDVNFIRGIEVVSEIDKLIAEND-------------------DNISRFSPSVAISRIKERARRASCF--DRKEIQPGQKKVWN-RNKYIPTIVYYSRIDAGQPRISPKAGNMNRAIFSFN-PQEEPLIGEAAVIVINDVRHELYPEFLQRTVPYFFTFDKPRRCYKWANIAFIQTPQRFHDRTDWNDPDPLGNQAATQFDIVNSGRDGAGGALSCGQGSVWRVQVLRDGIRPDGTKFVKKGMPEDQVGQQGG-LGFRAEVLIEDTHTSIDLFRQGWKSVYVNFPNERLACCTLPPDTVKWRWKQVLRWHQGAVQLAMWKGWGYAVLGENWGTTFQKVFAFDAVSYFLQAFAGEILLIFPIVYGFTNSAPFNTWNIEFALYFFPFIITGVLPTVAALGWQKTPSAKVMRDEQIWFATSFVQIYAVMHAVWGTITRKDPSNAWECKCPVWPLYL----HFVAITIAVCFNTADWAARSYEEPWVWVSCIGSALFALHSLWPVVSFGLGVTMPEAFYTRVFGMLVVMTLVSLWLLGE 2016
BLAST of mRNA_A-nodosum_M_contig100.21.1 vs. uniprot
Match: D8LMC3_ECTSI (Cellulose synthase (UDP-forming), family GT2 n=2 Tax=Ectocarpus TaxID=2879 RepID=D8LMC3_ECTSI) HSP 1 Score: 480 bits (1236), Expect = 5.460e-143 Identity = 298/754 (39.52%), Postives = 422/754 (55.97%), Query Frame = 1
Query: 136 MGEKRRAGKQGASYDPRFVQR---GERINPQTIETFSSSFLIRGIVLLNIGLGIAYLVWRFTQTQGVEEHLMWXXXTFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVSIDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGDGLRVIVLDEKRRKEVYVLTSGVH-ALATQILAPSTRKILQAEGVRNLTPLAFYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMDDLVTGDDGGDLYRLD-RNQTYTNMGLDDDISKSKGGDEVEELTHGVSLLADESVHITPGFFQVFRGQAKQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIG-PSTIVVVNDARHQLQPDFLQRTTPYFFDVDEITGQ-YKWAKVAFVQTPQRFRQELPNDPLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALRGRSPDGKTGVDA-KDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQPANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQATYYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVISAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAKVPTWPLYLVF-LAQFAAVAGAVYWTLHDGFEHYYKNTLSICAGAF----LGMFYLWPMIALQMG--VTKPSFWFFKLGAYVILGVAM 2352
M ++R Q + + +R G +P I+ +SF I+ +LN+ G AY+ WR T++ + FF E+ L +W H QR F R +MD +V ID V A V I++PTAGE++ + LLGA SQR W + P S LR+ VLDEK R+EV V T A+ + A S R+ + P + F IF D + +D V G + + + R + YT S S G G A + GF ++FR + MIY +R + GTPK+SPKAGNMNAA+FP + P P+IG P+ +VVVNDARH+L+ +FLQRT PYFF +D TG+ Y+WA V FVQTPQRF DPLGNHA + V N+ KDG+GGV+S GQGSLWRV+ALRG + DGK VD+ +VGH+ GFRAE+LIEDTHTS+E F+Q WRS YV E GE LA C QP + WR+KQV RWH GAVQLL G + C PT H+++ D TYY+QA+ G+ ++LMP+++ I ++PFNT + +F P+I+TA +PT+++ W++++ +R+ DEQ WLST YVQI+AF W+++ ANP++AW P WPL L+F L +A+ VYW + F+ ++ + I +F + M+ +WPM+ L V+ PS + K Y+++ +A+
Sbjct: 797 MPQRRTGAAQATGFTMKSTKRFLAGNYPDPARIKHHPTSFWIKLSAVLNVVAGAAYIWWRATRSMPDNPKSVVWNWLFFAGEVILTFGVWTSHLQRSFPSVRDVCSMDDLVEIDSNVSNEATVCIMVPTAGEKMKNLKHVLLGAYSQRLWVSRLPTSSQ----LRIAVLDEKGRREVSVTTLVARGAMGERKSATSWRRCTVWPRPSSSPPYSRSCAPSFDEPFITPKIFMDY-----FNQMDALDQYVFGPECHPAFDMAVRMENYTGQAKLARASSSSG-------VPGKKKAAKALPRLEEGFKKLFRS-SPNIPSMIYSARANPGTPKVSPKAGNMNAAIFP-NSPGEEPVIGDPARVVVVNDARHRLKTEFLQRTVPYFFKLDRRTGKKYEWADVGFVQTPQRFEDLGDGDPLGNHAVLTFFVSNVSKDGVGGVTSCGQGSLWRVDALRGMAADGKQVVDSVAKPDIVGHDCGFRAEVLIEDTHTSLEFFKQQWRSAYVCEAGETLAVCVEQPNTVAWRVKQVFRWHIGAVQLLLKDGVGFL-CTSRMPTPLHKMFGLDSLTYYIQAVGGFFIILMPIMFSIFQETPFNTVDLEFVYFFFPYIITATIPTILAVGWKNVNPNRVLTDEQFWLSTCYVQIWAFALGVWNRITCANPDNAWNLVCPVWPLGLLFGLLIVSAINTTVYWAFYLSFD---EDGIWIFLASFGACIVVMYSIWPMVKLWWPGWVSLPSAYHQKF-CYILIFIAL 1527
The following BLAST results are available for this feature:
Alignments
The following features are aligned
Analyses
This mRNA is derived from or has results from the following analyses
Properties
Property Name | Value |
Stop | 1 |
Start | 1 |
Seed ortholog | 2880.D7FSC4 |
PFAMs | Cellulose_synt,zf-RING_4,zf-UDP |
Model size | 3641 |
Max annot lvl | 2759|Eukaryota |
KEGG rclass | RC00005 |
KEGG ko | ko:K00694,ko:K10999,ko:K20924 |
KEGG TC | 4.D.3.1.2,4.D.3.1.4,4.D.3.1.5,4.D.3.1.6,4.D.3.1.7,4.D.3.1.9 |
KEGG Reaction | R02889 |
KEGG Pathway | ko00500,ko01100,ko02026,map00500,map01100,map02026 |
Hectar predicted targeting category | other localisation |
GOs | GO:0000271,GO:0003674,GO:0003824,GO:0005575,GO:0005622,GO:0005623,GO:0005737,GO:0005739,GO:0005829,GO:0005938,GO:0005975,GO:0005976,GO:0006073,GO:0008150,GO:0008152,GO:0008194,GO:0009058,GO:0009059,GO:0009250,GO:0009653,GO:0009987,GO:0010927,GO:0016020,GO:0016043,GO:0016051,GO:0016740,GO:0016757,GO:0016758,GO:0016759,GO:0016760,GO:0022607,GO:0030154,GO:0030243,GO:0030244,GO:0030435,GO:0030587,GO:0031150,GO:0031154,GO:0032502,GO:0032989,GO:0033692,GO:0034637,GO:0034645,GO:0035251,GO:0042244,GO:0042546,GO:0043170,GO:0043226,GO:0043227,GO:0043229,GO:0043231,GO:0043934,GO:0044042,GO:0044085,GO:0044237,GO:0044238,GO:0044249,GO:0044260,GO:0044262,GO:0044264,GO:0044424,GO:0044444,GO:0044448,GO:0044464,GO:0045177,GO:0045179,GO:0045229,GO:0046527,GO:0048646,GO:0048856,GO:0048869,GO:0051273,GO:0051274,GO:0051703,GO:0051704,GO:0070590,GO:0070726,GO:0071554,GO:0071555,GO:0071704,GO:0071840,GO:0071944,GO:0090702,GO:0099120,GO:0099568,GO:0099738,GO:1901576 |
Exons | 15 |
Evalue | 0.0 |
EggNOG OGs | COG1215@1|root,2QQSH@2759|Eukaryota |
Ec32 ortholog description | Cellulose synthase (UDP-forming), family GT2 |
Ec32 ortholog | Ec-20_004990.1 |
EC | 2.4.1.12 |
Description | Cellulose synthase |
Cds size | 2379 |
COG category | M |
CAZy | GT2 |
BRITE | ko00000,ko00001,ko01000,ko01003,ko02000 |
Relationships
The following polypeptide feature(s) derives from this mRNA:
The following UTR feature(s) are a part of this mRNA:
Feature Name | Unique Name | Species | Type | Position |
1680866357.1493304-UTR-A-nodosum_M_contig100:354534..354546 | 1680866357.1493304-UTR-A-nodosum_M_contig100:354534..354546 | Ascophyllum nodosum dioecious | UTR | A-nodosum_M_contig100 354535..354546 + |
1680866357.3222268-UTR-A-nodosum_M_contig100:376116..377366 | 1680866357.3222268-UTR-A-nodosum_M_contig100:376116..377366 | Ascophyllum nodosum dioecious | UTR | A-nodosum_M_contig100 376117..377366 + |
The following CDS feature(s) are a part of this mRNA:
Feature Name | Unique Name | Species | Type | Position |
1680866357.1666534-CDS-A-nodosum_M_contig100:354546..354633 | 1680866357.1666534-CDS-A-nodosum_M_contig100:354546..354633 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 354547..354633 + |
1680866357.1768873-CDS-A-nodosum_M_contig100:355151..355344 | 1680866357.1768873-CDS-A-nodosum_M_contig100:355151..355344 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 355152..355344 + |
1680866357.1852686-CDS-A-nodosum_M_contig100:359770..359940 | 1680866357.1852686-CDS-A-nodosum_M_contig100:359770..359940 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 359771..359940 + |
1680866357.1975322-CDS-A-nodosum_M_contig100:362325..362514 | 1680866357.1975322-CDS-A-nodosum_M_contig100:362325..362514 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 362326..362514 + |
1680866357.2118824-CDS-A-nodosum_M_contig100:371599..371772 | 1680866357.2118824-CDS-A-nodosum_M_contig100:371599..371772 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 371600..371772 + |
1680866357.2206311-CDS-A-nodosum_M_contig100:372324..372640 | 1680866357.2206311-CDS-A-nodosum_M_contig100:372324..372640 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 372325..372640 + |
1680866357.230741-CDS-A-nodosum_M_contig100:372805..372976 | 1680866357.230741-CDS-A-nodosum_M_contig100:372805..372976 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 372806..372976 + |
1680866357.2398806-CDS-A-nodosum_M_contig100:373174..373252 | 1680866357.2398806-CDS-A-nodosum_M_contig100:373174..373252 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 373175..373252 + |
1680866357.2546642-CDS-A-nodosum_M_contig100:373434..373647 | 1680866357.2546642-CDS-A-nodosum_M_contig100:373434..373647 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 373435..373647 + |
1680866357.2660341-CDS-A-nodosum_M_contig100:373925..374060 | 1680866357.2660341-CDS-A-nodosum_M_contig100:373925..374060 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 373926..374060 + |
1680866357.2747073-CDS-A-nodosum_M_contig100:374260..374347 | 1680866357.2747073-CDS-A-nodosum_M_contig100:374260..374347 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 374261..374347 + |
1680866357.2833815-CDS-A-nodosum_M_contig100:374502..374596 | 1680866357.2833815-CDS-A-nodosum_M_contig100:374502..374596 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 374503..374596 + |
1680866357.2940605-CDS-A-nodosum_M_contig100:374939..375079 | 1680866357.2940605-CDS-A-nodosum_M_contig100:374939..375079 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 374940..375079 + |
1680866357.303217-CDS-A-nodosum_M_contig100:375348..375466 | 1680866357.303217-CDS-A-nodosum_M_contig100:375348..375466 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 375349..375466 + |
1680866357.311597-CDS-A-nodosum_M_contig100:375901..376116 | 1680866357.311597-CDS-A-nodosum_M_contig100:375901..376116 | Ascophyllum nodosum dioecious | CDS | A-nodosum_M_contig100 375902..376116 + |
Sequences
The following sequences are available for this feature:
protein sequence of mRNA_A-nodosum_M_contig100.21.1 >prot_A-nodosum_M_contig100.21.1 ID=prot_A-nodosum_M_contig100.21.1|Name=mRNA_A-nodosum_M_contig100.21.1|organism=Ascophyllum nodosum dioecious|type=polypeptide|length=793bp
MEDGRGGIPSGHRRMKSSGGFSQRLGKMQEALGMAHHSVPGMGEKRRAGK QGASYDPRFVQRGERINPQTIETFSSSFLIRGIVLLNIGLGIAYLVWRFT QTQGVEEHLMWWWWTFFLVEIFLVTAIWIGHTQRLFAVQRIRVTMDQIVS IDPAVGANAVVSILLPTAGERLDVVLKCLLGASSQRSWPTTAPGKSGRGD GLRVIVLDEKRRKEVYVLTSGVHALATQILAPSTRKILQAEGVRNLTPLA FYEWCHEKRGFGKVHIFKDIGLSRAVAMVRQMDDLVTGDDGGDLYRLDRN QTYTNMGLDDDISKSKGGDEVEELTHGVSLLADESVHITPGFFQVFRGQA KQAACMIYYSRKDAGTPKISPKAGNMNAAMFPVDDPTSPPLIGPSTIVVV NDARHQLQPDFLQRTTPYFFDVDEITGQYKWAKVAFVQTPQRFRQELPND PLGNHAASQYDVINIGKDGIGGVSSSGQGSLWRVEALRGRSPDGKTGVDA KDLSLVGHELGFRAELLIEDTHTSIELFRQGWRSVYVNEPGEVLAWCTHQ PANLTWRIKQVLRWHQGAVQLLYLKGFRYTSCGGNFPTIFHRIYAFDQAT YYLQAIPGYVLLLMPLVYGITGQSPFNTEITPYFSYFVPFIVTAVLPTVI SAQWRSIDSHRLTRDEQTWLSTTYVQIYAFLQVTWSKLIRANPEHAWVAK VPTWPLYLVFLAQFAAVAGAVYWTLHDGFEHYYKNTLSICAGAFLGMFYL WPMIALQMGVTKPSFWFFKLGAYVILGVAMVILGQFFDLQIG* back to topmRNA from alignment at A-nodosum_M_contig100:354535..377366+ Legend: UTRpolypeptideCDS Hold the cursor over a type above to highlight its positions in the sequence below. >mRNA_A-nodosum_M_contig100.21.1 ID=mRNA_A-nodosum_M_contig100.21.1|Name=mRNA_A-nodosum_M_contig100.21.1|organism=Ascophyllum nodosum dioecious|type=mRNA|length=22832bp|location=Sequence derived from alignment at A-nodosum_M_contig100:354535..377366+ (Ascophyllum nodosum dioecious) TCTATTCCTCAGATGGAGGACGGACGTGGTGGAATTCCGTCCGGCCACCG
GCGGATGAAGTCTTCGGGCGGCTTCTCACAAAGGCTGGGCAAAATGCAGG
TGACAGCACCATCGGTTTTCCATCCTTCCGATGTTACCGTACCGCAACAC
CTAAATAGCGATATAACAACTTTTGACTGCAGGGCTACGGAGCGAGGTGC
ACTATGCGTCGGAGTCGAGAACGCTTTTTCCCGATGTAAGGCCGAGGACT
AAAGCTCTTGCGGTAGCTACTCTTCCGAGACTTCAGCGTATGAGCCCGAA
CCCTATACCTGAGAACGATGTAGACCTCCCTCCCTCCGGTTGACTAATTT
TCTGTCAATTCTACAGGTTGACTAACGACAGATTACTGAATGACTAGAAA
CTGTAGCAGGAGTTTAAAGAAAAATCAGAATGCACCTAGACCTTCTGAGC
ATCCCCCAGTCAGGGTTGTTGTTGTTGTTGTTGTTTTCTGCAGTACAGCA
ATTACTGCTGTACAGAAAGGAGAGTTTTTCCCCGTCATCCTTCCCAAGTA
CACATTTTGTGTGGCGTGATGAGCTCTTTTTTTGGGATGATTACTCTTGG
CGGCTGCCCCGTTGCAGGAAGCCTTGGGGATGGCTCATCATTCGGTCCCA
GGGATGGGGGAGAAACGGCGAGCAGGAAAGCAGGGAGCTTCATACGATCC
GCGCTTCGTCCAGAGGGGCGAGAGAATCAACCCTCAAACCATCGAGACTT
TTTCCTCGAGCTTCCTTATCCGCGGGATCGTACTTCTCAACATCGGTTTG
GGAATAGCCTGTGAGTAAGGCCGTCCGTGAGGCACTCATTGGCGACACGC
CAAGGAAGAGAGAAAACACATTTTTGCTCAAAACAGAGAAAAACACACTT
CTAGTAGTCTAGTCTTCGAGAACAAACAAGTTATGTTTATATAGTACCAC
CTGTCGTATTGATAATTTTCGTATCCTTATCTCGCGGCAGGAGTCTGGAC
TTGAGTACTTTTAGTTGGCTGCTGTTTATATTGTACTACCGTACAGTAAC
AGCGTCGGTATACTGGTACTAGATTAATACACGTATATAGAACTCCCAGA
AAACAAGCTCTTTACTGGGATTTTTGTTTTTCAAGTCTTGTACAACAGGC
AGTACTTGACAAGTCGTCTACCGTATAAAAGAGCTGAGCGAACCGTACTT
TTTTTATGGTTGCTCAGGTCAAAACCCCCGCGAACAGTTTGATCCCGATA
AAAAACATTTCAGTTGCTCAGGGAGTTACAGAAAAAGTCTACTACTACTA
ATCTGGGCATCTAGGGCTATCTATCTATCGACTCTACTCCTTTCCCGAGC
CTTCTCTGTGGTCTCCGGTCTAGAATGGAAAACCTCGAGCTCCAGACAAA
GTATTTTTCATCGCGTCTGTTGAAAATTTCGAGCTCTCTAATATTGGACA
ACTCTGTTCAGCGCTCGCCAAAGCTTGGCAAGAGAGCTTTTCGGCCTTTG
CATCGACTTGGCAGTAGGTGTGGACATGTGCTGGTCGTTCTCTCCCCCAA
CCAAATAATGACTTTTTCGATTTTTTTCTGTTCTGTCGTTGGTTGAACGT
TTACCCACAGTGATTGGTATTATCCGAGTACGATCCGGTACGGTAACGTT
GTTTTCTTTTTCTAACTCTGTATACTGCAGGTATACTGTACGATACGTAC
GCCTGCATTGTTGTGGGGAGTCCTCGCTTGGGTGCTTCCTCTACGCGCAC
TGTCCTGTTTTTAAAGAAAAACTCGAACGCATCTAGACCTTCTGAGTTGT
CTCATTATACATGTTCAGTACTAACACTCTGTAAAGCTGCAGCAGTATTG
TGAACAACCCCGTTTTGGGCGCTTAGCGATCTGACAAAATCCTCACACCT
CACCTCAAACTACTTTCGTAGAAAAATACCACTGAAAAAATTAACATCGT
ATTATTATTCGTATTATTACAGTAGTCTAGTATTACAGTAGTCCAATGCT
GTACCAGGTACGATGGTAGGGCAAGGGTGCTAGGGCCAGTTTCTAAACCA
GACCTTAGCGCAATGACTTTGAATACGAGTACAGTATTTACAGCTGATAT
GCCATTTCTTTAGATACGGCGCAACGAAGGCATAACTGTTGGGCAGCGGA
GGTAAAAAGGAATGAAACGAAATATCCTGTACTGTACGTACGAGCACGAA
GGATCGCAGTAGAACAACCTTTAGACACTCTTCCACTTGAGGTGATGCTC
AGAAGGTTTAGATGCGTTCAGATTTTTCCTAAACTAGTCACGTTATAACT
CCCACTGCTCAACTCACTTCTGTTTTGTCGTTACAATCAGTACACTTACA
CGCTTTCGTGTCTACAAACCCGCGGGATTTTATCCGGCAAATTATACCGG
TTTTCAAACCTGTTGCCTAGTCGCTGTGAGCTCAGACCTCTTCCTCGCGA
GCTATTATAGTATTAAGCTATTAATATTAATACCAGAACCAGTATGCTGT
GTTCGAAGTTGCCCGACCATGCCAATTTTTTTTTGAAAGCATGTTACCTG
TTACCAGATACTTTAGGAACGCCCAGAAATTGTTTGTCGTCGGGTGCAGT
TCGAACGAGACCAGCGTACAGTAGGAGCGTAGAAGCGTTCAGGTTTCACG
ACAGAAAATAAACCTGCCTAAAAAAACAGAGAGCCCTAACAATTGATTCG
GTGATTTAGACGGCGTTTTTAACCGGTGATATTGGTAGCCGGTCATAACG
TTTCGAGCCTCCAAAAATGTTGGTCATGTCGTGTCAGCGTGGGCAGTACT
ATAGTATCAAGAAGTATAGTATTATTACTATTATATTCCTCCCAAGCACT
GCGATGGCTCGTTTGGTCTTGTCTGCGGTTCCTACTCCAGATGTCTGATC
CCACCAACGAGCTGACCCATGGGGCGGAACGGCCGAGGATGCTCTCATTA
CGAATTAAAAACTTGAAAATAACCTCAACTCTTTATTCCTCGAAGTCTTC
TGAATCTTTGCAGTGCCGTACTAAAGAGATGGGAAGCGACAGTGAAAATG
ATCTTCAAAATCGTCACATCATGTTGGTTATGTTATTTATGTACTGTAGG
CGCCCGAATGGATTTGTTGTTTTTTGTTGTTGTTTCTCACATTCAGCGCA
TTGGTTGCCAACCCGAAAAAAACTACTTTACACGGTGGCCAATCCCGCTC
GTGGTCTGCTGAACAGAATTTTTTTTTAAAAAAGTCTGGCAGCGCCCCCC
CCCCCCCCGCCGCGCGCGACGCATCCACATGTCATGTCTAGGCGCTACGC
AGGTCGTCGCTACGCAGGTCGTCTCGATCCGTCTCGCGTCCGTACAGGGA
TTCCTACGACTCGTCGTCTAGGCTAATGGGTGTCGCTACGCAAAATGCTA
AGCTTCCGTGTGCGATTGCAACTTTTCCAGTCTCGTTGCCTCTCTCTTCT
CCTGGCGATGTCTGGCCGTGTCTTCCTCTTCTTTTCTCCACGCGGCCATG
AACCTCCGCCCACCCTCCGTGACCGCCTCAACCCAAACCTCAGCGTTTGA
CACCATCGTTTTCCAATCCCCTGTTATGCGTATGATAGTGTATTCTATTA
ATAGTGGTTAACCTCGCAGTGTCTTTCACTTGTGTGGATTCGTTACCTTT
AGTTCGATTCTCGACAGGAAAACGTGTTTTTTATTGATTGTTGATTGTGG
CTAATAAAATAGTCCGTAACGTAACGGAAAGCGTGATAACTTGAGCAGCC
AAAATTCATCCATCGGTCGATGAGATGTAAGAATGCATCTCGATAAAAAA
GGCGTAGGCGAGAACCGGAGATAGATCAAAAGATCCTTAAGTGACCACAG
GAATTAACAGGGTAGATACTGAAGCAACGTTGCAAGAGGGATATACCGGT
AGATATTAATTGTTACGTAACACGGTGAGGCCAGCCCATCGGCCGGGCCA
TGAGTGAGATACAAACTAAACAAAACTCGCTGCACTTGAACATAACTTTT
GCTTGAAATCGAATCGCCCTTCTCTAAAGTTTGTGAACGTTTGCAACAAC
AACCAAAATAGAACAGTCTTGTCAAGACTATGTTCCCACCTAAACGTTTT
ATTAATAGTAATATCACCCCCCTCGTCGCCCGAGTGGTCTATTAATAGTA
GGTAACCTGTACACCTGCTAATGCGGAAGTCGTGAGTTCGGTTTTTGTCA
TCATAATTTGTCTTCTTTCTTCCGTAGCGGTAGCGGACAGCGTGAGAATG
TGAGTAAGTAAAATTCAACCATTGGTTGACGAGTTGTAAGGGTCCTGAAC
TTGAATCGCGTTAAAAAAAACTAAGGCGTGAACCGGGGAGAGATGGGATC
ATTGGTCAGTATGTGACCTCAGGCAGGATGGAATGAGTTCCATCGATAAT
AATACTACAAGTGAGATGAAAAAAATATAAATAAAATATATATAATGTTG
TACATCCATCCTACGGTACTATCGACTCTGACTCACCAATACTTGAACCC
GTACGCAATACAGTATACGATATACAGCATAACGTAACTGAGTTTTATAC
TTGTGTACGACCAAGACCTCCGTAGCGCCTGTAGATGCGTCGCGTGATTT
TATTTATTTTCTCCGAACGAGCAGTGTTTGGGGTGGGGGTGGTATGCTGC
CAGACTTTTTTTTCTCTTTTCGCTGTTCAGCAGACCCCGAGCGGGATTGG
CCACCGTGTAAAGTAGTTTTTTCGGGTTTGCAACCAATACGCTGGATGTG
AGAAACAACAACCAACAACAACAAATGTGTAATGTGTAAAGTATAACGGG
GGATTCTCCCCGATGTCATACTGTTGACCATATGTTGCTACCACCGGGGA
ACCCAATTAGATGTCATGAACAGATTTTGTCCTTACAGTCTAAATTCCAA
CCTAAACCTAAACGTTTTTAAAAGTTTCCCCTCTGATAGGGTGCAGGCTT
AGATCCGTTCCGATTTTTCTTTAAATAGAAAAAAAAAACGTGACTTTGGA
TTAGTTCGAGGAACCAACCTTTAAACGACCCGTTGTTTCACAAATCGAAT
CGCCCTTTTTATCTTGATAACGAAAATGTTTTCATTTTACAACGCCATGA
TTTGTCGAATGACATTTCTGGAAACTACTCGGCGACCCCCTCAAATGTAG
CTTTTTGCGTCTCACATTTCGGCATTTGCTATGATTTTACTCTCATATGG
TCCTCTATGTCTCTCGTTTTTGTCATGAAAACTCAGATTTGGTTTGGCGC
TTTACCCAAACCCAAGGCGTTGAGGAACATCTTATGTGGTGGTGGTGGAC
GTTCTTCCTCGTGGAAATTTTCCTCGTAACCGCAATCTGGATCGGGCACA
CGCAGCGGCTGTTCGCCGTCCAGCGCATCAGGGTGACCATGGACCAGATC
GTTTCGGTAAGTAGAGAAACCTTAACTACTGTACAAGACAGTGCGTAAAA
ATCACTAATCACCAATTATCCTCCTTTCTTAAGAGAGCTCAACTCTACTT
TTGTATATTTTGATAATTTACGTTCACTATTCTTTCTGTTTTCGAGTGAA
ACTGGACCTCTTTCCCGAAGCCCAAGATACGATACGATACGATATGCGTT
GTGTTCCTTCCCTAACCCTGTGACTACGACTATCAAATTTAATAGATGAC
CTAATCTGACTAATCTGGGTAGGCGACAACATTCCGAGTCGATCGAGGTT
GCTCTCGATATGGAGTAAGGAGGTTTGATGAACCGTTTTGATATTGGTCG
GTGCCATTGCTTGTAGAGCCGCATAACTTTTTGTCGCGATTTGGATCCCT
GTTGGTCTGAGTAGCGGGTTGTCACCATGATGTGACCCGTTGACCTTCGT
ACCTCTTTGTGACCACAAGCGGGATTGGCTGATAGGTCTATGGCTGACTT
GTTGTGTCACCGTAGTTTCTTCAGTTGCAAGGACAATACGCTGAATGCGT
GAATCAACAACAACACATCAAGAGGGGCACGACTGACTTACAGTACAGCA
GATAATACCGGTATGCCAGCTATATTTGTGTACCTGGTAGTGTTTAAAGT
GTGTTTCAACCATCATTTCTATCCTTGGCTTTCCTTTTCTTTCTACGGTA
TGTAACGTGTCACTCTACAAAATATCATATTCGTAGTGTACTGGCCTGGG
CGGTGCAAACCCGGCGGTGAATACACCCCGGCCTAGTAGGGGTACAAGGA
TAGAATCATGCGATTTTATTGTCTTGATTGGTGATCCTTGATTCTTTTTG
TTTGTTTAATTTTGCTTGGCCTCGCGTGTCGAGTGGGAGCGATGGGCTCT
ATCTATCTCCCTCCTTTGTCTGACATGTCGCCTGCGACGTCTGACTGTCT
GGAGACTTATTTTTCTGGGGGGTTTTATTTTATTTTCCAATTATCGCGTG
TGTGTTGACTTTTCTGGGTCCTTTTATTTCTTTCTGATATGTGTACCCTT
TCTGGGTAGTTAGCCTGTGACCACGGAAGGCGTCCGCGTCAGTTGGACCA
GTACGGATGTTAGAAAGTACTGAGTAGAGACCGGAACGGCTTTTAATGTG
AAGACAACAACAACAACTGGTGTCCCTCTAGAGCCTCGAGAATGATGGCG
TGGGCAAGAAATACAGGGTCACACAAGAAATCATTTCGTTGCAAACGAAA
CAGTTTTGCCGAGATACCTACTGGTAAGCGTTTTGACACTTTTCCGCCTA
TTATGTGGGATGCTCAGAAGGTCCAGATGCGTCCAGATTTTTCTTTAAAT
ATTTAGTGGTAATTGTACTCGTGTCTAAGCCATAAGCTATTATTTCGTTT
TTTCCGCTATTCAGGTGTTTGCTTATTTAATACGGTACTACAGAACAGAA
CAAAGGTCTAGGTGCATTCAGATTTTTCTTTAAACATCTTGTCGGGTGTC
GGGAACGAGGGCTTGAATAGTCTTAGTATGTTAATAAAAAACGTAATAAA
GATGTACCGCGAACTCCTCGCGAGCCATAATGGCTGTCGGCGAAGCAAGT
TCGAAAATTCCCGATAAACATGCCCTCCGTTCTCATAAAAAGTTTAGAAT
CTTCATTCTAAAGCCCGAAATCCAACCTATCAAAATCTGAAATTTCCGCC
CCTAGGGCACGATGCGAGATTGATTCAAGATTGCTCCTCCCATTTTATTC
TATTTTGGAACTCCACAGGAGAAAGGTAAACACTGCTCTAGGACTAAAAA
AATACAGCAGATTTTAGTTGAATAGAGAAGAGAGAATAATACAGCAGTCC
AATAAGATAGTGCCTTCGTCGCCGTCACGGTTGTTCATTCATTCCTGTAT
TGACTCTATTGACTCTTGTGACTCGAAGGGGGTGCGTGGTTAGTGTGGAA
CGTCTTTGTGCCGACCTACCGCCAGGCGCTTGCTCAAGTGCCTCACCTCA
AGGCACACCTCACGTCCGTTTGACTGATTGCAACGACCCGATTCTTCCAT
TATTTTCTAGCTATAGCTATACTATAGGATAACCGTATGAAACTTGCGCG
TATTCCTCTCAGGCACGGTTAGGAATCCGCTGTACCGTATTAAAGGGGAA
TCAGATTGGATTTCGCGAGTTACTTGCTTGCCAAGATTCCGAATTCCACT
TTCCCGGACTCTTTTATCTTTTATAGAAGTATATATTATTACCTGTTTTA
ACGTTTTAACCCATGGTTTCTCTATCGACTCGCAGTCAGTTTTCTTATTC
AAAAATCAGAAGAAAAAACACAAGCAAACGCATGTTCTGTTTGGTGTTGA
AATTGTGAGTTCTCTTTTTATTCTGATGATTGCGTGGACAGATCGATCCG
GCCGTGGGTGCAAATGCGGTGGTGTCCATCCTGCTTCCGACGGCGGGAGA
ACGCTTGGATGTCGTCCTCAAGTGCCTGCTGGGTGCTTCCAGCCAGAGAT
CGTGGCCTACGACCGCTCCCGGGAAATCAGGCCGAGGCGATGGGCTGCGA
GTCATCGTCCTAGACGAGAAACGGCGTAAGGTAAGGTAGACGAGATCCTA
TCGTTAAATTTGAGAACACACACAACCAACCTACCTACAAATTCCGTATG
AGTCTGAAAAATGAGATAATACAGTAGGCTGATGCGGGGCAGGACGGCCG
ATCTGTTGTGCGAGACCAAATTCCCAGGCCCGAACAGTGAACATTAATTT
TCTCTGTTCAGCTTCCCCAATTGATGCCACAGTAGTCTATTACTTAATAG
TCCTTTCCTGTACCCAAAACATGAATAGCCTTACGTTCTACGAAGTAGTA
AATTGTCAGTGGCTCATTACGGTCAGCATCTATAATAGGTGCTTCGTTAC
CCAGAGCCTATTTCGCACTCTTTGGAGTTGGATATTTTTACTAAAATTTA
GTACATGTAGTCTAGTCTAGTCTAGTTGGGTTAGGGCCACTCTAATACTG
GAACGTTGTATTAGGCTCTAATCAGCTAGCTAGATCTGCCCAAAGTTTGT
CAAACCCGGAACCGTTTGTATCCGAAATTCGTTGCTAACCTAGTTGAGTA
TTGACTGCAGTAGTACCGTTTCGCGGGACCAAATTCTCAGGCACAAACCG
AGATAAGAAATATTCATTTTCCCTGTTTTCCCTGTTCAGCTGACCACGAG
CAGGATTGGCAACCTTACCCGGTTGATTCATACCCTGCTATATGTGATGG
CCATACTACAGAATGTAGTATCAATAGTCCTTTCCTGTACATTCTAACAT
GAATAAAGTAGTCTCACGTTCTTGTAAATTGTAGGTGGTTCACTGCGCCC
AGCATCTGGGTGCTTCGTTACCCAGAGCCTTTTTTGCACTTTTTGGAGTT
GGATGTTTTTACTAGTATCTAGTACATGTCGTCTAGTTGGGGCCACTCTA
GGGCTGGGAACGTTGTAGCTACATCTGTGCAAAGTTTGTCGAGCCCGGAA
CCGTTTGTATCCGAAATTTGTCGCTTACCTAGTCGATAATTGACTACAGT
GGTACCGCCTCGCCGATCCAATTCTCAGGGGTTAGTACGAACGGAGACAG
GGAAATATATTTTTTCCCTGTTTAGCTGACCACGAGCAGGATCGGCAACC
TTACCCGGTTGATCCATACTCTGCTATATGTGATGACCATACTACAGAAT
GTAGTATCAAAAGTATTTTCCTGTACACGAATAGCTTTACGTTTTTGGAA
ATTGTCTGTGGTTCACTCCGCCCAGCACCTGGGTGCTTTGTTACCCAAAG
CCTATTTCGCACTTGGAATGGAATGTTTTTACTAGTATTTAGTACATGTA
GTCTAGTTGGGACCACTCTGGTACTGGGAACGCTGTAGCTAGATCTGCTC
AAAGTTTGTCGGCCTCGGAACCTTTGTATCCGAAATTCGTCACTAATCTA
GTTGAATACTGACTACATTAGTACCGTCTCGCGGAACCCAATTCTCAGAC
ACGAACGGAGACAGACATATTAATTTTCCCTGTTTAGCTGACCACGAGCA
GGATTGGCAATCTTACCCGGTTGATCCACATCCTGCTATATGTGATGACC
ATACATACATACATGGTCTATTTTTTTTAAACGTTGAAATCGGAGTACAG
CAGCTATTCCGTAGCTCTTAAAATATAGTCGTATCAACTACCAGGTACAG
ATAGTACCATCGACAGATACAGTAGTTTGTTGAACGCTTTCACGAAATCC
CAAGGTCGAGGCAGCATTTCAATTCGTCAACTAAATACTATGTTCCCAGA
ATCGCCCATGCCACAAAGCAATAACAGACTTGTAGCTACAGGCTGTACGC
GTGTTGTGCATCTTTTGTTCGATTTAAGACATTCAACATTTCAAACGATT
GAACCAGTACTCGTATAGGACGGAGATGACTATTTTCAATTTCCCTTCTC
GAACCACTTGCACATTTACATTTTTACTGCACGAGTACGTGCAGAACAGT
ATATTTTATATTTTGATGCCCACCTTGCAAAGTACTGTATTTGTGTAGCT
CTAGCTCTATGCATTTCTTCCAAACTACTATTAATACTATTACTAGTAGC
CAGGACCCGCGCTACATTTTGGAGGATATTCTAGGTTTTCCGAATTCTCT
AGAATGTTGTGGCTTGAAAGCCGGTCGCCCGGGTAGCTCAGAGGTAACCT
CGTAACCTTCTGACACGTTGGGACGTGAGTTCGATCCCGGCAAACGGAAT
TTTTCTCACTAGAAAGATAAAAATGCCCAACGAGTGGAGAACGATTTAAT
CGTTAGTAGACAAAATGAAATTCGACTTCACGGTCGACGACGGAAAGGAA
TGGCTGAATCATTCTCGCGAGAAAAATTAAGGCACGCACCGCAGAAGGAA
GGTGGATGAAAAATCCTGTGTGACCTTCCTCCTGTGTGAAAGGGAAGAAA
ACACACACACCTAGCTGGATGAATCTAAAAATATGCCGTATACTGTAGAA
CCCAACACCAACGGTTCGAACGTGCAGCCTCGTCGTGAAATACTCTAGAC
CATACTGTATCCTCTATTATATACATGAACCATGACACAAGTACCGTATT
CACTTTCGTGAAGTTCTGGTGAAACGAAATGGATGAGGGCCGGCACTTGG
CCAATAAAACCGTGTAGGCGACCGACCCCTGCCATAATCATTGCTGTACG
TTACAGTAATAATAAGAGTATTTAGGGTTTCCGGCTCGAAAGTTTTACCG
CCAGTGTCCAACTCCAGAAGTATATCGTACTTCGGATCGCCATTGGGTAC
CTTCCCTTCACCACGGTCGACTGTAACTGTAGCTGCCAATGGCTGTAATG
CTGTATTACTTTTATACTGTTAAATAGTAACAACAGTAACATGATGTGCA
TTGTAATGTTTTCTTATGTTCCTTCCCTCCAACAGCTACAGCTACATTAT
ACCGTGTGCAGTATAGTAATTGCATCGCAGGTTCCGTCTATCTTGTGACC
ACGGCTAGATTCGTAGCGGATCAGTTAATCTGAGATAAACAACAACACTT
GGTCGTTCCAATTGTGATCCGATAGTCTACTGGTATACTCGCTCTGTGTC
GTATAAAAATTGGTTCATGGTGAAACCTGAAAAGGCAGCACCCGTTACTA
GAACTACCTTGAGGACTGATGGCCCTTTGCCGGCGGACTCTTGGCAGTGA
ATGCCATCGCTACGCAATTGCGCGGGTTGGAAACTCGGGACAGACCCGAT
GGCGGTTGACGGTATAAATAAATGGACGCCACCTGATTCAGACTATTAAT
AGTATTAGTAATAATAGTGTGTGATGGAGAATGAGGAGGCCGACGCGGGA
CGGGACGGCCGAACCCGGCTAGCGAGACCAAATTCTCAGGCGCAATAAAC
GGGTACGGGAAATATGAATTTCCCTGTTCAGCTGACCACGAGCACGATTG
ACAACCATACCCGGTTGATCCATACTCTTGCTATATGTATGACCATGTAT
ACGTAAATACTGTGAAACAAAATCAATTGGGCCGACCCAAGGACTCCCGA
AACGGCGTTGGTTGCGCAGATAGGGAAGAATGAGGACCTCGAATGCTCTG
TTGTAGATGTCGTGCCTTTGTGGGCATGTACGAGGTAGTTGCGACCCGGT
GGAAAGCATCGAATGCGCTCGCTACTGGCGCCAAACCAAACTCTTAACCC
TACTTTAACCCTAGCGGATGGCCCTATTTGTGATGTGCGCGTTCGTGTAT
CATGTATCGTGTACCTATCTATCCTATCTGCAATTTTTCTAGGGGAGGTT
TGGTTTGGCTTTATTTGTCTTGTTCTAGTTTTTTTATGCCCTTCTTGTAG
CGCTTTGTGTGATTATGTTACAGATGTTCATCCAATATTTTCTACTATTA
GTCCAACAAATCACGCACCATACTACCGTGGCTACCGTGGCTACCAGAGT
ATTGATGCTGTATTAATATTATTAGGAACTATTTATATTGCTTGGCATGG
TTGAGGCCCGATCCGTTAATGTGATCTACACACGTGTTTCGACCGTTGCT
TTTCTAATTAGAGGAGTCGCCATGTGTTTTTCCTCAATAAATTGTAGTAA
TATCTGCCTTACCGGCTGTGATCGTGATTCGACAAAGTAGGAATTAGTTT
TAACTACTCTTGTTAGTTGTGGTTTTACTCATACTTCTGTTGGCGTTCTG
TAACGGTCGATCGTGTCGATAAAACTTGCAGCTGGAGCGTTGTCTACGAC
AGTTACGCCATTTGATTCTCAAGTCTGAAATCTGGACGCATCTAGACTAT
TTGAGCATCCCCCAGTCAGGGGGGAAAATGTCAGAACGTTTAGGTGGGAT
CATAGGCTGCAAATACGCTCTTCACATGAACCGATCGGGCCTCAACCATA
ACCAGTAATAAACGTGGTGGCCAATCCGGTACTGTACGTGGTCTGCTGGA
CAGGAAAAGATCAGAGGAACATCTACAAAGCTCCAACGACAGCATGAACA
CAAAACAAAAACGACAAAAAGAAAGGAACAATTCCACAAAGCACATGCCA
GAAAAAGACAGGAGAAGGGCACTCGATAGAGAGAGTTTGGTATATTGCCA
GCCGCGAGCCCTCCGATACCCTCCCCGGATCGTAAACATACGCCCAGCAA
GGCCACCCATTATAATATGCAGTATACAGTATTATGAGTATGAATAGAAC
AATGTTGGGATTCTTGAATTGCAATGTTACTCATCTTTTATGGTTTATTT
TTGCCTTTTGTTAATTAAGGCCCATTACGTTACACCGCGCATCAGCTACG
TTGTCCGAACTTGATCACTGTATCAATTACTTGATGCGTTTTTATAGGTG
TAAACTGAAAGATCATGTTTTTTAAACTCGAGTTGTTGTTGTTTATCTCA
CATTAACTGGTACGTAACGAATCCAGCCGTGGTCACAAGATAGACAGAGC
CAAACCAAATAAACCGGGAACACGTTCGGAGAAAAAACAGGGAAAGCAGA
CAGAAGGAAAGGGACAGACAAAAATATAAATAAATGAAACAAAAACCACA
TAAAACATAGTACGCGCAGGGTAATATAACATACTCGTCAGCAGGCTACG
CAAGACAGGCAGTCCTATAAGCAGCGGAATACAACCCACCCAGGCATTGA
TAGGATGACAAGGGCAAGGACAACGAATCCGACACACGGTGAAGGTATAC
AGGCACAAGAAGTATCGCGAGGGATGCCCCAGAAGAAGACAAGACCAGCA
CCCATACCGGACTCACCTATCGTATAAGACGACAAATAAAAACAAAGTAG
AAATAAATATAAAAAATCGCACTCGTTGCTGTTTCGAAAAAATCCGAACA
CATCTAGACCTGACCAACCAAGCATCCCTCAGTCTAGGGGGGAAATGTCA
TCTCGCATTAACTGACCGCTACGAATTCAGCCGTGGTCACAAGATAGACG
GAGCCAAACCAAAACGAACCGGGAACGAACACAATCGAAAAAAAAACGGG
GAGAGCATACAGAAAAAAAGGCGGACGTGAAAGTGGCCTGAAAGTAGCCT
GCAAAGGCCATCGTTGCCAGGAATAAAAGTGACTCAGAAGAGAGAAAACA
GAAAACAAAAGGGAAATAAACCAAGGACAACCGCATGAACATAAGACGCG
TAGGGCAAGAAAAACATTCTCCCCAGCAGGCTACGCAAGAAAGGCACCTG
CTCTGCTCAGTGTTATCATACAATAATCATGGATGAATCTTCTTCATCGA
GTCTGGGGTGATCTCTCTCCTTCATCGGGCCTATAGTGTACCTGGTATAT
TGGAGATTTTGGCTATAGCTATGGCCTTGCTTTGTGAAGTGTGCTGATGC
ATTACTGTTATATTATACTATACTCGTCTGTACCTTGCCTGCCTTGCCTC
GGAGGGCCAGAACGAACTGAAGCAGGGAAGCCTCTATCGGAAGGACCGTA
CAAGTGGTGTCGGATTGGGAACCGTTGATATTACTACGTAGTGATGAGCT
ATACTCTATTAATAGTATTTACTAGTAATCCGGTCGCCCCGGTATGATAG
CTCAGAGGTAACCTCATAACCTTCTGACACGTTGGGACAACGAGTGGAGT
TCGATCCCAATAAACGGGATTTTTTAAAATGTCCAACGAGTGGAGAACGA
TTAAATCGTTAGGCCACTTCTTATGCTGTCACAAAATGTTGTCCTTTTTT
GCCAAAAAGGGACAACTGACAAAAAAACACTTCTTACGCTGTCCTAAAAG
GTCACTTTTGGATCACCAAAAAAAGGACAACAGTTTTGGACAGCCCCATC
CTGTCCTTTTTTGCCGAACTTTTGGACAGGGTTGGGCTGTCCTAAAGTGT
TATCCTTTTTTACTGCTGACTTGAAATTTCCTGGAAATTTCGCCATCACA
GATGGCACACACACCGACACAAGCATCACCGATGTTGAGGTGGGTGCCAC
CTCTGCTGCTGCTCGCCGGCAGTAGGCGTGGTACTTGCTTGACTCCACCC
ACCTAACAGCAGCCTTGACTCTCTTTCGATGGCCGCGCCAGGTGTGGGTG
TCCTGTTGCCAGCAACCGCGTTGACCGTGGCGTTGGTGGCCCTCGATGAG
GTCCAGACGACTTTTAGCTAGAGGTCGAGCCTGTGTATGAACTTCCACGC
GAGAAACTCACAAAATCAACGAAGATCCTTGAGTTGATCAAGACCATATA
TATTGATTGCGTCGAAACGACGACTATGTGCCTTGCACGTTCGTACCGCA
CAAGGTTGGGGACCTCACCGTTTGCGGGTTCAGGAGATGGGCTGACCCCA
CCGTGGTCCTTGTATATATGAGTACAACAATCGTATTATAACATTATTGG
TACATTTTCTAGCCTACGAACACAAGAAGGCTGTTGGCTGTTGTCCTAAA
TGTCTAGCGCAGTACAAGCACAGCACTGCTGTACAAGAAGTTCTACTGTT
GAAAGTACAGTAGTACTATTGCTGTGTACGAAAATAACTGTAACTGTCAG
TAGTAGTGGTAGTGGTGCTGTGTAGAAGCAGCTGTTTGTTGTATCATTAA
TACTAATATTACATTGGCTGATAGTGTGGTCCAAGTACAGTGTACATAAT
TCCTCTGTGACGTTCACCGTACAAACAGCAGTACTACAGTATTACAGTCA
GTACTTATATCATACCCGTGCAGAAGGAAGCGGCGACACGGTGGTGAAGA
GTTCTTCCGTACTTGAAAGGGTGGGAATGGTGAGCGATAGTATATGCACT
GGAAGCGACACAGCACCAACATTACATAGCTGTGGGAGTGGTTGGAGAGG
ATGGAACACATGCTGCTCCGAGCTCTGTACAGCAGAAGCAGTTCATTCGA
TTTCAACAAGTAGAAAATTATAACTGATAAATCTGAAAGATTTCAAAGGG
ACAACAAAAGCTTTTAGTACTGTATATATATACAAAAGCTTGGTTGGAGT
TATGCTGGCTTCTTCCCCAAGTATGTCTAATGTAGCTTCGGCCTATTCAC
TAAAATCCAGCGTGAGAGGTACAACGTTTTTTAGTTTCATGGAATTCATG
GTCTGCGTACTGAGCTGCGGCTGTGTGCGGAGAATTTGCAGCTGTGTCTG
TGTGTGAAAAGATTGCACCCAAGAGGAGCGGTGACAGGATTCAAAAACGT
GCCTGCTCCCGCCCGTCAAACGCCCCCGATAAATCCTGAAAACTACACTT
TTAGATTTCCCCAGGATAAGAAGTGGGTAACCCAAAACAGTTGTCCGAAA
GTGGGTCGGTTCTTTTTTGGCAAAAAAAGACAACATTTTTTGGACAGCAT
AAGAAGTGGCCTTAGTACACAAAATTCGACTTCACGGTCGACGAGGGAAA
GGAACGGCTGAATCCTTCTCGCGAGAAAAATTCAGGCACGCACCGCAGAA
GGAAGGTGGACGGAAAATCCTGTGTGACCCCGGCTGGTACGTCCAACGAA
CTTGATATCCAGAATGAATGGGAAGAAAACGCACACACACACCTTACTGT
AATGTAATACTATTTACTGTAATAGGTAGAAGGGACAGGCCGTTCTTGAA
TGGATCGTCTTTTCCTCGGTTTGTCTACCATTAGTCTACTATGAATACAG
TACACTTTAAATAGTTACACTATATCTGTATTGCATGTTGTTTTGTTAAC
AATGGCTCTGGTCCGACGAAAACAGTCACTGCGATTGGTTTGTCTCTTAT
AAAGTGTCGGTGCATCGATACCTTCTTGCATTTGCAGCCGCCGCGCTTTG
TGAAGAGACAGTACAGTCTCTTACAGAGTATACTGCTTGTTGTGCTTGTT
GTACCGTTCGCCACGTGGAAGTTTTTGTTAGAAGTTTTCTATTAATAGTG
GCTTGTTAACGATCTATGAATATTCCGACCGCTTTCGGGTGAGCAAGGCC
CGAGCCGAGATGGGCCGCGGTTCCACTCGCCGATGAAGTGTGGTGTTTTC
AAGAAAAATCTGAAGGAGTCGAAACCTTATTATTGAAAATAATTGCCAAA
AGGAATACCGGTATTAATATTTTAGTAGTGTGCTGTTGGGCGTGCATGGT
TCGGTATGGTACTATATCGGGGCAGCCTCGGTGAAATCATTCTCCCGAAC
GAATGCGATTCGAAATAGGCTCTCTATGCGTCTGAACCCTATTTCTCCTA
GTCTAATCGACTAATAGGGGTCAGTGATCAGCAATATCTACAAAAAAGAA
TCGAACAGCCTCGGCAGCGTTGTGCCAGTCGGTACTTTTTGTTATGGACC
GTATGTACCGTCTGTACCGTCATATCATAATTGCCGAAGTACCCAAGTAG
TTGCGATATTTGTTTTTCTCGGGGTAAATCCCGACATATTCTCCTTTAGA
AAGAAGCTTCAAGTATTACTGCACCTTCGCAAGAGGTTTTGTTTTCTATT
TGCGTGTCCTTGCGCGACGCGGAATGTGGAATGTTTTTGGAGCCTTATTT
TTGTTGTTGTTGGTCCATGCACCAGGTAACTTGGAACCTTTCTTCTCATT
CGACACGACACCTGCCATAAAACCTGTATTGCGCCGTTTTTATAGTATTC
TATGAATAGAATACCCAGCTGTGTATTGTGAAGGTGAAGGTGACCGAGTC
GCTCCTTCTTACACCTATGTACCGTTTTGTACCTTATTGTTCTATGGCTT
GTCCCGCTCCCGCAGGAGGTATATGTGCTCACGTCGGGAGTGCACGCGTT
AGCCACGCAGATCCTTGCGCCATCGACACGCAAAATCCTTCAAGCTGAGG
GCGTGAGGAATCTCACTCCGCTGGCCTTCTACGAGTGGTGCCATGAGAAG
CGAGGTTTCGGCAAGGTTCACATCTTCAAGGATATCGGGTAAGAAGAGAG
GATGACACGTCGGTTCTTCAACGTTTACACGTGTAAATGTATCATTGCAA
TACAGTGTGCAGACGATTCGAGCTGGTATATGAAGTGTGTAGCGAAATGC
TACTCCTCCCATGTTGTTGTCCTTCCATGTTGTCGAAATATATATATACA
GTATACCAGTATAAATTATCGCGGTCCACGATAAAGTCAAGACTCGAAAT
TTTTTTCCTTGAATCGTTTTTATCCTTGTGCGTGTCCCTCCCTGGCGACG
ATGTCCTTTACTGGCTACCTTCTTGTCCGTTTTTTTCTGTCTGCTTTCCA
TGCTCTTTTTCTCCGAACGTTTTCCCGGTTTGATTGGTTTGATCCCGTCT
ATGTTGTGGTCACGGTTTGATTGGCTGGAATCGTAGCGAATCAATTAATC
TGAGATATACAACAAACTAAACTTATGTCCTGGACTTGAAAATAGTACTA
CGGTTGAGCGAACGATATTGTACGCGCTAACCACTAACGGCTTGGCTTTG
TATCACACCTCCACATGTATTTTGATTATTAATCCGTCAGGCTCAGCCGG
GCGGTGGCTATGGTGCGCCAGATGGACGATCTAGTTACTGGAGACGACGG
AGGAGATCTTTATCGCCTCGACAGGAATCAGACGTACACGAACATGGGCC
TGGACGACGACATCTCGAAGTCTAAGGGTGGAGACGAAGTGGAAGAGCTG
ACACACGGTGTTAGCCTGCTTGCCGACGAAAGCGTGCACATCACTCCTGG
GTTCTTCCAAGTTTTCCGCGGGCAGGCCAAGCAGGCGGCGTGTATGATCT
ATTATTCTAGAAAGGATGCGGGCACCCCCAAGATCAGCCCCAAGGCCGGA
AACATGGTGCGGTTCCTGGGATAACGATGCCATATCTCCTTTCCGCTATA
TAGTTCAGTTCGCCCGTTGCTTGCGATCCCGATATGCGTTTTGACGTTTG
TAGACGAGCGAAATGCTCTCGTACTGATATCCGCTTGTGTTTTTGCCTGT
TCGCCGCCCGCCTCCTCCCAGAATGCTGCTATGTTCCCCGTAGACGACCC
CACCTCCCCTCCGCTGATCGGGCCGAGCACCATCGTGGTCGTCAATGACG
CCCGTCACCAGCTTCAGCCGGATTTCTTGCAAAGGACCACCCCTTACTTC
TTTGACGTTGACGAGATCACCGGACAGTACAAGTGGGCCAAGGTAAGACA
AGATAAGACTGGCCACAGGTTCGCTTTTTGAGGTGTGTGTGTGTGTGCGT
GTCGAACACTGTTCATATGAATACGATATATTCATGGCATTCCCCTGCGT
CTTGAGATGGCATATGAAACGGACTTTTTGATGATCTTGGTCTCCTGATT
TCTGCAGACATCGTTCCCCGTGCGTGTGCCGATCGGACAGGTGGCCTTTG
TTCAGACTCCCCAGCGTTTCCGTCAGGAGTTGCCGAACGATCCCCTGGGA
AACCACGCTGCATCGCAGGTGAGTCAACCCAACCCACCGTGCCTACCTAT
GTTGCTTACAGTTACTGTGCCCCAAACCGATACGCGATGTAGCTTTTTTT
TTCTCCTCTGTTTTCTAGTAATATGTCCGCATATGTTCACCTTTAAGCAT
AAAATTTGGTATTATACTTGATAAAAAAAAAATGTTCGTTTCAATTCCAG
TACGACGTCATCAACATCGGAAAAGATGGTATTGGAGGCGTTTCGTCGAG
TGGACAGGGCAGTCTATGGCGCGTGGAGGCGTTAAGGGGAAGATCTCCTG
ACGGGAAGACGGGGGTCGACGCGAAGGACCTGTCCTTGGTCGGGCACGAA
CTGGGTTTCCGTGCAGAGTTGCTCATCGAAGACACGCACACGTCGATCGA
GCTCTTCCGACAGGTGAGAGAAAACAAAAGTGCGCAACGCCATTGCCTGA
TTGTTGATGAAAAAATATGGCTTTGATTCAGAGCCTACGTACATACCCTT
TCTTCTGGCCCCGGGTTGATGGATGGGACACATACAAGACATGTTAATGG
ATGATACGATCGATCGATCCATACAGTTTGAAATGCATGTCTTGGCTTGC
AGCATGGTATTGTTTAATGGTCGATGGGCCATCGATATCTCAAATAGATT
TCGCCTTTATGTGTGTGCAACCACGTCTTTTTCATTGATAGGGATGGCGC
AGCGTGTACGTGAACGAGCCGGGAGAGGTTCTGGCTTGGTGCACGCATCA
GCCTGCGAACCTTACGTGGCGTATCAAGCAAGTGTTGCGCTGGCATCAAG
GCGCCGTGCAGCTGCTGTACCTCAAGGTGCCTCCTCCCTTTTCGGTGTCA
AGTATTAATATTAATAGTTAACCTTCACGTATGGCGTCTGCTGCTGCTGC
TTGATTGCAAGGCTTTGAACCGTATCGTCTTGTCGCCAAGCAATATCGTT
TCGTTTTTGTCCGTTACTCGTGTGCTTGACTTGACCATTGATCGCTGTGC
TTTTTTACTACGGTACGTTTTTTCAGGGCTTCCGTTACACGAGCTGCGGG
GGTAACTTCCCCACCATCTTTCACCGGATATACGCATTCGATCAAGCCAC
GTACTACCTCCAGGTTAGTCGAGACCACGGATGCCATACATGAACGTTTA
GATTGTGTTTTGTTAGAGGAAGGTTGCATTGTTTTGTCACCACCCTATCT
ATGTATACCGTATATTTTAACTTACATGCTAATCTATTAACGATCGATCG
TTTGGTGCCACATGACAGGCGATCCCGGGTTACGTTCTTCTACTGATGCC
CTTGGTGTACGGTATTACCGGACAATCACCTTTCAACACCGAAATCACAC
CTTACTTTTCCTGTGAGTGTTTGTTGTTGTTGTTGATGATGTTGCTGTCT
TGTCTATCTTGCAGTGAAACTTCACTTTTCATCCGGCCGAAAGATTAGCG
AGCGTTATTGCGTGGCTACACGAATCATTTCGTAATTGTACGCCGCTTCA
ACAAAACCACTTCCCTACATTGATTGCTTGCTTTGGCGTCATAGACCTCT
CCATTCTTTGCTTTTGTTGATCCTTCTCATGACACTTTCCCCCTTAACCC
CCTCCACCCCCCGAACCCCTCCCATACCCTATATGCCACTGTACTGCCGC
TGTATTGGTACAACTACCGCCTTATTTAACCATCTTGTCTACGGTGGATG
CGCAGACTTCGTTCCGTTCATCGTGACGGCCGTTCTACCCACGGTTATCT
CGGCTCAGTGGAGGTCCATCGACTCACACCGCCTTACGCGTGACGAGCAG
ACCTGGCTATCCACAACCTACGTGCAGATCTACGCCTTCCTGCAGGTACT
ATACCATGGTGTACTATAGACTAGTGTATGAGTTAAATACTGTAGATTTC
AAACGTGAAGATGCAAAAGTTACAAAAGCGAAAAGTTCCGAAACCTCTCA
TCCTTGCGTTATTTTAGTTTATCGATACGTTGACAGCTCGATGGTGTTAT
ACTGTACACGATCGAATTCCAAGCCAAGGCATCGTGTCCATGCGGTTGCC
ATGGGCTGTGATTGTGACTGTGGCCCCCGTCGCTTATCCGTTGTCGCGCA
TCCTTCGGGACAAGGTGACGTGGTCCAAGCTCATTCGGGCAAACCCAGAG
CACGCTTGGGTGGCGAAGGTTCCGACGTGGCCGCTGTACCTGGTATTCCT
GGCGCAGTTTGCAGCTGTCGCGGGCGCGGTTTGTAAGTGCTGGCCTTCGT
GAAATATTTATAGTCATTGGGTAAAGCCCCAGGACACAACGCATGAGTTT
CACGTCTTTGAAGCGAAACGGCTGGTTTTGTGATCAGCCACTACTCATCA
TGCGAAGTCGGGGGTTTTTTACAGTTACAGCATAAATTCCAGTTATTTCC
ACGGTGCATAAATTGTCTCCCATATATTTAGAGCTATATGAACACAGGTG
GTACTAGTACATCCAAACAAGACGTCCCCCCCCTACGAGTAGAGGCACAA
CTCAATATTTTTTGGGAACCCATGTTATGTACAACAGTCAAAAGTATGCG
GCTACATTTCTACAAAGCAGCAGCAATCCGCCAAACCACCCGTTTAGCTC
CAGTGCGCGTTACTGATTACCTACTTGCACCTGTGCGCCCGCTCGATCTT
TGTTTCTCTTACGGCAGACTGGACTCTGCATGACGGCTTTGAACACTACT
ACAAGAATACGCTCTCGATATGCGCTGGTGCCTTCCTGGGTATGTTTTAC
CTTTGGCCAATGATCGCTCTTCAGATGGGCGTCACCAAACCCTCTTTCTG
GTTCTTCAAGCTGGGGGCTTACGTGATTCTCGGCGTAGCGATGGTCATAC
TGGGGCAATTCTTCGACCTGCAAATTGGTTGAGCGTGCTCCTTTGCCTGC
CGGCCGTCTCTGTACCCGCCTCTCGTGAGCATATGAAGTCGTACGTATCT
CTAACGGGAGATCGTTGTTCGTTGTTAGCTCGGCCATGTGGCTTGTGCTA
TGATTGTACTTGTACGGCGGTTATGGTATGGGGCGTGCTCGCTGTCGCCG
TTTGGGTGAAGCGATTTGTCGGAGGGATAAGTGTGAAAGATGAAATATGC
TCCATATTCTATTTTTTTAACAAGTGAACGTGCTGCCGGCTAGAGTTAGT
TGAGTTGAATTGAACTCTCGGCCTCTCGTTGTTTAGTTGGCGCAGCGCGT
GCAGCTTTCGGGGCGTGGCTAGCTGCTTTTTCCCGATCAGTCGTACGTAC
GGGACGTTTGAGTGATTCGGTATATGATACGGTTTGTGATTCGGTATATG
ATTCGGTTTGTGATTCGGGCTACGATCCGGTAGTATTTTGTGTTATGGAA
CCAAATCCATGGATTGTCAAATAGATTGACGAATCAGAATGTCGCTTTCT
TTTGGGGGGGGGGGGTCGTTTCCCTTCATTCCCCCTTCAATGAATACGGA
TATTCCGTATTTTTTCGTGTGTCGTGTTTTCCTCGTTAATAAGTACTTGT
ATCACGTTGACTTCATGTCCATCGTCATTTGGGGGTCGAGCTCGCTTTTT
TGTCCCGTGGCGGCGGAGATGCAGCCGGCTGTGTCGCCCTTTGTTCTGTA
GTACTCAGTACAGTGCGATTTTTGTCTTACGTTCCCGACCCATCAGAAGA
AAAGCGCCAGTAAACTACTGTACCGTAGGCAGCAGTACCTCGTAAACTAT
TAATAGTAAGTACAGTATATGGGTGGGTTAGGGCTGACTTCTAGCTCTGT
AGATAATACAAGTTTTTGTTGCAGCTTCTCGTGCATAATCTTGGGCTCTC
TTAAAACAACGAAGAAAATAAAAATATAGCACCGCACGTACCGTTGCTTC
TGCCTCCCCGTGATACTGTAGATGTAATACTTGTGGATTGTGTCGTTTCG
ACCCGTCCCTCCGTCTCCCTTTGAAGGAGAATTGCTACCTGTCCCGTCCC
GTTGCCGGGCGCCGAGTTGCCACATGTTGGTGGAGACCGTGGTTTACCTA
GGCGCGGTGTTGATGTCCTTGGGGAACACCTGTCCTAAGTTGCAGATGTA
GTTATACGTGGCCCACCGGAAGATTCATTTGTTATTTTCGAAGCAAGGCT
CGTCCATGACTGTGCCACAGCAGATTAGCCTG back to topCoding sequence (CDS) from alignment at A-nodosum_M_contig100:354535..377366+ >mRNA_A-nodosum_M_contig100.21.1 ID=mRNA_A-nodosum_M_contig100.21.1|Name=mRNA_A-nodosum_M_contig100.21.1|organism=Ascophyllum nodosum dioecious|type=CDS|length=2379bp|location=Sequence derived from alignment at A-nodosum_M_contig100:354535..377366+ (Ascophyllum nodosum dioecious) ATGGAGGACGGACGTGGTGGAATTCCGTCCGGCCACCGGCGGATGAAGTC TTCGGGCGGCTTCTCACAAAGGCTGGGCAAAATGCAGGAAGCCTTGGGGA TGGCTCATCATTCGGTCCCAGGGATGGGGGAGAAACGGCGAGCAGGAAAG CAGGGAGCTTCATACGATCCGCGCTTCGTCCAGAGGGGCGAGAGAATCAA CCCTCAAACCATCGAGACTTTTTCCTCGAGCTTCCTTATCCGCGGGATCG TACTTCTCAACATCGGTTTGGGAATAGCCTATTTGGTTTGGCGCTTTACC CAAACCCAAGGCGTTGAGGAACATCTTATGTGGTGGTGGTGGACGTTCTT CCTCGTGGAAATTTTCCTCGTAACCGCAATCTGGATCGGGCACACGCAGC GGCTGTTCGCCGTCCAGCGCATCAGGGTGACCATGGACCAGATCGTTTCG ATCGATCCGGCCGTGGGTGCAAATGCGGTGGTGTCCATCCTGCTTCCGAC GGCGGGAGAACGCTTGGATGTCGTCCTCAAGTGCCTGCTGGGTGCTTCCA GCCAGAGATCGTGGCCTACGACCGCTCCCGGGAAATCAGGCCGAGGCGAT GGGCTGCGAGTCATCGTCCTAGACGAGAAACGGCGTAAGGAGGTATATGT GCTCACGTCGGGAGTGCACGCGTTAGCCACGCAGATCCTTGCGCCATCGA CACGCAAAATCCTTCAAGCTGAGGGCGTGAGGAATCTCACTCCGCTGGCC TTCTACGAGTGGTGCCATGAGAAGCGAGGTTTCGGCAAGGTTCACATCTT CAAGGATATCGGGCTCAGCCGGGCGGTGGCTATGGTGCGCCAGATGGACG ATCTAGTTACTGGAGACGACGGAGGAGATCTTTATCGCCTCGACAGGAAT CAGACGTACACGAACATGGGCCTGGACGACGACATCTCGAAGTCTAAGGG TGGAGACGAAGTGGAAGAGCTGACACACGGTGTTAGCCTGCTTGCCGACG AAAGCGTGCACATCACTCCTGGGTTCTTCCAAGTTTTCCGCGGGCAGGCC AAGCAGGCGGCGTGTATGATCTATTATTCTAGAAAGGATGCGGGCACCCC CAAGATCAGCCCCAAGGCCGGAAACATGAATGCTGCTATGTTCCCCGTAG ACGACCCCACCTCCCCTCCGCTGATCGGGCCGAGCACCATCGTGGTCGTC AATGACGCCCGTCACCAGCTTCAGCCGGATTTCTTGCAAAGGACCACCCC TTACTTCTTTGACGTTGACGAGATCACCGGACAGTACAAGTGGGCCAAGG TGGCCTTTGTTCAGACTCCCCAGCGTTTCCGTCAGGAGTTGCCGAACGAT CCCCTGGGAAACCACGCTGCATCGCAGTACGACGTCATCAACATCGGAAA AGATGGTATTGGAGGCGTTTCGTCGAGTGGACAGGGCAGTCTATGGCGCG TGGAGGCGTTAAGGGGAAGATCTCCTGACGGGAAGACGGGGGTCGACGCG AAGGACCTGTCCTTGGTCGGGCACGAACTGGGTTTCCGTGCAGAGTTGCT CATCGAAGACACGCACACGTCGATCGAGCTCTTCCGACAGGGATGGCGCA GCGTGTACGTGAACGAGCCGGGAGAGGTTCTGGCTTGGTGCACGCATCAG CCTGCGAACCTTACGTGGCGTATCAAGCAAGTGTTGCGCTGGCATCAAGG CGCCGTGCAGCTGCTGTACCTCAAGGGCTTCCGTTACACGAGCTGCGGGG GTAACTTCCCCACCATCTTTCACCGGATATACGCATTCGATCAAGCCACG TACTACCTCCAGGCGATCCCGGGTTACGTTCTTCTACTGATGCCCTTGGT GTACGGTATTACCGGACAATCACCTTTCAACACCGAAATCACACCTTACT TTTCCTACTTCGTTCCGTTCATCGTGACGGCCGTTCTACCCACGGTTATC TCGGCTCAGTGGAGGTCCATCGACTCACACCGCCTTACGCGTGACGAGCA GACCTGGCTATCCACAACCTACGTGCAGATCTACGCCTTCCTGCAGGTGA CGTGGTCCAAGCTCATTCGGGCAAACCCAGAGCACGCTTGGGTGGCGAAG GTTCCGACGTGGCCGCTGTACCTGGTATTCCTGGCGCAGTTTGCAGCTGT CGCGGGCGCGGTTTACTGGACTCTGCATGACGGCTTTGAACACTACTACA AGAATACGCTCTCGATATGCGCTGGTGCCTTCCTGGGTATGTTTTACCTT TGGCCAATGATCGCTCTTCAGATGGGCGTCACCAAACCCTCTTTCTGGTT CTTCAAGCTGGGGGCTTACGTGATTCTCGGCGTAGCGATGGTCATACTGG GGCAATTCTTCGACCTGCAAATTGGTTGA back to top
|