Tomato mild mosaic virus


Basic Information

Genus Begomovirus
NCBI Assembly GCF_000874565.1
Isolate Brazil
Release date 2015/2/13
Submitter Castillo-Urquiza,G.P., Beserra,J.E. Jr., Bruckner,F.P., Lima,A.T., Varsani,A., Alfenas-Zerbini,P., Murilo Zerbini,F., Beserra,J.E.A. Jr., Lima,A.T.M., Zerbini,P.A., Zerbini,F.M.
Host
Download Genome |GFF3 |PEP |CDS

Genomic Organization


JBrowse


Genome

NC_010833
ACCGGATGGCCGCGATTTTTTCCCCCCTACGTGGCGCTCTGGAGGTCGTTCGATCCGATCGAACGTGTCCCCCCCACCAAATCCCCTCGCGCGCTCAATTCAGTTGAGCGCTTTTTTGGAGTCCGCGAAATGGGTTCAGCGCATTTTTTGAGTTCCGCCGTTTAACTTTAATTTAAATTAAAGCCTTGACCAATCATTTCGCGTCTGACGAGTTTATTTAGCCTAAGAGAACTTAACAACGAAGTTGTTGACCAACCGTTATAAATTAATCCTGATTGGACTCGTCAGTCTTTAACTCAGAATGCCGAAGCGGGACGCCCCATGGCGCCTGATGGCGGGAACCTCAAAGGTTAGCCGATCTGTCAACTACTCCCCTCGTGGAGGCCCAAAGTTCAACAAGGCATCTGAGTGGGTCAACAGGCCCATGTATAGAAAGCCCAGGATATATCGGACGCTAAGGTCTCCTGACGTCCCCAAAGGCTGTGAAGGGCCTTGTAAGGTCCAGTCGTTCGAGGCGCGACATGATGTCTCTCATGTCGGGAAGGTCATATGCGTCTCTGACGTGACACGTGGTAATGGTATTACCCATCGTGTGGGTAAGCGTTTCTGTGTTAAGTCTGTATATATTTTAGGGAAGATCTGGATGGACGAAGCGATCAAGCTCAAGAACCACACGAACAGCGTCATGTTCTGGTTAGTTAGAGACAGGAGACCCTACGGAACTCCTATGGATTTTGGACAGGTGTTCAACCTGTTCGACAACGAGCCCAGCACTGCCACGGTTAAGAGCGATCTCCGTGATCGCTTCCAAGTGATGCGCAAGTTTTACGCCAAGGTCACAGGTGGACAGTATGCTAGCAACGAGCAGGCTCTGGTCAAGCGTTTCTGGAAGGTCAACAATCATGTGGTCTACAACCACCAAGAAGCTGCGAAGTACGAGAACCACACAGAGAACGCTATGTTATTGTATATGGCATGTACCCATGCCTCTAACCCTGTGTATGCGAGTCTCAAGATTCGAATCTACTTCTACGATTCCGTTCTCAATTAATAAAATTTAAATTTTATTGAATGATTTTCCAGTACATAGTTTACATATGGTTTGTCTGTTGCAAACCTAACAGCCCTAATTACATTATTAAGGGAGATCACACCTAATTGATCGAGATACAACATAACTAACTGCCTAAACCTATGTAAATATGTCGTCCCAGAAGCTCGAACTGATGTCGTCCAGACTTGGAAGTTCAGGAAGGCCTTGTGTAGACCCAGCATCCTCCTGAGGTTGTGGTTGAACCGTATCTGGATGTGGTACACTCTGGTCGCCGTGTACCGTAGATCCTCTACGTTGTATATCTTGAAATAGAGGGGATTTGATATCTCCCAGATATATACGCCATTCTCTGCCTGAGGTGCAGTGATGCTCTCCCCGGTGCGTGAATCCATGCCCTGCGCAGTTAAGGTGCACGAAGACCGAGCACCCGCAGTCTAGGTCAATTCTGCGTCGTCTGGTAGCCCTCTTCTTCGCAATCCTGTGCTTGGGCTTGATAGAGGGGGGCGTCGAGGAAGATGAATTTAGCATTCTTGATCGTCCAGTTCTTCAGAGCAAGGTTTTCCTCTTTCTCGAGGAAATCTTTATAGCTGGCCCCCTCTCCAGGATTGCAGAGCACGATTGATGGGATACCGCCTTTAATTTGAACAGGCTTTCCGTATTTGCAATTTGACTGCCAGTCCTTTTGAGCTCCCATCAATTCTTTCCAGTGCTTTAGCTTTAGGTAGTGCGGAGGGACGTCATCAATGACGTTATATTCAGCCTCATTTGAATAGACCCTGGAGTTGAAGTCCAGGTGTCCACTGAGATAGTTATGGGCCCCTAAAGCACGAGCCCACATTGTCTTGCCTGTTCGACTATCTCCTTCGACTATCAAACTAACAGGCCTGCTGGGCCGCGCAGCGGCATCTCTTCCAAAATAGTCATCGGCCCACTCTTGCATCTCGTCGGGAACGTGAGTGAATGAGGAGAGTGGAAACGGAGGAGACCAAGTTTCCGGAGCCTTTGCGAAAATGCGATCTAAATTACTATTTAGATTATGGAATTGGAAGAGATACTTTTCCGGCAACTTCTCTCTGATTATGCTCAGCGCCGCTTCTTTCGACGGCGCGTTTAAGGCCTCTGCGGCAGCGTCGTTAGCTGTCTGGCAACCTCCTCTAGCACTTCTTCCGTCGACCTGGAAAACTCCCCATTCGACTGTATCTCCGTCCTTGTCGACATAGGACTTGACGTCGGACGACGACTTAGCCCCCTGAACGTTCGGATGGAAATGTGCTGACCTGGTTGGGGAGACAAGGTCGAAGAATCTATTATTCGTGCACTGGAATTTCCCTTCGAACTGGATGAGGACGTGGAGATGAGGCTCCCCATCGTCGTGGAGCTCTCTAGCTACCTTGATAAATTTCTTCTGTGCAGGAATTTCTAGGGCTAATAATTGGGAAAGTGCCTCTTCTTTGGTTAGGCTGCATTTGGGGTATGTCAAGAAATAGTTCTTGGCATTTATTCTAAAACGTTTGGGCGGTGGCATTTTTGTAATAAGAAGGGTGTACACCGATTGAGAGCCTCTCCAAAACTCATATGAATCGGTGTATGGTGTACAATATATAGTAAGAAGTTCCTAAGGCTAGGTAACACGTGGCGGCCATCCGTTAATATT

NC_010834
ACCGGATGCCCGCGCGATTTTTTCCCCCCCTACGTGGCGCTCTGGAGGTCGTTCGATCCGATCGAACGTGTCCCCACTGGTGTTCTCTCTCCCCTGGTGTCCGTTGGATTTCCTCTACGCCAAATCAGTTGAGCGCGTTTTTGACGTCCGCCAAATGAGTTCAGCGCATTTTTTGAGTTCCGCCTATTGGATGCTGACACGTCGCATCCTATCTATGTAGACGCGCGCTCAACTGTTAGATATTGTCAGTTCGCGATATCAGCTGTAGACCGTTGGATAAATCTGACATGCAATCCAGCTGGATTGTATATTGGAACTTGAATTATTTGGGCGCGACTGACAGAAGACGGCCCCATTGTACAACTCCGTTAGCCAATCGAACAGCTACTTCGGTGACAAGATAAATAATTGATTGACCAGGAATTTAATATTTAAATTCCGTCTGTGAGGACTTATTACCACGTCTATTTACATCAGCAACAACTGTTAACAAATTTTTGTGCATAATTAACTCAGCATGTATCCTGTTAAGTATAGACGTGGCTGGTCGACTACTCAACGACGAAGTTATCGACGGGCTCCTGTGTTCAAGCGGAATGCCGTTAAACGTGCAGATTGGATACGTCGACCGAGTAATTCAATGAAGGCCCATGACGAGCCCAAGATGACAGCCCAGCGGATACATGAGAACCAGTTTGGCCCAGAATTCATGATGGTTCAGAATACAGCAATTTCTACTTACATATCCTTTCCCAATCTTGGTAAGACAGAACCGAACCGGTCTAGGTCATATATTAAGCTAAAACGACTTCGTTTTAAAGGGACTGTCAAGATAGAACGTGTTCAGCCAGATGTCAACATGGACGGTTCGGTCTCGAAAACCGAAGGTGTGTTCTCTCTCGTAGTGGTTGTGGATCGTAAGCCTCATTTGGGCCCTTCTGGATGTTTGCACACCTTTGACGAGCTCTTCGGTGCAAGGATCCACAGCCATGGGAACCTATCTATTACTCCTTCATTGAAGGACCGGTACTACATACGTCATGTGACCAAACATGTACTCTCGGCGGAAAAGGACACCATGATGGTCAACCTCGAAGGGACGACATTCCTATCTAACAGGCGTGTTAGCTGTTGGGCCGGTTTTAAGGATCATGATCATGATTCATGTAATGGGGTTTATGCTAACATAAGCAAGAACGCCCTGTTAGTTTATTACTGTTGGATGTCGGATATTATGTCCAAGGCATCGACATTTGTATCATATGATCTGGATTATGTTGGTTGATCAGGAATAATAATTCATTAATTATTATAAGTTAACAGGGAATTTACAATAACCATACATTTAATTTAAAGACTTGGCCTGCGAAGGCACACAGTTACTGTTAATGCACTCCTGGACCGTGTTCCTCAGCAGTTCGTTTAATTGTCCCAGAGACATCGTAATGTTGGACTGGGCTCTCTGGGCCCCCACGATGGAAGCCGATTCGCCTGGGTCCAACATTGTTGTACCCAGTCTGTTCAGCTCCCTGTATGGATGCGATGTCTCTACTATATCGGACTCCGCATCCGATGGATTGGCACCTACAGTGCTCCTGACAGCCCACGACTCTCCTGGCCTAAGTTGTATTGGGCTGCGAAGCCCAATTGTTGAGGCGGATGCGGATCGGACTGGTTTCCTCTCCCATCTTCCGTATCCCACGTGGCAGAAATCGACATCCTTGTCGGTAAACTGTTTGGACAGGATCTTGACCGTCGGAGCCCTGAAAGGAATATCCACGGAGTGTTTAGCCGTCGACAGCTTCAGTTTACCTTTGAATTTTGCGAAGTGGGTCCTCTGATGGACGTTCGTGTCGCAAACTCTGTAGTACAGCTTCCATGGAATTGGGTCTTTGAGGGAGAAGAAGGAAGATGAAAAGTAGTGGAGATCGATGTTACACCTGATCGGGAAAGTCCACGACGCCTGTAACGACTCATTGTCCGTCATCCTCTTGTCGTGAATCTCCACTACCACCGCCCCAGTGGCGTTAATCGGAACCTGCTGCCTGTACTCTATGACGCAGTGGTCTATTTTCATACATCTACGACTGAGTCTGGCACTTATCTGGGAGGCGGTGGACGGAAACTGCAGTACGATCTCAGTTAGGTCATGAGACAGCTGGTACTCGTCCCTTTGCGACTCTATATAATTGAAGGCGTTCGGAGGATTAGCTAACTGAGATTCCATTGTTTCTGTTTGGTCGCGCAGCGACAGCGATCGACTAAATTGAACAGGAGAAGAGAACAGCTGCGACAGCGAGAAGGATTTAATGTTAACAGAAAGAGTAAAGTAAAGGAGATCGTACAGATTAGGATGATAATTAGGACAATTAAACAAGTTTAATGGCTTATAAGGATAAGAAAAGGAAAAGTTTAGAGAAGAATTAGGGATTTACGTTTCGGAGGAAAGAGTAAAGTTGGGGTTCTGACACTCATAGGGAAGTTAATTGGGAGTTATATTTATAGCTAAACCCGGAAAGGGTTTGATGGCATTTTTGTAATAAGAAGGGTGTACCCCGATTGAGAGCCTCTTCAAAACTCATATGAATCGGTGTATGGTGTACAATATATAGTAAGAAGTTCCTAAGGGCTAGGTACAACGTGGCGGCCATCCGTCTAATATT

Gene Information

NCBI Accession YP_001960945.1
Location 302-1051
Gene Name CP
Protein Name coat protein
Coding Region ATGCCGAAGCGGGACGCCCCATGGCGCCTGATGGCGGGAACCTCAAAGGTTAGCCGATCTGTCAACTACTCCCCTCGTGGAGGCCCAAAGTTCAACAAGGCATCTGAGTGGGTCAACAGGCCCATGTATAGAAAGCCCAGGATATATCGGACGCTAAGGTCTCCTGACGTCCCCAAAGGCTGTGAAGGGCCTTGTAAGGTCCAGTCGTTCGAGGCGCGACATGATGTCTCTCATGTCGGGAAGGTCATATGCGTCTCTGACGTGACACGTGGTAATGGTATTACCCATCGTGTGGGTAAGCGTTTCTGTGTTAAGTCTGTATATATTTTAGGGAAGATCTGGATGGACGAAGCGATCAAGCTCAAGAACCACACGAACAGCGTCATGTTCTGGTTAGTTAGAGACAGGAGACCCTACGGAACTCCTATGGATTTTGGACAGGTGTTCAACCTGTTCGACAACGAGCCCAGCACTGCCACGGTTAAGAGCGATCTCCGTGATCGCTTCCAAGTGATGCGCAAGTTTTACGCCAAGGTCACAGGTGGACAGTATGCTAGCAACGAGCAGGCTCTGGTCAAGCGTTTCTGGAAGGTCAACAATCATGTGGTCTACAACCACCAAGAAGCTGCGAAGTACGAGAACCACACAGAGAACGCTATGTTATTGTATATGGCATGTACCCATGCCTCTAACCCTGTGTATGCGAGTCTCAAGATTCGAATCTACTTCTACGATTCCGTTCTCAATTAA
Protein Sequence MPKRDAPWRLMAGTSKVSRSVNYSPRGGPKFNKASEWVNRPMYRKPRIYRTLRSPDVPKGCEGPCKVQSFEARHDVSHVGKVICVSDVTRGNGITHRVGKRFCVKSVYILGKIWMDEAIKLKNHTNSVMFWLVRDRRPYGTPMDFGQVFNLFDNEPSTATVKSDLRDRFQVMRKFYAKVTGGQYASNEQALVKRFWKVNNHVVYNHQEAAKYENHTENAMLLYMACTHASNPVYASLKIRIYFYDSVLN

NCBI Accession YP_001960946.1
Location 1048-1446
Gene Name Ren
Protein Name replication enhancer protein
Coding Region ATGGATTCACGCACCGGGGAGAGCATCACTGCACCTCAGGCAGAGAATGGCGTATATATCTGGGAGATATCAAATCCCCTCTATTTCAAGATATACAACGTAGAGGATCTACGGTACACGGCGACCAGAGTGTACCACATCCAGATACGGTTCAACCACAACCTCAGGAGGATGCTGGGTCTACACAAGGCCTTCCTGAACTTCCAAGTCTGGACGACATCAGTTCGAGCTTCTGGGACGACATATTTACATAGGTTTAGGCAGTTAGTTATGTTGTATCTCGATCAATTAGGTGTGATCTCCCTTAATAATGTAATTAGGGCTGTTAGGTTTGCAACAGACAAACCATATGTAAACTATGTACTGGAAAATCATTCAATAAAATTTAAATTTTATTAA
Protein Sequence MDSRTGESITAPQAENGVYIWEISNPLYFKIYNVEDLRYTATRVYHIQIRFNHNLRRMLGLHKAFLNFQVWTTSVRASGTTYLHRFRQLVMLYLDQLGVISLNNVIRAVRFATDKPYVNYVLENHSIKFKFY

NCBI Accession YP_001960947.1
Location 1193-1582
Gene Name Trap
Protein Name trans-activating protein
Coding Region ATGCTAAATTCATCTTCCTCGACGCCCCCCTCTATCAAGCCCAAGCACAGGATTGCGAAGAAGAGGGCTACCAGACGACGCAGAATTGACCTAGACTGCGGGTGCTCGGTCTTCGTGCACCTTAACTGCGCAGGGCATGGATTCACGCACCGGGGAGAGCATCACTGCACCTCAGGCAGAGAATGGCGTATATATCTGGGAGATATCAAATCCCCTCTATTTCAAGATATACAACGTAGAGGATCTACGGTACACGGCGACCAGAGTGTACCACATCCAGATACGGTTCAACCACAACCTCAGGAGGATGCTGGGTCTACACAAGGCCTTCCTGAACTTCCAAGTCTGGACGACATCAGTTCGAGCTTCTGGGACGACATATTTACATAG
Protein Sequence MLNSSSSTPPSIKPKHRIAKKRATRRRRIDLDCGCSVFVHLNCAGHGFTHRGEHHCTSGREWRIYLGDIKSPLFQDIQRRGSTVHGDQSVPHPDTVQPQPQEDAGSTQGLPELPSLDDISSSFWDDIFT

NCBI Accession YP_001960948.1
Location 1494-2579
Gene Name Rep
Protein Name replication-associated protein
Coding Region ATGCCACCGCCCAAACGTTTTAGAATAAATGCCAAGAACTATTTCTTGACATACCCCAAATGCAGCCTAACCAAAGAAGAGGCACTTTCCCAATTATTAGCCCTAGAAATTCCTGCACAGAAGAAATTTATCAAGGTAGCTAGAGAGCTCCACGACGATGGGGAGCCTCATCTCCACGTCCTCATCCAGTTCGAAGGGAAATTCCAGTGCACGAATAATAGATTCTTCGACCTTGTCTCCCCAACCAGGTCAGCACATTTCCATCCGAACGTTCAGGGGGCTAAGTCGTCGTCCGACGTCAAGTCCTATGTCGACAAGGACGGAGATACAGTCGAATGGGGAGTTTTCCAGGTCGACGGAAGAAGTGCTAGAGGAGGTTGCCAGACAGCTAACGACGCTGCCGCAGAGGCCTTAAACGCGCCGTCGAAAGAAGCGGCGCTGAGCATAATCAGAGAGAAGTTGCCGGAAAAGTATCTCTTCCAATTCCATAATCTAAATAGTAATTTAGATCGCATTTTCGCAAAGGCTCCGGAAACTTGGTCTCCTCCGTTTCCACTCTCCTCATTCACTCACGTTCCCGACGAGATGCAAGAGTGGGCCGATGACTATTTTGGAAGAGATGCCGCTGCGCGGCCCAGCAGGCCTGTTAGTTTGATAGTCGAAGGAGATAGTCGAACAGGCAAGACAATGTGGGCTCGTGCTTTAGGGGCCCATAACTATCTCAGTGGACACCTGGACTTCAACTCCAGGGTCTATTCAAATGAGGCTGAATATAACGTCATTGATGACGTCCCTCCGCACTACCTAAAGCTAAAGCACTGGAAAGAATTGATGGGAGCTCAAAAGGACTGGCAGTCAAATTGCAAATACGGAAAGCCTGTTCAAATTAAAGGCGGTATCCCATCAATCGTGCTCTGCAATCCTGGAGAGGGGGCCAGCTATAAAGATTTCCTCGAGAAAGAGGAAAACCTTGCTCTGAAGAACTGGACGATCAAGAATGCTAAATTCATCTTCCTCGACGCCCCCCTCTATCAAGCCCAAGCACAGGATTGCGAAGAAGAGGGCTACCAGACGACGCAGAATTGA
Protein Sequence MPPPKRFRINAKNYFLTYPKCSLTKEEALSQLLALEIPAQKKFIKVARELHDDGEPHLHVLIQFEGKFQCTNNRFFDLVSPTRSAHFHPNVQGAKSSSDVKSYVDKDGDTVEWGVFQVDGRSARGGCQTANDAAAEALNAPSKEAALSIIREKLPEKYLFQFHNLNSNLDRIFAKAPETWSPPFPLSSFTHVPDEMQEWADDYFGRDAAARPSRPVSLIVEGDSRTGKTMWARALGAHNYLSGHLDFNSRVYSNEAEYNVIDDVPPHYLKLKHWKELMGAQKDWQSNCKYGKPVQIKGGIPSIVLCNPGEGASYKDFLEKEENLALKNWTIKNAKFIFLDAPLYQAQAQDCEEEGYQTTQN

NCBI Accession YP_001960949.1
Location 2165-2422
Gene Name AC4
Protein Name AC4
Coding Region ATGGGGAGCCTCATCTCCACGTCCTCATCCAGTTCGAAGGGAAATTCCAGTGCACGAATAATAGATTCTTCGACCTTGTCTCCCCAACCAGGTCAGCACATTTCCATCCGAACGTTCAGGGGGCTAAGTCGTCGTCCGACGTCAAGTCCTATGTCGACAAGGACGGAGATACAGTCGAATGGGGAGTTTTCCAGGTCGACGGAAGAAGTGCTAGAGGAGGTTGCCAGACAGCTAACGACGCTGCCGCAGAGGCCTTAA
Protein Sequence MGSLISTSSSSSKGNSSARIIDSSTLSPQPGQHISIRTFRGLSRRPTSSPMSTRTEIQSNGEFSRSTEEVLEEVARQLTTLPQRP

NCBI Accession YP_001960950.1
Location 518-1285
Gene Name NSP
Protein Name nuclear shuttle protein
Coding Region ATGTATCCTGTTAAGTATAGACGTGGCTGGTCGACTACTCAACGACGAAGTTATCGACGGGCTCCTGTGTTCAAGCGGAATGCCGTTAAACGTGCAGATTGGATACGTCGACCGAGTAATTCAATGAAGGCCCATGACGAGCCCAAGATGACAGCCCAGCGGATACATGAGAACCAGTTTGGCCCAGAATTCATGATGGTTCAGAATACAGCAATTTCTACTTACATATCCTTTCCCAATCTTGGTAAGACAGAACCGAACCGGTCTAGGTCATATATTAAGCTAAAACGACTTCGTTTTAAAGGGACTGTCAAGATAGAACGTGTTCAGCCAGATGTCAACATGGACGGTTCGGTCTCGAAAACCGAAGGTGTGTTCTCTCTCGTAGTGGTTGTGGATCGTAAGCCTCATTTGGGCCCTTCTGGATGTTTGCACACCTTTGACGAGCTCTTCGGTGCAAGGATCCACAGCCATGGGAACCTATCTATTACTCCTTCATTGAAGGACCGGTACTACATACGTCATGTGACCAAACATGTACTCTCGGCGGAAAAGGACACCATGATGGTCAACCTCGAAGGGACGACATTCCTATCTAACAGGCGTGTTAGCTGTTGGGCCGGTTTTAAGGATCATGATCATGATTCATGTAATGGGGTTTATGCTAACATAAGCAAGAACGCCCTGTTAGTTTATTACTGTTGGATGTCGGATATTATGTCCAAGGCATCGACATTTGTATCATATGATCTGGATTATGTTGGTTGA
Protein Sequence MYPVKYRRGWSTTQRRSYRRAPVFKRNAVKRADWIRRPSNSMKAHDEPKMTAQRIHENQFGPEFMMVQNTAISTYISFPNLGKTEPNRSRSYIKLKRLRFKGTVKIERVQPDVNMDGSVSKTEGVFSLVVVVDRKPHLGPSGCLHTFDELFGARIHSHGNLSITPSLKDRYYIRHVTKHVLSAEKDTMMVNLEGTTFLSNRRVSCWAGFKDHDHDSCNGVYANISKNALLVYYCWMSDIMSKASTFVSYDLDYVG

NCBI Accession YP_001960951.1
Location 1346-2227
Gene Name MP
Protein Name movement protein
Coding Region ATGGAATCTCAGTTAGCTAATCCTCCGAACGCCTTCAATTATATAGAGTCGCAAAGGGACGAGTACCAGCTGTCTCATGACCTAACTGAGATCGTACTGCAGTTTCCGTCCACCGCCTCCCAGATAAGTGCCAGACTCAGTCGTAGATGTATGAAAATAGACCACTGCGTCATAGAGTACAGGCAGCAGGTTCCGATTAACGCCACTGGGGCGGTGGTAGTGGAGATTCACGACAAGAGGATGACGGACAATGAGTCGTTACAGGCGTCGTGGACTTTCCCGATCAGGTGTAACATCGATCTCCACTACTTTTCATCTTCCTTCTTCTCCCTCAAAGACCCAATTCCATGGAAGCTGTACTACAGAGTTTGCGACACGAACGTCCATCAGAGGACCCACTTCGCAAAATTCAAAGGTAAACTGAAGCTGTCGACGGCTAAACACTCCGTGGATATTCCTTTCAGGGCTCCGACGGTCAAGATCCTGTCCAAACAGTTTACCGACAAGGATGTCGATTTCTGCCACGTGGGATACGGAAGATGGGAGAGGAAACCAGTCCGATCCGCATCCGCCTCAACAATTGGGCTTCGCAGCCCAATACAACTTAGGCCAGGAGAGTCGTGGGCTGTCAGGAGCACTGTAGGTGCCAATCCATCGGATGCGGAGTCCGATATAGTAGAGACATCGCATCCATACAGGGAGCTGAACAGACTGGGTACAACAATGTTGGACCCAGGCGAATCGGCTTCCATCGTGGGGGCCCAGAGAGCCCAGTCCAACATTACGATGTCTCTGGGACAATTAAACGAACTGCTGAGGAACACGGTCCAGGAGTGCATTAACAGTAACTGTGTGCCTTCGCAGGCCAAGTCTTTAAATTAA
Protein Sequence MESQLANPPNAFNYIESQRDEYQLSHDLTEIVLQFPSTASQISARLSRRCMKIDHCVIEYRQQVPINATGAVVVEIHDKRMTDNESLQASWTFPIRCNIDLHYFSSSFFSLKDPIPWKLYYRVCDTNVHQRTHFAKFKGKLKLSTAKHSVDIPFRAPTVKILSKQFTDKDVDFCHVGYGRWERKPVRSASASTIGLRSPIQLRPGESWAVRSTVGANPSDAESDIVETSHPYRELNRLGTTMLDPGESASIVGAQRAQSNITMSLGQLNELLRNTVQECINSNCVPSQAKSLN