CoVDB Coronavirus Database (v3)   
Strain
Snake_MG752895 (Region: USA;  Strain: Ball python nidovirus 1 isolate 148, complete genome.;  Date: 30-Nov-16)
Gene
pp1a replicase polyprotein
Description
Annotated in NCBI,  pp1a replicase polyprotein
GenBank Accession
Full name
Replicase polyprotein 1a      
Alternative Name
ORF1a polyprotein
 

Sequence

CDS
ATGGTCTTAATTGTACCACTCGCCTACACGAACAGGCCACCAATCAGTAAACCTCGAAGATATTCAGTCCTTCCAAATTACTATCAAAAGTATGGAAGTGATATGATCAAAAGAAACATCTATTACCGAGGCATCGATACCACACCAGCACCAATGAGCAGAGTGGCAAGAGGCAACTACGTACAAAGCGTACAAGACCAGCCTAACACCAAGAAAAATAAAAAAGTTACATTTAACGAGAACGTAAAAGTCGTGGAAGTCACGAACAACTCGGAAAGCCACGCAATGGCTGACAGTAAAACAGCCAGTAATGCACAACCAAAAGTGCAGGCAGCTAACAACAGCGCCCCCAATTTTACACCATACGAGGTAGCTAACAACAGCTCCCTCCAAAAGAAAACTTTTGCACAAGTAGCTAGAAATAGTGAAGTGCCGGCAGCTAGTAATAGTGCCCCTCAAGAAGAAGTAAAATTCTCGTTTGTTGCGGCTATAAAAAGAAGAGGCCGAAGATCAGCAAAAATAAAAGCTGAGGCGGAAATCAGTCCCGTTAAAGAAACTGAAGAGTCTGCCCAAAAAGCAACTCCCAATTTCGAGCCCGAAAACGCTCATGTTAAGGAGACAGCTGTAAAAGCTGCTCCCAATTTCATGGTGGCAACAGTCCCCATAAGTAAGATCAGCTATGCAGCGGTTATGGCCAATGCAGTAAATGCTATCAATAACACCAAAGAAGCAACTGCAGCCGCAGTAGCCAATGCAGTAAATGCTAATTTAGGAGAAGGAACTCCTGTAGTGTCGGCGGGGATCCCCAAGGGTAAACGCACCGTCGACAGTAGCTACGAGCTACAAGCTACGAATCCATCGGTGGCAACTCCTGTCCTCGTTAACGAGGCCTCTGAAGACGTAGAAGTTGCAAAAGTAGAGGAACCAACCTCTATAGCTCACGAACAACAGCACAAGACTGCACCCACACCAACCAGCAGCAACACCACACCGAACTATGCAGCAGCTGTAAGAGCCAAAGTAAAGTCAGCGACAACCGCTGTAGCTACAACAAGTAGCGATGTAACAACACCAGAAGGAAGTCCAAGAAAGGCACCGCAACAAGCCGCAACCAACTACGTAGCCGCAGTAAAAGCCAAGACAACCGACGTAGCTACGACAAGTAGCAATGTAACAACATCAAAAGTTGGAACCCCAACCTCAACCTGTGGGGCTATGACTCCGAATGAAAACAGGATAGTAGCAACAGTTGATGAACTACTATCCAGCAACACCGCTGCAATAGTCACTAAAAACAAGAAAGAAAGCGGTGTGCCACTCGACCGCAACACCGCAGCACCGAGTCTACAGAAAATAACAAAAAGCGGTGAACAGCACAACTACACTACAGCTGACCAACAAGCAAGCTTGCAACAAACACCAGATCAAGTCGCCAGAAGCTACATGAGCAGCTTAGTTTCACTCTTCAACAAAACACAAGTAGAAGAGCCAGAAAAGGACATCGCAGAAGCTATCGCAAACTTAGCAACAAGCTACGCAGTGGCAGTAACTGACACCAGTACAAACACCAACGGCAAGAAAACCGACGGTACCAGTGTCAGTTCCAACCAAGGTAGTAAGCAACAACTCAACTACCAGCATAGCAACCAGAACACTAAGTTTTTATGCGAAGCCCTCAGCGAGCCCACTGCCACAACGACAGTCCACGGCTTCATCCAACCACACGACTTCGCTTATAGCACCAACTGCTCAAGCGTCAACGTTAGCAACCACCACCTGCAAGCAGACTGTAGTCAAGTCAGCCCAAGCTCCCACCAGTGCGTCAATACAGCACGAGAACTCATACTAAGTGCGCTCACTGACATCACTCAAGAAGAATATAACATATTAAAATTCACCGAAGTGACGAACTTCGCAGGACTCAAAGGAATACTCCCAAGAGGATACACCTTCAACGCCACACAACACGTCAACAACTTCACCATCGCGATCTGGAAACACAACGAGCTTCCAATCACCCACGTGGCAGTACTCAACACCGACAGCATAAGCACCGTCATCTACGAAACTAAAGACAACGAAGTCGACCTCAGTGAATTTTCAGAGCCACACCTCTTCGAGCTCGTCAAAATCAACCAAGAAAGCACCGAAAGAGTCACTAGCACGAAGTACGAAGAAATAGGACGCAACAACTTGTTCCATTGCTCAACCGAGCTCCCACGAGACTACATCCAAGTACCAACATTCAGCACGAAAGCTAACATCAAAGTCAAACCAAAGTCGAAGACAATCAACCTCAAGAAAACATATCAACAGTTTAAAGAAGAATTGAACGAGTTGTCAATGACAATACCATTCAATCCAAAAGAAAGCCAGAAAACACCAATCTACTCACAACTCAACAACGACCACAGGTACTTCAACGGAGAGTTTGTAACAAGCGGAGCTGGAAACCAACATTTAACTACCGTAACAGACGTCACCGAAAATGAAACAGTCAGACAGCTAGACATGGCAATAAAAACAGCCAGTTCTAAACTAGAAGTACGTCAACTTAAACACCAAGTTAAGGTTCCCAGAAAAGTCGAAAACAAAATCATAAAAGGAACCAACATTTGGTCGCCACTAAAAGTCTCCGAGACAGTAAGATTGGGAAAAATGTGTAACAAAAACCACAACTTAGAATGTAAGAAACAAATTGAGAAAAAGTTGTGTAGAATGTGTAAAAATCAATGGTATGTTCAAAGAACAGGAGGAGTTTTTGACACTGAAATCGGACAAACACTAGTCCACATGCAGCCATGTACTGCTTGTGCGATTGAGTTTACTGATTATGAGTGTGATTGTGAGAAGCAAAACATCGAAGTTTGTGCTTTTAAAAGAAGTGAGCTAAATCTACACCAGTTGAACGAATTTAAAAAGAATAAACAAATACAATTTGGAAGAGAAAAAAATCAAAAGCAAGGAAATAGAGGAAGAAATTTTGATGAAATTCAACGAAGTGAAGATTTACCACAACAATTTTTCCATCATAATCACTTAGCACTTAGAGGTGCAAAGAAAGTTGTGAGATTAAATAAATATTCTTTTGTGAAAGCTTATTTAAAAATTTGGGATTTGCCAGCCTTTGGGCCACCATCAAGCCAGCAGCAGCAGCAGCGGCAGCAGCATCCTAGGCAGCAACAACAGCAGCAGCAACAGCAGCCTGGGTATCAGCGTCAGCAGCAACCCCAGCAGCAGTGTCAGCAACAACCTAGGCAGCAACAACAGCAGCAGCAACAGCAGCCTGGGTATCAGCGTCAGCAGCAACCCCGGCAGCAGTGCCAGCAACAACAGAGGCAGCAACAGCAGCCTGGGCAGCAACTTCAGCAGCAACCCAGGCAACAATGGCCACAGCAAAAGCAACAGTACCAAAGGCAGCAGCCTAGGCAACAATGGCAACAACAACAAGGCCAGCCAGGRCAGCAGCAGCAGCAGCAGCCTAGGCAACAATGGCAACAACAACAAGGCCAGCCAGGACAGCAGCATCAGCAGCAGCAACAATGGCAACGGCAGCAAAGCCAACAGCCACAAGAAAATAATTATCAACAAATGCAACAACAACAACAACAACAACAGCAAGAAGAAATAATAAAAAATTTAAAAATAAAGTTGCAGCAGCAAGAAGAAATGAATGGAAAATTATTAAAGCAGCAAGAAGAAATGAATAAAAATTTGGAAAATATGCAAAAGCAAATGCAGCAACAAATGCAGCAACAGCAGCAAATGCAAATGATGTATATGAATCAAATGCAACAAATGCAAAATCAAATGCAGCAGCAAAATCAACAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAKCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCAAGTCCAAAGCCAGGCCCAAAGGCAGCAACAACAAAAACAACAACAGGAAATTGTGCAGCCACAAGGCCAGGCCCAAAGACAGCAGCAGCAGAAACAGCAGCGAGATCAGCAGCAGCAACAGCAGCAACAACATCATGAACAGCAAGATCAACCACAACAGATGGACCAAGAACAAGCAAGTAACTACATCAACGACTTAGTGACATCGCAATGGTTCGAAATAGTCACCAAAGAGATGTCTAGGCACTATAGCAAAAATGCAATTTGCCGCCATAATCTTTATGGAAAATGTCGTTTTGGTAGTAAATGTAAAATGGCCCATATCAGCCTCAATGGCAAGAACGGACCACTACCAACCCAAGAACAACAAGAACAGCTCCAAGCAACAAAGCTACTACCTTGCCGCAACTATCTCCAAGGTTATTGTAGCTATGGTGATAAGTGCAACAACTTGCATAGTTACATTAACATCAACAGAGTCAACCAAGAGCTGAATAAAGGTAATTTGTTCTTATTTGTTGTGCCCAAAGGCTTTAAAACAAATTATGTCATCGTCAGCGAGCAACAGATGCAGCTACAACAACAGCTGCAGAAATTGCAAGCCGTAAATGATAGTAAACAAAGCCAACAGCAACAGCAACAGGAAGAGCAGACAAAACACCAGCAACCAGCTCAGCAACAGCAGCAACAGCTGCACGAGCAGCAGGAGCAAGAGCCAGAGCAACTGCAGCAAGGGACGCAACAGCGTCAAGAGCAGCAAGTGCAGGAGCAAAAGCCAATGCAGCAAGAGCTGTTACAGCAGCAACAAGTGGTCGATCAAAAACAGCTGCAGCAAGAATTCCAAAATAAAATCAAAACTCAGCAAGAAGAACTTCAAAAGAAAAAGGACTTAAAGAAAGAAGAAAGAAGGCAGCAACGAGAGCTGGCATACAAAAAGTACGACCAGCAACAGTCAAAAGAAAGCTTAAGCCAGAATAGCCAAGAAAGCAAGGACCAAGAAAGCATGAAAAGTCAGAAAGACCAGGAATGTATAAGCCAAGCAAGCCAGGAAAGCCAGAAAAGTCAAGCAAGTCAGAAAAGCCAGAAAAGTCAAGCAAGTCAGAAAAGCCAGAAAAGCCAAGCAAGTCAGAAGAGTCAGAAAAGTCAAGCAAGTCAGAAGAAGGAAAGCGAGAAGACCAAGGAAAGCTCAAAACAAGAAAGGCAACAAGAAGTTTTACAACAGCAGCAAGTACAACAGCAAGCTCAACAACAACAGCAAGTACCACAGACACAGCAACAACAGCACGAGCAACTGCAAGTACAACCGGAAAGCCAACAACAGCAGGCAATCCAGCAGCAAGAAACTCAACAAGAGCAAGCGCAACAAGACGAAGTTCAAGCCACTCCGAACACACCGCAGCCACAACGGCAGCTGTATCAACCCAAAGTCATCGACCTCACGAAGGGCATACCACTCAACCTCAAGAAACCAGATGACGAATGCAAAACATGCCAAGATGCTACGAAGTATCTTGTAAAGCAAGGTCTCAGAAGAGTCGTCACTATCTTCGAAAGAAACAGTGGCAAACAGATGGAAAAATTCGCAATCGTCGTGGCCCTCCAAGGCCACTATGGAATTTTTGAGTCAGGCAATCCACGTGAGCTTTATAACGGAGAAGTTTACAACCAAGTTTTGTCATCCTCAGTCGAAGCTTACTTTATACCAAACCCACAACGCCAACAACAACAGCCACAGCAGACTGAAGACAAGCTGCGACAACAGCAAAAGCAACAAGAAAACGAGCAAAAGCAACAAGGTAACAACCTTCAGCTGCAGCAACAACAGCAAGAAAGCAACAACCTGCACCGGCAGCAATGCCCGCAAGGAGCCAAAAAGCAACAGCAGCAACCGCAACAGCACCAAGATAGTAACAATCAGCAACAAGAAAGGCAACAAGAAGAAATGCAACAACGACAGCAACAACAACAGCCACAGGTTGGAGAGTCCAAGTTCGAGATCACTAACATCGAAAAGTTAAGCCCGCTGACAGAAGTCAGTAGTGGTTCTTTCACTTCTACAAGTGACACCGACAACGACGCTGACAACAACGTACTCCAACAGAACGTCAACCAAGCTGTCAAGTATCAGAACTTCGTCAAAGAGTTCCGCAACGCCCAAGCAAACTTCAAATACCTACCTCCAAAATACAAAGACAAGCTCGTCCAATACAAAAAGCACAAATTTGATATTTTGTATGCAAGTAAGTTCGTCGAGATGGAACCTAACCAAACGAAAAGGTACCACAACGAGAACTTCGTATTTTGTAAAGAGGAAGGAGGTGTCTTATATGGTTACTACCATAGCGACACCAGTGAAGACGTTTATGAGGGAGTTGATTTCCCATGTATCGTCTATCGAGAAGTATTGCAATCTGTTGACTGTTATGTAAGTAAATGCAGTGAGTTCATTTATCACAAGAACGACATCTTCACATCAAGCGTCAACTTACCACTCCAGTACCAAGTCGAAACTCACATTTGGCCAGCATCCAAATGCAGAGCCGAAGGTATCAAAATCGATAAGAAAACACCACTCAACACCAAGGTCATTGTCTGCTACCGCTATTGTCAACTTGTCGAAGGCAAAAACACCTTCCGACTTCACGTCAAAGACTTGCTAGAACAACACTCCAGTCAAGCCGGCGACGACGTAACAGTTATCACAGACCCACTGCGAATCGCAAAATCACAAGTAAGCTTATTGCTGAAATTAATGGAAAACCACCTCCGCCACGGCGTCCACATGAGCATCAACAACTATCTCAACTGTCAGTCCACCGAGTTTGCACTCGTTTTTAACAACTTTGACTTCAGCATTACGAAATTCGCACTTGCACAATACCAGCTATGTGCTAACGGCAACGAGCTAGAACCAGACCTGCACGGCGTTTTCATTAAACAACTGTGTAGTCAGGTCAACTTCAAATATGTCGCCCCTGACGAGCTAACATCAAGTAATTATGGCAAAGTCATCTTGAATATCATCAAGCAGAACGTCGTGCGTGTTGACAATAAGATACCTTATGAATTTGACATGCCAGACAGCTTGGAAGTCAAACCCCAAGTCGATGATATCGATTACGAAAAGCTAATCGGTAGAGGAGCTTATGGTAGAGTTTTTGGTAGTAAATCAGGAAAGTTTGTATTTAAGCTACAAGGCTTACAAAGCTCGCAATATGAGCATAAGATTTTAAACTTAGTTAAACACCTCAATGTACCACAAGTCGTCGAGCATCACGAATTCCAAGAACAAGGAGTCGGTATAATAAAAATGAAAGCACTTGAGCTGAAAACTCTCGACATTCTGAAAGTCGATGAGAATCAAGATTTAACTACCAGGCAAGATATGTTGATGCAGTGTTTAGAACACCTCGAACAGGTTTTGAAAGTCGGCGTCGTCCAAAACGATTACCATATGGCTAATATGGCATGGACGAGTGAAGGAAGACTTATCGTGTTAGATTGGGGAATTGCAAAAACAAGAAAAGAAGGTCAAGAGCAAGAGTTCAAAGCAATTGTGTATGCTTGGTATGTCAATTTAATCATTGACGTATTAGCTAACATATTCTTGGACTATTATGATGAAGTTTATTACATCACGAAACAGAAAGACCTGTTGTGTTACTTCGAATGGTTCATGGACGAGGAGTTATTCTCCAAAACACACGAGCTATTCGGAGATGAATATCACGACATGCTCCACCATCCACCAGTCCAAGAGGACGAAGAAGAGGAAGAGAGCGATTGGGAATCTGAGTATTCGGACAACCAAGAATTGTCAGAAGACCAACAGGTACAGGAAGTAGAATATGGGCAACCTCAACAGCAATGCCATGAAGTAAAACCTCAGGAGCAATTCCAAGAAGAAAAACCCCAACAGGTGAAACCTCAGGAAAAACCTCAGGACGTCAAACCTTTTGAGCAAGTTCAACAAGAGGAAAAGCCCCAACAGGTGAAACCTGAGGAAAAACCTCAGGAAGTAAAACCTCAGGAGAAACTTCAACAAGAGGAAAAGCCCCAACAGGTGAAACCTCAGGAAAAACCTCAGGAAGTAAAACCTCAGGAGAAACTYCAACAAGAGGAAAAGCCCCAACAGGTGAAACCTCAGGAAAAACCTCAGGAAGTAAAACCTCAGGAGAAACTCCAACAAGAGGAAAAGCCCCAACAGGTGAAACCTCAGGAAAAACCTCAGGAAGTAAAACCTCAGGAGAAACTTCAACAAGAGGAAAAGCCCCAACAGGTGAAACCTAAGGAAAAACCTCAGGAAGTAAAACCTCAGGAGAAACTTCAACAAGAGGAAAAGCCCCAACAGGTGAAACCTAAGGAAAAACCTCAGGAAGTAAAACCTCAGGAGCAATTTCAAAAGGAAAAAAACCAGGATGTGAAACTTCAGGAAAAACCCCAGGACATCAAGCCCTTTGAGCAAGTTCAAAGCCTAACTATAGCCACTCAGCTACCAAGCAACCATACCAACACAAGCAGCTTATTAACTTTGACTCAACATAAAATCAATTGCGTCAACAACGAGATTAACCAAGTCCAGCAACAACTCACTCGACTCAAGAAAGGAAAAGCAAAAGCACAAAAAGCAACACTCAAAGTCAAACTCGAAAAACTCAATAGCAAGAAAAGCGAACTCGAGAAGCAGTTGAATGGTAAGTCAAGAACATCCAAGTCACCAAAGAAAGCAGACCAGCAAGCAACTTCTAACAAGCCACAGCAACGTGAACAATTACAACACAAGCAACTCAACGACATACCAAACGTCAACAGTGGCATGACTAAAGTCGAGCAACAAGCGATCGACTGGTTCGAGATGACAACATCACCAGCCAGTTCGAGAGCTAGTAGCTCAAGTGGACCATCAGCGGCAACAGCCAACGAGCAGTCCATAACTGCCAAACCAAGCCACAAGCATGCCACAGTAGCTATCACGACATCTAAGCCAGCACCAGCAACAACAGTGCAACATAGACTCAGCGAGCTCCAGAAGATGGTATTCAGCCTGAAAAAAGTTGACACAAGCTTATTGGACAAACACCAACTCTCCTGCCATAACAAAGTACTCCAAGAGTGCTTTGAGCAAATCCAACAACTTGCTCACGAGGTCGACAGCCAAGGCACCAGCAACTCCAGTGAGAGTTCTAGTTTACTTCCAAGTCAATCTCACCACGAAGTCGAAGCACCAAGTCAACCAAGCCGAGACATCTTAGCTACAAGCAGGCAGAAACGACCAAAGGTATTTTGCATCCGTGAAGTGTCTGACATCGACATCAAAGTCAGCACAACAGCTACAAGCTGCAAGCCAGGAGTCAACCGATGCTTCATCTACGCAGCTGAGGCAGCTCTCGCAACTCTCGGCCAGTATCTCACACAAGAGACACTCAACAGCTTGTACAAACTCGCCGACAAAGGCCAACACGATGCATCTGAAGTCATCGCTAGAAGCGGTCTACCAATCATCCAGACTGCCAAGTGCATTCTGCATGGAACTTGTAAGCTCATGACACCATGCTGTAGCAATGTCGTACCTAACACGACACGTACGAACATCTTGGTCAAAGAACACATCGCCAGTACATCCGCGTATGGTCTCACGACACTCGAGCCACTCGCCTACTTGCACTACTCAGGTAATGCACTCGCAGGCCACTGGAAATGTTCTTACAACACAGCACAAGGAGTCAAGAAAATGAAGGACAGTGGTGTCACGAGTGTGATCAACAAGCACCCTAAAGCAACATGGACACTCTGCAAAGTCGCATTTAACACCGAGCTCGCACTCACGAATGACGACATCACGAGCAACGAAAACGTCACCATCATCAGCCAGAACATCAACCTCGTGGCATCAATGCTCACAGCCCACGGTGATCCAGTATACCACGATGCCACAACACTCGCCATCGTCCGAGACAAAGATACAATCACAGTCATCACACCAGACGCAGCCGCCAACCACGACTTCACAGCGAACTCCAACGTCTACTGTCTCGAGTATCATTTACAAGTTTCCAAGAAACTCAAAGCAAGCAAATGTCTAGTCAAAGAACCACAAGCACCAGTAAGCAAGTTCGAACTAAACAACAAGAACAAAGATAGCACCATCAAGGCAATGCAACGCAACTTAAGGCAAGAGCTCACAAAGTACTATAGAAACAACCTCACAACAGGTGGTCCAATCAAGACACTCGTGGTCGGCATCGGCAAAGGCAACGACGCTCCACACTACGCTGCCGCAAACGGCGGAAATGGAATCATCTATGACGGATGCGATATCAGCAACGAGTCTCTCAACATCTGTGCCACGAAGGTGCCAAGTACTACAACGTTGCACCAGAGTGACATCAACACAAGACAGTTCGATACATTATGCAAAAAGTACGACTTGCTAGTCAGTACTTTTAGCTGGCATTTTAAAGACAAGATCAAGTGCGGCAAACACCACGAGTTCCACATCATACCAGTCTACAATCCAGACACGTACGACACGTGGAGCCAGTACTATGACGTGACAGTCCACTCGAAGGACGACACGAGCATCACGCACACGATCAACATGCCAACATGTCGGACAACTGAGACACTCCACACTATAAAGTGGTGGTATCAACAGTACTGTAGCAATATTACAATCAAGTCACTCCAGACCGAGTTCAACTCAAGCAACAGACACTTTAGCAATTTTATTGTCGTGATCCACACCGGACACAAGACATCAAGCACTAGTATGAAGACAATTGCAATCACACAATTAAGTTTAGACAAAGGAAACGACTCAGCAAGTAGCTCCAGCAGCGTATCTAGCACGAGCGATTCCACGACTAGCAATACAACTGTAACCGCAAGCTCAGAACAAGCAATAAGCAACCAAGTCAAAGTACCACAACTCCCAGCCGACAACCACGACTGTAACACAATGCAACCAAGCGACGCAGACGATGAGAAGACGGAATTCGAGCAGTGTCAAAGTAGGGGTAACATTCCCATCACTATACACTCATGCGAATCTTCTGACTTCACGGACACTGTGTACGCTACAAGACCAGAAAACGTCAACGCGTTTTTTTGCCGAGAATTTTGCACCAGTTGCTTTGTTCATTTGCCAAAGAATGCAGTCGTCGATAGACAACTCCTCACGAACGCTCAAGTCTACCTCAAGTGGAGCGGTATCAGCATCAACGAGCTCAACTATCAGCTCATTTCAACTCTGCACAAACGCGAAGTCATCTGTCTCGACCACCTCGAGCCAGCACCAGTAGCCACAGACGACAACTGGGTCATCAGACGCACTACGACTAAACCAGTCAAAGTGCTCTACATGTCCCACGACGTCTACAAGACCATCGACTCAACACAAGCAGCAATCTTCCACCGCGACTTCGACATCTTCACACACCAAATGAGCAGATTCACCAAGAGGCCAGTCCAGAGCGTCGTCATCCACACCAGCAGCAAAGACGAAGCAGAGTCAATCTGCAACACACTCGAACACTACAACATCAGCAACATCAAGATCGCCGAACACAGAGTCGGCCAGTTCGAGACGCTCGCATTCAACCGCAAGACATGCCAGTACGAGTCACTCCAAATCAACAAGAAACGAGCTGGCATTTGGGAGCAAATCAAAAGTCACTGTTATGACATCAAGAAACCGTACCTCATCGAGTGGAACGACATCACCGACGAGTGTAGTCACGAAGACAAGTACAAAGCAATCGCTAGCTTTAGCGACAAACACGGCCTCGACAGCTACCAACGGCTCATCGGCGCCGACAAGTGCGTCTTCGTAGACTACAAAACCAGCACCGACACGAACCCAGACAAGCTCGACGACTACTACCGTAGCCGCCACGCAACTGAAACAGGCAAGGAAGTGGTCTATACAGCGGTCTTCAAGCTACCACCGACAGATAACAACGAGTCCAGGACAGTCTTCGCGGCTCTCGCAGTCACGAAAGACGGTAAATCACTGTACATCAACGGTGATAAAACCCAGATACCAGCTTCAGACATCAACAAGGCACTGTATCACATCATGCAAGCCAAGTACGGCAACAACACAAAGATCATCGGCGAGGTTTACTCCAAGTTCTTTAGCCCTGAAAAGGAAGATGTCGCAGCTTACAGCTACAAAGACCTCTTCACGATCTTACCGTTCTTGCTATTTGTTTTGCAACCTGGATTCTTTACGTTCGCTTTGTCACTCGTGATCATGGCTTTCTTTTTATTCGACAGACAAGCACTGCGCAACTTCGCCACAACAACGCACCAGCTGATCAACACAGTCCTCGAGTACGAAACCGCAGCAGCCATGAGCTATTGTCGTTTCAGTAACTATCGAGTCACTAAGTACATACTCGCCACAGCCAACTCTGTGATCTGTTTGTACTTGGTCCTCTATACTTACTGGTCATGTGTGTCTAACCACGACCTCATTGACACGACGCTACCGCCGTACCACAGCTACGTGCAACGCATACTCGCATTTCTAGACTACATACCATCAGTCCACGAGTACGCAAACGCAGTGACGCTCAGGGAACTCTGCGGATCTAATATCATCTGTAGACTTGGATCACCGTATAACAAGCTGTATCTCGACCAGTACAAGCACTATGTCACAAACAAGAGCCACATCTTCCGCGACAAGCTAATGTCAATGTTCTCCTTCTATACACCATGGGGAATGCTAATGTACTGTCTCGACTTCAAGCCGTCAGTGCATTTAGTCTGTTATAACATCTTAATGACAGCCTTCATCGCGTATTTAAGCTTTCGCCGATTCTTCACTTGCTGTAAACCATCTGGCCCATTTTGTGCCAAACACGCAATGCTCCGCACGAACAACGTCCAGTACTACTGCAGCGGCAAGCCTCACAACATCCAGGCTATCAAAGTCAAGTACTGCACTAAGCACAAATGGTGGTGCAACACCACCCAGGAGCATCTGCTGCCGCACCCAATCGCCAGAGTCATCGAAGCTTCCACACAGATGAAGTCTAACTTCATCAAGTCTGACACTTACTTCAAGTATGTCAGCCAGATACCAGATCGTGAGCTCCCAGACAACTTTAAAGACTATAACCACAACTTTGTCTATAGCACCAAGAACTTGTCGTACCTCGAATGGACACAACGAGCAGCCCTCTTCGCATATTGTTCGAGCACGAAGGTTAAGATCTCAAGTCACGACTCAGGTTCCATCACGAACCAGAAAGTCGAAATGACACAACAGATGCTCGACTACTTCAAGACCAACGAAGACTGGAAGTATAACTACGTCGCGGCCCAACAAGCCGGTCGTAGGAATATCTTGCATGTCGGATTTCTTGAAAGTATGACCGAACAGGAACTTAGCGAATTCTATGAATTCTGCTGTCAGTACGACCAAAGCTACACCAAGACTTGCATCAAGCGAGATAACTTTCTCAACGGTAAAGTACCTGACTGTTTTAAGTCAAGCCGCTTATTAAGTACCAAGTATCACAGTGACACAATCTATCTCGACAGCTTCGCAGTCGAGCACCACGACGACATCAAACAGCAAGCGAGTCAACACCACTTCTTAGTGTCGCAAGGCAAACCTAAGAAGTACATCAACTCGAAACGCAACATCTTCATCGCACTCGCAGTCTTACTAGCGATCACACTCACACGAGTCGCATATCGCAAAGTCGTCAAGATCAACACTCCAGCAGGACTCAACCCAACAGGCAAAGACTACGCAAAAGGAACACTGTATCTTCATGAGAACATCATCGCAACACCAGTCGCATACGGCAACAACCGCAGAGTCCAAGCATGGCACTTCAGCAACGGGACGTTCGCTTTCACCGAAAAGCGCAGCACTCTCATAGCTAAGACGTCATGCGGAGAATTCCAAAACAGTTTTCTCCAGATCGAGACTTATCACCAGTCATGCGACCAATATTGGCCACTGGCCATCAACTTTTTTAAGTTGTCCATTTATTGGTTCCAAGCCGACAAGGCGTATACAACACGCCATGGTGATTATCAATCACAGGAAGGTGCAATATGCTTTGGATTGTCAGACTCTCTAGTCTGCCACGAGACAATGTCGATTTATACACCAGCTGCATTCGTCTGGATAACATCTAGCGTCGTGCTGTGCTTCGTGCTCAGCATCTTCTGCTACATCAAGATGAGAGGTTGCTTTGGTAACTATACCAGCTCAGTGCTCATCCTCATGACAGTACACGCTATCACCGTAATTTTGTACTTGTTCGTGCCCGGACTTGCATTCGTATTCGTCATTGCATTGTTTTTTATTCCAGCATATCGACTCGTCATTGGCACCTATAGTTTTGTGGTTTGTTCCATGTTTCTCGGTCTCAACTTGGTCGTATTGTTTGGCCTATATACAATCGTCTTTCTCACCATTTTCTGGTTCTCACGAAACCAGACCAACGACGTCGAGTATACGCCATCAGGAGTCGTCTTCAGTTCCAACTTCAGCGGAATCGCCAAAAACGCATTCCTGCTGACCCCTGAAAACATACCCACAATAATCTCAGTGACAGGCAAAAACTACGCCCAGTTACTCGAAATGTCAACAGGACCACAGCTGAAACCAGAGACAGCTCTTGCTAATGCAATCCTCAAGTGTAGCATCACGAATACCAAGATGCTTTACGAGCCGCCTAAGAGCACAAGACTGCCAGTCTACTTGCAGTCTAAACTCAACAGGCTCTCGGACATCGTCGTCAACAGCGCCCAGCTCACTAACATCTGTGGCATCCATGACGGAACAGGCCTCATCGGACACGGCATCTTCACGACACCGACCACAGTTCTCACCGCAAGACACTGCTACACCACAGCAGCCCAAGTATTCTATCGAGGCCGACTGTTGCAAGTCAAGGACCACCACGAAGTCGGCTTCAACATAATCTTCACAGTCGACAAGCAAGACAACATCAAGCCAATTGCAATCGACAACCACCACGAACTCGAACTCGGCACCCAGTATACGCACGTAGTCTCACCACTTGACGACCAAGCAACAGTCACTGTGCACCAACTGTATCCAACACCGTCAGGCCACTTCGCCCACGCATCCACGATCGCAGGCGAATCTGGCTCACCGATTTTTTACCACAACAAGCTAGTCGGCATCCACCAGGCAATGGTCAAGTCTAAAGAACACACCGGAGCCCACGCAATCGCATGCCGAGTCGATGGTACGCCATTCGACAGCAGCTTCCATGATACACTCATGGCAGCAGGTAAGATCCAATTCGATGGCAACGCACTTTTGAATTACTATTTCACCACCGAATGTCGGAAAGCAATTTCAGTCCGTGATTTTAACAACCAGATCACCGAAGCTAACAGGCTTATCAGCTACTATAGTCACATCAGTATCATCAACGACCCTGTATTATTACAAGCAGAGCCACGAGACTTAAAACCTCTCATAGAATTTTTGAAAGACTCACCAGTCATCAGAGAACACCACTGTAAACCATATAAGATGGGTAATGACATCAAGCTTCAGACAAGAATCAGCCAACGCATCATCGTACACTGTACACTCAGCAACTTACTGAGCTTTTGTATGACACTCACATATTTTCTCACAACTGCAATCTACGGTACGCTCACGCTCAAGAACTTCATCGACTTACTACTCGCAGGCTTCATGCTCACGGTAGTTTTTCGGAGTAGGCATGTCGTGTTCTTACTCACAACAGGAGCATATTTTGTCAACATGCTCGAGTTGTTTTTGTATGTCTGTCATACAAATATCCAAGTCATCAGAAACATTTTTTCAAGCAACGACGAAATTTACGCCCACGTCGTCCAAACGGTTGTGCGTTTTGGTCTCCAAGACGCCATGTTATGTACAGTCATTCTCATCGTCATGATGCTCAAGTTCATTTTATTGCCACTGCGCTGTGTCATTTTTACCATCATCTATTGGACACTTGTCTATTGCTTCGGTGTCGTCAATTTCTACACCATCGCAGTATTCGTGTTCAGTTTTGCTAATACATCAAGCTGGTTTACTTGTCTGACACTGTTTTTACAAAACACAATCTATTTTCCAGTTTGGTTTACACTCAACATTGTGTTGTCCATGCGCATCGGACTTCCCAAATGGGTCATTAATCGATACCACCAAATCACATCCGACAGCGTCCGAGTCAACCGAGGGTATTTTTGTCACCATCTCGCAACTTATCAAAGGCCACCATCATTTCTTGAAGTGCTCATAAGTCAGCTATTTTATTCGCAGGACGACGTCATCGAGTATATCCCACAGTCCAAGATTGTGCCATACTCAGTTGTCGGTTCTGCCACACACTACCCTAACGTCAACGCCAAGAGTCTCGCACTCATGTCAGGTAGTGACAGTAAAACGCTCTATGCTCACTTCATCTCCGCAGTCGAGGCAGTCTTACAATCAACGACAGCCGCCGACCAGCAGGCATTTATGGAATGGTGTGCAACTACATGTGACCAGGAAACCCTCGAGAAATGGCTCGAAGATAACCCAGAAGACATTCAGAACAAGTTACGCACGAAACGCCGGAACATCATCCACGCCAGGATCATGTTTTTGAAAGCCAAAGAAGAAAAACTCCGCAAGCAAATCAACCTCATGCAACTCGAACAAGTCCGAGGCATGATGCGTAGTGAGTTGTCTATCAAGCTCTGCGACGTACTCAACCGATCAGTCGCCGAAATGCAAGAAAAGGCAAATCTGCGTACTAAAACGTTCGGCAAAGGTATCATCGCAGCAAGCACACTCACAGTTCCAGAGGTACTCGTAGTCACCAACACGAATGGCAGAGACAAAATCAGCTGGGATGACGAATCCGAATGCTTCTGCTTCGAATTCGAAGAGGCAATCTATCACATCGCAGAGCTGAACACAAACACAGGAACAGCTATCAAGACCGAAGCTGAACTCAACGCACTCACAGCCACGAACTTTCCACTGTACGGCAAATTACACGACTTCTCATTCGACGGAGCAATCAACCAAGCTAACATCGGCTACACTATCAAACTCCACCAGATCAAAATCAACAGATCAAGCTCAGGCATCAAGATCGAATATGCCGCAAGCTCAGACAAAGCAACGAAGCCGATACTCGAAGAAGTCCAAGAAGCAAACGACAACACAGTTTTACTCGTGGCAGACGGCATCCTCAGACCATTCAAAGTCAATTCTAGCGTTTCGGCTCCAGTCCTCGCAGCAGTCATGACAAAGCTACGTCTCGACAGCCCAGAACTGCAAGCCATCAAGCTCGGGGGACTCGAAAATGTCACTGAGCATGTCGCCACATCCAACCAGCCGCTCAGGTGTCGAGGGTACACCACGTACTTCGCACCGTCACTGTGTAAATACTGCCGTACGAACATCGAACATAAATGCAAATACCAACAGTTCGTGCAAATTCCTGTCAACGAAGATCCAGACAAGTACCTCAGCAGCCACGACATCTGTCAACACAACAAGTTCAACTGTGACACATGTCACGGTAAAGTGACTATCCAAGCAAGGCCAAGCACCACAGTCAAGCCGGCTGACAAACTCCGCGAGCTGCGCAAACTGGCAAAAAACTCGTCTCGTCCGCAGTGA
Protein
MVLIVPLAYTNRPPISKPRRYSVLPNYYQKYGSDMIKRNIYYRGIDTTPAPMSRVARGNYVQSVQDQPNTKKNKKVTFNENVKVVEVTNNSESHAMADSKTASNAQPKVQAANNSAPNFTPYEVANNSSLQKKTFAQVARNSEVPAASNSAPQEEVKFSFVAAIKRRGRRSAKIKAEAEISPVKETEESAQKATPNFEPENAHVKETAVKAAPNFMVATVPISKISYAAVMANAVNAINNTKEATAAAVANAVNANLGEGTPVVSAGIPKGKRTVDSSYELQATNPSVATPVLVNEASEDVEVAKVEEPTSIAHEQQHKTAPTPTSSNTTPNYAAAVRAKVKSATTAVATTSSDVTTPEGSPRKAPQQAATNYVAAVKAKTTDVATTSSNVTTSKVGTPTSTCGAMTPNENRIVATVDELLSSNTAAIVTKNKKESGVPLDRNTAAPSLQKITKSGEQHNYTTADQQASLQQTPDQVARSYMSSLVSLFNKTQVEEPEKDIAEAIANLATSYAVAVTDTSTNTNGKKTDGTSVSSNQGSKQQLNYQHSNQNTKFLCEALSEPTATTTVHGFIQPHDFAYSTNCSSVNVSNHHLQADCSQVSPSSHQCVNTARELILSALTDITQEEYNILKFTEVTNFAGLKGILPRGYTFNATQHVNNFTIAIWKHNELPITHVAVLNTDSISTVIYETKDNEVDLSEFSEPHLFELVKINQESTERVTSTKYEEIGRNNLFHCSTELPRDYIQVPTFSTKANIKVKPKSKTINLKKTYQQFKEELNELSMTIPFNPKESQKTPIYSQLNNDHRYFNGEFVTSGAGNQHLTTVTDVTENETVRQLDMAIKTASSKLEVRQLKHQVKVPRKVENKIIKGTNIWSPLKVSETVRLGKMCNKNHNLECKKQIEKKLCRMCKNQWYVQRTGGVFDTEIGQTLVHMQPCTACAIEFTDYECDCEKQNIEVCAFKRSELNLHQLNEFKKNKQIQFGREKNQKQGNRGRNFDEIQRSEDLPQQFFHHNHLALRGAKKVVRLNKYSFVKAYLKIWDLPAFGPPSSQQQQQRQQHPRQQQQQQQQQPGYQRQQQPQQQCQQQPRQQQQQQQQQPGYQRQQQPRQQCQQQQRQQQQPGQQLQQQPRQQWPQQKQQYQRQQPRQQWQQQQGQPGQQQQQQPRQQWQQQQGQPGQQHQQQQQWQRQQSQQPQENNYQQMQQQQQQQQQEEIIKNLKIKLQQQEEMNGKLLKQQEEMNKNLENMQKQMQQQMQQQQQMQMMYMNQMQQMQNQMQQQNQQQQQQQQQQQQQQQQQQQQQQQQQQQXQQQQQQQQQQQQVQSQAQRQQQQKQQQEIVQPQGQAQRQQQQKQQRDQQQQQQQQHHEQQDQPQQMDQEQASNYINDLVTSQWFEIVTKEMSRHYSKNAICRHNLYGKCRFGSKCKMAHISLNGKNGPLPTQEQQEQLQATKLLPCRNYLQGYCSYGDKCNNLHSYININRVNQELNKGNLFLFVVPKGFKTNYVIVSEQQMQLQQQLQKLQAVNDSKQSQQQQQQEEQTKHQQPAQQQQQQLHEQQEQEPEQLQQGTQQRQEQQVQEQKPMQQELLQQQQVVDQKQLQQEFQNKIKTQQEELQKKKDLKKEERRQQRELAYKKYDQQQSKESLSQNSQESKDQESMKSQKDQECISQASQESQKSQASQKSQKSQASQKSQKSQASQKSQKSQASQKKESEKTKESSKQERQQEVLQQQQVQQQAQQQQQVPQTQQQQHEQLQVQPESQQQQAIQQQETQQEQAQQDEVQATPNTPQPQRQLYQPKVIDLTKGIPLNLKKPDDECKTCQDATKYLVKQGLRRVVTIFERNSGKQMEKFAIVVALQGHYGIFESGNPRELYNGEVYNQVLSSSVEAYFIPNPQRQQQQPQQTEDKLRQQQKQQENEQKQQGNNLQLQQQQQESNNLHRQQCPQGAKKQQQQPQQHQDSNNQQQERQQEEMQQRQQQQQPQVGESKFEITNIEKLSPLTEVSSGSFTSTSDTDNDADNNVLQQNVNQAVKYQNFVKEFRNAQANFKYLPPKYKDKLVQYKKHKFDILYASKFVEMEPNQTKRYHNENFVFCKEEGGVLYGYYHSDTSEDVYEGVDFPCIVYREVLQSVDCYVSKCSEFIYHKNDIFTSSVNLPLQYQVETHIWPASKCRAEGIKIDKKTPLNTKVIVCYRYCQLVEGKNTFRLHVKDLLEQHSSQAGDDVTVITDPLRIAKSQVSLLLKLMENHLRHGVHMSINNYLNCQSTEFALVFNNFDFSITKFALAQYQLCANGNELEPDLHGVFIKQLCSQVNFKYVAPDELTSSNYGKVILNIIKQNVVRVDNKIPYEFDMPDSLEVKPQVDDIDYEKLIGRGAYGRVFGSKSGKFVFKLQGLQSSQYEHKILNLVKHLNVPQVVEHHEFQEQGVGIIKMKALELKTLDILKVDENQDLTTRQDMLMQCLEHLEQVLKVGVVQNDYHMANMAWTSEGRLIVLDWGIAKTRKEGQEQEFKAIVYAWYVNLIIDVLANIFLDYYDEVYYITKQKDLLCYFEWFMDEELFSKTHELFGDEYHDMLHHPPVQEDEEEEESDWESEYSDNQELSEDQQVQEVEYGQPQQQCHEVKPQEQFQEEKPQQVKPQEKPQDVKPFEQVQQEEKPQQVKPEEKPQEVKPQEKLQQEEKPQQVKPQEKPQEVKPQEKLQQEEKPQQVKPQEKPQEVKPQEKLQQEEKPQQVKPQEKPQEVKPQEKLQQEEKPQQVKPKEKPQEVKPQEKLQQEEKPQQVKPKEKPQEVKPQEQFQKEKNQDVKLQEKPQDIKPFEQVQSLTIATQLPSNHTNTSSLLTLTQHKINCVNNEINQVQQQLTRLKKGKAKAQKATLKVKLEKLNSKKSELEKQLNGKSRTSKSPKKADQQATSNKPQQREQLQHKQLNDIPNVNSGMTKVEQQAIDWFEMTTSPASSRASSSSGPSAATANEQSITAKPSHKHATVAITTSKPAPATTVQHRLSELQKMVFSLKKVDTSLLDKHQLSCHNKVLQECFEQIQQLAHEVDSQGTSNSSESSSLLPSQSHHEVEAPSQPSRDILATSRQKRPKVFCIREVSDIDIKVSTTATSCKPGVNRCFIYAAEAALATLGQYLTQETLNSLYKLADKGQHDASEVIARSGLPIIQTAKCILHGTCKLMTPCCSNVVPNTTRTNILVKEHIASTSAYGLTTLEPLAYLHYSGNALAGHWKCSYNTAQGVKKMKDSGVTSVINKHPKATWTLCKVAFNTELALTNDDITSNENVTIISQNINLVASMLTAHGDPVYHDATTLAIVRDKDTITVITPDAAANHDFTANSNVYCLEYHLQVSKKLKASKCLVKEPQAPVSKFELNNKNKDSTIKAMQRNLRQELTKYYRNNLTTGGPIKTLVVGIGKGNDAPHYAAANGGNGIIYDGCDISNESLNICATKVPSTTTLHQSDINTRQFDTLCKKYDLLVSTFSWHFKDKIKCGKHHEFHIIPVYNPDTYDTWSQYYDVTVHSKDDTSITHTINMPTCRTTETLHTIKWWYQQYCSNITIKSLQTEFNSSNRHFSNFIVVIHTGHKTSSTSMKTIAITQLSLDKGNDSASSSSSVSSTSDSTTSNTTVTASSEQAISNQVKVPQLPADNHDCNTMQPSDADDEKTEFEQCQSRGNIPITIHSCESSDFTDTVYATRPENVNAFFCREFCTSCFVHLPKNAVVDRQLLTNAQVYLKWSGISINELNYQLISTLHKREVICLDHLEPAPVATDDNWVIRRTTTKPVKVLYMSHDVYKTIDSTQAAIFHRDFDIFTHQMSRFTKRPVQSVVIHTSSKDEAESICNTLEHYNISNIKIAEHRVGQFETLAFNRKTCQYESLQINKKRAGIWEQIKSHCYDIKKPYLIEWNDITDECSHEDKYKAIASFSDKHGLDSYQRLIGADKCVFVDYKTSTDTNPDKLDDYYRSRHATETGKEVVYTAVFKLPPTDNNESRTVFAALAVTKDGKSLYINGDKTQIPASDINKALYHIMQAKYGNNTKIIGEVYSKFFSPEKEDVAAYSYKDLFTILPFLLFVLQPGFFTFALSLVIMAFFLFDRQALRNFATTTHQLINTVLEYETAAAMSYCRFSNYRVTKYILATANSVICLYLVLYTYWSCVSNHDLIDTTLPPYHSYVQRILAFLDYIPSVHEYANAVTLRELCGSNIICRLGSPYNKLYLDQYKHYVTNKSHIFRDKLMSMFSFYTPWGMLMYCLDFKPSVHLVCYNILMTAFIAYLSFRRFFTCCKPSGPFCAKHAMLRTNNVQYYCSGKPHNIQAIKVKYCTKHKWWCNTTQEHLLPHPIARVIEASTQMKSNFIKSDTYFKYVSQIPDRELPDNFKDYNHNFVYSTKNLSYLEWTQRAALFAYCSSTKVKISSHDSGSITNQKVEMTQQMLDYFKTNEDWKYNYVAAQQAGRRNILHVGFLESMTEQELSEFYEFCCQYDQSYTKTCIKRDNFLNGKVPDCFKSSRLLSTKYHSDTIYLDSFAVEHHDDIKQQASQHHFLVSQGKPKKYINSKRNIFIALAVLLAITLTRVAYRKVVKINTPAGLNPTGKDYAKGTLYLHENIIATPVAYGNNRRVQAWHFSNGTFAFTEKRSTLIAKTSCGEFQNSFLQIETYHQSCDQYWPLAINFFKLSIYWFQADKAYTTRHGDYQSQEGAICFGLSDSLVCHETMSIYTPAAFVWITSSVVLCFVLSIFCYIKMRGCFGNYTSSVLILMTVHAITVILYLFVPGLAFVFVIALFFIPAYRLVIGTYSFVVCSMFLGLNLVVLFGLYTIVFLTIFWFSRNQTNDVEYTPSGVVFSSNFSGIAKNAFLLTPENIPTIISVTGKNYAQLLEMSTGPQLKPETALANAILKCSITNTKMLYEPPKSTRLPVYLQSKLNRLSDIVVNSAQLTNICGIHDGTGLIGHGIFTTPTTVLTARHCYTTAAQVFYRGRLLQVKDHHEVGFNIIFTVDKQDNIKPIAIDNHHELELGTQYTHVVSPLDDQATVTVHQLYPTPSGHFAHASTIAGESGSPIFYHNKLVGIHQAMVKSKEHTGAHAIACRVDGTPFDSSFHDTLMAAGKIQFDGNALLNYYFTTECRKAISVRDFNNQITEANRLISYYSHISIINDPVLLQAEPRDLKPLIEFLKDSPVIREHHCKPYKMGNDIKLQTRISQRIIVHCTLSNLLSFCMTLTYFLTTAIYGTLTLKNFIDLLLAGFMLTVVFRSRHVVFLLTTGAYFVNMLELFLYVCHTNIQVIRNIFSSNDEIYAHVVQTVVRFGLQDAMLCTVILIVMMLKFILLPLRCVIFTIIYWTLVYCFGVVNFYTIAVFVFSFANTSSWFTCLTLFLQNTIYFPVWFTLNIVLSMRIGLPKWVINRYHQITSDSVRVNRGYFCHHLATYQRPPSFLEVLISQLFYSQDDVIEYIPQSKIVPYSVVGSATHYPNVNAKSLALMSGSDSKTLYAHFISAVEAVLQSTTAADQQAFMEWCATTCDQETLEKWLEDNPEDIQNKLRTKRRNIIHARIMFLKAKEEKLRKQINLMQLEQVRGMMRSELSIKLCDVLNRSVAEMQEKANLRTKTFGKGIIAASTLTVPEVLVVTNTNGRDKISWDDESECFCFEFEEAIYHIAELNTNTGTAIKTEAELNALTATNFPLYGKLHDFSFDGAINQANIGYTIKLHQIKINRSSSGIKIEYAASSDKATKPILEEVQEANDNTVLLVADGILRPFKVNSSVSAPVLAAVMTKLRLDSPELQAIKLGGLENVTEHVATSNQPLRCRGYTTYFAPSLCKYCRTNIEHKCKYQQFVQIPVNEDPDKYLSSHDICQHNKFNCDTCHGKVTIQARPSTTVKPADKLRELRKLAKNSSRPQ

Summary

Function
The papain-like proteinase 1 (PL1-PRO) and papain-like proteinase 2 (PL2-PRO) are responsible for the cleavages located at the N-terminus of the replicase polyprotein. In addition, PLP2 possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. Antagonizes innate immune induction of type I interferon by blocking the phosphorylation, dimerization and subsequent nuclear translocation of host IRF-3 (By similarity).
The main proteinase 3CL-PRO is responsible for the majority of cleavages as it cleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function (By similarity).
Nsp7-nsp8 hexadecamer may possibly confer processivity to the polymerase, maybe by binding to dsRNA or by producing primers utilized by the latter.
Nsp9 is a ssRNA-binding protein.
binds to the 40S ribosomal subunit and inhibits host translation. The nsp1-40S ribosome complex further induces an endonucleolytic cleavage near the 5'UTR of host mRNAs, targeting them for degradation. By suppressing host gene expression, nsp1 facilitates efficient viral gene expression in infected cells and evasion from host immune response (By similarity).
Catalytic Activity
TSAVLQ-|-SGFRK-NH(2) and SGVTFQ-|-GKFKK the two peptides corresponding to the two self-cleavage sites of the SARS 3C-like proteinase are the two most reactive peptide substrates. The enzyme exhibits a strong preference for substrates containing Gln at P1 position and Leu at P2 position.
Thiol-dependent hydrolysis of ester, thioester, amide, peptide and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-residue protein attached to proteins as an intracellular targeting signal).
Subunit
3CL-PRO exists as monomer and homodimer. Eight copies of nsp7 and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is a dimer. Nsp10 forms a dodecamer (By similarity).
Miscellaneous
Produced by conventional translation.
Similarity
Belongs to the coronaviruses polyprotein 1ab family.
Keywords
Activation of host autophagy by virus   Decay of host mRNAs by virus   Eukaryotic host gene expression shutoff by virus   Eukaryotic host translation shutoff by virus   Host cytoplasm   Host gene expression shutoff by virus   Host membrane   Host mRNA suppression by virus   Host-virus interaction   Hydrolase   Inhibition of host innate immune response by virus   Inhibition of host interferon signaling pathway by virus   Inhibition of host IRF3 by virus   Inhibition of host ISG15 by virus   Inhibition of host RLR pathway by virus   Membrane   Metal-binding   Modulation of host ubiquitin pathway by viral deubiquitinase   Modulation of host ubiquitin pathway by virus   Protease   Repeat   Ribosomal frameshifting   RNA-binding   Thiol protease   Transmembrane   Transmembrane helix   Ubl conjugation pathway   Viral immunoevasion   Zinc   Zinc-finger  
Feature
chain  Replicase polyprotein 1a
Uniprot
Pfam
PF08716   nsp7
PF08717   nsp8
PF11963   DUF3477
PF09401   NSP10
PF08715   Viral_protease
PF01661   Macro
PF16251   NAR
PF16348   Corona_NSP4_C
PF08710   nsp9
PF01831   Peptidase_C16
PF05409   Peptidase_C30
Interpro
IPR002705   Pept_C30/C16_B_coronavir
IPR014829   NSP8
IPR014827   Viral_protease
IPR002589   Macro_dom
IPR036499   NSP9_sf
IPR032505   Corona_NSP4_C
IPR008740   Peptidase_C30
IPR037204   NSP7_sf
IPR032592   NAR_dom
IPR014828   NSP7
IPR036333   NSP10_sf
IPR014822   NSP9
IPR009003   Peptidase_S1_PA
IPR013016   Peptidase_C30/C16
IPR022570   B-CoV_NSP1
IPR037230   NSP8_sf
IPR038123   NSP4_C_sf
IPR018995   RNA_synth_NSP10_coronavirus
IPR038083   R1a/1ab
SUPFAM
SSF159936   SSF159936
SSF144246   SSF144246
SSF50494   SSF50494
SSF101816   SSF101816
SSF143076   SSF143076
SSF140367   SSF140367
ProteinModelPortal
PDB
5YNQ     E-value=0.016     Score=39.3     Identity=36.25%     Cov(Q)=1.36%     Cov(P)=57.14%

Ontologies

Subcellular Location

From MSLVP
Capsid
From Uniprot
Host membrane  
   nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity).   With evidence from 1 publications.

Topology

Length:
5865
Number of predicted TMHs:
12
Exp number of AAs in TMHs:
278.91362
Exp number, first 60 AAs:
0
Total prob of N-in:
0.00016
outside
1  -  4049
TMhelix
4050  -  4072
inside
4073  -  4109
TMhelix
4110  -  4132
outside
4133  -  4234
TMhelix
4235  -  4254
inside
4255  -  4533
TMhelix
4534  -  4551
outside
4552  -  4689
TMhelix
4690  -  4712
inside
4713  -  4739
TMhelix
4740  -  4762
outside
4763  -  4776
TMhelix
4777  -  4799
inside
4800  -  5182
TMhelix
5183  -  5205
outside
5206  -  5224
TMhelix
5225  -  5247
inside
5248  -  5280
TMhelix
5281  -  5303
outside
5304  -  5306
TMhelix
5307  -  5329
inside
5330  -  5335
TMhelix
5336  -  5358
outside
5359  -  5865
 
 
Copyright@ 2018-2023    Any Comments and suggestions mail to:  zhuzl@cqu.edu.cn, mg@cau.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号

In processing...
Login to ASFVdb
Email
Password
Please go to Regist if without an account.
If you have forgotten your password, you can once again Regist an account with a registed or new email.
Change my password
Enter new password
Reenter new password
Regist an account of ASFVdb
It is required that you provide your institutional e-mail address (with edu or org in the domain) as confirmation of your affiliation.
Enter email
Reenter email
First Name
Last Name
Institution
You can directly go to if with an account.
Registraion Success
Your password has been sent to your email.
Please check it and login later.
Welcome to use ASFVdb.