CoVDB Coronavirus Database Go to the lastest version >>
Strain
EUROPE_HS_2000_NC_002645 (Region: Europe;  Strain: Human coronavirus 229E, complete genome.;  Date: 2000)
Gene
replicase polyprotein 1ab
Description
Annotated in NCBI,  replicase polyprotein 1ab
GenBank Accession
Full name
Replicase polyprotein 1ab      
Alternative Name
ORF1ab polyprotein
 

Sequence

CDS
ATGGCCTGCAACCGTGTGACACTTGCCGTAGCAAGTGATTCTGAAATTTCTGCAAATGGCTGTTCTACTATTGCGCAAGCCGTCCGCCGTTATAGCGAGGCCGCTAGCAATGGTTTTAGGGCATGCCGATTTGTTTCATTAGATTTGCAGGATTGCATCGTTGGCATTGCAGACGATACATATGTTATGGGTCTGCATGGCAATCAGACGTTGTTTTGCAACATAATGAAATTTTCTGACCGTCCTTTTATGCTTCATGGGTGGTTGGTTTTTTCCAATTCAAATTACCTTTTGGAGGAATTTGATGTTGTCTTCGGTAAGAGAGGTGGTGGTAATGTGACATACACTGACCAGTATCTCTGTGGCGCCGATGGCAAACCTGTTATGAGTGAAGATTTATGGCAGTTTGTTGACCATTTCGGTGAGAACGAAGAAATTATCATCAATGGTCATACTTACGTTTGTGCTTGGCTTACTAAGCGTAAGCCCTTAGATTACAAACGTCAGAACAACCTTGCCATTGAAGAGATTGAATATGTGCATGGTGATGCTTTGCATACACTACGCAATGGTTCTGTTCTTGAAATGGCTAAGGAAGTGAAGACATCTAGTAAAGTTGTGTTAAGCGATGCTCTTGACAAACTTTACAAAGTCTTTGGTTCTCCTGTTATGACAAATGGTTCCAACATCCTAGAGGCCTTTACTAAACCTGTGTTTATTAGTGCATTAGTTCAATGTACTTGTGGTACCAAGTCTTGGTCTGTTGGTGATTGGACCGGTTTTAAATCCTCTTGTTGCAACGTGATCAGTAATAAACTGTGTGTTGTTCCCGGTAATGTTAAACCTGGTGATGCTGTGATTACCACTCAGCAAGCTGGTGCTGGTATTAAGTATTTTTGTGGCATGACTCTTAAGTTTGTTGCAAATATTGAAGGTGTCTCTGTTTGGAGAGTGATTGCTCTTCAGAGTGTGGATTGCTTTGTTGCTTCTTCCACTTTTGTAGAAGAGGAACATGTTAATAGAATGGATACATTCTGCTTCAATGTACGCAATAGTGTTACTGATGAGTGTCGTCTGGCCATGTTGGGTGCTGAAATGACTAGTAATGTCAGAAGACAAGTTGCTTCAGGTGTCATAGACATTAGTACCGGTTGGTTTGATGTTTATGATGACATCTTTGCTGAAAGCAAACCATGGTTTGTTCGCAAGGCTGAAGACATTTTTGGCCCTTGTTGGTCCGCTCTTGCTTCTGCACTTAAACAACTTAAAGTCACTACAGGTGAACTTGTGAGATTTGTTAAGTCTATTTGCAATTCAGCTGTTGCTGTCGTGGGTGGTACTATACAAATTCTCGCTAGTGTGCCTGAGAAGTTTTTGAATGCGTTTGACGTGTTTGTCACAGCTATTCAAACTGTCTTTGACTGTGCTGTTGAAACTTGTACTATTGCCGGTAAAGCATTTGACAAGGTTTTTGACTATGTTTTGCTTGATAATGCGCTTGTAAAACTTGTCACCACAAAGCTTAAGGGTGTTCGTGAACGTGGCCTTAATAAAGTTAAGTATGCAACAGTTGTTGTTGGTTCCACTGAAGAAGTTAAATCTTCACGTGTTGAACGTAGCACTGCTGTACTTACAATCGCCAACAATTATTCCAAACTTTTTGATGAAGGGTATACTGTTGTAATTGGCGATGTGGCGTACTTTGTTAGTGACGGCTACTTCCGTCTTATGGCCAGTCCAAATAGTGTGTTGACTACTGCAGTCTATAAACCATTGTTTGCTTTTAATGTGAATGTTATGGGTACTAGACCTGAAAAATTTCCAACCACTGTGACTTGTGAAAATTTAGAGTCTGCTGTTTTGTTTGTTAATGACAAAATTACTGAATTCCAATTGGATTACAGTATTGATGTCATTGATAATGAAATAATTGTCAAACCTAATATCAGCCTATGTGTTCCACTTTATGTGAGAGACTATGTTGACAAATGGGATGATTTTTGCAGACAATATAGTAACGAGTCTTGGTTTGAGGATGATTACAGGGCTTTTATCAGTGTTTTGGACATCACTGATGCTGCTGTGAAAGCTGCAGAGTCTAAAGCTTTCGTTGATACTATTGTTCCACCTTGCCCATCTATTTTGAAAGTTATAGATGGAGGCAAAATATGGAATGGTGTTATTAAAAATGTTAACTCTGTTAGAGACTGGCTTAAGTCTTTGAAGTTAAATCTCACACAACAGGGTTTGCTTGGAACATGTGCAAAGCGTTTTAAACGTTGGCTTGGCATTTTGCTAGAGGCCTATAATGCGTTTTTAGACACTGTGGTTTCTACTGTTAAAATTGGTGGCTTGACCTTTAAAACATATGCTTTTGATAAACCTTACATTGTGATACGTGATATCGTGTGTAAGGTTGAAAATAAAACAGAAGCAGAATGGATTGAGCTTTTTCCACATAATGACAGGATTAAGTCTTTTAGTACTTTCGAGAGTGCTTACATGCCAATTGCAGACCCTACACATTTTGACATTGAAGAAGTTGAACTTTTAGATGCAGAGTTTGTAGAACCAGGCTGTGGTGGTATTTTGGCAGTAATAGATGAGCACGTCTTTTATAAGAAGGATGGTGTTTATTATCCATCAAATGGTACTAACATTCTACCTGTTGCATTTACAAAAGCCGCTGGTGGTAAAGTTTCATTTTCTGATGACGTTGAAGTAAAAGACATTGAACCTGTTTACAGAGTCAAGCTTTGCTTTGAGTTTGAAGATGAAAAACTTGTAGATGTTTGTGAAAAGGCAATTGGCAAGAAAATTAAACATGAAGGTGACTGGGATAGCTTTTGTAAGACTATTCAATCAGCACTTTCTGTTGTTTCTTGCTATGTAAATCTACCTACTTATTACATTTATGATGAAGAAGGCGGTAATGACTTGAGTTTGCCCGTTATGATTTCTGAATGGCCTCTTTCTGTTCAACAAGCTCAACAAGAAGCTACTTTACCTGATATTGCTGAGGATGTTGTTGACCAAGTTGAAGAAGTCAATAGCATTTTTGACATTGAGACAGTGGATGTTAAACATGATGTGAGTCCTTTTGAAATGCCATTTGAAGAGTTAAATGGTTTAAAGATACTCAAACAATTGGATAACAACTGCTGGGTTAACTCAGTTATGTTACAAATACAATTAACTGGTATACTTGATGGTGACTATGCTATGCAGTTTTTTAAAATGGGCCGAGTTGCCAAGATGATTGAACGCTGCTACACTGCTGAGCAATGTATACGTGGTGCTATGGGTGATGTTGGTTTGTGTATGTATAGACTGCTTAAAGACTTACACACTGGTTTTATGGTTATGGATTATAAATGTAGTTGTACCAGTGGTAGGCTTGAAGAATCGGGAGCTGTTTTGTTTTGTACGCCCACTAAGAAGGCGTTTCCTTATGGTACTTGTCTAAATTGTAACGCACCTCGCATGTGTACAATTAGGCAGTTACAAGGTACCATAATATTTGTGCAACAAAAACCAGAACCTGTTAATCCTGTTTCTTTTGTTGTTAAACCAGTCTGCTCATCAATTTTTCGTGGTGCTGTGTCTTGTGGTCATTACCAGACTAACATCTATTCACAAAATTTGTGTGTGGATGGTTTTGGTGTTAACAAGATTCAGCCCTGGACAAATGATGCACTTAATACTATTTGTATTAAGGATGCAGATTATAATGCAAAAGTTGAAATATCTGTTACACCAATTAAAAATACAGTTGATACAACACCTAAGGAAGAATTTGTTGTTAAAGAGAAGTTGAACGCCTTCCTCGTTCATGACAATGTAGCTTTCTACCAAGGTGATGTTGATACTGTTGTTAATGGTGTTGACTTTGACTTTATTGTAAATGCTGCTAATGAGAACCTTGCTCATGGTGGAGGACTTGCCAAAGCTTTAGATGTGTACACTAAAGGTAAACTTCAACGTTTATCTAAAGAACACATTGGATTAGCGGGTAAAGTAAAAGTTGGTACAGGAGTTATGGTTGAGTGTGATAGCCTTAGAATTTTTAATGTTGTTGGTCCACGCAAGGGTAAACATGAACGTGATTTACTCATAAAAGCTTACAACACTATTAATAATGAACAAGGCACACCTTTAACACCAATTTTGAGCTGTGGTATTTTTGGTATCAAACTCGAAACTTCATTAGAAGTTTTGCTTGATGTTTGTAATACAAAAGAAGTTAAAGTTTTTGTTTATACAGACACAGAGGTTTGTAAGGTTAAGGATTTTGTGTCTGGTTTAGTGAATGTTCAAAAAGTTGAGCAACCTAAAATAGAACCAAAACCAGTGTCCGTAATTAAAGTTGCACCCAAGCCTTACAGGGTAGATGGTAAATTTAGTTACTTTACAGAAGACTTGTTGTGTGTCGCTGATGACAAACCCATTGTTTTGTTTACTGACTCTATGCTTACTTTGGATGACCGTGGTTTAGCTCTAGACAATGCACTTAGTGGTGTGCTTAGTGCTGCTATTAAGGATTGTGTTGACATAAATAAAGCTATACCTTCTGGTAATCTTATTAAGTTTGATATAGGTTCTGTTGTTGTCTACATGTGTGTTGTGCCATCCGAAAAGGACAAACATTTAGATAATAATGTTCAACGATGCACACGTAAGTTGAATAGACTTATGTGTGATATAGTTTGTACTATACCAGCTGACTACATCTTGCCATTGGTGTTGTCTAGTTTGACTTGTAATGTTTCTTTTGTAGGTGAACTTAAAGCTGCTGAAGCTAAAGTTATAACTATAAAGGTGACAGAGGATGGTGTTAATGTTCATGATGTGACCGTGACAACAGACAAGTCATTTGAACAACAAGTTGGTGTTATTGCTGATAAGGACAAAGATCTTTCTGGTGCAGTACCAAGTGATCTTAACACATCTGAATTGCTTACTAAAGCAATAGATGTTGATTGGGTCGAATTTTATGGCTTTAAAGATGCTGTTACTTTTGCAACAGTTGATCATAGTGCTTTTGCCTATGAAAGTGCTGTTGTTAATGGTATTAGAGTGTTAAAAACTAGTGATAATAATTGTTGGGTGAATGCTGTTTGTATTGCACTACAGTATTCGAAACCCCATTTTATTTCACAAGGTCTTGATGCTGCGTGGAATAAATTTGTTTTAGGCGATGTTGAAATTTTTGTTGCATTTGTTTACTATGTTGCAAGACTAATGAAAGGTGACAAGGGTGATGCTGAAGACACTTTGACTAAGTTGTCTAAGTATCTTGCTAATGAAGCTCAAGTTCAATTAGAACATTATAGTTCTTGTGTTGAATGTGATGCTAAATTTAAAAACTCTGTTGCATCTATCAATTCTGCTATAGTTTGTGCTAGTGTCAAACGTGATGGTGTGCAAGTTGGTTATTGTGTCCATGGTATTAAGTACTATTCACGTGTTAGAAGTGTTAGAGGTAGAGCTATTATAGTCAGTGTCGAACAGCTTGAACCGTGTGCTCAGTCTAGACTTTTGAGTGGTGTTGCTTATACTGCTTTTTCTGGACCTGTTGACAAAGGTCATTATACTGTTTATGATACTGCAAAGAAATCAATGTATGATGGTGATCGTTTTGTTAAACATGATCTTTCTCTGCTGTCTGTCACATCAGTTGTTATGGTTGGTGGTTATGTTGCACCTGTTAATACAGTGAAACCTAAACCAGTCATTAATCAACTTGATGAAAAGGCACAGAAGTTCTTTGATTTTGGTGATTTTTTGATTCATAATTTTGTTATTTTTTTCACATGGTTATTGAGTATGTTTACTTTGTGTAAAACTGCAGTAACTACAGGTGATGTTAAAATAATGGCCAAAGCACCACAAAGGACGGGTGTTGTTTTAAAACGTAGTCTTAAATATAACTTAAAAGCGTCAGCAGCTGTTCTTAAATCTAAGTGGTGGCTGCTTGCTAAGTTTACGAAACTACTGTTACTCATATATACATTGTACTCAGTAGTTTTGCTTTGTGTACGTTTTGGACCGTTTAATTTTTGTAGTGAGACTGTTAATGGTTATGCTAAGTCAAACTTTGTCAAGGATGATTACTGTGATGGTTCATTGGGCTGCAAGATGTGTCTTTTTGGTTACCAAGAGTTAAGTCAATTTAGCCATTTGGATGTTGTGTGGAAGCATATAACAGACCCTTTGTTTAGTAATATGCAACCTTTCATTGTCATGGTTTTGCTGCTTATATTTGGTGACAATTATTTGAGATGCTTCTTGCTGTATTTTGTTGCTCAGATGATAAGCACAGTTGGTGTTTTTCTAGGTTACAAGGAAACAAATTGGTTCTTGCACTTTATTCCATTTGATGTTATTTGTGATGAACTGCTTGTCACTGTTATTGTTATTAAGGTTATTTCTTTTGTCAGACATGTGCTTTTTGGTTGTGAAAACCCAGATTGTATTGCGTGTTCTAAGAGTGCTAGACTTAAGAGATTCCCTGTTAACACAATTGTCAATGGTGTGCAACGTTCATTTTATGTTAATGCAAATGGTGGTAGTAAGTTTTGTAAGAAACATAGATTTTTCTGTGTTGATTGTGACTCTTATGGTTATGGCAGCACGTTTATAACACCCGAAGTTTCTAGAGAACTTGGTAACATTACCAAAACAAATGTGCAACCAACAGGGCCGGCCTATGTCATGATTGACAAAGTGGAGTTTGAAAATGGTTTTTACAGATTGTATTCCTGTGAAACATTTTGGCGTTACAACTTTGATATAACTGAAAGCAAGTATTCTTGCAAAGAGGTTTTTAAAAATTGTAATGTTTTGGATGATTTCATCGTGTTTAACAATAATGGGACCAATGTAACGCAGGTTAAAAATGCTAGTGTTTACTTTTCACAGTTGTTGTGTAGGCCCATTAAATTAGTTGACAGTGAACTTTTGTCCACTTTGTCAGTTGATTTTAATGGTGTCTTACACAAGGCATACATTGATGTACTACGTAATAGCTTTGGTAAAGATCTTAATGCTAATATGTCTTTAGCCGAGTGCAAGAGAGCTTTAGGCCTGTCTATTAGTGATCATGAATTTACTAGTGCTATTTCTAATGCACATCGTTGTGACGTGTTGTTATCTGATTTGTCATTTAACAACTTTGTCAGTTCGTATGCTAAACCTGAGGAAAAATTATCAGCTTATGACTTGGCGTGTTGTATGCGTGCAGGTGCTAAGGTTGTTAATGCCAATGTTCTGACAAAGGACCAAACTCCTATTGTTTGGCATGCAAAGGATTTTAACAGTCTTTCTGCTGAAGGTCGCAAGTATATTGTAAAAACTAGCAAAGCTAAGGGTTTGACTTTCTTGTTGACAATTAATGAAAACCAAGCTGTCACGCAAATACCTGCAACTAGCATTGTTGCTAAGCAAGGTGCTGGTGATGCTGGCCATTCATTAACATGGCTGTGGCTACTGTGTGGTCTTGTGTGTTTGATTCAATTCTACTTGTGCTTTTTCATGCCCTATTTTATGTACGATATCGTGAGTAGTTTTGAGGGTTATGATTTTAAGTATATAGAAAATGGTCAGTTGAAGAATTTTGAAGCGCCACTTAAATGCGTCAGAAACGTTTTTGAAAACTTTGAGGACTGGCATTATGCTAAGTTTGGCTTCACACCTTTAAACAAGCAAAGCTGTCCTATTGTAGTTGGAGTTTCTGAAATTGTTAATACTGTCGCTGGCATTCCATCTAATGTGTATCTTGTTGGTAAAACTTTAATTTTTACACTACAAGCTGCTTTTGGTAATGCTGGTGTTTGTTATGACATTTTTGGAGTCACAACACCTGAAAAGTGCATTTTTACTTCTGCTTGTACTAGATTAGAAGGTTTGGGTGGTAACAATGTTTATTGTTATAACACAGCGCTTATGGAAGGTTCTTTGCCTTACAGTTCAATACAAGCTAATGCATATTATAAATATGACAATGGCAATTTTATTAAGTTGCCAGAAGTTATTGCACAAGGCTTTGGTTTTAGAACAGTGCGTACTATTGCCACCAAATACTGCCGCGTAGGTGAATGTGTTGAATCCAATGCAGGTGTGTGTTTTGGCTTTGACAAGTGGTTTGTTAACGATGGACGTGTTGCCAATGGTTACGTTTGTGGTACTGGTTTGTGGAACCTTGTATTTAACATACTTTCCATGTTTTCATCTTCATTCTCTGTTGCTGCAATGTCAGGTCAAATTTTACTTAATTGTGCATTAGGTGCTTTTGCTATTTTTTGTTGTTTTCTTGTGACAAAGTTTAGACGCATGTTTGGTGACCTTTCTGTAGGTGTTTGCACTGTTGTTGTGGCTGTTTTGCTTAACAATGTCTCTTACATTGTAACTCAGAATTTAGTAACAATGATTGCTTATGCCATATTGTATTTCTTTGCTACTAGAAGCTTACGCTATGCATGGATTTGGTGTGCTGCATATTTAATTGCGTATATTTCTTTTGCTCCATGGTGGTTGTGTGCTTGGTACTTTCTTGCTATGTTGACAGGTTTGTTACCTAGTTTGCTGAAGCTTAAAGTTTCGACAAATCTTTTCGAAGGTGACAAATTTGTAGGTACATTTGAAAGTGCTGCTGCAGGAACATTTGTCATTGACATGCGTTCTTATGAGAAACTTGCTAATAGCATCTCTCCAGAAAAGTTGAAAAGTTATGCTGCTAGCTATAATAGATATAAGTACTATAGTGGTAATGCAAATGAAGCTGATTACCGTTGCGCTTGTTATGCCTATTTAGCAAAAGCAATGTTGGACTTTTCGCGTGATCATAATGACATCTTGTACACACCTCCGACTGTCAGTTATGGTTCTACATTACAGGCTGGTTTGCGCAAAATGGCACAACCATCTGGCTTTGTGGAGAAATGTGTTGTCCGTGTCTGCTATGGAAACACTGTGTTGAATGGGTTGTGGCTTGGTGATATTGTTTATTGCCCACGTCATGTTATCGCATCTAACACAACTTCTGCTATAGATTATGATCACGAATATAGTATTATGCGGTTGCATAATTTTTCTATAATATCTGGTACAGCATTTCTTGGTGTTGTAGGTGCTACTATGCATGGAGTAACTCTTAAAATTAAGGTTTCACAGACTAACATGCACACACCTAGACATTCTTTTAGAACACTAAAATCTGGTGAAGGTTTTAACATCTTAGCATGCTATGATGGTTGTGCTCAAGGTGTTTTTGGTGTGAACATGAGAACTAATTGGACTATCCGTGGTTCATTTATTAATGGTGCGTGTGGTTCCCCTGGCTACAATCTTAAAAATGGCGAGGTGGAATTTGTTTATATGCATCAAATTGAACTCGGAAGTGGTAGCCATGTAGGTTCTAGCTTTGATGGTGTTATGTATGGTGGTTTTGAAGACCAACCTAATCTTCAAGTTGAATCTGCAAACCAGATGTTAACAGTTAATGTGGTTGCATTTCTTTATGCTGCTATATTGAATGGTTGCACATGGTGGCTTAAAGGTGAAAAATTGTTTGTGGAGCATTATAATGAGTGGGCACAGGCTAATGGTTTCACAGCTATGAATGGTGAAGACGCTTTTTCCATTCTTGCTGCTAAAACTGGTGTCTGTGTGGAAAGATTACTTCATGCTATTCAAGTTTTGAATAATGGCTTTGGTGGTAAACAAATTTTGGGTTATTCTAGTCTCAATGATGAGTTCAGTATTAATGAAGTTGTCAAACAAATGTTTGGTGTTAACCTGCAAAGTGGTAAAACCACTAGTATGTTTAAATCCATAAGCTTATTTGCTGGCTTCTTTGTCATGTTCTGGGCTGAATTATTTGTTTATACCACCACTATTTGGGTTAACCCTGGTTTTCTTACTCCGTTTATGATTTTGCTTGTTGCTTTGTCACTCTGTCTTACATTTGTTGTTAAACATAAGGTTTTGTTTTTGCAAGTGTTTTTGTTGCCTTCAATTATTGTGGCTGCTATTCAAAACTGTGCTTGGGACTACCATGTTACAAAGGTGTTGGCAGAGAAGTTTGATTATAATGTTTCTGTTATGCAAATGGACATCCAGGGTTTTGTTAACATTTTTATTTGTCTTTTTGTTGCACTGTTGCATACTTGGCGCTTTGCTAAAGAGCGTTGTACACATTGGTGCACTTATTTGTTCTCACTCATTGCTGTTTTATACACTGCATTGTATAGTTATGACTACGTTAGTTTGCTGGTTATGCTACTTTGTGCAATTTCTAATGAATGGTATATTGGTGCTATTATTTTTAGAATTTGTCGTTTTGGTGTTGCATTTTTACCAGTGGAATACGTGTCTTACTTTGATGGTGTTAAAACTGTGCTGTTGTTTTACATGTTGTTAGGCTTTGTTAGCTGTATGTACTATGGTTTGTTGTACTGGATTAACAGGTTCTGTAAGTGCACATTAGGTGTTTATGATTTCTGTGTTAGTCCAGCCGAATTTAAGTATATGGTTGCTAATGGTTTGAATGCACCAAATGGCCCTTTTGATGCGCTCTTTCTGTCTTTTAAACTAATGGGTATTGGCGGTCCTAGAACCATTAAAGTTTCTACTGTACAGTCTAAATTGACTGATCTTAAGTGCACAAACGTCGTTCTAATGGGCATTTTGTCTAACATGAACATAGCTTCTAATTCAAAGGAGTGGGCATATTGTGTTGAAATGCACAATAAAATAAACTTGTGTGACGACCCTGAAACTGCTCAAGAGTTATTGCTGGCGTTGTTGGCCTTTTTCTTGTCTAAGCATAGTGATTTTGGTCTTGGTGATCTTGTCGATTCTTATTTTGAGAACGACTCCATTTTGCAAAGTGTTGCATCTTCTTTTGTTGGTATGCCATCTTTTGTTGCATATGAAACAGCAAGACAAGAGTATGAAAATGCTGTTGCAAATGGTTCCTCACCACAAATAATCAAACAATTGAAGAAGGCTATGAATGTTGCAAAAGCTGAGTTTGACAGGGAATCATCTGTTCAAAAGAAAATTAACAGAATGGCTGAACAAGCTGCTGCAGCTATGTACAAAGAAGCACGTGCTGTTAATAGAAAATCAAAAGTTGTTAGTGCCATGCATAGTTTACTCTTTGGCATGCTCCGACGTTTGGACATGTCTAGTGTTGACACTATCCTTAATATGGCACGTAATGGTGTTGTCCCTCTTTCCGTTATCCCTGCTACTTCTGCAGCCAGGCTCGTCGTCGTAGTACCAGATCATGATTCATTTGTGAAAATGATGGTAGATGGTTTTGTGCACTACGCTGGTGTTGTTTGGACATTACAGGAAGTTAAGGATAATGATGGTAAGAATGTGCATCTTAAAGATGTTACAAAGGAAAACCAGGAAATACTTGTTTGGCCTCTGATTTTGACTTGTGAACGTGTCGTTAAATTGCAGAACAATGAAATAATGCCGGGCAAGATGAAGGTCAAGGCCACCAAAGGTGAAGGTGATGGAGGCATTACTAGTGAAGGTAATGCTCTATACAACAATGAAGGTGGACGTGCATTCATGTATGCATATGTGACTACGAAGCCTGGCATGAAGTATGTTAAATGGGAACATGACTCTGGTGTGGTTACAGTTGAATTGGAACCACCTTGCAGATTTGTTATAGACACACCTACTGGACCCCAAATTAAGTATCTTTATTTTGTTAAGAATCTTAACAATTTAAGGAGAGGTGCTGTTTTGGGTTACATTGGTGCCACTGTGAGATTGCAAGCTGGCAAACAGACTGAGTTTGTTTCAAACTCCCATTTATTAACACATTGTTCTTTTGCTGTTGACCCAGCTGCAGCCTATCTTGATGCTGTTAAACAAGGCGCAAAACCTGTTGGCAATTGTGTAAAGATGTTGACTAATGGTTCTGGTAGCGGTCAGGCTATTACTTGTACCATTGATTCCAACACTACGCAGGACACATATGGTGGCGCGTCTGTTTGTATTTATTGCAGAGCACATGTTGCACATCCAACCATGGACGGTTTTTGTCAGTACAAAGGCAAGTGGGTACAAGTGCCTATAGGTACAAATGACCCTATAAGATTTTGTCTTGAAAATACTGTTTGTAAAGTTTGTGGTTGTTGGCTTAATCATGGCTGTACATGTGACCGGACTGCTATCCAAAGTTTTGATAACAGTTATTTAAACGAGTCCGGGGCTCTAGTGCCGCTCGACTAGAGCCCTGTAATGGTACAGACATAGATTACTGTGTCCGTGCATTTGACGTTTACAATAAAGATGCGTCTTTTATCGGAAAAAATCTGAAGTCCAATTGTGTGCGCTTCAAGAATGTAGATAAGGATGACGCGTTCTATATTGTTAAACGTTGCATTAAGTCAGTTATGGACCACGAGCAGTCCATGTATAACTTACTTAAAGGCTGTAATGCTGTTGCTAAGCATGATTTCTTTACTTGGCATGAGGGCAGAACCATTTATGGTAATGTTAGTAGACAGGATCTTACTAAATACACCATGATGGATTTGTGCTTCGCTCTGCGTAACTTTGATGAAAAAGACTGTGAAGTTTTTAAGGAGATATTGGTTCTTACTGGTTGTTGTAGTACTGATTACTTTGAAATGAAGAATTGGTTTGACCCCATAGAAAATGAGGACATACACCGTGTGTATGCTGCTTTAGGTAAGGTAGTTGCAAATGCAATGCTTAAGTGTGTTGCTTTTTGCGACGAAATGGTGCTCAAAGGAGTTGTTGGTGTTTTGACCTTAGACAACCAAGATCTTAATGGGAATTTCTATGACTTCGGTGACTTTGTATTGTGTCCTCCTGGAATGGGAATACCCTACTGCACGTCATACTATTCTTATATGATGCCTGTTATGGGTATGACTAATTGTTTAGCTAGTGAGTGCTTTATGAAAAGTGACATCTTTGGTCAAGACTTCAAAACTTTTGATTTGTTGAAATATGATTTCACAGAACATAAGGAGGTTTTGTTTAACAAGTACTTTAAGTATTGGGGACAGGATTATCATCCTGATTGTGTTGATTGCCATGACGAGATGTGTATTTTGCATTGTTCAAATTTTAACACACTCTTCGCAACCACAATTCCAAACACGGCTTTTGGACCTCTATGCAGAAAAGTGTTTATTGATGGTGTACCCGTAGTTGCTACTGCTGGTTACCACTTTAAACAATTAGGACTTGTGTGGAACAAAGATGTTAACACTCATTCTACCAGACTTACTATTACTGAACTCTTACAGTTTGTGACAGATCCAACGCTTATAGTTGCGTCATCGCCTGCCTTGGTGGATAAACGCACTGTTTGTTTTTCTGTCGCTGCTTTGAGTACAGGATTAACATCCCAAACAGTAAAACCTGGCCATTTTAATAAGGAGTTTTATGACTTCTTACGTTCTCAGGGGTTTTTCGATGAGGGTTCAGAATTAACATTGAAGCATTTCTTTTTTACACAAAAGGGTGATGCTGCAATTAAAGATTTTGATTATTATCGTTACAACAGACCTACTATGCTGGATATTGGACAAGCTCGCGTAGCATATCAAGTGGCAGCTCGCTATTTTGACTGTTACGAGGGTGGCTGTATTACATCTAGAGAGGTTGTTGTTACAAACCTTAATAAAAGCGCTGGTTGGCCCCTTAATAAGTTTGGTAAAGCTGGTTTATATTATGAGTCTATTAGTTATGAGGAACAAGATGCTATTTTTTCATTAACAAAGCGTAATATTCTCCCTACTATGACTCAGTTAAATCTTAAATACGCCATATCTGGTAAGGAACGCGCACGTACAGTGGGTGGCGTCTCTTTATTAGCTACTATGACTACAAGACAGTTTCATCAGAAATGTCTGAAATCCATAGTAGCTACCAGAAATGCCACCGTTGTTATCGGCACTACCAAGTTTTATGGCGGGTGGGATAATATGTTAAAGAACCTGATGGCCGATGTTGATGATCCTAAATTGATGGGATGGGACTATCCTAAGTGTGATAGAGCTATGCCCTCAATGATTCGTATGTTGTCGGCTATGATCTTAGGTTCTAAGCATGTCACATGTTGTACGGCTAGTGATAAATTTTATAGACTTAGTAATGAGCTTGCTCAAGTTTTGACCGAGGTTGTTTATTCAAATGGTGGGTTTTATTTTAAACCTGGTGGTACAACTTCTGGTGATGCAACTACAGCCTACGCCAATTCTGTCTTTAATATATTTCAGGCTGTAAGTTCTAACATTAATTGCGTTTTGAGCGTTAACTCGTCAAATTGCAATAATTTTAATGTTAAGAAGTTACAGAGACAACTTTATGATAATTGCTATAGAAATAGTAATGTTGATGAATCTTTTGTGGATGACTTTTATGGTTATTTGCAAAAGCATTTTTCTATGATGATTCTTTCTGATGATAGTGTTGTGTGCTATAATAAAACTTATGCTGGACTTGGTTACATTGCTGATATTAGTGCTTTTAAAGCCACTTTGTATTATCAGAATGGTGTGTTTATGAGTACAGCTAAGTGTTGGACTGAGGAAGATCTTTCTATAGGACCTCATGAATTTTGCTCACAGCACACTATGCAGATTGTAGATGAAAATGGTAAGTATTATCTACCATATCCAGATCCTAGCCGTATTATTTCTGCTGGTGTTTTTGTGGATGACATCACTAAGACTGATGCTGTCATTCTTTTGGAACGCTATGTTTCTCTGGCTATAGATGCCTACCCATTGTCTAAGCATCCTAAACCTGAGTACAGGAAGGTGTTTTACGCATTGTTAGACTGGGTCAAACATCTCAACAAGACTCTTAACGAAGGTGTTTTGGAGTCTTTTTCTGTTACACTTTTAGATGAACATGAGTCTAAGTTTTGGGATGAAAGCTTTTATGCTAGTATGTATGAGAAGTCTACAGTATTACAAGCTGCTGGTCTTTGTGTAGTATGTGGTTCTCAAACAGTTCTAAGATGCGGTGATTGTTTACGCAGACCGATGTTGTGCACTAAGTGCGCCTATGATCATGTGTTTGGCACTGATCATAAGTTCATTTTAGCTATTACACCATATGTGTGTAACACATCTGGCTGCAATGTAAATGACGTTACAAAACTGTATCTTGGAGGTTTGAATTATTACTGTGTAGACCACAAACCACATCTTTCATTCCCACTGTGTTCAGCTGGTAATGTCTTTGGTTTGTACAAAAGTTCTGCTTTGGGTTCCATGGACATTGATGTCTTTAACAAACTTTCTACCTCTGATTGGTCTGACATTCGCGACTACAAGCTTGCTAATGATGCAAAAGAGTCACTAAGGTTGTTTGCAGCTGAAACGGTCAAGGCTAAAGAGGAAAGTGTTAAGTCATCATACGCTTATGCTACCCTAAAGGAGATTGTAGGTCCTAAGGAACTTTTGCTCTTATGGGAAAGTGGAAAAGCCAAACCACCGTTAAACCGTAATTCTGTTTTTACATGCTTCCAAATTACAAAAGACTCCAAGTTTCAAGTTGGTGAGTTTGTGTTTGAGAAAGTAGATTACGGTTCTGATACGGTTACTTACAAATCCACTGCTACTACTAAGTTAGTACCAGGTATGTTGTTTATTTTGACTTCTCATAATGTTGCTCCACTTAGAGCGCCAACAATGGCAAACCAGGAGAAATATTCTACCATTTACAAGTTGCACCCATCATTTAATGTTAGTGATGCTTATGCAAATCTTGTACCTTATTACCAACTTATTGGCAAACAGCGTATAACCACAATACAGGGTCCTCCTGGTAGTGGAAAATCGCATTGTTCTATTGGTATTGGTGTGTATTACCCTGGAGCGAGGATCGTGTTCACCGCTTGTTCTCACGCTGCTGTTGATTCGCTCTGTGCAAAAGCTGTCACAGCCTATAGTGTTGATAAGTGTACACGTATTATTCCTGCACGTGCCAGAGTTGAGTGTTATAGTGGTTTTAAACCTAACAATAATAGTGCACAATACGTGTTTAGTACTGTTAATGCGTTACCTGAAGTTAATGCAGACATTGTTGTCGTGGATGAGGTGTCTATGTGCACTAACTATGACTTGTCTGTGATTAACCAGCGTATATCATATAAACACATTGTATATGTTGGTGATCCTCAACAGCTTCCAGCTCCTAGAGTTCTTATCTCTAAAGGTGTTATGGAACCAATTGACTATAATGTTGTGACACAACGTATGTGTGCTATAGGACCCGATGTCTTTTTACACAAGTGTTACAGATGTCCTGCTGAAATAGTTAACACTGTTTCAGAGCTTGTTTATGAAAACAAGTTTGTACCTGTCAAAGAAGCTAGTAAGCAGTGCTTCAAAATCTTTGAACGCGGTAGTGTTCAGGTAGACAATGGCTCCAGTATAAATAGGCGTCAACTTGATGTTGTTAAGCGATTTATACATAAAAACTCCACATGGAGCAAGGCTGTGTTTATCTCACCTTACAATAGTCAAAATTATGTAGCTGCCAGGCTTTTAGGCTTACAAACTCAGACAGTGGATTCTGCTCAAGGTAGTGAATATGACTATGTTATATTCGCACAGACATCAGATACTGCTCATGCCTGTAATGCCAATCGTTTTAACGTTGCCATTACTAGAGCAAAGAAAGGTATTTTCTGTATTATGTCTGACAGAACTTTGTTTGATGCACTTAAGTTCTTTGAAATCACTATGACAGATTTACAGTCTGAAAGTAGTTGTGGTTTGTTTAAGGATTGTGCACGTAACCCTATTGATTTACCACCAAGTCATGCCACTACTTATTTGTCATTGTCTGATAGATTTAAGACTAGTGGTGACTTGGCTGTTCAAATAGGTAACAACAATGTTTGTACCTATGAACATGTGATTTCATATATGGGTTTCAGGTTTGATGTTAGCATGCCTGGTAGTCATAGTTTGTTCTGTACTAGAGACTTTGCCATGCGTCATGTCAGAGGTTGGTTAGGAATGGATGTGGAAGGTGCACATGTCACAGGTGACAATGTTGGCACTAATGTACCTCTACAAGTTGGTTTTTCCAATGGTGTTGATTTTGTAGCTCAACCTGAAGGTTGTGTTCTAACAAACACTGGCAGTGTTGTAAAACCTGTTCGTGCTCGTGCACCACCTGGAGAACAATTCACTCACATTGTACCTCTGTTACGCAAGGGACAACCTTGGAGTGTGTTGAGAAAACGTATTGTTCAAATGATAGCAGATTTTCTTGCTGGCTCATCTGATGTACTGGTGTTTGTACTTTGGGCTGGCGGTTTAGAGTTGACCACTATGCGTTATTTTGTTAAGATTGGAGCTGTTAAACATTGCCAATGTGGTACTGTTGCAACATGCTACAATTCTGTTAGTAATGACTATTGTTGCTTTAAACATGCATTGGGCTGTGACTATGTTTATAATCCATATGTCATAGATATTCAACAATGGGGTTATGTTGGTTCACTCTCCACTAATCACCATGCAATTTGTAATGTTCATAGAAATGAGCATGTTGCTTCTGGTGATGCTATTATGACTAGATGTTTGGCTGTGTATGACTGCTTTGTTAAGAATGTGGATTGGTCAATTACCTACCCTATGATAGCTAATGAAAATGCCATAAACAAGGGCGGTCGCACTGTGCAGAGTCATATTATGCGTGCTGCTATTAAATTGTACAACCCTAAAGCAATCCATGACATTGGTAATCCTAAGGGTATTCGTTGTGCTGTAACTGATGCCAAGTGGTATTGTTATGACAAGAACCCTATTAATTCTAATGTGAAAACATTGGAGTATGATTACATGACACATGGCCAAATGGATGGCTTGTGTTTGTTTTGGAATTGTAATGTGGATATGTACCCTGAATTCTCAATTGTTTGCAGGTTTGACACACGTACACGATCTACATTGAACCTTGAAGGTGTAAATGGTGGGTCATTGTATGTCAATAATCATGCATTTCACACTCCTGCTTATGATAAACGTGCTATGGCTAAATTGAAACCAGCACCGTTTTTCTACTATGACGACGGTTCATGTGAGGTTGTTCACGATCAAGTTAACTATGTTCCTTTGAGAGCCACTAATTGCATTACCAAGTGTAATATTGGTGGTGCTGTATGTTCTAAGCACGCTAATCTCTATAGAGCATATGTTGAGTCATATAACATTTTTACTCAAGCTGGTTTTAATATTTGGGTTCCTACCACGTTTGATTGTTATAATTTGTGGCAGACATTCACAGAGGTCAATTTACAAGGTTTAGAGAACATTGCTTTTAACGTTGTTAATAAAGGTTCATTTGTTGGTGCTGATGGTGAATTACCAGTAGCCATTAGTGGTGATAAAGTGTTCGTACGTGATGGTAACACTGATAATTTAGTCTTTGTTAACAAAACATCACTGCCTACAAACATAGCATTTGAACTTTTTGCTAAGAGGAAGGTTGGTTTAACACCACCTCTCAGTATTCTCAAAAACCTTGGTGTTGTCGCCACATATAAGTTTGTCTTGTGGGATTATGAAGCTGAGCGTCCCTTGACAAGCTTTACTAAGTCTGTTTGTGGTTATACAGACTTTGCAGAGGATGTTTGTACTTGTTACGATAATAGTATACAAGGTTCATACGAACGTTTTACTCTGTCAACTAATGCTGTGTTATTCTCTGCTACTGCTGTGAAAACAGGTGGTAAGAGTTTGCCGGCTATTAAATTGAATTTTGGAATGCTTAATGGTAATGCAATTGCTACTGTCAAATCAGAAGATGGTAACATAAAAAATATTAACTGGTTTGTTTACGTACGCAAAGATGGCAAACCTGTTGATCATTATGATGGTTTTTATACCCAAGGTCGTAATTTACAAGACTTTTTGCCTCGCAGCACAATGGAAGAAGACTTTTTGAACATGGATATAGGCGTGTTTATTCAAAAGTATGGTCTAGAGGATTTCAACTTCGAGCACGTTGTGTATGGTGATGTTTCAAAAACTACTCTAGGCGGTTTACACTTGTTGATTTCACAAGTACGTCTGAGTAAAATGGGCATCTTAAAGGCAGAGGAGTTTGTGGCAGCATCTGACATAACACTCAAATGTTGTACTGTGACTTATCTTAATGATCCTAGTTCTAAGACTGTTTGTACTTACATGGATTTGTTGTTGGATGATTTTGTTTCTGTATTGAAGTCTTTGGATTTGACTGTTGTATCCAAGGTTCATGAGGTCATAATTGACAACAAACCATGGAGATGGATGCTATGGTGTAAAGATAATGCCGTTGCTACATTCTATCCTCAGTTGCAGAGTGCAGAATGGAAATGCGGGTATTCTATGCCTGGTATTTATAAGACACAACGTATGTGCTTAGAACCATGTAATTTGTATAATTATGGTGCAGGTTTGAAGTTGCCCAGTGGCATTATGTTCAATGTTGTTAAATACACTCAATTGTGTCAATATTTTAACAGTACCACGTTATGTGTTCCTCATAATATGAGAGTGTTACACTTGGGTGCTGGCTCTGATTATGGTGTTGCACCAGGAACTGCTGTTCTTAAAAGGTGGTTGCCGCACGACGCAATTGTTGTTGACAACGATGTTGTTGACTATGTGAGTGACGCTGATTTTAGTGTTACTGGTGATTGTGCAACCGTTTATTTGGAAGACAAGTTTGACTTGTTAATCTCTGATATGTACGATGGTAGGACAAAGGCAATTGATGGTGAAAATGTTTCGAAAGAAGGATTTTTCACTTACATCAATGGTTTCATTTGTGAAAAACTTGCCATCGGAGGTTCGATTGCTATTAAAGTAACAGAGTATAGCTGGAATAAGAAATTGTATGAACTTGTACAAAGATTTTCTTTTTGGACTATGTTTTGCACTTCTGTTAATACGTCATCATCAGAAGCCTTTGTTGTCGGAATTAACTATCTTGGTGATTTCGCACAAGGACCTTTTATAGATGGTAACATAATACACGCAAATTATGTATTTTGGCGTAACTCCACTGTTATGAGTTTGTCCTACAACTCTGTTTTAGACCTGAGTAAATTTAATTGCAAACACAAAGCGACTGTTGTTGTGCAATTAAAGGATAGTGATATTAATGAAATGGTGCTTAGTCTTGTTAGGAGTGGTAAGTTGCTTGTAAGGGGTAATGGCAAGTGTTTGAGTTTTAGTAATCATTTAGTCTCAACTAAATAA
Protein
MACNRVTLAVASDSEISANGCSTIAQAVRRYSEAASNGFRACRFVSLDLQDCIVGIADDTYVMGLHGNQTLFCNIMKFSDRPFMLHGWLVFSNSNYLLEEFDVVFGKRGGGNVTYTDQYLCGADGKPVMSEDLWQFVDHFGENEEIIINGHTYVCAWLTKRKPLDYKRQNNLAIEEIEYVHGDALHTLRNGSVLEMAKEVKTSSKVVLSDALDKLYKVFGSPVMTNGSNILEAFTKPVFISALVQCTCGTKSWSVGDWTGFKSSCCNVISNKLCVVPGNVKPGDAVITTQQAGAGIKYFCGMTLKFVANIEGVSVWRVIALQSVDCFVASSTFVEEEHVNRMDTFCFNVRNSVTDECRLAMLGAEMTSNVRRQVASGVIDISTGWFDVYDDIFAESKPWFVRKAEDIFGPCWSALASALKQLKVTTGELVRFVKSICNSAVAVVGGTIQILASVPEKFLNAFDVFVTAIQTVFDCAVETCTIAGKAFDKVFDYVLLDNALVKLVTTKLKGVRERGLNKVKYATVVVGSTEEVKSSRVERSTAVLTIANNYSKLFDEGYTVVIGDVAYFVSDGYFRLMASPNSVLTTAVYKPLFAFNVNVMGTRPEKFPTTVTCENLESAVLFVNDKITEFQLDYSIDVIDNEIIVKPNISLCVPLYVRDYVDKWDDFCRQYSNESWFEDDYRAFISVLDITDAAVKAAESKAFVDTIVPPCPSILKVIDGGKIWNGVIKNVNSVRDWLKSLKLNLTQQGLLGTCAKRFKRWLGILLEAYNAFLDTVVSTVKIGGLTFKTYAFDKPYIVIRDIVCKVENKTEAEWIELFPHNDRIKSFSTFESAYMPIADPTHFDIEEVELLDAEFVEPGCGGILAVIDEHVFYKKDGVYYPSNGTNILPVAFTKAAGGKVSFSDDVEVKDIEPVYRVKLCFEFEDEKLVDVCEKAIGKKIKHEGDWDSFCKTIQSALSVVSCYVNLPTYYIYDEEGGNDLSLPVMISEWPLSVQQAQQEATLPDIAEDVVDQVEEVNSIFDIETVDVKHDVSPFEMPFEELNGLKILKQLDNNCWVNSVMLQIQLTGILDGDYAMQFFKMGRVAKMIERCYTAEQCIRGAMGDVGLCMYRLLKDLHTGFMVMDYKCSCTSGRLEESGAVLFCTPTKKAFPYGTCLNCNAPRMCTIRQLQGTIIFVQQKPEPVNPVSFVVKPVCSSIFRGAVSCGHYQTNIYSQNLCVDGFGVNKIQPWTNDALNTICIKDADYNAKVEISVTPIKNTVDTTPKEEFVVKEKLNAFLVHDNVAFYQGDVDTVVNGVDFDFIVNAANENLAHGGGLAKALDVYTKGKLQRLSKEHIGLAGKVKVGTGVMVECDSLRIFNVVGPRKGKHERDLLIKAYNTINNEQGTPLTPILSCGIFGIKLETSLEVLLDVCNTKEVKVFVYTDTEVCKVKDFVSGLVNVQKVEQPKIEPKPVSVIKVAPKPYRVDGKFSYFTEDLLCVADDKPIVLFTDSMLTLDDRGLALDNALSGVLSAAIKDCVDINKAIPSGNLIKFDIGSVVVYMCVVPSEKDKHLDNNVQRCTRKLNRLMCDIVCTIPADYILPLVLSSLTCNVSFVGELKAAEAKVITIKVTEDGVNVHDVTVTTDKSFEQQVGVIADKDKDLSGAVPSDLNTSELLTKAIDVDWVEFYGFKDAVTFATVDHSAFAYESAVVNGIRVLKTSDNNCWVNAVCIALQYSKPHFISQGLDAAWNKFVLGDVEIFVAFVYYVARLMKGDKGDAEDTLTKLSKYLANEAQVQLEHYSSCVECDAKFKNSVASINSAIVCASVKRDGVQVGYCVHGIKYYSRVRSVRGRAIIVSVEQLEPCAQSRLLSGVAYTAFSGPVDKGHYTVYDTAKKSMYDGDRFVKHDLSLLSVTSVVMVGGYVAPVNTVKPKPVINQLDEKAQKFFDFGDFLIHNFVIFFTWLLSMFTLCKTAVTTGDVKIMAKAPQRTGVVLKRSLKYNLKASAAVLKSKWWLLAKFTKLLLLIYTLYSVVLLCVRFGPFNFCSETVNGYAKSNFVKDDYCDGSLGCKMCLFGYQELSQFSHLDVVWKHITDPLFSNMQPFIVMVLLLIFGDNYLRCFLLYFVAQMISTVGVFLGYKETNWFLHFIPFDVICDELLVTVIVIKVISFVRHVLFGCENPDCIACSKSARLKRFPVNTIVNGVQRSFYVNANGGSKFCKKHRFFCVDCDSYGYGSTFITPEVSRELGNITKTNVQPTGPAYVMIDKVEFENGFYRLYSCETFWRYNFDITESKYSCKEVFKNCNVLDDFIVFNNNGTNVTQVKNASVYFSQLLCRPIKLVDSELLSTLSVDFNGVLHKAYIDVLRNSFGKDLNANMSLAECKRALGLSISDHEFTSAISNAHRCDVLLSDLSFNNFVSSYAKPEEKLSAYDLACCMRAGAKVVNANVLTKDQTPIVWHAKDFNSLSAEGRKYIVKTSKAKGLTFLLTINENQAVTQIPATSIVAKQGAGDAGHSLTWLWLLCGLVCLIQFYLCFFMPYFMYDIVSSFEGYDFKYIENGQLKNFEAPLKCVRNVFENFEDWHYAKFGFTPLNKQSCPIVVGVSEIVNTVAGIPSNVYLVGKTLIFTLQAAFGNAGVCYDIFGVTTPEKCIFTSACTRLEGLGGNNVYCYNTALMEGSLPYSSIQANAYYKYDNGNFIKLPEVIAQGFGFRTVRTIATKYCRVGECVESNAGVCFGFDKWFVNDGRVANGYVCGTGLWNLVFNILSMFSSSFSVAAMSGQILLNCALGAFAIFCCFLVTKFRRMFGDLSVGVCTVVVAVLLNNVSYIVTQNLVTMIAYAILYFFATRSLRYAWIWCAAYLIAYISFAPWWLCAWYFLAMLTGLLPSLLKLKVSTNLFEGDKFVGTFESAAAGTFVIDMRSYEKLANSISPEKLKSYAASYNRYKYYSGNANEADYRCACYAYLAKAMLDFSRDHNDILYTPPTVSYGSTLQAGLRKMAQPSGFVEKCVVRVCYGNTVLNGLWLGDIVYCPRHVIASNTTSAIDYDHEYSIMRLHNFSIISGTAFLGVVGATMHGVTLKIKVSQTNMHTPRHSFRTLKSGEGFNILACYDGCAQGVFGVNMRTNWTIRGSFINGACGSPGYNLKNGEVEFVYMHQIELGSGSHVGSSFDGVMYGGFEDQPNLQVESANQMLTVNVVAFLYAAILNGCTWWLKGEKLFVEHYNEWAQANGFTAMNGEDAFSILAAKTGVCVERLLHAIQVLNNGFGGKQILGYSSLNDEFSINEVVKQMFGVNLQSGKTTSMFKSISLFAGFFVMFWAELFVYTTTIWVNPGFLTPFMILLVALSLCLTFVVKHKVLFLQVFLLPSIIVAAIQNCAWDYHVTKVLAEKFDYNVSVMQMDIQGFVNIFICLFVALLHTWRFAKERCTHWCTYLFSLIAVLYTALYSYDYVSLLVMLLCAISNEWYIGAIIFRICRFGVAFLPVEYVSYFDGVKTVLLFYMLLGFVSCMYYGLLYWINRFCKCTLGVYDFCVSPAEFKYMVANGLNAPNGPFDALFLSFKLMGIGGPRTIKVSTVQSKLTDLKCTNVVLMGILSNMNIASNSKEWAYCVEMHNKINLCDDPETAQELLLALLAFFLSKHSDFGLGDLVDSYFENDSILQSVASSFVGMPSFVAYETARQEYENAVANGSSPQIIKQLKKAMNVAKAEFDRESSVQKKINRMAEQAAAAMYKEARAVNRKSKVVSAMHSLLFGMLRRLDMSSVDTILNMARNGVVPLSVIPATSAARLVVVVPDHDSFVKMMVDGFVHYAGVVWTLQEVKDNDGKNVHLKDVTKENQEILVWPLILTCERVVKLQNNEIMPGKMKVKATKGEGDGGITSEGNALYNNEGGRAFMYAYVTTKPGMKYVKWEHDSGVVTVELEPPCRFVIDTPTGPQIKYLYFVKNLNNLRRGAVLGYIGATVRLQAGKQTEFVSNSHLLTHCSFAVDPAAAYLDAVKQGAKPVGNCVKMLTNGSGSGQAITCTIDSNTTQDTYGGASVCIYCRAHVAHPTMDGFCQYKGKWVQVPIGTNDPIRFCLENTVCKVCGCWLNHGCTCDRTAIQSFDNSYLNRVRGSSAARLEPCNGTDIDYCVRAFDVYNKDASFIGKNLKSNCVRFKNVDKDDAFYIVKRCIKSVMDHEQSMYNLLKGCNAVAKHDFFTWHEGRTIYGNVSRQDLTKYTMMDLCFALRNFDEKDCEVFKEILVLTGCCSTDYFEMKNWFDPIENEDIHRVYAALGKVVANAMLKCVAFCDEMVLKGVVGVLTLDNQDLNGNFYDFGDFVLCPPGMGIPYCTSYYSYMMPVMGMTNCLASECFMKSDIFGQDFKTFDLLKYDFTEHKEVLFNKYFKYWGQDYHPDCVDCHDEMCILHCSNFNTLFATTIPNTAFGPLCRKVFIDGVPVVATAGYHFKQLGLVWNKDVNTHSTRLTITELLQFVTDPTLIVASSPALVDKRTVCFSVAALSTGLTSQTVKPGHFNKEFYDFLRSQGFFDEGSELTLKHFFFTQKGDAAIKDFDYYRYNRPTMLDIGQARVAYQVAARYFDCYEGGCITSREVVVTNLNKSAGWPLNKFGKAGLYYESISYEEQDAIFSLTKRNILPTMTQLNLKYAISGKERARTVGGVSLLATMTTRQFHQKCLKSIVATRNATVVIGTTKFYGGWDNMLKNLMADVDDPKLMGWDYPKCDRAMPSMIRMLSAMILGSKHVTCCTASDKFYRLSNELAQVLTEVVYSNGGFYFKPGGTTSGDATTAYANSVFNIFQAVSSNINCVLSVNSSNCNNFNVKKLQRQLYDNCYRNSNVDESFVDDFYGYLQKHFSMMILSDDSVVCYNKTYAGLGYIADISAFKATLYYQNGVFMSTAKCWTEEDLSIGPHEFCSQHTMQIVDENGKYYLPYPDPSRIISAGVFVDDITKTDAVILLERYVSLAIDAYPLSKHPKPEYRKVFYALLDWVKHLNKTLNEGVLESFSVTLLDEHESKFWDESFYASMYEKSTVLQAAGLCVVCGSQTVLRCGDCLRRPMLCTKCAYDHVFGTDHKFILAITPYVCNTSGCNVNDVTKLYLGGLNYYCVDHKPHLSFPLCSAGNVFGLYKSSALGSMDIDVFNKLSTSDWSDIRDYKLANDAKESLRLFAAETVKAKEESVKSSYAYATLKEIVGPKELLLLWESGKAKPPLNRNSVFTCFQITKDSKFQVGEFVFEKVDYGSDTVTYKSTATTKLVPGMLFILTSHNVAPLRAPTMANQEKYSTIYKLHPSFNVSDAYANLVPYYQLIGKQRITTIQGPPGSGKSHCSIGIGVYYPGARIVFTACSHAAVDSLCAKAVTAYSVDKCTRIIPARARVECYSGFKPNNNSAQYVFSTVNALPEVNADIVVVDEVSMCTNYDLSVINQRISYKHIVYVGDPQQLPAPRVLISKGVMEPIDYNVVTQRMCAIGPDVFLHKCYRCPAEIVNTVSELVYENKFVPVKEASKQCFKIFERGSVQVDNGSSINRRQLDVVKRFIHKNSTWSKAVFISPYNSQNYVAARLLGLQTQTVDSAQGSEYDYVIFAQTSDTAHACNANRFNVAITRAKKGIFCIMSDRTLFDALKFFEITMTDLQSESSCGLFKDCARNPIDLPPSHATTYLSLSDRFKTSGDLAVQIGNNNVCTYEHVISYMGFRFDVSMPGSHSLFCTRDFAMRHVRGWLGMDVEGAHVTGDNVGTNVPLQVGFSNGVDFVAQPEGCVLTNTGSVVKPVRARAPPGEQFTHIVPLLRKGQPWSVLRKRIVQMIADFLAGSSDVLVFVLWAGGLELTTMRYFVKIGAVKHCQCGTVATCYNSVSNDYCCFKHALGCDYVYNPYVIDIQQWGYVGSLSTNHHAICNVHRNEHVASGDAIMTRCLAVYDCFVKNVDWSITYPMIANENAINKGGRTVQSHIMRAAIKLYNPKAIHDIGNPKGIRCAVTDAKWYCYDKNPINSNVKTLEYDYMTHGQMDGLCLFWNCNVDMYPEFSIVCRFDTRTRSTLNLEGVNGGSLYVNNHAFHTPAYDKRAMAKLKPAPFFYYDDGSCEVVHDQVNYVPLRATNCITKCNIGGAVCSKHANLYRAYVESYNIFTQAGFNIWVPTTFDCYNLWQTFTEVNLQGLENIAFNVVNKGSFVGADGELPVAISGDKVFVRDGNTDNLVFVNKTSLPTNIAFELFAKRKVGLTPPLSILKNLGVVATYKFVLWDYEAERPLTSFTKSVCGYTDFAEDVCTCYDNSIQGSYERFTLSTNAVLFSATAVKTGGKSLPAIKLNFGMLNGNAIATVKSEDGNIKNINWFVYVRKDGKPVDHYDGFYTQGRNLQDFLPRSTMEEDFLNMDIGVFIQKYGLEDFNFEHVVYGDVSKTTLGGLHLLISQVRLSKMGILKAEEFVAASDITLKCCTVTYLNDPSSKTVCTYMDLLLDDFVSVLKSLDLTVVSKVHEVIIDNKPWRWMLWCKDNAVATFYPQLQSAEWKCGYSMPGIYKTQRMCLEPCNLYNYGAGLKLPSGIMFNVVKYTQLCQYFNSTTLCVPHNMRVLHLGAGSDYGVAPGTAVLKRWLPHDAIVVDNDVVDYVSDADFSVTGDCATVYLEDKFDLLISDMYDGRTKAIDGENVSKEGFFTYINGFICEKLAIGGSIAIKVTEYSWNKKLYELVQRFSFWTMFCTSVNTSSSEAFVVGINYLGDFAQGPFIDGNIIHANYVFWRNSTVMSLSYNSVLDLSKFNCKHKATVVVQLKDSDINEMVLSLVRSGKLLVRGNGKCLSFSNHLVSTK

Summary

Function
The papain-like proteinase 1 (PLP1) and papain-like proteinase 2 (PLP2) are responsible for the cleavages located at the N-terminus of the replicase polyprotein. In addition, PLP2 possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. PLP2 also antagonizes innate immune induction of type I interferon by blocking the nuclear translocation of host IRF-3 (By similarity).
The main proteinase 3CL-PRO is responsible for the majority of cleavages as it cleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function (By similarity).
The exoribonuclease acts on both ssRNA and dsRNA in a 3' to 5' direction.
Nsp7-nsp8 hexadecamer may possibly confer processivity to the polymerase, maybe by binding to dsRNA or by producing primers utilized by the latter.
Nsp9 is a ssRNA-binding protein.
NendoU is a Mn(2+)-dependent, uridylate-specific enzyme, which leaves 2'-3'-cyclic phosphates 5' to the cleaved bond.
Catalytic Activity
Thiol-dependent hydrolysis of ester, thioester, amide, peptide and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-residue protein attached to proteins as an intracellular targeting signal).
a ribonucleoside 5'-triphosphate + RNA(n) = diphosphate + RNA(n+1)
ATP + H2O = ADP + H(+) + phosphate
Subunit
3CL-PRO exists as monomer and homodimer. Eight copies of nsp7 and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is a dimer. Nsp10 forms a dodecamer (By similarity).
Miscellaneous
Produced by -1 ribosomal frameshifting at the 1a-1b genes boundary.
Similarity
Belongs to the coronaviruses polyprotein 1ab family.
Keywords
3D-structure   Activation of host autophagy by virus   ATP-binding   Endonuclease   Exonuclease   Helicase   Host cytoplasm   Host membrane   Host-virus interaction   Hydrolase   Inhibition of host innate immune response by virus   Inhibition of host IRF3 by virus   Inhibition of host RLR pathway by virus   Membrane   Metal-binding   Methyltransferase   Modulation of host ubiquitin pathway by viral deubiquitinase   Modulation of host ubiquitin pathway by virus   Nuclease   Nucleotide-binding   Nucleotidyltransferase   Protease   Reference proteome   Repeat   Ribosomal frameshifting   RNA-binding   RNA-directed RNA polymerase   Thiol protease   Transferase   Transmembrane   Transmembrane helix   Ubl conjugation pathway   Viral immunoevasion   Viral RNA replication   Zinc   Zinc-finger  
Feature
chain  Non-structural protein 1
Uniprot
Pfam
PF05409   Peptidase_C30
PF08710   nsp9
PF01661   Macro
PF06471   NSP11
PF09401   NSP10
PF13087   AAA_12
PF08716   nsp7
PF16348   Corona_NSP4_C
PF08715   Viral_protease
PF06478   Corona_RPol_N
PF06460   NSP16
PF08717   nsp8
Interpro
IPR027417   P-loop_NTPase
IPR014829   NSP8
IPR007094   RNA-dir_pol_PSvirus
IPR037204   NSP7_sf
IPR014827   Viral_protease
IPR037227   EndoU-like
IPR036499   NSP9_sf
IPR013016   Peptidase_C30/C16
IPR009003   Peptidase_S1_PA
IPR027351   (+)RNA_virus_helicase_core_dom
IPR042515   Nsp15_N
IPR037230   NSP8_sf
IPR009461   Coronavirus_NSP16
IPR014828   NSP7
IPR038123   NSP4_C_sf
IPR009469   RNA_pol_N_coronovir
IPR027352   CV_ZBD
IPR018995   RNA_synth_NSP10_coronavirus
IPR041679   DNA2/NAM7-like_AAA
IPR002589   Macro_dom
IPR036333   NSP10_sf
IPR009466   NSP11
IPR029063   SAM-dependent_MTases
IPR014822   NSP9
IPR032505   Corona_NSP4_C
IPR008740   Peptidase_C30
SUPFAM
SSF144246   SSF144246
SSF50494   SSF50494
SSF101816   SSF101816
SSF143076   SSF143076
SSF142877   SSF142877
SSF53335   SSF53335
SSF52540   SSF52540
SSF140367   SSF140367
ProteinModelPortal
PDB
6NUS     E-value=0,     Score=3002

Ontologies

KEGG

Subcellular Location

From MSLVP
Capsid
From Uniprot
Host membrane  
   nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity).   With evidence from 5 publications.
Host endoplasmic reticulum-Golgi intermediate compartment   The helicase interacts with the N protein in membranous complexes and colocalizes with sites of synthesis of new viral RNA.   With evidence from 5 publications.

Topology

Length:
6758
Number of predicted TMHs:
18
Exp number of AAs in TMHs:
437.01000999999
Exp number, first 60 AAs:
0.00212
Total prob of N-in:
0.00325
outside
1  -  1928
TMhelix
1929  -  1951
inside
1952  -  1997
TMhelix
1998  -  2020
outside
2021  -  2066
TMhelix
2067  -  2089
inside
2090  -  2095
TMhelix
2096  -  2114
outside
2115  -  2492
TMhelix
2493  -  2515
inside
2516  -  2720
TMhelix
2721  -  2740
outside
2741  -  2749
TMhelix
2750  -  2772
inside
2773  -  2778
TMhelix
2779  -  2796
outside
2797  -  2799
TMhelix
2800  -  2819
inside
2820  -  2825
TMhelix
2826  -  2848
outside
2849  -  3031
TMhelix
3032  -  3051
inside
3052  -  3277
TMhelix
3278  -  3300
outside
3301  -  3303
TMhelix
3304  -  3323
inside
3324  -  3327
TMhelix
3328  -  3345
outside
3346  -  3364
TMhelix
3365  -  3387
inside
3388  -  3399
TMhelix
3400  -  3418
outside
3419  -  3422
TMhelix
3423  -  3445
inside
3446  -  3465
TMhelix
3466  -  3488
outside
3489  -  6758
 
 

Population Genetic Test Statistics

Pi
0.1606797
Theta
0.3279168
Tajima's D
-1.8202704
CLR
5.057011
Interpretation
No evidence of Selection

Genomic alignment in the CDS region

 
 

Multiple alignment of Orthologues

 
 

Orthologous in Strains

Strain Availability Status Gene
CHINA_HS_2019_MN908947    
CHINA_AVIAN_2008_NC_016995    
CHINA_AVIAN_2007_NC_016991    
CHINA_MURINE_2015_NC_035191    
CANADA_AVIAN_2007_NC_010800    
USA_PIG_2000_NC_038861    
CHINA_AVIAN_2007_NC_011549    
ITALY_PIG_2009_NC_028806    
GERMANY_PIG_2012_LT545990    
CHINA_AVIAN_2007_NC_016992    
CHINA_BAT_2005_NC_009657    
CANADA_HS_2003_NC_004718    
CHINA_BAT_2014_NC_030886    
CHINA_BAT_2005_NC_018871    
USA_MURINE_2009_NC_012936    
CHINA_RABBIT_2006_NC_017083    
UK_PIG_2000_NC_003436    
ROMANIA_PIG_2015_LT898435    
ROMANIA_PIG_2015_LT898436    
GERMANY_PIG_2015_LT898444    
GERMANY_PIG_2015_LT898414    
GERMANY_PIG_2015_LT898439    
GERMANY_PIG_2015_LT898413    
GERMANY_PIG_2015_LT898412    
GERMANY_PIG_2015_LT898411    
GERMANY_PIG_2015_LT898416    
GERMANY_PIG_2015_LT898443    
GERMANY_PIG_2015_LT898420    
GERMANY_PIG_2015_LT898408    
GERMANY_PIG_2015_LT898423    
GERMANY_PIG_2015_LT898432    
GERMANY_PIG_2015_LT898409    
GERMANY_PIG_2015_LT898425    
GERMANY_PIG_2015_LT898446    
GERMANY_PIG_2014_LT898438    
GERMANY_PIG_2014_LT898440    
GERMANY_PIG_2014_LT900501    
GERMANY_PIG_2014_LT898427    
GERMANY_PIG_2014_LT898415    
GERMANY_PIG_2014_LT898421    
GERMANY_PIG_2014_LT898431    
GERMANY_PIG_2014_LT898410    
GERMANY_PIG_2014_LT900498    
GERMANY_PIG_2014_LT898430    
GERMANY_PIG_1978_LT897799    
GERMANY_PIG_2014_LT898447    
GERMANY_PIG_2014_LT898426    
GERMANY_PIG_2014_LT898417    
GERMANY_PIG_2014_LT900500    
GERMANY_PIG_2014_LT898445    
AUSTRIA_PIG_2015_LT898418    
AUSTRIA_PIG_2015_LT898441    
AUSTRIA_PIG_2015_LT898433    
AUSTRIA_PIG_2015_LT900502    
GERMANY_PIG_2015_LT900499    
BELGIUM_PIG_1980_LT906620    
BELGIUM_PIG_1977_LT905450    
SWITZERLAND_PIG_2003_LT905451    
BELGIUM_PIG_1978_LT906581    
UK_PIG_1987_LT906582    
CHINA_PIG_2009_NC_016990    
CHINA_PIG_2010_NC_039208    
KENYA_BAT_2010_KY073745    
KENYA_BAT_2010_NC_032107    
CHINA_AVIAN_2007_NC_016994    
EUROPE_MURINE_2004_AY700211    
CHINA_AVIAN_2007_NC_011550    
USA_MURINE_1997_NC_001846    
USA_MURINE_1998_NC_023760    
MID_EAST_HS_2012_NC_019843    
CHINA_AVIAN_2007_NC_016993    
CHINA_MURINE_2013_NC_032730    
USA_HS_2004_AY585228    
NETHERLAND_HS_2004_NC_005831    
CHINA_HS_2004_NC_006577    
EUROPE_HS_2000_NC_002645 Protein
Protein
NP_073549.1
NP_073550.1
NETHERLAND_FERRET_2010_NC_030292    
JAPAN_FERRET_2013_LC119077    
USA_FELINE_2005_NC_002306    
CHINA_MURINE_2011_NC_034972    
CHINA_AVIAN_2007_NC_016996    
ARABIA_CAMEL_2015_NC_028752 Protein
Protein
YP_009194637.1
YP_009194638.1
CHINA_AVIAN_2007_NC_011547    
CHINA_BAT_2012_NC_028824    
CHINA_BAT_2013_NC_028814    
CHINA_BAT_2013_NC_028833    
CHINA_BAT_2011_NC_028811    
USA_BOVIN_2001_NC_003045    
CHINA_MURINE_2012_NC_026011    
GERMANY_ERINACEINAE_2012_NC_022643    
GERMANY_ERINACEINAE_2012_NC_039207    
UK_HS_2012_NC_038294    
AMERICA_WHALE_2007_NC_010646    
CHINA_BAT_2013_NC_025217    
UGANDA_BAT_2013_NC_034440    
CHINA_BAT_2006_NC_009021    
CHINA_BAT_2008_NC_010438    
CHINA_BAT_2006_NC_009020    
CHINA_BAT_2006_NC_009019    
CHINA_BAT_2006_NC_009988    
USA_BAT_2006_NC_022103    
BULGARIA_BAT_2008_NC_014470    
CHINA_BAT_2008_NC_010437    
USA_AVIAN_2004_NC_001451    
Copyright@ 2018-2023    Any Comments and suggestions mail to:  zhuzl@cqu.edu.cn, mg@cau.edu.cn   渝ICP备19006517号

渝公网安备 50010602502065号

In processing...
Login to ASFVdb
Email
Password
Please go to Regist if without an account.
If you have forgotten your password, you can once again Regist an account with a registed or new email.
Change my password
Enter new password
Reenter new password
Regist an account of ASFVdb
It is required that you provide your institutional e-mail address (with edu or org in the domain) as confirmation of your affiliation.
Enter email
Reenter email
First Name
Last Name
Institution
You can directly go to if with an account.
Registraion Success
Your password has been sent to your email.
Please check it and login later.
Welcome to use ASFVdb.