P19811 · RPOA_EAVBU
- ProteinReplicase polyprotein 1ab
- Generep
- StatusUniProtKB reviewed (Swiss-Prot)
- Amino acids3175 (go to sequence)
- Protein existenceEvidence at protein level
- Annotation score5/5
Function
function
The replicase polyprotein 1ab is a multifunctional protein: it contains the activities necessary for the transcription of negative stranded RNA, leader RNA, subgenomic mRNAs and progeny virion RNA as well as proteinases responsible for the cleavage of the polyprotein into functional products.
Nsp1 is essential for viral subgenomic mRNA synthesis.
Nsp2 cysteine proteinase which cleaves the nsp2/nsp3 site in the polyprotein. Also displays deubiquitinating and deISGylase activities. The deubiquitinating activity cleaves both ubiquitinated and ISGylated products and may therefore regulate ubiquitin and ISG15 dependent host innate immunity.
The 3C-like serine proteinase chain is responsible for the majority of cleavages as it cleaves the C-terminus of the polyprotein.
The helicase chain, which contains a zinc finger structure, displays RNA and DNA duplex-unwinding activities with 5' to 3' polarity.
Uridylate-specific endoribonuclease nsp11
Plays a role in viral transcription/replication and prevents the simultaneous activation of host cell dsRNA sensors, such as MDA5/IFIH1, OAS, and PKR (By similarity).
Acts by degrading the 5'-polyuridines generated during replication of the poly(A) region of viral genomic and subgenomic RNAs. Catalyzes a two-step reaction in which a 2'3'-cyclic phosphate (2'3'-cP) is first generated by 2'-O transesterification, which is then hydrolyzed to a 3'-phosphate (3'-P) (PubMed:19297500).
If not degraded, poly(U) RNA would hybridize with poly(A) RNA tails and activate host dsRNA sensors (By similarity).
Acts by degrading the 5'-polyuridines generated during replication of the poly(A) region of viral genomic and subgenomic RNAs. Catalyzes a two-step reaction in which a 2'3'-cyclic phosphate (2'3'-cP) is first generated by 2'-O transesterification, which is then hydrolyzed to a 3'-phosphate (3'-P) (PubMed:19297500).
If not degraded, poly(U) RNA would hybridize with poly(A) RNA tails and activate host dsRNA sensors (By similarity).
Miscellaneous
Nsp1 contains an inactivated papain-like cysteine proteinase domain due to a Lys instead of a Cys in position 73.
Catalytic activity
- a ribonucleoside 5'-triphosphate + RNA(n) = diphosphate + RNA(n+1)
Uridylate-specific endoribonuclease nsp11
uridylyl-uridylyl-ribonucleotide-RNA = a 3'-end uridylyl-2',3'-cyclophospho-uridine-RNA + a 5'-end dephospho-ribonucleoside-RNA
Features
Showing features for active site, site, binding site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Active site | 164 | For Nsp1 papain-like cysteine proteinase activity | ||||
Sequence: C | ||||||
Active site | 230 | For Nsp1 papain-like cysteine proteinase activity | ||||
Sequence: H | ||||||
Site | 260-261 | Cleavage; by PCP | ||||
Sequence: GG | ||||||
Active site | 270 | For Nsp2 cysteine proteinase activity | ||||
Sequence: C | ||||||
Binding site | 319 | Zn2+ 1 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Active site | 332 | For Nsp2 cysteine proteinase activity | ||||
Sequence: H | ||||||
Binding site | 349 | Zn2+ 1 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 354 | Zn2+ 1 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 356 | Zn2+ 1 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Site | 831-832 | Cleavage; by Nsp2 cysteine proteinase | ||||
Sequence: GG | ||||||
Site | 1064-1065 | Cleavage; by 3CLSP | ||||
Sequence: EG | ||||||
Active site | 1103 | Charge relay system; for 3C-like serine proteinase activity | ||||
Sequence: H | ||||||
Active site | 1129 | Charge relay system; for 3C-like serine proteinase activity | ||||
Sequence: D | ||||||
Active site | 1184 | Charge relay system; for 3C-like serine proteinase activity | ||||
Sequence: S | ||||||
Site | 1268-1269 | Cleavage; by 3CLSP; in major pathway | ||||
Sequence: ES | ||||||
Site | 1430-1431 | Cleavage; by 3CLSP; in minor pathway | ||||
Sequence: EG | ||||||
Site | 1452-1453 | Cleavage; by 3CLSP; in minor pathway | ||||
Sequence: ES | ||||||
Site | 1575-1576 | Cleavage; by 3CLSP | ||||
Sequence: EA | ||||||
Site | 1677-1678 | Cleavage; by 3CLSP | ||||
Sequence: EG | ||||||
Site | 2370-2371 | Cleavage; by 3CLSP | ||||
Sequence: ES | ||||||
Binding site | 2374 | Zn2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2377 | Zn2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2387 | Zn2+ 3 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2392 | Zn2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2395 | Zn2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2399 | Zn2+ 3 (UniProtKB | ChEBI) | ||||
Sequence: H | ||||||
Binding site | 2402 | Zn2+ 3 (UniProtKB | ChEBI) | ||||
Sequence: H | ||||||
Binding site | 2403 | Zn2+ 3 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2412 | Zn2+ 4 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2414 | Zn2+ 4 (UniProtKB | ChEBI) | ||||
Sequence: H | ||||||
Binding site | 2423 | Zn2+ 4 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2426 | Zn2+ 4 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Site | 2429 | Involved in mRNA transcription process | ||||
Sequence: S | ||||||
Binding site | 2528-2535 | ATP (UniProtKB | ChEBI) | ||||
Sequence: GPPGSGKT | ||||||
Site | 2837-2838 | Cleavage; by 3CLSP | ||||
Sequence: QS | ||||||
Active site | 2963 | |||||
Sequence: H | ||||||
Active site | 2978 | |||||
Sequence: H | ||||||
Active site | 3007 | |||||
Sequence: K | ||||||
Site | 3056-3057 | Cleavage; by 3CLSP | ||||
Sequence: EG |
GO annotations
Keywords
- Molecular function
- Biological process
- Ligand
Enzyme and pathway databases
Protein family/group databases
Names & Taxonomy
Protein names
- Recommended nameReplicase polyprotein 1ab
- Alternative names
- Cleaved into 14 chains
Gene names
Organism names
- Taxonomic lineageViruses > Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales > Arnidovirineae > Arteriviridae > Equarterivirinae > Alphaarterivirus > Alphaarterivirus equid
- Virus hosts
Accessions
- Primary accessionP19811
- Secondary accessions
Proteomes
Subcellular Location
UniProt Annotation
GO Annotation
Nsp1 papain-like cysteine proteinase
Nsp2 cysteine proteinase
Host membrane ; Multi-pass membrane protein
Non-structural protein 3
Host membrane ; Multi-pass membrane protein
Non-structural protein 5-6-7
Host membrane ; Multi-pass membrane protein
3C-like serine proteinase
RNA-directed RNA polymerase
Helicase
Features
Showing features for transmembrane.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Transmembrane | 530-550 | Helical | ||||
Sequence: AVIACLLPIWPSLALLLSFAI | ||||||
Transmembrane | 551-571 | Helical | ||||
Sequence: GLIPSVGNNVVLTALLVSSAN | ||||||
Transmembrane | 625-645 | Helical | ||||
Sequence: FDAAYVPCTVFDLCSFAILYL | ||||||
Transmembrane | 829-849 | Helical | ||||
Sequence: LIGGWIYGICYFVLVVVSTFT | ||||||
Transmembrane | 903-923 | Helical | ||||
Sequence: YWIAAVTSGLVILLVCHRLAI | ||||||
Transmembrane | 935-955 | Helical | ||||
Sequence: LVLLVFPWASVGLLLACSLAG | ||||||
Transmembrane | 977-997 | Helical | ||||
Sequence: LVTMGYWACVAALAVYSLMGL | ||||||
Transmembrane | 1291-1311 | Helical | ||||
Sequence: AYLPYVLGFFAANFFLPKSVG | ||||||
Transmembrane | 1333-1353 | Helical | ||||
Sequence: CLFHLVCATVTGNVISLWFYI | ||||||
Transmembrane | 1355-1375 | Helical | ||||
Sequence: AAGTSYLSEMWFGGYPTMLFV | ||||||
Transmembrane | 1385-1405 | Helical | ||||
Sequence: GWAIGTVLAVCSITMLAAALG |
Keywords
- Cellular component
Phenotypes & Variants
Features
Showing features for mutagenesis.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Mutagenesis | 25 | Complete loss of transcription. | ||||
Sequence: C → A | ||||||
Mutagenesis | 44 | Complete loss of transcription. | ||||
Sequence: C → A | ||||||
Mutagenesis | 164 | Complete loss of PCP proteinase activity. | ||||
Sequence: C → G or S | ||||||
Mutagenesis | 219 | No effect. | ||||
Sequence: H → A, G, or V | ||||||
Mutagenesis | 230 | Complete loss of PCP proteinase activity. | ||||
Sequence: H → A, G, or V | ||||||
Mutagenesis | 260 | Complete loss of nsp1-nsp2 cleavage. | ||||
Sequence: G → V | ||||||
Mutagenesis | 270 | Complete loss of CP2 activity. | ||||
Sequence: C → A, H, R, or S | ||||||
Mutagenesis | 271 | Complete loss of CP2 activity. | ||||
Sequence: G → W | ||||||
Mutagenesis | 291 | No effect. | ||||
Sequence: D → E or N | ||||||
Mutagenesis | 295 | No effect. | ||||
Sequence: D → E or N | ||||||
Mutagenesis | 296 | No effect. | ||||
Sequence: D → E or N | ||||||
Mutagenesis | 297 | No effect. | ||||
Sequence: E → D or Q | ||||||
Mutagenesis | 319 | Complete loss of CP2 activity. | ||||
Sequence: C → A, H, or P | ||||||
Mutagenesis | 332 | Complete loss of CP2 activity. | ||||
Sequence: H → C, I, N, or Y | ||||||
Mutagenesis | 344 | No effect. | ||||
Sequence: C → A | ||||||
Mutagenesis | 344 | Almost complete loss of CP2 activity. | ||||
Sequence: C → H | ||||||
Mutagenesis | 349 | Complete loss of CP2 activity. | ||||
Sequence: C → A, H, or S | ||||||
Mutagenesis | 354 | Complete loss of CP2 activity. | ||||
Sequence: C → A, H, or P | ||||||
Mutagenesis | 356 | Complete loss of CP2 activity. | ||||
Sequence: C → A | ||||||
Mutagenesis | 356 | No effect. | ||||
Sequence: C → H | ||||||
Mutagenesis | 831 | Complete loss of nsp2-nsp3 cleavage. | ||||
Sequence: G → P | ||||||
Mutagenesis | 1064 | Complete loss of nsp3-nsp4 cleavage. | ||||
Sequence: E → P | ||||||
Mutagenesis | 1103 | Complete loss of nsp3-nsp4, nsp4-nsp5 and nsp5-nsp6 cleavages. | ||||
Sequence: H → G or R | ||||||
Mutagenesis | 1117 | No effect. | ||||
Sequence: D → N or T | ||||||
Mutagenesis | 1129 | Complete loss of nsp3-nsp4 and nsp5-nsp6 cleavages. | ||||
Sequence: D → E | ||||||
Mutagenesis | 1129 | Complete loss of nsp3-nsp4, nsp4-nsp5 and nsp5-nsp6 cleavages. | ||||
Sequence: D → K or V | ||||||
Mutagenesis | 1179 | Partial loss of nsp4-nsp5 cleavage. | ||||
Sequence: T → G or S | ||||||
Mutagenesis | 1179 | Increased nsp5-nsp6 cleavage. | ||||
Sequence: T → N | ||||||
Mutagenesis | 1184 | Complete loss of nsp3-nsp4, nsp4-nsp5 and nsp5-nsp6 cleavages. | ||||
Sequence: S → C, F, I, or T | ||||||
Mutagenesis | 1198 | Complete loss of nsp4-nsp5 and nsp5-nsp6 cleavages. | ||||
Sequence: H → L, R, or Y | ||||||
Mutagenesis | 1268 | Complete loss of nsp3-nsp4 and nsp4-nsp5 cleavages. | ||||
Sequence: E → P | ||||||
Mutagenesis | 1430 | No effect. | ||||
Sequence: E → P | ||||||
Mutagenesis | 1677 | Complete loss of nsp5-nsp6 cleavage. | ||||
Sequence: E → P | ||||||
Mutagenesis | 2351 | No effect. | ||||
Sequence: D → P | ||||||
Mutagenesis | 2370 | Complete loss of nsp9-nsp10 cleavage. | ||||
Sequence: E → P | ||||||
Mutagenesis | 2429 | RNA can replicate efficiently but does not produce the subgenomic mRNAs required for structural protein expression. | ||||
Sequence: S → P | ||||||
Mutagenesis | 2534 | Abolishes ATPase and helicase activity. | ||||
Sequence: K → Q | ||||||
Mutagenesis | 2800 | No effect. | ||||
Sequence: E → P | ||||||
Mutagenesis | 2819 | No effect. | ||||
Sequence: D → P | ||||||
Mutagenesis | 2835 | Almost complete loss of nsp9-nsp10 cleavage. | ||||
Sequence: E → D, P, or Q | ||||||
Mutagenesis | 2837 | No effect. | ||||
Sequence: Q → D or N | ||||||
Mutagenesis | 2837 | Increased nsp9-nsp10 cleavage. | ||||
Sequence: Q → E | ||||||
Mutagenesis | 2837 | Almost complete loss of nsp9-nsp10 cleavage. | ||||
Sequence: Q → P | ||||||
Mutagenesis | 3056 | Complete loss of nsp10-nsp11 cleavage. | ||||
Sequence: E → P |
PTM/Processing
Features
Showing features for chain, glycosylation.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Chain | PRO_0000036621 | 1-260 | Nsp1 papain-like cysteine proteinase | |||
Sequence: MATFSATGFGGSFVRDWSLDLPDACEHGAGLCCEVDGSTLCAECFRGCEGMEQCPGLFMGLLKLASPVPVGHKFLIGWYRAAKVTGRYNFLELLQHPAFAQLRVVDARLAIEEASVFISTDHASAKRFPGARFALTPVYANAWVVSPAANSLIVTTDQEQDGFCWLKLLPPDRREAGLRLYYNHYREQRTGWLSKTGLRLWLGDLGLGINASSGGLKFHIMRGSPQRAWHITTRSCKLKSYYVCDISEADWSCLPAGNYG | ||||||
Chain | PRO_0000036619 | 1-3175 | Replicase polyprotein 1ab | |||
Sequence: MATFSATGFGGSFVRDWSLDLPDACEHGAGLCCEVDGSTLCAECFRGCEGMEQCPGLFMGLLKLASPVPVGHKFLIGWYRAAKVTGRYNFLELLQHPAFAQLRVVDARLAIEEASVFISTDHASAKRFPGARFALTPVYANAWVVSPAANSLIVTTDQEQDGFCWLKLLPPDRREAGLRLYYNHYREQRTGWLSKTGLRLWLGDLGLGINASSGGLKFHIMRGSPQRAWHITTRSCKLKSYYVCDISEADWSCLPAGNYGGYNPPGDGACGYRCLAFMNGATVVSAGCSSDLWCDDELAYRVFQLSPTFTVTIPGGRVCPNAKYAMICDKQHWRVKRAKGVGLCLDESCFRGICNCQRMSGPPPAPVSAAVLDHILEAATFGNVRVVTPEGQPRPVPAPRVRPSANSSGDVKDPAPVPPVPKPRTKLATPNPTQAPIPAPRTRLQGASTQEPLASAGVASDSAPKWRVAKTVYSSAERFRTELVQRARSVGDVLVQALPLKTPAVQRYTMTLKMMRSRFSWHCDVWYPLAVIACLLPIWPSLALLLSFAIGLIPSVGNNVVLTALLVSSANYVASMDHQCEGAACLALLEEEHYYRAVRWRPITGALSLVLNLLGQVGYVARSTFDAAYVPCTVFDLCSFAILYLCRNRCWRCFGRCVRVGPATHVLGSTGQRVSKLALIDLCDHFSKPTIDVVGMATGWSGCYTGTAAMERQCASTVDPHSFDQKKAGATVYLTPPVNSGSALQCLNVMWKRPIGSTVLGEQTGAVVTAVKSISFSPPCCVSTTLPTRPGVTVVDHALYNRLTASGVDPALLRVGQGDFLKLNPGFRLIGGWIYGICYFVLVVVSTFTCLPIKCGIGTRDPFCRRVFSVPVTKTQEHCHAGMCASAEGISLDSLGLTQLQSYWIAAVTSGLVILLVCHRLAISALDLLTLASPLVLLVFPWASVGLLLACSLAGAAVKIQLLATLFVNLFFPQATLVTMGYWACVAALAVYSLMGLRVKVNVPMCVTPAHFLLLARSAGQSREQMLRVSAAAPTNSLLGVARDCYVTGTTRLYIPKEGGMVFEGLFRSPKARGNVGFVAGSSYGTGSVWTRNNEVVVLTASHVVGRANMATLKIGDAMLTLTFKKNGDFAEAVTTQSELPGNWPQLHFAQPTTGPASWCTATGDEEGLLSGEVCLAWTTSGDSGSAVVQGDAVVGVHTGSNTSGVAYVTTPSGKLLGADTVTLSSLSKHFTGPLTSIPKDIPDNIIADVDAVPRSLAMLIDGLSNRESSLSGPQLLLIACFMWSYLNQPAYLPYVLGFFAANFFLPKSVGRPVVTGLLWLCCLFTPLSMRLCLFHLVCATVTGNVISLWFYITAAGTSYLSEMWFGGYPTMLFVPRFLVYQFPGWAIGTVLAVCSITMLAAALGHTLLLDVFSASGRFDRTFMMKYFLEGGVKESVTASVTRAYGKPITQESLTATLAALTDDDFQFLSDVLDCRAVRSAMNLRAALTSFQVAQYRNILNASLQVDRDAARSRRLMAKLADFAVEQEVTAGDRVVVIDGLDRMAHFKDDLVLVPLTTKVVGGSRCTICDVVKEEANDTPVKPMPSRRRRKGLPKGAQLEWDRHQEEKRNAGDDDFAVSNDYVKRVPKYWDPSDTRGTTVKIAGTTYQKVVDYSGNVHYVEHQEDLLDYVLGKGSYEGLDQDKVLDLTNMLKVDPTELSSKDKAKARQLAHLLLDLANPVEAVNQLNLRAPHIFPGDVGRRTFADSKDKGFVALHSRTMFLAARDFLFNIKFVCDEEFTKTPKDTLLGYVRACPGYWFIFRRTHRSLIDAYWDSMECVYALPTISDFDVSPGDVAVTGERWDFESPGGGRAKRLTADLVHAFQGFHGASYSYDDKVAAAVSGDPYRSDGVLYNTRWGNIPYSVPTNALEATACYRAGCEAVTDGTNVIATIGPFPEQQPIPDIPKSVLDNCADISCDAFIAPAAETALCGDLEKYNLSTQGFVLPSVFSMVRAYLKEEIGDAPPLYLPSTVPSKNSQAGINGAEFPTKSLQSYCLIDDMVSQSMKSNLQTATMATCKRQYCSKYKIRSILGTNNYIGLGLRACLSGVTAAFQKAGKDGSPIYLGKSKFDPIPAPDKYCLETDLESCDRSTPALVRWFATNLIFELAGQPELVHSYVLNCCHDLVVAGSVAFTKRGGLSSGDPITSISNTIYSLVLYTQHMLLCGLEGYFPEIAEKYLDGSLELRDMFKYVRVYIYSDDVVLTTPNQHYAASFDRWVPHLQALLGFKVDPKKTVNTSSPSFLGCRFKQVDGKCYLASLQDRVTRSLLYHIGAKNPSEYYEAAVSIFKDSIICCDEDWWTDLHRRISGAARTDGVEFPTIEMLTSFRTKQYESAVCTVCGAAPVAKSACGGWFCGNCVPYHAGHCHTTSLFANCGHDIMYRSTYCTMCEGSPKQMVPKVPHPILDHLLCHIDYGSKEELTLVVADGRTTSPPGRYKVGHKVVAVVADVGGNIVFGCGPGSHIAVPLQDTLKGVVVNKALKNAAASEYVEGPPGSGKTFHLVKDVLAVVGSATLVVPTHASMLDCINKLKQAGADPYFVVPKYTVLDFPRPGSGNITVRLPQVGTSEGETFVDEVAYFSPVDLARILTQGRVKGYGDLNQLGCVGPASVPRNLWLRHFVSLEPLRVCHRFGAAVCDLIKGIYPYYEPAPHTTKVVFVPNPDFEKGVVITAYHKDRGLGHRTIDSIQGCTFPVVTLRLPTPQSLTRPRAVVAVTRASQELYIYDPFDQLSGLLKFTKEAEAQDLIHGPPTACHLGQEIDLWSNEGLEYYKEVNLLYTHVPIKDGVIHSYPNCGPACGWEKQSNKISCLPRVAQNLGYHYSPDLPGFCPIPKELAEHWPVVSNDRYPNCLQITLQQVCELSKPCSAGYMVGQSVFVQTPGVTSYWLTEWVDGKARALPDSLFSSGRFETNSRAFLDEAEEKFAAAHPHACLGEINKSTVGGSHFIFSQYLPPLLPADAVALVGASLAGKAAKAACSVVDVYAPSFEPYLHPETLSRVYKIMIDFKPCRLMVWRNATFYVQEGVDAVTSALAAVSKLIKVPANEPVSFHVASGYRTNALVAPQAKISIGAYAAEWALSTEPPPAGYAIVRRYIVKRLLSSTEVFLCRRGVVSSTSVQTICALEGCKPLFNFLQIGSVIGPV | ||||||
Chain | PRO_0000036622 | 261-831 | Nsp2 cysteine proteinase | |||
Sequence: GYNPPGDGACGYRCLAFMNGATVVSAGCSSDLWCDDELAYRVFQLSPTFTVTIPGGRVCPNAKYAMICDKQHWRVKRAKGVGLCLDESCFRGICNCQRMSGPPPAPVSAAVLDHILEAATFGNVRVVTPEGQPRPVPAPRVRPSANSSGDVKDPAPVPPVPKPRTKLATPNPTQAPIPAPRTRLQGASTQEPLASAGVASDSAPKWRVAKTVYSSAERFRTELVQRARSVGDVLVQALPLKTPAVQRYTMTLKMMRSRFSWHCDVWYPLAVIACLLPIWPSLALLLSFAIGLIPSVGNNVVLTALLVSSANYVASMDHQCEGAACLALLEEEHYYRAVRWRPITGALSLVLNLLGQVGYVARSTFDAAYVPCTVFDLCSFAILYLCRNRCWRCFGRCVRVGPATHVLGSTGQRVSKLALIDLCDHFSKPTIDVVGMATGWSGCYTGTAAMERQCASTVDPHSFDQKKAGATVYLTPPVNSGSALQCLNVMWKRPIGSTVLGEQTGAVVTAVKSISFSPPCCVSTTLPTRPGVTVVDHALYNRLTASGVDPALLRVGQGDFLKLNPGFRLIG | ||||||
Chain | PRO_0000036623 | 832-1064 | Non-structural protein 3 | |||
Sequence: GWIYGICYFVLVVVSTFTCLPIKCGIGTRDPFCRRVFSVPVTKTQEHCHAGMCASAEGISLDSLGLTQLQSYWIAAVTSGLVILLVCHRLAISALDLLTLASPLVLLVFPWASVGLLLACSLAGAAVKIQLLATLFVNLFFPQATLVTMGYWACVAALAVYSLMGLRVKVNVPMCVTPAHFLLLARSAGQSREQMLRVSAAAPTNSLLGVARDCYVTGTTRLYIPKEGGMVFE | ||||||
Chain | PRO_0000036624 | 1065-1268 | 3C-like serine proteinase | |||
Sequence: GLFRSPKARGNVGFVAGSSYGTGSVWTRNNEVVVLTASHVVGRANMATLKIGDAMLTLTFKKNGDFAEAVTTQSELPGNWPQLHFAQPTTGPASWCTATGDEEGLLSGEVCLAWTTSGDSGSAVVQGDAVVGVHTGSNTSGVAYVTTPSGKLLGADTVTLSSLSKHFTGPLTSIPKDIPDNIIADVDAVPRSLAMLIDGLSNRE | ||||||
Chain | PRO_0000423106 | 1269-1430 | Non-structural protein 5 | |||
Sequence: SSLSGPQLLLIACFMWSYLNQPAYLPYVLGFFAANFFLPKSVGRPVVTGLLWLCCLFTPLSMRLCLFHLVCATVTGNVISLWFYITAAGTSYLSEMWFGGYPTMLFVPRFLVYQFPGWAIGTVLAVCSITMLAAALGHTLLLDVFSASGRFDRTFMMKYFLE | ||||||
Chain | PRO_0000036625 | 1269-1677 | Non-structural protein 5-6-7 | |||
Sequence: SSLSGPQLLLIACFMWSYLNQPAYLPYVLGFFAANFFLPKSVGRPVVTGLLWLCCLFTPLSMRLCLFHLVCATVTGNVISLWFYITAAGTSYLSEMWFGGYPTMLFVPRFLVYQFPGWAIGTVLAVCSITMLAAALGHTLLLDVFSASGRFDRTFMMKYFLEGGVKESVTASVTRAYGKPITQESLTATLAALTDDDFQFLSDVLDCRAVRSAMNLRAALTSFQVAQYRNILNASLQVDRDAARSRRLMAKLADFAVEQEVTAGDRVVVIDGLDRMAHFKDDLVLVPLTTKVVGGSRCTICDVVKEEANDTPVKPMPSRRRRKGLPKGAQLEWDRHQEEKRNAGDDDFAVSNDYVKRVPKYWDPSDTRGTTVKIAGTTYQKVVDYSGNVHYVEHQEDLLDYVLGKGSYE | ||||||
Chain | PRO_0000423107 | 1431-1452 | Non-structural protein 6 | |||
Sequence: GGVKESVTASVTRAYGKPITQE | ||||||
Chain | PRO_0000423108 | 1453-1575 | Non-structural protein 7-alpha | |||
Sequence: SLTATLAALTDDDFQFLSDVLDCRAVRSAMNLRAALTSFQVAQYRNILNASLQVDRDAARSRRLMAKLADFAVEQEVTAGDRVVVIDGLDRMAHFKDDLVLVPLTTKVVGGSRCTICDVVKEE | ||||||
Glycosylation | 1501 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Chain | PRO_0000423109 | 1576-1677 | Non-structural protein 7-beta | |||
Sequence: ANDTPVKPMPSRRRRKGLPKGAQLEWDRHQEEKRNAGDDDFAVSNDYVKRVPKYWDPSDTRGTTVKIAGTTYQKVVDYSGNVHYVEHQEDLLDYVLGKGSYE | ||||||
Chain | PRO_0000036627 | 1678-1727 | Non-structural protein 8 | |||
Sequence: GLDQDKVLDLTNMLKVDPTELSSKDKAKARQLAHLLLDLANPVEAVNQLN | ||||||
Chain | PRO_0000036626 | 1678-2370 | RNA-directed RNA polymerase | |||
Sequence: GLDQDKVLDLTNMLKVDPTELSSKDKAKARQLAHLLLDLANPVEAVNQLNLRAPHIFPGDVGRRTFADSKDKGFVALHSRTMFLAARDFLFNIKFVCDEEFTKTPKDTLLGYVRACPGYWFIFRRTHRSLIDAYWDSMECVYALPTISDFDVSPGDVAVTGERWDFESPGGGRAKRLTADLVHAFQGFHGASYSYDDKVAAAVSGDPYRSDGVLYNTRWGNIPYSVPTNALEATACYRAGCEAVTDGTNVIATIGPFPEQQPIPDIPKSVLDNCADISCDAFIAPAAETALCGDLEKYNLSTQGFVLPSVFSMVRAYLKEEIGDAPPLYLPSTVPSKNSQAGINGAEFPTKSLQSYCLIDDMVSQSMKSNLQTATMATCKRQYCSKYKIRSILGTNNYIGLGLRACLSGVTAAFQKAGKDGSPIYLGKSKFDPIPAPDKYCLETDLESCDRSTPALVRWFATNLIFELAGQPELVHSYVLNCCHDLVVAGSVAFTKRGGLSSGDPITSISNTIYSLVLYTQHMLLCGLEGYFPEIAEKYLDGSLELRDMFKYVRVYIYSDDVVLTTPNQHYAASFDRWVPHLQALLGFKVDPKKTVNTSSPSFLGCRFKQVDGKCYLASLQDRVTRSLLYHIGAKNPSEYYEAAVSIFKDSIICCDEDWWTDLHRRISGAARTDGVEFPTIEMLTSFRTKQYE | ||||||
Chain | PRO_0000036628 | 2371-2837 | Helicase | |||
Sequence: SAVCTVCGAAPVAKSACGGWFCGNCVPYHAGHCHTTSLFANCGHDIMYRSTYCTMCEGSPKQMVPKVPHPILDHLLCHIDYGSKEELTLVVADGRTTSPPGRYKVGHKVVAVVADVGGNIVFGCGPGSHIAVPLQDTLKGVVVNKALKNAAASEYVEGPPGSGKTFHLVKDVLAVVGSATLVVPTHASMLDCINKLKQAGADPYFVVPKYTVLDFPRPGSGNITVRLPQVGTSEGETFVDEVAYFSPVDLARILTQGRVKGYGDLNQLGCVGPASVPRNLWLRHFVSLEPLRVCHRFGAAVCDLIKGIYPYYEPAPHTTKVVFVPNPDFEKGVVITAYHKDRGLGHRTIDSIQGCTFPVVTLRLPTPQSLTRPRAVVAVTRASQELYIYDPFDQLSGLLKFTKEAEAQDLIHGPPTACHLGQEIDLWSNEGLEYYKEVNLLYTHVPIKDGVIHSYPNCGPACGWEKQ | ||||||
Chain | PRO_0000036629 | 2838-3056 | Uridylate-specific endoribonuclease nsp11 | |||
Sequence: SNKISCLPRVAQNLGYHYSPDLPGFCPIPKELAEHWPVVSNDRYPNCLQITLQQVCELSKPCSAGYMVGQSVFVQTPGVTSYWLTEWVDGKARALPDSLFSSGRFETNSRAFLDEAEEKFAAAHPHACLGEINKSTVGGSHFIFSQYLPPLLPADAVALVGASLAGKAAKAACSVVDVYAPSFEPYLHPETLSRVYKIMIDFKPCRLMVWRNATFYVQE | ||||||
Chain | PRO_0000036630 | 3057-3175 | Non-structural protein 12 | |||
Sequence: GVDAVTSALAAVSKLIKVPANEPVSFHVASGYRTNALVAPQAKISIGAYAAEWALSTEPPPAGYAIVRRYIVKRLLSSTEVFLCRRGVVSSTSVQTICALEGCKPLFNFLQIGSVIGPV |
Post-translational modification
Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. There are two alternative pathways for processing. Either nsp4-5 is cleaved, which represents the major pathway or the nsp5-6 and nsp6-7 are processed, which represents the minor pathway. The major pathway occurs when nsp2 acts as a cofactor for nsp4.
Keywords
- PTM
PTM databases
Interaction
Subunit
Nsp1 interacts with cellular transcription cofactor SND1/p100.
Binary interactions
Type | Entry 1 | Entry 2 | Number of experiments | Intact | |
---|---|---|---|---|---|
BINARY | P19811 | rep PRO_0000036626 P19811 | 4 | EBI-27070211, EBI-27070280 |
Structure
Family & Domains
Features
Showing features for zinc finger, domain, region, compositional bias.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Zinc finger | 25-44 | C4-type; atypical | ||||
Sequence: CEHGAGLCCEVDGSTLCAEC | ||||||
Domain | 66-156 | Peptidase C31 | ||||
Sequence: SPVPVGHKFLIGWYRAAKVTGRYNFLELLQHPAFAQLRVVDARLAIEEASVFISTDHASAKRFPGARFALTPVYANAWVVSPAANSLIVTT | ||||||
Domain | 157-260 | Peptidase C32 | ||||
Sequence: DQEQDGFCWLKLLPPDRREAGLRLYYNHYREQRTGWLSKTGLRLWLGDLGLGINASSGGLKFHIMRGSPQRAWHITTRSCKLKSYYVCDISEADWSCLPAGNYG | ||||||
Region | 261-339 | OTU-like | ||||
Sequence: GYNPPGDGACGYRCLAFMNGATVVSAGCSSDLWCDDELAYRVFQLSPTFTVTIPGGRVCPNAKYAMICDKQHWRVKRAK | ||||||
Domain | 261-360 | Peptidase C33 | ||||
Sequence: GYNPPGDGACGYRCLAFMNGATVVSAGCSSDLWCDDELAYRVFQLSPTFTVTIPGGRVCPNAKYAMICDKQHWRVKRAKGVGLCLDESCFRGICNCQRMS | ||||||
Region | 386-451 | Disordered | ||||
Sequence: VVTPEGQPRPVPAPRVRPSANSSGDVKDPAPVPPVPKPRTKLATPNPTQAPIPAPRTRLQGASTQE | ||||||
Region | 530-645 | HD1 | ||||
Sequence: AVIACLLPIWPSLALLLSFAIGLIPSVGNNVVLTALLVSSANYVASMDHQCEGAACLALLEEEHYYRAVRWRPITGALSLVLNLLGQVGYVARSTFDAAYVPCTVFDLCSFAILYL | ||||||
Region | 829-997 | HD2 | ||||
Sequence: LIGGWIYGICYFVLVVVSTFTCLPIKCGIGTRDPFCRRVFSVPVTKTQEHCHAGMCASAEGISLDSLGLTQLQSYWIAAVTSGLVILLVCHRLAISALDLLTLASPLVLLVFPWASVGLLLACSLAGAAVKIQLLATLFVNLFFPQATLVTMGYWACVAALAVYSLMGL | ||||||
Domain | 1065-1268 | Peptidase S32 | ||||
Sequence: GLFRSPKARGNVGFVAGSSYGTGSVWTRNNEVVVLTASHVVGRANMATLKIGDAMLTLTFKKNGDFAEAVTTQSELPGNWPQLHFAQPTTGPASWCTATGDEEGLLSGEVCLAWTTSGDSGSAVVQGDAVVGVHTGSNTSGVAYVTTPSGKLLGADTVTLSSLSKHFTGPLTSIPKDIPDNIIADVDAVPRSLAMLIDGLSNRE | ||||||
Region | 1291-1405 | HD3 | ||||
Sequence: AYLPYVLGFFAANFFLPKSVGRPVVTGLLWLCCLFTPLSMRLCLFHLVCATVTGNVISLWFYITAAGTSYLSEMWFGGYPTMLFVPRFLVYQFPGWAIGTVLAVCSITMLAAALG | ||||||
Region | 1577-1614 | Disordered | ||||
Sequence: NDTPVKPMPSRRRRKGLPKGAQLEWDRHQEEKRNAGDD | ||||||
Compositional bias | 1593-1614 | Basic and acidic residues | ||||
Sequence: LPKGAQLEWDRHQEEKRNAGDD | ||||||
Domain | 1716-1883 | NiRAN | ||||
Sequence: LANPVEAVNQLNLRAPHIFPGDVGRRTFADSKDKGFVALHSRTMFLAARDFLFNIKFVCDEEFTKTPKDTLLGYVRACPGYWFIFRRTHRSLIDAYWDSMECVYALPTISDFDVSPGDVAVTGERWDFESPGGGRAKRLTADLVHAFQGFHGASYSYDDKVAAAVSGD | ||||||
Domain | 2116-2251 | RdRp catalytic | ||||
Sequence: KYCLETDLESCDRSTPALVRWFATNLIFELAGQPELVHSYVLNCCHDLVVAGSVAFTKRGGLSSGDPITSISNTIYSLVLYTQHMLLCGLEGYFPEIAEKYLDGSLELRDMFKYVRVYIYSDDVVLTTPNQHYAAS | ||||||
Domain | 2371-2438 | AV ZBD | ||||
Sequence: SAVCTVCGAAPVAKSACGGWFCGNCVPYHAGHCHTTSLFANCGHDIMYRSTYCTMCEGSPKQMVPKVP | ||||||
Domain | 2496-2661 | +RNA virus helicase ATP-binding | ||||
Sequence: PGSHIAVPLQDTLKGVVVNKALKNAAASEYVEGPPGSGKTFHLVKDVLAVVGSATLVVPTHASMLDCINKLKQAGADPYFVVPKYTVLDFPRPGSGNITVRLPQVGTSEGETFVDEVAYFSPVDLARILTQGRVKGYGDLNQLGCVGPASVPRNLWLRHFVSLEPL | ||||||
Domain | 2662-2793 | +RNA virus helicase C-terminal | ||||
Sequence: RVCHRFGAAVCDLIKGIYPYYEPAPHTTKVVFVPNPDFEKGVVITAYHKDRGLGHRTIDSIQGCTFPVVTLRLPTPQSLTRPRAVVAVTRASQELYIYDPFDQLSGLLKFTKEAEAQDLIHGPPTACHLGQE | ||||||
Domain | 2840-2930 | AV-Nsp11N/CoV-Nsp15M | ||||
Sequence: KISCLPRVAQNLGYHYSPDLPGFCPIPKELAEHWPVVSNDRYPNCLQITLQQVCELSKPCSAGYMVGQSVFVQTPGVTSYWLTEWVDGKAR | ||||||
Domain | 2932-3054 | NendoU | ||||
Sequence: LPDSLFSSGRFETNSRAFLDEAEEKFAAAHPHACLGEINKSTVGGSHFIFSQYLPPLLPADAVALVGASLAGKAAKAACSVVDVYAPSFEPYLHPETLSRVYKIMIDFKPCRLMVWRNATFYV |
Domain
The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane.
The OTU-like region is responsible for the deubiquitinating and deISGylation activities of Nsp2.
Sequence similarities
Belongs to the arteriviridae polyprotein family.
Keywords
- Domain
Family and domain databases
Sequence & Isoform
- Sequence statusComplete
This entry describes 2 isoforms produced by Ribosomal frameshifting.
P19811-1
This isoform has been chosen as the canonical sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
- NameReplicase polyprotein 1ab
- Synonymspp1ab
- NoteProduced by -1 ribosomal frameshifting at the 1a-1b genes boundary.
- Length3,175
- Mass (Da)345,384
- Last updated2004-04-13 v3
- ChecksumCB536C1F5091045C
P19811-2
- NameReplicase polyprotein 1a
- Synonymspp1a, ORF1a polyprotein
- NoteProduced by -1 ribosomal frameshifting at the 1a-1b genes boundary.
- Differences from canonical
- 1728-3175: Missing
Features
Showing features for compositional bias, alternative sequence.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Compositional bias | 1593-1614 | Basic and acidic residues | ||||
Sequence: LPKGAQLEWDRHQEEKRNAGDD | ||||||
Alternative sequence | VSP_032887 | 1728-3175 | in isoform Replicase polyprotein 1a | |||
Sequence: Missing |
Keywords
- Coding sequence diversity
- Technical term
Sequence databases
Nucleotide Sequence | Protein Sequence | Molecule Type | Status | |
---|---|---|---|---|
X53459 EMBL· GenBank· DDBJ | CAC42774.2 EMBL· GenBank· DDBJ | Genomic RNA | ||
X53459 EMBL· GenBank· DDBJ | CAC42775.2 EMBL· GenBank· DDBJ | Genomic RNA | ||
X52277 EMBL· GenBank· DDBJ | CAA36520.1 EMBL· GenBank· DDBJ | Genomic RNA |