P0C6F6 · R1A_BC512
- ProteinReplicase polyprotein 1a
- StatusUniProtKB reviewed (Swiss-Prot)
- Amino acids4128 (go to sequence)
- Protein existenceInferred from homology
- Annotation score5/5
Function
function
The papain-like proteinase 1 (PLP1) and papain-like proteinase 2 (PLP2) are responsible for the cleavages located at the N-terminus of the replicase polyprotein. In addition, PLP2 possesses a deubiquitinating/deISGylating activity and processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. PLP2 also antagonizes innate immune induction of type I interferon by blocking the nuclear translocation of host IRF-3 (By similarity).
3C-like proteinase
Responsible for the majority of cleavages as it cleaves the C-terminus of replicase polyprotein at 11 sites. Recognizes substrates containing the core sequence [ILMVF]-Q-|-[SGACN]. Inhibited by the substrate-analog Cbz-Val-Asn-Ser-Thr-Leu-Gln-CMK. Also contains an ADP-ribose-1''-phosphate (ADRP)-binding function (By similarity).
Nsp7-nsp8 hexadecamer may possibly confer processivity to the polymerase, maybe by binding to dsRNA or by producing primers utilized by the latter.
Nsp9 is a ssRNA-binding protein.
Miscellaneous
Bat coronavirus 512/2005 is highly similar to porcine epidemic diarrhea virus (PEDV).
Catalytic activity
Features
Showing features for site, active site, binding site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Site | 110-111 | Cleavage; by PL1-PRO | ||||
Sequence: GT | ||||||
Site | 897-898 | Cleavage; by PL1-PRO | ||||
Sequence: GG | ||||||
Active site | 1103 | For PL1-PRO activity | ||||
Sequence: C | ||||||
Active site | 1252 | For PL1-PRO activity | ||||
Sequence: H | ||||||
Active site | 1265 | For PL1-PRO activity | ||||
Sequence: D | ||||||
Active site | 1737 | For PL2-PRO activity | ||||
Sequence: C | ||||||
Active site | 1902 | For PL2-PRO activity | ||||
Sequence: H | ||||||
Active site | 1915 | For PL2-PRO activity | ||||
Sequence: D | ||||||
Binding site | 2194 | Zn2+ 1 (UniProtKB | ChEBI) | ||||
Sequence: H | ||||||
Binding site | 2199 | Zn2+ 1 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2204 | Zn2+ 1 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2207 | Zn2+ 1 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2240 | Zn2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2243 | Zn2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: H | ||||||
Binding site | 2247 | Zn2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 2250 | Zn2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Site | 2530-2531 | Cleavage; by PL2-PRO | ||||
Sequence: GA | ||||||
Site | 3012-3013 | Cleavage; by 3CL-PRO | ||||
Sequence: QA | ||||||
Active site | 3053 | For 3CL-PRO activity | ||||
Sequence: H | ||||||
Active site | 3156 | For 3CL-PRO activity | ||||
Sequence: C | ||||||
Site | 3314-3315 | Cleavage; by 3CL-PRO | ||||
Sequence: QS | ||||||
Site | 3590-3591 | Cleavage; by 3CL-PRO | ||||
Sequence: QS | ||||||
Site | 3673-3674 | Cleavage; by 3CL-PRO | ||||
Sequence: QS | ||||||
Site | 3868-3869 | Cleavage; by 3CL-PRO | ||||
Sequence: QN | ||||||
Site | 3976-3977 | Cleavage; by 3CL-PRO | ||||
Sequence: QA | ||||||
Binding site | 4050 | Zn2+ 3 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 4053 | Zn2+ 3 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 4059 | Zn2+ 3 (UniProtKB | ChEBI) | ||||
Sequence: H | ||||||
Binding site | 4066 | Zn2+ 3 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 4092 | Zn2+ 4 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 4095 | Zn2+ 4 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 4103 | Zn2+ 4 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Binding site | 4105 | Zn2+ 4 (UniProtKB | ChEBI) | ||||
Sequence: C | ||||||
Site | 4111-4112 | Cleavage; by 3CL-PRO | ||||
Sequence: QS |
GO annotations
Aspect | Term | |
---|---|---|
Cellular Component | host cell membrane | |
Cellular Component | host cell perinuclear region of cytoplasm | |
Cellular Component | membrane | |
Molecular Function | cysteine-type deubiquitinase activity | |
Molecular Function | cysteine-type endopeptidase activity | |
Molecular Function | omega peptidase activity | |
Molecular Function | RNA binding | |
Molecular Function | transferase activity | |
Molecular Function | zinc ion binding | |
Biological Process | induction by virus of host autophagy | |
Biological Process | proteolysis | |
Biological Process | symbiont-mediated perturbation of host ubiquitin-like protein modification | |
Biological Process | symbiont-mediated suppression of host cytoplasmic pattern recognition receptor signaling pathway via inhibition of IRF3 activity | |
Biological Process | viral genome replication | |
Biological Process | viral protein processing | |
Biological Process | viral translational frameshifting | |
Biological Process | virus-mediated perturbation of host defense response |
Keywords
- Molecular function
- Biological process
- Ligand
Names & Taxonomy
Protein names
- Recommended nameReplicase polyprotein 1a
- Short namespp1a
- Alternative names
- Cleaved into 11 chains
Gene names
Organism names
- Taxonomic lineageViruses > Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales > Cornidovirineae > Coronaviridae > Orthocoronavirinae > Alphacoronavirus > Pedacovirus > Alphacoronavirus scotophili
- Virus hosts
Accessions
- Primary accessionP0C6F6
- Secondary accessions
Proteomes
Subcellular Location
UniProt Annotation
GO Annotation
Non-structural protein 3
Host membrane ; Multi-pass membrane protein
Non-structural protein 4
Host membrane ; Multi-pass membrane protein
Non-structural protein 6
Host membrane ; Multi-pass membrane protein
Non-structural protein 7
Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity).
Non-structural protein 8
Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity).
Non-structural protein 9
Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity).
Non-structural protein 10
Note: nsp7, nsp8, nsp9 and nsp10 are localized in cytoplasmic foci, largely perinuclear. Late in infection, they merge into confluent complexes (By similarity).
Features
Showing features for transmembrane.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Transmembrane | 1973-1993 | Helical | ||||
Sequence: FVSRNIIVLIVYLFSLLAICF | ||||||
Transmembrane | 2036-2056 | Helical | ||||
Sequence: YIKVFLKFSLVLYTLYALMFM | ||||||
Transmembrane | 2119-2139 | Helical | ||||
Sequence: IGNILPLFYLVFLIIFGGFFV | ||||||
Transmembrane | 2141-2161 | Helical | ||||
Sequence: IGITYFIMQYINAAGVALGYQ | ||||||
Transmembrane | 2164-2184 | Helical | ||||
Sequence: VWLLHLLPFNSMGNIIVVAFI | ||||||
Transmembrane | 2543-2563 | Helical | ||||
Sequence: FFWHLCVLIVVLFVATSLLDF | ||||||
Transmembrane | 2634-2654 | Helical | ||||
Sequence: VPAGVFLYGKSLIFAMSTIFG | ||||||
Transmembrane | 2669-2689 | Helical | ||||
Sequence: DSCIFNSACTTLSGIGGRNVY | ||||||
Transmembrane | 2769-2789 | Helical | ||||
Sequence: GSDYVCGTGFFSLLFNVIGMF | ||||||
Transmembrane | 2802-2822 | Helical | ||||
Sequence: ILLNCVVAFTAVMACFAFTKF | ||||||
Transmembrane | 2829-2849 | Helical | ||||
Sequence: MSFGVLSVGLCTVVNNLSYVV | ||||||
Transmembrane | 2878-2898 | Helical | ||||
Sequence: VGFAISYCFLAPWWVVLAYLI | ||||||
Transmembrane | 3351-3371 | Helical | ||||
Sequence: GYITPVFLAIIVASSALMLLV | ||||||
Transmembrane | 3376-3396 | Helical | ||||
Sequence: LFLQLYLLPSLCIVSGYNIFK | ||||||
Transmembrane | 3414-3434 | Helical | ||||
Sequence: FGGFNVTGVLNISLCCFVMGL | ||||||
Transmembrane | 3443-3463 | Helical | ||||
Sequence: PNKIFSYVVAVLTVLYTYYYS | ||||||
Transmembrane | 3466-3486 | Helical | ||||
Sequence: VLGLILTSMSGFTNYWFIGTA | ||||||
Transmembrane | 3488-3507 | Helical | ||||
Sequence: YKLATYVLPHTSLLDSFDAI | ||||||
Transmembrane | 3511-3531 | Helical | ||||
Sequence: VFLYLLLGYCNCVYYGSLYWI |
Keywords
- Cellular component
PTM/Processing
Features
Showing features for chain, disulfide bond.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Chain | PRO_0000338051 | 1-110 | Non-structural protein 1 | |||
Sequence: MASNHISLAFANDEEISAIGFGSVEEAVSYYSDAAVNGFDQCRFVSLGLQDAVVGVEDDDVVMLITGVTQLRAYLGTFGDRPLNLRGWLLFSNCNYFLEELDLVFGRCGG | ||||||
Chain | PRO_0000338050 | 1-4128 | Replicase polyprotein 1a | |||
Sequence: MASNHISLAFANDEEISAIGFGSVEEAVSYYSDAAVNGFDQCRFVSLGLQDAVVGVEDDDVVMLITGVTQLRAYLGTFGDRPLNLRGWLLFSNCNYFLEELDLVFGRCGGTTIPVDQFMCGADGAPVIQEGDWTFMDYFQDSNQFTLNGITYVKAWDVDRKPNDYAKQNVTCIRRITYITDHRHVLADGTTMKTARHPKVNKSVVLDSPFDQIYKEVGSPFMGNGSTFVEMLKDPAFFHALITCECGRSEWTVGDWKGYNSLCCNIKCKPITIVTPKAVPGAVVITKAGIGAGLKCYNNVFLKHIIDLVVPGTNLGWGVWRIAKVQSKDDVATSGNVLVDDPEDRLDPCYFGNDGPFATKFKFQLLANSFDDEVKGAIVQGVVHVNTAICDVVKDILGLPWFVKKLGSLVTVMWDQFVAGVQSMKICTLKVVQLAKALSCATMSVVKGVITLVAEVPEIFKRLFYTLTSALKSLCTSSCDALVVAGKSFAKIGDYVLLPSALVRLVSSKVKGKAQSGIKQLQFATVVLGDTHKVESDRVEFSSVNLKMVDEEFPLNPVGHTVAVGNQAFFCSDGLYRFMADRDLVITSPIFKPELELEPIFECDAIPGFPKVAASNVAELCVKVDTLLFNYDKIYKKYSTIIKGDRCYIQCTHTFKAPSYYFDDDEFVELCTKYYKLPDFDAFYNAVHAATDMDQFCALCTSGFEVFIPRVPDCPPILNDIDGGSIWTSFILSVRSATDFIKTLKIDLGLNGVVVFVTKKFRKAGALLQKLYNAFLDTVTSFIKVAGVAFKYCATCVPKIVINGCYHTVTRLFAKDLQIPTEDGVADFNTFNHCVFPVNPTRIETDSLELEEVDFVEPGVDGKLVILDDYSFYSDGTNYYPSDGKGVVASCFKKKGGGVVTISDEVQVRTIDPVYKVRLEYEFEDETLVKVCEKAIGTKLKVTGDWSNLLETLEKAMDVVRQHLDVPDYFVYDEEGGTDLNLTIMVSQWPLSSDSEDDFKAVDDEPNANTDETVDTFAEDVAETQNVQQDVTQDEVEAVCDLVVKATEEGPIEHEELSEDQKEVQQALAFIEDKPVVVKPDVFAFSYASYGGLKVLNQSSNNCWVSSALVQLQLTGLLDSDEMQLFNAGRVSPMVKRCYESQRAIFGSLGDVSACLESLLKDRDGMSITCTIDCGCGPGVRVYENAIFRFTPLKTAFPMGRCLICSKTLMHTITQMKGTGIFCRDATALDVDTLVVKPLCAAVYVGAQDGGHYLTNMYDANMAVDGHGRHPIKFNTINTLCYKDVDWEVSNGSCDVKPFLTYKNIEFYQGELSALLSVNHDFVVNAANEQLSHGGGIAKALDDLTKGELQVLSNQYVSRNGSIKVGSGVLIKCKEHSILNVVGPRKGKHAAELLTKAYTFVFKQKGVPLMPLLSVGIFKVPITESLAAFLACVGDRVCKCFCYTDKERLAIQNFVTSFQTEQPVEPLPVIQEVKGVQLEKPVPDVKVENPCEPFRIEGDAKFYDLTPSMVQSLQVTRLVSFTNSDLCLGSFVRDCDGYVQGSLGGAIANYKKSNPVLPAGNCVTLKCDGFISFTFVILPKEGDTNYEKNFNRAIAKFLKLKGSLLVVVEDSSVFNKISHASVAGYVAKPALVDTLFEAKPVQVVVTQDQRSFHTVELSTSQTYGQQLGDCVVEDKKVTNLKPVSKDKVVSVVPNVDWDKHYGFVDAGIFHTLDHTMFVFDNNVVNGKRVLRTSDNNCWINAVCLQLQFANAKFKPKGLQQLWESYCTGDVAMFVHWLYWITGVEKGEPSDAENTLNIISRFLKPQGSVEMLRATSTTCDGTCSTKRVVSTPVVNASVLKVGLDDGNCVHGLPLVDRVVSVNGTVIITNVGDTPGKPVVATENLLLDGVSYTVFQDSTTGVGHYTVFDKEAKLMFDGDVLKPCDLNVSPVTSVVVCNNKKIVVQDPVKRVELDASKFLDTMNVASEKFFTFGDFVSRNIIVLIVYLFSLLAICFRALKKRDMKVMAGVPERTGIILKRSVKYNYKALKFFFRLKFQYIKVFLKFSLVLYTLYALMFMFIRFTPVGTPICKRYTDGYANSTFDKNDYCGNVLCKICLYGYEELSDFTHTRVIWQHLKDPLIGNILPLFYLVFLIIFGGFFVRIGITYFIMQYINAAGVALGYQDNVWLLHLLPFNSMGNIIVVAFIVTRILLFLKHVLFGCDKPSCIACSKSAKLTRVPLQTILQGVTKSFYVNANGGKKFCKKHNFFCVDCDSYGYGCTFINDVIAPELSNVTKLNVIPTGPATIIIDKVEFSNGFYYLYSGSTFWKYNFDITEAKYACKDVLKNCNILTDFVVFNNSGSNVTQVKNACVYFSQLLCKPIKLVDSALLASLNVDFSANLHKAFVEVLSNSFGKDLSNCSNMNECRESLGLSDVPEEEFSAAVSEAHRYDVLISDVSFNNLIVSYAKPEEKLAVHDIANCMRVGAKVVNHNVLTKDNVPVVWLAKDFIALSEEARKYIVRTTKTKGINFMLTFNDRRMHLTIPTISVANKKGAGLPSLFTRLYSFFWHLCVLIVVLFVATSLLDFSAQVTSDTQYDFKYIENGVLKVFEKPLDCVHNAFVNFNEWHNAKFGSIPTNSRRCPIVVGTSDEVRYIPGVPAGVFLYGKSLIFAMSTIFGTSGLCFDDRGLTDPDSCIFNSACTTLSGIGGRNVYCYREGVVDNAKLYSSLLPHSYYRLMDGNHIVLPEIITRGFGIRTIKTQAMTYCRTGECIDSQAGVCVGLDRFFVYSKTPGSDYVCGTGFFSLLFNVIGMFSNSIPVTVMSGQILLNCVVAFTAVMACFAFTKFKRLFGDMSFGVLSVGLCTVVNNLSYVVTQNSIGMLAYATLYFLCTKGVRYSWVWHVGFAISYCFLAPWWVVLAYLICALLEFLPNLFKLKVSTQLFEGDKFVGSFESAASGTFVLDMHSYQKLANSISTEKLKQYCASYNRYKYYSGSASEADYRLACFAHLAKAMSDFANDHMDKLYTPPTVSYNSTLQAGLRKMAQPSGIVEGCIVRVSYGNLTLNGLWLGDTVICPRHVIASNTTNVIDYDHAMSLVRLHNFSISSGNMFLGVISASMRGTLLHIKVNQSNVNTPNYTYKVLKPGDSFNILACYDGSAAGVYGVNMRTNYTIRGSFISGACGSPGYNINNGVVEFCYMHHLELGSGCHVGSDMDGTMYGKYEDQPTLQIEGASNLVTENVCSWLYGALINGDRWWLSSVSVGVDTYNEWALRNGMTALKNVDCFSLLVAKTGVDVGRLLASIQKLHGNFGGKSILGCTSLCDEFTLSEVVKQMYGVTLQSGKVSRAFRNASIVCCLLFLFLSEMLNHSKLFWINPGYITPVFLAIIVASSALMLLVKHKLLFLQLYLLPSLCIVSGYNIFKDYHFYTYMLEEFDYKVPFGGFNVTGVLNISLCCFVMGLHTFRFLQTPNKIFSYVVAVLTVLYTYYYSTDVLGLILTSMSGFTNYWFIGTATYKLATYVLPHTSLLDSFDAIKAVVFLYLLLGYCNCVYYGSLYWINRFCKLTLGCYEFKVSAAEFKYMVANGLRAPTGVFDALILSLKLIGVGGRKTIKISSVQSKLTDLKCTNVVLLGCLSNMNIAANSREWAYCVDLHNKINLCNDAEAAQEMLLALLAFFLSKNSAFGVDELLDSYFNDSSVLQSVAATYVNLPSYLAYETARQSYEDALANGSPPQLVKQLRHAMNVAKSEFDREASTQRKLDRMAEQAASQMYKEARAVNRKSKVVSAMHSLLFGMLRRLDMSSVDTILSLAKDGVVPLSIIPAVSATKLNIVVSDIESYSKIQREGCVHYAGVIWSVVDIKDNDGKPVHAKEVVTSNVESLAWPLFLNCERIIKLQNNEIIPSKIKQRPIKAEGEGVVADGNALYSNEGGRTFMYAFISDKPDLKVVKWEFDGGSNAIELEPPCKFLVEAPSGPVVKYLYFVRNLNNLRRGAVLGFIGATVRLQAGKQTEQATNSSLLTLCAFAVDPPKTYLDAVKSGHRPVGNCVKMLANGSGNGQAITNGVEASTNQDSYGGASVCLYCRAHVEHPDMDGFCKLRGKYVQVPLGTLDPIRFVLENTVCKVCGCWQANGCTCDRAVIQSVDSGYLNECGALVQLD | ||||||
Chain | PRO_0000338052 | 111-897 | Non-structural protein 2 | |||
Sequence: TTIPVDQFMCGADGAPVIQEGDWTFMDYFQDSNQFTLNGITYVKAWDVDRKPNDYAKQNVTCIRRITYITDHRHVLADGTTMKTARHPKVNKSVVLDSPFDQIYKEVGSPFMGNGSTFVEMLKDPAFFHALITCECGRSEWTVGDWKGYNSLCCNIKCKPITIVTPKAVPGAVVITKAGIGAGLKCYNNVFLKHIIDLVVPGTNLGWGVWRIAKVQSKDDVATSGNVLVDDPEDRLDPCYFGNDGPFATKFKFQLLANSFDDEVKGAIVQGVVHVNTAICDVVKDILGLPWFVKKLGSLVTVMWDQFVAGVQSMKICTLKVVQLAKALSCATMSVVKGVITLVAEVPEIFKRLFYTLTSALKSLCTSSCDALVVAGKSFAKIGDYVLLPSALVRLVSSKVKGKAQSGIKQLQFATVVLGDTHKVESDRVEFSSVNLKMVDEEFPLNPVGHTVAVGNQAFFCSDGLYRFMADRDLVITSPIFKPELELEPIFECDAIPGFPKVAASNVAELCVKVDTLLFNYDKIYKKYSTIIKGDRCYIQCTHTFKAPSYYFDDDEFVELCTKYYKLPDFDAFYNAVHAATDMDQFCALCTSGFEVFIPRVPDCPPILNDIDGGSIWTSFILSVRSATDFIKTLKIDLGLNGVVVFVTKKFRKAGALLQKLYNAFLDTVTSFIKVAGVAFKYCATCVPKIVINGCYHTVTRLFAKDLQIPTEDGVADFNTFNHCVFPVNPTRIETDSLELEEVDFVEPGVDGKLVILDDYSFYSDGTNYYPSDGKGVVASCFKKKGG | ||||||
Chain | PRO_0000338053 | 898-2530 | Non-structural protein 3 | |||
Sequence: GVVTISDEVQVRTIDPVYKVRLEYEFEDETLVKVCEKAIGTKLKVTGDWSNLLETLEKAMDVVRQHLDVPDYFVYDEEGGTDLNLTIMVSQWPLSSDSEDDFKAVDDEPNANTDETVDTFAEDVAETQNVQQDVTQDEVEAVCDLVVKATEEGPIEHEELSEDQKEVQQALAFIEDKPVVVKPDVFAFSYASYGGLKVLNQSSNNCWVSSALVQLQLTGLLDSDEMQLFNAGRVSPMVKRCYESQRAIFGSLGDVSACLESLLKDRDGMSITCTIDCGCGPGVRVYENAIFRFTPLKTAFPMGRCLICSKTLMHTITQMKGTGIFCRDATALDVDTLVVKPLCAAVYVGAQDGGHYLTNMYDANMAVDGHGRHPIKFNTINTLCYKDVDWEVSNGSCDVKPFLTYKNIEFYQGELSALLSVNHDFVVNAANEQLSHGGGIAKALDDLTKGELQVLSNQYVSRNGSIKVGSGVLIKCKEHSILNVVGPRKGKHAAELLTKAYTFVFKQKGVPLMPLLSVGIFKVPITESLAAFLACVGDRVCKCFCYTDKERLAIQNFVTSFQTEQPVEPLPVIQEVKGVQLEKPVPDVKVENPCEPFRIEGDAKFYDLTPSMVQSLQVTRLVSFTNSDLCLGSFVRDCDGYVQGSLGGAIANYKKSNPVLPAGNCVTLKCDGFISFTFVILPKEGDTNYEKNFNRAIAKFLKLKGSLLVVVEDSSVFNKISHASVAGYVAKPALVDTLFEAKPVQVVVTQDQRSFHTVELSTSQTYGQQLGDCVVEDKKVTNLKPVSKDKVVSVVPNVDWDKHYGFVDAGIFHTLDHTMFVFDNNVVNGKRVLRTSDNNCWINAVCLQLQFANAKFKPKGLQQLWESYCTGDVAMFVHWLYWITGVEKGEPSDAENTLNIISRFLKPQGSVEMLRATSTTCDGTCSTKRVVSTPVVNASVLKVGLDDGNCVHGLPLVDRVVSVNGTVIITNVGDTPGKPVVATENLLLDGVSYTVFQDSTTGVGHYTVFDKEAKLMFDGDVLKPCDLNVSPVTSVVVCNNKKIVVQDPVKRVELDASKFLDTMNVASEKFFTFGDFVSRNIIVLIVYLFSLLAICFRALKKRDMKVMAGVPERTGIILKRSVKYNYKALKFFFRLKFQYIKVFLKFSLVLYTLYALMFMFIRFTPVGTPICKRYTDGYANSTFDKNDYCGNVLCKICLYGYEELSDFTHTRVIWQHLKDPLIGNILPLFYLVFLIIFGGFFVRIGITYFIMQYINAAGVALGYQDNVWLLHLLPFNSMGNIIVVAFIVTRILLFLKHVLFGCDKPSCIACSKSAKLTRVPLQTILQGVTKSFYVNANGGKKFCKKHNFFCVDCDSYGYGCTFINDVIAPELSNVTKLNVIPTGPATIIIDKVEFSNGFYYLYSGSTFWKYNFDITEAKYACKDVLKNCNILTDFVVFNNSGSNVTQVKNACVYFSQLLCKPIKLVDSALLASLNVDFSANLHKAFVEVLSNSFGKDLSNCSNMNECRESLGLSDVPEEEFSAAVSEAHRYDVLISDVSFNNLIVSYAKPEEKLAVHDIANCMRVGAKVVNHNVLTKDNVPVVWLAKDFIALSEEARKYIVRTTKTKGINFMLTFNDRRMHLTIPTISVANKKG | ||||||
Disulfide bond | 2068↔2094 | |||||
Sequence: CKRYTDGYANSTFDKNDYCGNVLCKIC | ||||||
Disulfide bond | 2086↔2091 | |||||
Sequence: CGNVLC | ||||||
Chain | PRO_0000338054 | 2531-3012 | Non-structural protein 4 | |||
Sequence: AGLPSLFTRLYSFFWHLCVLIVVLFVATSLLDFSAQVTSDTQYDFKYIENGVLKVFEKPLDCVHNAFVNFNEWHNAKFGSIPTNSRRCPIVVGTSDEVRYIPGVPAGVFLYGKSLIFAMSTIFGTSGLCFDDRGLTDPDSCIFNSACTTLSGIGGRNVYCYREGVVDNAKLYSSLLPHSYYRLMDGNHIVLPEIITRGFGIRTIKTQAMTYCRTGECIDSQAGVCVGLDRFFVYSKTPGSDYVCGTGFFSLLFNVIGMFSNSIPVTVMSGQILLNCVVAFTAVMACFAFTKFKRLFGDMSFGVLSVGLCTVVNNLSYVVTQNSIGMLAYATLYFLCTKGVRYSWVWHVGFAISYCFLAPWWVVLAYLICALLEFLPNLFKLKVSTQLFEGDKFVGSFESAASGTFVLDMHSYQKLANSISTEKLKQYCASYNRYKYYSGSASEADYRLACFAHLAKAMSDFANDHMDKLYTPPTVSYNSTLQ | ||||||
Chain | PRO_0000338055 | 3013-3314 | 3C-like proteinase | |||
Sequence: AGLRKMAQPSGIVEGCIVRVSYGNLTLNGLWLGDTVICPRHVIASNTTNVIDYDHAMSLVRLHNFSISSGNMFLGVISASMRGTLLHIKVNQSNVNTPNYTYKVLKPGDSFNILACYDGSAAGVYGVNMRTNYTIRGSFISGACGSPGYNINNGVVEFCYMHHLELGSGCHVGSDMDGTMYGKYEDQPTLQIEGASNLVTENVCSWLYGALINGDRWWLSSVSVGVDTYNEWALRNGMTALKNVDCFSLLVAKTGVDVGRLLASIQKLHGNFGGKSILGCTSLCDEFTLSEVVKQMYGVTLQ | ||||||
Chain | PRO_0000338056 | 3315-3590 | Non-structural protein 6 | |||
Sequence: SGKVSRAFRNASIVCCLLFLFLSEMLNHSKLFWINPGYITPVFLAIIVASSALMLLVKHKLLFLQLYLLPSLCIVSGYNIFKDYHFYTYMLEEFDYKVPFGGFNVTGVLNISLCCFVMGLHTFRFLQTPNKIFSYVVAVLTVLYTYYYSTDVLGLILTSMSGFTNYWFIGTATYKLATYVLPHTSLLDSFDAIKAVVFLYLLLGYCNCVYYGSLYWINRFCKLTLGCYEFKVSAAEFKYMVANGLRAPTGVFDALILSLKLIGVGGRKTIKISSVQ | ||||||
Chain | PRO_0000338057 | 3591-3673 | Non-structural protein 7 | |||
Sequence: SKLTDLKCTNVVLLGCLSNMNIAANSREWAYCVDLHNKINLCNDAEAAQEMLLALLAFFLSKNSAFGVDELLDSYFNDSSVLQ | ||||||
Chain | PRO_0000338058 | 3674-3868 | Non-structural protein 8 | |||
Sequence: SVAATYVNLPSYLAYETARQSYEDALANGSPPQLVKQLRHAMNVAKSEFDREASTQRKLDRMAEQAASQMYKEARAVNRKSKVVSAMHSLLFGMLRRLDMSSVDTILSLAKDGVVPLSIIPAVSATKLNIVVSDIESYSKIQREGCVHYAGVIWSVVDIKDNDGKPVHAKEVVTSNVESLAWPLFLNCERIIKLQ | ||||||
Chain | PRO_0000338059 | 3869-3976 | Non-structural protein 9 | |||
Sequence: NNEIIPSKIKQRPIKAEGEGVVADGNALYSNEGGRTFMYAFISDKPDLKVVKWEFDGGSNAIELEPPCKFLVEAPSGPVVKYLYFVRNLNNLRRGAVLGFIGATVRLQ | ||||||
Chain | PRO_0000338060 | 3977-4111 | Non-structural protein 10 | |||
Sequence: AGKQTEQATNSSLLTLCAFAVDPPKTYLDAVKSGHRPVGNCVKMLANGSGNGQAITNGVEASTNQDSYGGASVCLYCRAHVEHPDMDGFCKLRGKYVQVPLGTLDPIRFVLENTVCKVCGCWQANGCTCDRAVIQ | ||||||
Chain | PRO_0000338061 | 4112-4128 | Non-structural protein 11 | |||
Sequence: SVDSGYLNECGALVQLD |
Post-translational modification
Specific enzymatic cleavages in vivo by its own proteases yield mature proteins. 3CL-PRO and PL-PRO proteinases are autocatalytically processed (By similarity).
Keywords
- PTM
Interaction
Subunit
3CL-PRO exists as monomer and homodimer. Eight copies of nsp7 and eight copies of nsp8 assemble to form a heterohexadecamer. Nsp9 is a dimer. Nsp10 forms a dodecamer (By similarity).
Family & Domains
Features
Showing features for domain, zinc finger, region.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Domain | 2-109 | CoV Nsp1 globular | ||||
Sequence: ASNHISLAFANDEEISAIGFGSVEEAVSYYSDAAVNGFDQCRFVSLGLQDAVVGVEDDDVVMLITGVTQLRAYLGTFGDRPLNLRGWLLFSNCNYFLEELDLVFGRCG | ||||||
Domain | 112-368 | CoV Nsp2 N-terminal | ||||
Sequence: TIPVDQFMCGADGAPVIQEGDWTFMDYFQDSNQFTLNGITYVKAWDVDRKPNDYAKQNVTCIRRITYITDHRHVLADGTTMKTARHPKVNKSVVLDSPFDQIYKEVGSPFMGNGSTFVEMLKDPAFFHALITCECGRSEWTVGDWKGYNSLCCNIKCKPITIVTPKAVPGAVVITKAGIGAGLKCYNNVFLKHIIDLVVPGTNLGWGVWRIAKVQSKDDVATSGNVLVDDPEDRLDPCYFGNDGPFATKFKFQLLAN | ||||||
Domain | 396-785 | CoV Nsp2 middle | ||||
Sequence: ILGLPWFVKKLGSLVTVMWDQFVAGVQSMKICTLKVVQLAKALSCATMSVVKGVITLVAEVPEIFKRLFYTLTSALKSLCTSSCDALVVAGKSFAKIGDYVLLPSALVRLVSSKVKGKAQSGIKQLQFATVVLGDTHKVESDRVEFSSVNLKMVDEEFPLNPVGHTVAVGNQAFFCSDGLYRFMADRDLVITSPIFKPELELEPIFECDAIPGFPKVAASNVAELCVKVDTLLFNYDKIYKKYSTIIKGDRCYIQCTHTFKAPSYYFDDDEFVELCTKYYKLPDFDAFYNAVHAATDMDQFCALCTSGFEVFIPRVPDCPPILNDIDGGSIWTSFILSVRSATDFIKTLKIDLGLNGVVVFVTKKFRKAGALLQKLYNAFLDTVTSFIKV | ||||||
Domain | 776-897 | CoV Nsp2 C-terminal | ||||
Sequence: LDTVTSFIKVAGVAFKYCATCVPKIVINGCYHTVTRLFAKDLQIPTEDGVADFNTFNHCVFPVNPTRIETDSLELEEVDFVEPGVDGKLVILDDYSFYSDGTNYYPSDGKGVVASCFKKKGG | ||||||
Domain | 898-993 | Ubiquitin-like 1 | ||||
Sequence: GVVTISDEVQVRTIDPVYKVRLEYEFEDETLVKVCEKAIGTKLKVTGDWSNLLETLEKAMDVVRQHLDVPDYFVYDEEGGTDLNLTIMVSQWPLSS | ||||||
Domain | 1069-1302 | Peptidase C16 1 | ||||
Sequence: AFIEDKPVVVKPDVFAFSYASYGGLKVLNQSSNNCWVSSALVQLQLTGLLDSDEMQLFNAGRVSPMVKRCYESQRAIFGSLGDVSACLESLLKDRDGMSITCTIDCGCGPGVRVYENAIFRFTPLKTAFPMGRCLICSKTLMHTITQMKGTGIFCRDATALDVDTLVVKPLCAAVYVGAQDGGHYLTNMYDANMAVDGHGRHPIKFNTINTLCYKDVDWEVSNGSCDVKPFLTY | ||||||
Zinc finger | 1174-1205 | C4-type 1; degenerate | ||||
Sequence: CGCGPGVRVYENAIFRFTPLKTAFPMGRCLIC | ||||||
Domain | 1303-1467 | Macro | ||||
Sequence: KNIEFYQGELSALLSVNHDFVVNAANEQLSHGGGIAKALDDLTKGELQVLSNQYVSRNGSIKVGSGVLIKCKEHSILNVVGPRKGKHAAELLTKAYTFVFKQKGVPLMPLLSVGIFKVPITESLAAFLACVGDRVCKCFCYTDKERLAIQNFVTSFQTEQPVEPL | ||||||
Domain | 1638-1693 | Ubiquitin-like 2 | ||||
Sequence: AKPVQVVVTQDQRSFHTVELSTSQTYGQQLGDCVVEDKKVTNLKPVSKDKVVSVVP | ||||||
Domain | 1699-1965 | Peptidase C16 2 | ||||
Sequence: KHYGFVDAGIFHTLDHTMFVFDNNVVNGKRVLRTSDNNCWINAVCLQLQFANAKFKPKGLQQLWESYCTGDVAMFVHWLYWITGVEKGEPSDAENTLNIISRFLKPQGSVEMLRATSTTCDGTCSTKRVVSTPVVNASVLKVGLDDGNCVHGLPLVDRVVSVNGTVIITNVGDTPGKPVVATENLLLDGVSYTVFQDSTTGVGHYTVFDKEAKLMFDGDVLKPCDLNVSPVTSVVVCNNKKIVVQDPVKRVELDASKFLDTMNVASE | ||||||
Zinc finger | 1816-1849 | C4-type 2; degenerate | ||||
Sequence: TTCDGTCSTKRVVSTPVVNASVLKVGLDDGNCVH | ||||||
Region | 1973-2184 | HD1 | ||||
Sequence: FVSRNIIVLIVYLFSLLAICFRALKKRDMKVMAGVPERTGIILKRSVKYNYKALKFFFRLKFQYIKVFLKFSLVLYTLYALMFMFIRFTPVGTPICKRYTDGYANSTFDKNDYCGNVLCKICLYGYEELSDFTHTRVIWQHLKDPLIGNILPLFYLVFLIIFGGFFVRIGITYFIMQYINAAGVALGYQDNVWLLHLLPFNSMGNIIVVAFI | ||||||
Domain | 2052-2116 | 3Ecto | ||||
Sequence: ALMFMFIRFTPVGTPICKRYTDGYANSTFDKNDYCGNVLCKICLYGYEELSDFTHTRVIWQHLKD | ||||||
Region | 2190-2280 | Y1 | ||||
Sequence: LFLKHVLFGCDKPSCIACSKSAKLTRVPLQTILQGVTKSFYVNANGGKKFCKKHNFFCVDCDSYGYGCTFINDVIAPELSNVTKLNVIPTG | ||||||
Domain | 2190-2530 | CoV Nsp3 Y | ||||
Sequence: LFLKHVLFGCDKPSCIACSKSAKLTRVPLQTILQGVTKSFYVNANGGKKFCKKHNFFCVDCDSYGYGCTFINDVIAPELSNVTKLNVIPTGPATIIIDKVEFSNGFYYLYSGSTFWKYNFDITEAKYACKDVLKNCNILTDFVVFNNSGSNVTQVKNACVYFSQLLCKPIKLVDSALLASLNVDFSANLHKAFVEVLSNSFGKDLSNCSNMNECRESLGLSDVPEEEFSAAVSEAHRYDVLISDVSFNNLIVSYAKPEEKLAVHDIANCMRVGAKVVNHNVLTKDNVPVVWLAKDFIALSEEARKYIVRTTKTKGINFMLTFNDRRMHLTIPTISVANKKG | ||||||
Region | 2194-2207 | ZF1 | ||||
Sequence: HVLFGCDKPSCIAC | ||||||
Region | 2240-2250 | ZF2 | ||||
Sequence: CKKHNFFCVDC | ||||||
Region | 2281-2370 | Y2 | ||||
Sequence: PATIIIDKVEFSNGFYYLYSGSTFWKYNFDITEAKYACKDVLKNCNILTDFVVFNNSGSNVTQVKNACVYFSQLLCKPIKLVDSALLASL | ||||||
Region | 2281-2530 | CoV-Y | ||||
Sequence: PATIIIDKVEFSNGFYYLYSGSTFWKYNFDITEAKYACKDVLKNCNILTDFVVFNNSGSNVTQVKNACVYFSQLLCKPIKLVDSALLASLNVDFSANLHKAFVEVLSNSFGKDLSNCSNMNECRESLGLSDVPEEEFSAAVSEAHRYDVLISDVSFNNLIVSYAKPEEKLAVHDIANCMRVGAKVVNHNVLTKDNVPVVWLAKDFIALSEEARKYIVRTTKTKGINFMLTFNDRRMHLTIPTISVANKKG | ||||||
Region | 2371-2428 | Y3 | ||||
Sequence: NVDFSANLHKAFVEVLSNSFGKDLSNCSNMNECRESLGLSDVPEEEFSAAVSEAHRYD | ||||||
Region | 2429-2530 | Y4 | ||||
Sequence: VLISDVSFNNLIVSYAKPEEKLAVHDIANCMRVGAKVVNHNVLTKDNVPVVWLAKDFIALSEEARKYIVRTTKTKGINFMLTFNDRRMHLTIPTISVANKKG | ||||||
Region | 2543-2898 | HD2 | ||||
Sequence: FFWHLCVLIVVLFVATSLLDFSAQVTSDTQYDFKYIENGVLKVFEKPLDCVHNAFVNFNEWHNAKFGSIPTNSRRCPIVVGTSDEVRYIPGVPAGVFLYGKSLIFAMSTIFGTSGLCFDDRGLTDPDSCIFNSACTTLSGIGGRNVYCYREGVVDNAKLYSSLLPHSYYRLMDGNHIVLPEIITRGFGIRTIKTQAMTYCRTGECIDSQAGVCVGLDRFFVYSKTPGSDYVCGTGFFSLLFNVIGMFSNSIPVTVMSGQILLNCVVAFTAVMACFAFTKFKRLFGDMSFGVLSVGLCTVVNNLSYVVTQNSIGMLAYATLYFLCTKGVRYSWVWHVGFAISYCFLAPWWVVLAYLI | ||||||
Domain | 2917-3012 | Nsp4C | ||||
Sequence: LFEGDKFVGSFESAASGTFVLDMHSYQKLANSISTEKLKQYCASYNRYKYYSGSASEADYRLACFAHLAKAMSDFANDHMDKLYTPPTVSYNSTLQ | ||||||
Domain | 3013-3314 | Peptidase C30 | ||||
Sequence: AGLRKMAQPSGIVEGCIVRVSYGNLTLNGLWLGDTVICPRHVIASNTTNVIDYDHAMSLVRLHNFSISSGNMFLGVISASMRGTLLHIKVNQSNVNTPNYTYKVLKPGDSFNILACYDGSAAGVYGVNMRTNYTIRGSFISGACGSPGYNINNGVVEFCYMHHLELGSGCHVGSDMDGTMYGKYEDQPTLQIEGASNLVTENVCSWLYGALINGDRWWLSSVSVGVDTYNEWALRNGMTALKNVDCFSLLVAKTGVDVGRLLASIQKLHGNFGGKSILGCTSLCDEFTLSEVVKQMYGVTLQ | ||||||
Region | 3351-3531 | HD3 | ||||
Sequence: GYITPVFLAIIVASSALMLLVKHKLLFLQLYLLPSLCIVSGYNIFKDYHFYTYMLEEFDYKVPFGGFNVTGVLNISLCCFVMGLHTFRFLQTPNKIFSYVVAVLTVLYTYYYSTDVLGLILTSMSGFTNYWFIGTATYKLATYVLPHTSLLDSFDAIKAVVFLYLLLGYCNCVYYGSLYWI | ||||||
Domain | 3591-3673 | RdRp Nsp7 cofactor | ||||
Sequence: SKLTDLKCTNVVLLGCLSNMNIAANSREWAYCVDLHNKINLCNDAEAAQEMLLALLAFFLSKNSAFGVDELLDSYFNDSSVLQ | ||||||
Domain | 3674-3868 | RdRp Nsp8 cofactor | ||||
Sequence: SVAATYVNLPSYLAYETARQSYEDALANGSPPQLVKQLRHAMNVAKSEFDREASTQRKLDRMAEQAASQMYKEARAVNRKSKVVSAMHSLLFGMLRRLDMSSVDTILSLAKDGVVPLSIIPAVSATKLNIVVSDIESYSKIQREGCVHYAGVIWSVVDIKDNDGKPVHAKEVVTSNVESLAWPLFLNCERIIKLQ | ||||||
Domain | 3869-3976 | Nsp9 ssRNA-binding | ||||
Sequence: NNEIIPSKIKQRPIKAEGEGVVADGNALYSNEGGRTFMYAFISDKPDLKVVKWEFDGGSNAIELEPPCKFLVEAPSGPVVKYLYFVRNLNNLRRGAVLGFIGATVRLQ | ||||||
Domain | 3977-4115 | ExoN/MTase coactivator | ||||
Sequence: AGKQTEQATNSSLLTLCAFAVDPPKTYLDAVKSGHRPVGNCVKMLANGSGNGQAITNGVEASTNQDSYGGASVCLYCRAHVEHPDMDGFCKLRGKYVQVPLGTLDPIRFVLENTVCKVCGCWQANGCTCDRAVIQSVDS | ||||||
Zinc finger | 4050-4066 | |||||
Sequence: CLYCRAHVEHPDMDGFC | ||||||
Zinc finger | 4092-4105 | |||||
Sequence: CKVCGCWQANGCTC |
Domain
The hydrophobic domains (HD) could mediate the membrane association of the replication complex and thereby alter the architecture of the host cell membrane.
Sequence similarities
Belongs to the coronaviruses polyprotein 1ab family.
Keywords
- Domain
Family and domain databases
Sequence & Isoform
- Sequence statusComplete
This entry describes 2 isoforms produced by Ribosomal frameshifting.
P0C6F6-1
This isoform has been chosen as the canonical sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
- NameReplicase polyprotein 1a
- Synonymspp1a, ORF1a polyprotein
- NoteProduced by conventional translation.
- Length4,128
- Mass (Da)456,581
- Last updated2008-06-10 v1
- Checksum550324DCD9D1820F
P0C6W0-1
The sequence of this isoform can be found in the external entry linked below. Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.
View isoform- NameReplicase polyprotein 1ab
- Synonymspp1ab
Keywords
- Coding sequence diversity