A0A8F0ZWG1 · A0A8F0ZWG1_9BETC
- ProteinORF1ab polyprotein
- GeneORF1ab
- StatusUniProtKB unreviewed (TrEMBL)
- Organism
- Amino acids7081 (go to sequence)
- Protein existenceInferred from homology
- Annotation score5/5
Function
Catalytic activity
- ATP + H2O = ADP + H+ + phosphate
- ATP + H2O = ADP + H+ + phosphate
- uridylyl-uridylyl-ribonucleotide-RNA = a 3'-end uridylyl-2',3'-cyclophospho-uridine-RNA + a 5'-end dephospho-ribonucleoside-RNA
Cofactor
Protein has several cofactor binding sites:
Features
Showing features for active site, binding site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Active site | 6000 | |||||
Sequence: D | ||||||
Active site | 6002 | |||||
Sequence: E | ||||||
Active site | 6101 | |||||
Sequence: E | ||||||
Active site | 6178 | |||||
Sequence: H | ||||||
Active site | 6183 | |||||
Sequence: D | ||||||
Binding site | 6241-6247 | S-adenosyl-L-methionine (UniProtKB | ChEBI) | ||||
Sequence: DIGNPKA | ||||||
Active site | 6671 | |||||
Sequence: H | ||||||
Active site | 6686 | |||||
Sequence: H | ||||||
Active site | 6726 | |||||
Sequence: K |
GO annotations
Keywords
- Molecular function
- Biological process
- Ligand
Names & Taxonomy
Protein names
- Recommended nameORF1ab polyprotein
Gene names
Organism names
- Organism
- Strain
- Taxonomic lineageViruses > Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales > Cornidovirineae > Coronaviridae > Orthocoronavirinae > Betacoronavirus > Sarbecovirus
Accessions
- Primary accessionA0A8F0ZWG1
Proteomes
Subcellular Location
UniProt Annotation
GO Annotation
Features
Showing features for transmembrane.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Transmembrane | 2211-2231 | Helical | ||||
Sequence: ILTLVIWLLLIGVSLSSLVYL | ||||||
Transmembrane | 2356-2379 | Helical | ||||
Sequence: LVQMAPISAMVRMYIFFASFYYVW | ||||||
Transmembrane | 2754-2775 | Helical | ||||
Sequence: WLKFMLKVTIGLTVLVTILYCI | ||||||
Transmembrane | 3028-3050 | Helical | ||||
Sequence: ISASVIAGGFIAIVITCLAYYFM | ||||||
Transmembrane | 3062-3086 | Helical | ||||
Sequence: VIASNALLFLMSFTVLCLTPSYAFV | ||||||
Transmembrane | 3093-3115 | Helical | ||||
Sequence: LYLYFTFYLANDVSFLAHVQWFV | ||||||
Transmembrane | 3121-3142 | Helical | ||||
Sequence: VPFWITVLYGLCVFLKHFYWFF | ||||||
Transmembrane | 3567-3590 | Helical | ||||
Sequence: WLLLTALTSLLILVQTTQWSLFFF | ||||||
Transmembrane | 3596-3614 | Helical | ||||
Sequence: FLPFVAGIVAMAAISMMFV | ||||||
Transmembrane | 3621-3642 | Helical | ||||
Sequence: LCLFLLPSLATIAYFNVVYMPA | ||||||
Transmembrane | 3668-3686 | Helical | ||||
Sequence: IMYVLASLLLILMTARTVY | ||||||
Transmembrane | 3716-3740 | Helical | ||||
Sequence: LAMWALIISVTSNYSGVVTTIMFVA | ||||||
Transmembrane | 3752-3774 | Helical | ||||
Sequence: PILFITGNTLQCIMLIYCVLGYF |
Keywords
- Cellular component
PTM/Processing
Features
Showing features for disulfide bond.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Disulfide bond | 2267↔2273 | |||||
Sequence: CSSYIPC |
Keywords
- PTM
Structure
Family & Domains
Features
Showing features for domain, region.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Domain | 12-127 | CoV Nsp1 globular | ||||
Sequence: THVQLSLPVLQVRDVLVRGFGDSVEGALAEARQHLIDGTCGIIEVEKGVLPQLEQPFVFVKRSDARAAPHGHVVVELVAELDGVQYGRSGETLGVLVPHVGEVPVAYRKILLRKNG | ||||||
Domain | 148-179 | BetaCoV Nsp1 C-terminal | ||||
Sequence: DLVTDPFEDFEFNCNTKYSGGVTRNLMRELNG | ||||||
Domain | 183-456 | CoV Nsp2 N-terminal | ||||
Sequence: TRYIDNNFCGPDGYPLECVKDLLARAGKASCTLAEQLDYLETKRGIYCCRDHDHEIAWFTERSDVSYAQQTPFEIKIAKKFDSFTGECPKFVFPLNSTVHVLQPRVDKKKMDGFMGRIRSVYPVSSPCECNTMHLSILMKCSHCDETSWQSGDFVNATCEYCGTKNAVDDGPSTCGYLPQNPVVKISCPACHNPDMGPEHSVAEYHNESGLKTILRKGGRTIQFGGCVFSYVGCYNKCAFWVPRASANIGNRHTGIVGAETEALNDSLREVLHK | ||||||
Domain | 458-688 | CoV Nsp2 middle | ||||
Sequence: KIVINVVGDFKLTEEVAIILASFSASTSAFIDTIKGLDFKKFKQIVESCGNFKVARGEYKKGAWNIGESTSILTPLHAFSTQAASVVRSIFSRTMKTADCSVQVLQSRAVDILSSISEYSLRLIDAMSFTCELATDNLIVMAYVTGSLVQMTNQWFVNIFGMVHDQLKPVLNWLEDSLKSGVEFLKDGWEIFKVISQCACDIVSGQLVALTTEVKDCVKAFFALVNKFLAL | ||||||
Domain | 690-818 | CoV Nsp2 C-terminal | ||||
Sequence: ADTIIIGGARLRALNLGETFIAHSKGLYKKCVRPKGDAGLLMPLKAPKEVVFLEGATLPTEVIAEEVVLKTGELQPLDEPASADAKNPLVGTPVCINGLMLLEIKDTTKYCALSPNMLATNNLFTLKGG | ||||||
Domain | 821-929 | Ubiquitin-like | ||||
Sequence: TKVTFGDDTVIEIQGYKTIKVTFELDERVDKALNEKCSDYTVELGTDVKELACVVADNVVKTLTPLTDLLTPLGIDLDEWSVATFYLFDEAGDYKLASSMYCSFYPPSE | ||||||
Domain | 1012-1178 | Macro | ||||
Sequence: TNNANGYLKLTDNVFIKNVDIVEEALNMQPQVIVNAANVYLKHGGGVAGALNKATKGAMQLESDKYIADRGPLQVGGSCVLSGHNLATNCLHVVGPNLTRGEDISLLAQAYKNFNQFNCLLAPLLSAGIFGADPLVSLEACIQNVNSTVFLVVFDKAMYDKLISDFL | ||||||
Domain | 1215-1343 | Macro | ||||
Sequence: NVVPCVDEVTTTLEETKFLTKNLLVYIDINGKPYHDSLSLLGDMDLTFLTKDAPHIVGDIIISDFITAVVIPTKKAGGTMSMLCKVLKNIPTNQYITTYPGQGCAGYTIEEAKAALKKAKSAFYILPSN | ||||||
Domain | 1351-1478 | Macro | ||||
Sequence: ILGTVAWNLREMLTHAEETRKLMPVCMDTRAIISTIQRRFKGIKVQEGVIDYGVRFYFYTSKTPIANVIAAINNLNETIITMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPEAVTAYNGYLTSSS | ||||||
Domain | 1480-1552 | DPUP | ||||
Sequence: TAEEHFIETVSLAGSYKDWSYIGQSTTLGIAFLKRGDVVIYHTTNKPILFHKEGETIGLEDLKKYLALRELRT | ||||||
Domain | 1550-1605 | Ubiquitin-like | ||||
Sequence: LRTIKVFTTVDNINLHTQIVDMSMTYGQQFGPIYLDGADVTKIKPHTNHDGKTFYV | ||||||
Domain | 1619-1883 | Peptidase C16 | ||||
Sequence: HYHTTDESFLGRYMSALNLTKKWKYPQVGGLTSIKWADNNCYLATTLLTLQQIEVKFNPPALQDAYYRARAGDAANFCALILAYSNKTVGELGDVRETMAHLLRHVNLDTCKRVLNTVCKTCGQQQKTLHNLEAVMYMGTLSYDELKTGVKVPCVCGKDAIQYLVQQESPFVMMSAPPATHELEMGTFLCASEYTGDYLCGHYKHITTKETIYSIDGALLTKMSEYKGFVTDVFYKESSYYTTISPVVYKLDGSIYTELNPNLDN | ||||||
Domain | 1896-2006 | Nucleic acid-binding | ||||
Sequence: PIDLIPNQPFPNASYDNFKFVCENSKFADDLNQIAGYSKPATRELQVTFYPDLTGDVVAIDYRHYTSAFKKGAKLVHKPILWHVNKTTNKATYKPNTWCIRCLWSTKPIDT | ||||||
Domain | 2031-2140 | G2M | ||||
Sequence: DVTEEVVETPTVQKDIIECNVKQLETVGNVILKPSQEGTKVTEELGHEDMMTAYIEKTSLTIKRPNELSELLGLKTLLTHGVAAINSVPWNTIIMYAKPFLNTTASLTTD | ||||||
Domain | 2232-2302 | 3Ecto | ||||
Sequence: GSSLSLLMSVAGLPSYCNLYREGYLNSTNVTGTSYCSSYIPCMICLSDLDSLDLYPALATIQVTISSFNWD | ||||||
Region | 2380-2470 | Y1 | ||||
Sequence: RCYVHVVDGCTSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGRGFCKLHNWNCVNCDTFCSGSTFISDEVARDLSFQFKRTINPTD | ||||||
Domain | 2380-2748 | CoV Nsp3 Y | ||||
Sequence: RCYVHVVDGCTSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGRGFCKLHNWNCVNCDTFCSGSTFISDEVARDLSFQFKRTINPTDQSSYIVDNVVVKNGSLHLYFDKEGQKTYERHPLSHFVNLDNLRASNSKGSLPINVIVFDGKSKCEEAAAKVASVYYSQLMCQPILLLDQALISDIGDSTEVAVKMFNAYVNTFAATFNAPIDKLKALITTAESELTKGVALDNVLSTFIAAARQGFVDSDVDVKDIMECLKLSHQSDIEFTGDSCNNYMLTYNKIDNMTPRDLGACIDCNARQINAQVAKSHNISLIWNVRDFMSLSEQLRKQVRSAAKKNGLPFKLTCATTRQVVNVVTTKISLKGG | ||||||
Region | 2384-2397 | ZF1 | ||||
Sequence: HVVDGCTSSTCMMC | ||||||
Region | 2430-2440 | ZF2 | ||||
Sequence: CKLHNWNCVNC | ||||||
Region | 2471-2565 | Y2 | ||||
Sequence: QSSYIVDNVVVKNGSLHLYFDKEGQKTYERHPLSHFVNLDNLRASNSKGSLPINVIVFDGKSKCEEAAAKVASVYYSQLMCQPILLLDQALISDI | ||||||
Region | 2471-2748 | CoV-Y | ||||
Sequence: QSSYIVDNVVVKNGSLHLYFDKEGQKTYERHPLSHFVNLDNLRASNSKGSLPINVIVFDGKSKCEEAAAKVASVYYSQLMCQPILLLDQALISDIGDSTEVAVKMFNAYVNTFAATFNAPIDKLKALITTAESELTKGVALDNVLSTFIAAARQGFVDSDVDVKDIMECLKLSHQSDIEFTGDSCNNYMLTYNKIDNMTPRDLGACIDCNARQINAQVAKSHNISLIWNVRDFMSLSEQLRKQVRSAAKKNGLPFKLTCATTRQVVNVVTTKISLKGG | ||||||
Region | 2566-2647 | Y3 | ||||
Sequence: GDSTEVAVKMFNAYVNTFAATFNAPIDKLKALITTAESELTKGVALDNVLSTFIAAARQGFVDSDVDVKDIMECLKLSHQSD | ||||||
Region | 2648-2748 | Y4 | ||||
Sequence: IEFTGDSCNNYMLTYNKIDNMTPRDLGACIDCNARQINAQVAKSHNISLIWNVRDFMSLSEQLRKQVRSAAKKNGLPFKLTCATTRQVVNVVTTKISLKGG | ||||||
Domain | 3150-3248 | Nsp4C | ||||
Sequence: IVFDGVSFSTFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSDVLYQPPQTSITSAVLQ | ||||||
Domain | 3249-3554 | Peptidase C30 | ||||
Sequence: SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDIVYCPRHVICTAEDMLNPNYDDLLIRKSNHNFIVRAGNVQLRVIGHSMHNSVLRLKVDTANAKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNYTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLDGNFYGPFEDRQTAQSAGTDTVITVNVLAWLYAAVINGDRWFLNHFTTTLNDFNLIAMKFNYEPLTQEQVDILGPLSAQTGIPVLSMCAALKEMLQNGMNGRTILGSAILEDEFTPFDVVRQCSGVTFQ | ||||||
Domain | 3845-3927 | RdRp Nsp7 cofactor | ||||
Sequence: SKMSDVKCTSVVLLSVLQQLRIESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ | ||||||
Domain | 3928-4125 | RdRp Nsp8 cofactor | ||||
Sequence: AIASEFSSLPSYAAYATAQEAYEQAVANGDSDTVLKKLKKALNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVVPDYNTYKNTCEGNTFTYASALWEIQQIVDADSKVVQLSEITPDNSNIMAWPLIVTALRANSAVRLQ | ||||||
Domain | 4126-4238 | Nsp9 ssRNA-binding | ||||
Sequence: NNELSPVALRQMSCAAGTTQNNCNEGSALAYYNTSKGGRFVLALLSDIQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQ | ||||||
Domain | 4239-4377 | ExoN/MTase coactivator | ||||
Sequence: AGNATEVPANSTVLSFCAFAVDAAKAYKDYLTSGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGYCELKGKYVQIPTTCANDPVGFVLRNTVCAVCGMWKGYGCSCDQLREPLLQ | ||||||
Domain | 4384-4638 | NiRAN | ||||
Sequence: FLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNDKIAGFAKFLKTNCCRFQEKDEDGNLIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDEYFNKKDWYDFVENPDILRVYANLGERIRQALLKTVQFCDAMRDAGIVGVLTLDNQDLNGNWYDFGDFIQATPGSGVPVVDSYYSLLMPILTLTRALTAESHVDTDL | ||||||
Domain | 4643-4741 | Nsp12 Interface | ||||
Sequence: IKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSS | ||||||
Domain | 4742-5309 | Nsp12 RNA-dependent RNA polymerase | ||||
Sequence: RLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSIELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAITANVNALLSTDGNKIADKYIRNLQHRLYECLYRNRDVDIDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSIMLTNDNTSRYWEPEFYEAMYTPHTVLQ | ||||||
Domain | 4989-5151 | RdRp catalytic | ||||
Sequence: PHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAITANVNALLSTDGNKIADKYIRNLQHRLYECLYRNRDVDIDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQG | ||||||
Domain | 5310-5393 | CV ZBD | ||||
Sequence: AVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLC | ||||||
Domain | 5566-5917 | +RNA virus helicase C-terminal | ||||
Sequence: NISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYENKLKAHKEKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFAQTTETAHSCNVNRFNVAITRAKVGILCIMSDRDLYDSLQFTSLEVPRRNVANLQAENVTGL | ||||||
Domain | 5982-6197 | ExoN | ||||
Sequence: MFITREEAIRHVRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLRNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHIASCDAIMTRCLAVHECFV | ||||||
Domain | 6206-6437 | N7-MTase | ||||
Sequence: YPIIGDELKINAACRKVQHMVVKAALLADKFSVLHDIGNPKAIKCVPQAEVEWKFYDAQPCGDKAYKIEELYYSYATHSDKFTDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFVNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSTTCITRCNLGGAVCKHHANEYRLYLDAYNMMISAGFSLWVYKQFDTYNLWNTFTKLQ | ||||||
Region | 6324-6338 | GpppA-binding | ||||
Sequence: CDGGSLYVNKHAFHT | ||||||
Domain | 6438-6498 | Nsp15 N-terminal oligomerization | ||||
Sequence: SLENVAFNVVNKGHFDGQQGEVPVSIINNTVYTKVDGVDVELFENKTTLPVNVAFELWAKR | ||||||
Domain | 6499-6624 | AV-Nsp11N/CoV-Nsp15M | ||||
Sequence: NIKPVPEVKILNNLGVDIAANTVIWDYKRDAPAYMSTIGVCTMTDIAKKPTEPTCASLTVLFDGRVEGQVDLFRNARNGVLYTEGSVKGLQSSVGPKHASLNGVTLVGEAVKTQFNYYKKVDGVIQ | ||||||
Domain | 6641-6780 | NendoU | ||||
Sequence: KPRSQMENDFLELAMDEFIHRYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLTKRSMESPLVLEDFIPLDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVNIIKSQDLSVVSKVVKVTIDYTEISFMLWCKDGHAETFYP | ||||||
Domain | 6785-7079 | Nidovirus-type SAM-dependent 2'-O-MTase | ||||
Sequence: SQAWQPGVAMPNLYKMQRMLLDKCDIQNYGEAATLPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDINDFVSDSDATLIGDCATIHTANKWDLIISDMYDPKTKNVTCENDSKEGFFTYICGFIQQRLALGGSVAIKITEHSWNADLYRLMGHFAWWTAFVTNVNASSSEAFLIGCNYLGKQCELIDGYVMHANYVFWRNTNPIQLSSYSLFDMSKFPLKLKGTAVMSLKESQINDMILSLLSKGRLIIRENNKVVFSNDVLV |
Sequence similarities
Belongs to the coronaviruses polyprotein 1ab family.
Keywords
- Domain
Family and domain databases
Sequence
- Sequence statusComplete
- Length7,081
- Mass (Da)791,648
- Last updated2022-01-19 v1
- ChecksumCAC716B8946C4170
Keywords
- Coding sequence diversity