A0A883GPX6 · A0A883GPX6_SARS2
- ProteinORF1ab polyprotein
- GeneORF1ab
- StatusUniProtKB unreviewed (TrEMBL)
- Amino acids7096 (go to sequence)
- Protein existenceInferred from homology
- Annotation score5/5
Function
Catalytic activity
- ATP + H2O = ADP + H+ + phosphate
- ATP + H2O = ADP + H+ + phosphate
- uridylyl-uridylyl-ribonucleotide-RNA = a 3'-end uridylyl-2',3'-cyclophospho-uridine-RNA + a 5'-end dephospho-ribonucleoside-RNA
Cofactor
Protein has several cofactor binding sites:
Features
Showing features for active site, binding site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Active site | 6015 | |||||
Sequence: D | ||||||
Active site | 6017 | |||||
Sequence: E | ||||||
Active site | 6116 | |||||
Sequence: E | ||||||
Active site | 6193 | |||||
Sequence: H | ||||||
Active site | 6198 | |||||
Sequence: D | ||||||
Binding site | 6256-6262 | S-adenosyl-L-methionine (UniProtKB | ChEBI) | ||||
Sequence: DIGNPKA | ||||||
Active site | 6686 | |||||
Sequence: H | ||||||
Active site | 6701 | |||||
Sequence: H | ||||||
Active site | 6741 | |||||
Sequence: K |
GO annotations
Keywords
- Molecular function
- Biological process
- Ligand
Names & Taxonomy
Protein names
- Recommended nameORF1ab polyprotein
Gene names
Organism names
- Strain
- Taxonomic lineageViruses > Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales > Cornidovirineae > Coronaviridae > Orthocoronavirinae > Betacoronavirus > Sarbecovirus > Severe acute respiratory syndrome coronavirus
- Virus hosts
Accessions
- Primary accessionA0A883GPX6
Subcellular Location
UniProt Annotation
GO Annotation
Features
Showing features for transmembrane.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Transmembrane | 2229-2254 | Helical | ||||
Sequence: IIIWFLLLSVCLGSLIYSTAALGVLM | ||||||
Transmembrane | 2340-2359 | Helical | ||||
Sequence: VLGLAAIMQLFFSYFAVHFI | ||||||
Transmembrane | 2371-2394 | Helical | ||||
Sequence: LVQMAPISAMVRMYIFFASFYYVW | ||||||
Transmembrane | 2773-2790 | Helical | ||||
Sequence: LIKVTLVFLFVAAIFYLI | ||||||
Transmembrane | 3043-3065 | Helical | ||||
Sequence: ISASIVAGGIVAIVVTCLAYYFM | ||||||
Transmembrane | 3077-3100 | Helical | ||||
Sequence: VVAFNTLLFLMSFTVLCLTPVYSF | ||||||
Transmembrane | 3120-3140 | Helical | ||||
Sequence: VSFLAHIQWMVMFTPLVPFWI | ||||||
Transmembrane | 3582-3605 | Helical | ||||
Sequence: WLLLTILTSLLVLVQSTQWSLFFF | ||||||
Transmembrane | 3611-3629 | Helical | ||||
Sequence: FLPFAMGIIAMSAFAMMFV | ||||||
Transmembrane | 3636-3657 | Helical | ||||
Sequence: LCLFLLXXXATVAYFNMVYMPA | ||||||
Transmembrane | 3683-3701 | Helical | ||||
Sequence: VMYASAVVLLILMTARTVY |
Keywords
- Cellular component
PTM/Processing
Features
Showing features for disulfide bond.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Disulfide bond | 2282↔2288 | |||||
Sequence: CTGSIPC |
Keywords
- PTM
Structure
Family & Domains
Features
Showing features for domain, region, compositional bias, coiled coil.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Domain | 12-127 | CoV Nsp1 globular | ||||
Sequence: THVQLSLPVLQVRDVLVRGFGDSVEEVXXXXXXXXXXXXXXXXXXXXXXXXXLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRKVLLRKNG | ||||||
Domain | 148-179 | BetaCoV Nsp1 C-terminal | ||||
Sequence: ELGTDPYEDFQENWNTKHSSGVTRELMRELNG | ||||||
Domain | 183-456 | CoV Nsp2 N-terminal | ||||
Sequence: TRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATTXXXLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGRTIAFGGCVFXXXXXXXKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQK | ||||||
Domain | 458-688 | CoV Nsp2 middle | ||||
Sequence: KVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGAWNXXXXXXXXXXXXXXXXXXXXVVRSIFSRTLETAQNSVRVLQKAAXXILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSTCACEIVGGQIVTCAKEIKESVQXXXXXXXXFLAL | ||||||
Domain | 690-818 | CoV Nsp2 C-terminal | ||||
Sequence: ADSIIIGGAKLKALNLGETFVTHSKGLYRKXXXXXXXXXXXXXXXXXXXXXXXXGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNTFTLKGG | ||||||
Domain | 821-929 | Ubiquitin-like | ||||
Sequence: TKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDE | ||||||
Region | 926-1000 | Disordered | ||||
Sequence: PPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQTT | ||||||
Compositional bias | 927-947 | Acidic residues | ||||
Sequence: PDEDEEEGDCEEEEFEPSTQY | ||||||
Compositional bias | 984-1000 | Polar residues | ||||
Sequence: SQQTVGQQDGSEDNQTT | ||||||
Domain | 1025-1194 | Macro | ||||
Sequence: VNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNXXXXXXSDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMK | ||||||
Domain | 1231-1359 | Macro | ||||
Sequence: KIKACVEEVTTTLEETKFLTXXXXXYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALXXVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSI | ||||||
Domain | 1367-1494 | Macro | ||||
Sequence: ILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSS | ||||||
Domain | 1496-1561 | DPUP | ||||
Sequence: TPEEHFIETISLAGSYKDWSYSGQSTQLGIXXLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLS | ||||||
Domain | 1565-1620 | Ubiquitin-like | ||||
Sequence: VRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYV | ||||||
Domain | 1634-1898 | Peptidase C16 | ||||
Sequence: YYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVXXXXXXXXDN | ||||||
Domain | 1911-2021 | Nucleic acid-binding | ||||
Sequence: PIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWXXXXXXX | ||||||
Domain | 2046-2155 | G2M | ||||
Sequence: PVSEEVVENPTIQKDVLECNVKXXXXVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGXXXXXXVPWDTIANYAKPFLNKVVSTTTN | ||||||
Domain | 2247-2317 | 3Ecto | ||||
Sequence: TAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETIQITISSFKWD | ||||||
Region | 2395-2485 | Y1 | ||||
Sequence: KSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTD | ||||||
Domain | 2395-2763 | CoV Nsp3 Y | ||||
Sequence: KSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILXXXXXXXSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVXXXXALKGG | ||||||
Region | 2399-2412 | ZF1 | ||||
Sequence: HVVDGCNSSTCMMC | ||||||
Region | 2445-2455 | ZF2 | ||||
Sequence: CKLHNWNCVNC | ||||||
Region | 2486-2580 | Y2 | ||||
Sequence: QSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILXXXXXXXSDV | ||||||
Region | 2486-2763 | CoV-Y | ||||
Sequence: QSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILXXXXXXXSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVXXXXALKGG | ||||||
Region | 2581-2662 | Y3 | ||||
Sequence: GDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSD | ||||||
Region | 2663-2763 | Y4 | ||||
Sequence: IEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVXXXXALKGG | ||||||
Domain | 3165-3263 | Nsp4C | ||||
Sequence: VVFNGVSFSTFEEAALCTXXXXXXXXXXXXXXXXLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSDVLYQPPQTSITSAVLQ | ||||||
Domain | 3264-3569 | Peptidase C30 | ||||
Sequence: SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQ | ||||||
Domain | 3860-3942 | RdRp Nsp7 cofactor | ||||
Sequence: XXXXXXXXXXXXXXXXXXXXXXXSSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ | ||||||
Domain | 3943-4140 | RdRp Nsp8 cofactor | ||||
Sequence: AIASEFSSLPSYAXXXXXXXXYEQAVANGDSEVVLKKLKKSLNVAXXXXXXXXXXXXXXXXXXXQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQ | ||||||
Coiled coil | 3977-4019 | |||||
Sequence: LKKLKKSLNVAXXXXXXXXXXXXXXXXXXXQAMTQMYKQARSE | ||||||
Domain | 4141-4253 | Nsp9 ssRNA-binding | ||||
Sequence: NNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMXXXXXAATVRLQ | ||||||
Domain | 4254-4392 | ExoN/MTase coactivator | ||||
Sequence: AGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQ | ||||||
Domain | 4399-4653 | NiRAN | ||||
Sequence: FLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKXXXXXYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVDTDL | ||||||
Domain | 4658-4756 | Nsp12 Interface | ||||
Sequence: IKWDLLKYDFTEERLKLFDRYFKXXXXXXXXXXVNCLDDRCILHCANFNVLFSTVFPLTSFGPLVRKIFVDGVPFVVSTGYHXXXXXXVHNQDVNLHSS | ||||||
Domain | 4757-5324 | Nsp12 RNA-dependent RNA polymerase | ||||
Sequence: RLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRXXXXXXLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQ | ||||||
Domain | 5004-5166 | RdRp catalytic | ||||
Sequence: PHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQG | ||||||
Domain | 5325-5408 | CV ZBD | ||||
Sequence: AVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLC | ||||||
Domain | 5581-5932 | +RNA virus helicase C-terminal | ||||
Sequence: NISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGL | ||||||
Domain | 5997-6212 | ExoN | ||||
Sequence: MFIXXXXXIRHVRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVASCDAIMTRCLAVHECFV | ||||||
Domain | 6221-6452 | N7-MTase | ||||
Sequence: YPIIGDELKINAACRKVQHMVVKAALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFVNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYLDAYNMMISAGFSXXVYKQFDTYNLWNTFTRLQ | ||||||
Region | 6339-6353 | GpppA-binding | ||||
Sequence: CDGGSLYVNKHAFHT | ||||||
Domain | 6453-6513 | Nsp15 N-terminal oligomerization | ||||
Sequence: SLENVAFNVVNKGHFDGQQGEVPVSIINNTVYTKVDGVDVELFENKTTLPVNVAFELWAKR | ||||||
Domain | 6514-6639 | AV-Nsp11N/CoV-Nsp15M | ||||
Sequence: NIKPVPEVKILNNLGVDIAANTVIWDYKRDAPAHISTIGVCSMTDIAKKPTETICAPLTVFFDGRVDGQVDLFRNARNGVLITEGSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQ | ||||||
Domain | 6656-6795 | NendoU | ||||
Sequence: KPRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGXLHLLIGLAKRFKESPFELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTIDYTEISFMLWCKDGHVETFYP | ||||||
Domain | 6800-7094 | Nidovirus-type SAM-dependent 2'-O-MTase | ||||
Sequence: SQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSATLPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYICGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYLGKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMILSLLSKGRLIIRENNRVVISSDVLV |
Sequence similarities
Belongs to the coronaviruses polyprotein 1ab family.
Keywords
- Domain
Family and domain databases
Sequence
- Sequence statusComplete
- Length7,096
- Mass (Da)793,060
- Last updated2021-09-29 v1
- ChecksumAA43A9F795152E6B
Features
Showing features for compositional bias.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Compositional bias | 927-947 | Acidic residues | ||||
Sequence: PDEDEEEGDCEEEEFEPSTQY | ||||||
Compositional bias | 984-1000 | Polar residues | ||||
Sequence: SQQTVGQQDGSEDNQTT |
Keywords
- Coding sequence diversity