A0A859GSJ7 · A0A859GSJ7_SARS2
- ProteinORF1ab polyprotein
- GeneORF1ab
- StatusUniProtKB unreviewed (TrEMBL)
- Amino acids7096 (go to sequence)
- Protein existenceInferred from homology
- Annotation score5/5
Function
Catalytic activity
- ATP + H2O = ADP + H+ + phosphate
- ATP + H2O = ADP + H+ + phosphate
- uridylyl-uridylyl-ribonucleotide-RNA = a 3'-end uridylyl-2',3'-cyclophospho-uridine-RNA + a 5'-end dephospho-ribonucleoside-RNA
Cofactor
Protein has several cofactor binding sites:
Features
Showing features for active site, binding site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Active site | 6015 | |||||
Sequence: D | ||||||
Active site | 6017 | |||||
Sequence: E | ||||||
Active site | 6116 | |||||
Sequence: E | ||||||
Active site | 6193 | |||||
Sequence: H | ||||||
Active site | 6198 | |||||
Sequence: D | ||||||
Binding site | 6256-6262 | S-adenosyl-L-methionine (UniProtKB | ChEBI) | ||||
Sequence: DIGNPKA | ||||||
Active site | 6686 | |||||
Sequence: H | ||||||
Active site | 6701 | |||||
Sequence: H | ||||||
Active site | 6741 | |||||
Sequence: K |
GO annotations
Keywords
- Molecular function
- Biological process
- Ligand
Names & Taxonomy
Protein names
- Recommended nameORF1ab polyprotein
Gene names
Organism names
- Strain
- Taxonomic lineageViruses > Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales > Cornidovirineae > Coronaviridae > Orthocoronavirinae > Betacoronavirus > Sarbecovirus > Severe acute respiratory syndrome coronavirus
- Virus hosts
Accessions
- Primary accessionA0A859GSJ7
Subcellular Location
UniProt Annotation
GO Annotation
Features
Showing features for transmembrane.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Transmembrane | 2229-2253 | Helical | ||||
Sequence: IIIWFLLLSVCLGSLIYSTAALGVL | ||||||
Transmembrane | 2328-2351 | Helical | ||||
Sequence: FLAYILFTRFFYVLGLAAIMQLFF | ||||||
Transmembrane | 2363-2387 | Helical | ||||
Sequence: WLMWLIINLVQMAPISAMVRMYIFF | ||||||
Transmembrane | 2773-2790 | Helical | ||||
Sequence: LIKVTLVFLFVAAIFYLI | ||||||
Transmembrane | 3043-3065 | Helical | ||||
Sequence: ISASIVAGGIVAIVVTCLAYYFM | ||||||
Transmembrane | 3077-3098 | Helical | ||||
Sequence: VVAFNTLLFLMSFTVLCLTPVY | ||||||
Transmembrane | 3104-3124 | Helical | ||||
Sequence: VYSVIYLYLTFYLTNDVSFLA | ||||||
Transmembrane | 3136-3154 | Helical | ||||
Sequence: VPFWITIAYIICISTKHFY | ||||||
Transmembrane | 3582-3605 | Helical | ||||
Sequence: WLLLTILTSLLVLVQSTQWSLFFF | ||||||
Transmembrane | 3611-3629 | Helical | ||||
Sequence: FLPFAMGIIAMSAFAMMFV | ||||||
Transmembrane | 3636-3657 | Helical | ||||
Sequence: LCLFLLPSLATVAYFNMVYMPA | ||||||
Transmembrane | 3683-3701 | Helical | ||||
Sequence: VMYASAVVLLILMTARTVY | ||||||
Transmembrane | 3731-3755 | Helical | ||||
Sequence: ISMWALIISVTSNYSGVVTTVMFLA | ||||||
Transmembrane | 3767-3789 | Helical | ||||
Sequence: PIFFITGNTLQCIMLVYCFLGYF |
Keywords
- Cellular component
PTM/Processing
Features
Showing features for disulfide bond.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Disulfide bond | 2282↔2288 | |||||
Sequence: CTGSIPC |
Keywords
- PTM
Structure
Family & Domains
Features
Showing features for domain, region, compositional bias.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Domain | 12-127 | CoV Nsp1 globular | ||||
Sequence: THVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQHLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRKVLLRKNG | ||||||
Domain | 148-179 | BetaCoV Nsp1 C-terminal | ||||
Sequence: ELGTDPYEDFQENWNTKHSSGVTRELMRELNG | ||||||
Domain | 183-456 | CoV Nsp2 N-terminal | ||||
Sequence: TRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDIFNGECPNFVFPLNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQK | ||||||
Domain | 458-688 | CoV Nsp2 middle | ||||
Sequence: KVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLVNKFLAL | ||||||
Domain | 690-818 | CoV Nsp2 C-terminal | ||||
Sequence: ADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNTFTLKGG | ||||||
Domain | 821-929 | Ubiquitin-like | ||||
Sequence: TKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDE | ||||||
Region | 926-1000 | Disordered | ||||
Sequence: PPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQTT | ||||||
Compositional bias | 927-947 | Acidic residues | ||||
Sequence: PDEDEEEGDCEEEEFEPSTQY | ||||||
Compositional bias | 984-1000 | Polar residues | ||||
Sequence: SQQTVGQQDGSEDNQTT | ||||||
Domain | 1025-1194 | Macro | ||||
Sequence: VNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMK | ||||||
Domain | 1231-1359 | Macro | ||||
Sequence: KIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSI | ||||||
Domain | 1367-1494 | Macro | ||||
Sequence: ILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSS | ||||||
Domain | 1496-1561 | DPUP | ||||
Sequence: TPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLS | ||||||
Domain | 1565-1620 | Ubiquitin-like | ||||
Sequence: VRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYV | ||||||
Domain | 1634-1898 | Peptidase C16 | ||||
Sequence: YYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDN | ||||||
Domain | 1911-2021 | Nucleic acid-binding | ||||
Sequence: PIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWSTKPVET | ||||||
Domain | 2046-2155 | G2M | ||||
Sequence: PVSEEVVENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTN | ||||||
Domain | 2247-2317 | 3Ecto | ||||
Sequence: TAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETIQITISSFKWD | ||||||
Region | 2395-2485 | Y1 | ||||
Sequence: KSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTD | ||||||
Domain | 2395-2763 | CoV Nsp3 Y | ||||
Sequence: KSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGG | ||||||
Region | 2399-2412 | ZF1 | ||||
Sequence: HVVDGCNSSTCMMC | ||||||
Region | 2445-2455 | ZF2 | ||||
Sequence: CKLHNWNCVNC | ||||||
Region | 2486-2580 | Y2 | ||||
Sequence: QSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDV | ||||||
Region | 2486-2763 | CoV-Y | ||||
Sequence: QSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGG | ||||||
Region | 2581-2662 | Y3 | ||||
Sequence: GDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSD | ||||||
Region | 2663-2763 | Y4 | ||||
Sequence: IEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGG | ||||||
Domain | 3165-3263 | Nsp4C | ||||
Sequence: VVFNGVSFSTFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSDVLYQPPQTSITSAVLQ | ||||||
Domain | 3264-3569 | Peptidase C30 | ||||
Sequence: SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQ | ||||||
Domain | 3860-3942 | RdRp Nsp7 cofactor | ||||
Sequence: SKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ | ||||||
Domain | 3943-4140 | RdRp Nsp8 cofactor | ||||
Sequence: AIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQ | ||||||
Domain | 4141-4253 | Nsp9 ssRNA-binding | ||||
Sequence: NNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQ | ||||||
Domain | 4254-4392 | ExoN/MTase coactivator | ||||
Sequence: AGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQ | ||||||
Domain | 4399-4653 | NiRAN | ||||
Sequence: FLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVDTDL | ||||||
Domain | 4658-4756 | Nsp12 Interface | ||||
Sequence: IKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLFSTVFPLTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSS | ||||||
Domain | 4757-5324 | Nsp12 RNA-dependent RNA polymerase | ||||
Sequence: RLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQ | ||||||
Domain | 5004-5166 | RdRp catalytic | ||||
Sequence: PHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQG | ||||||
Domain | 5325-5408 | CV ZBD | ||||
Sequence: AVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLC | ||||||
Domain | 5581-5932 | +RNA virus helicase C-terminal | ||||
Sequence: NISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGL | ||||||
Domain | 5997-6212 | ExoN | ||||
Sequence: MFITREEAIRHVRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVASCDAIMTRCLAVHECFV | ||||||
Domain | 6221-6452 | N7-MTase | ||||
Sequence: YPIIGDELKINAACRKVQHMVVKAALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKQFDTYNLWNTFTRLQ | ||||||
Region | 6339-6353 | GpppA-binding | ||||
Sequence: XXXXXXXXXXXXXXX | ||||||
Domain | 6453-6513 | Nsp15 N-terminal oligomerization | ||||
Sequence: SLENVAFNVVNKGHFDGQQGEVPVSIINNTVYTKVDGVDVELFENKTTLPVNVAFELWAKR | ||||||
Domain | 6514-6639 | AV-Nsp11N/CoV-Nsp15M | ||||
Sequence: NIKPVPEVKILNNLGVDIAANTVIWDYKRDAPAHISTIGVCSMTDIAKKPTETICAPLTVFFDGRVDGQVDLFRNARNGVLITEGSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQ | ||||||
Domain | 6656-6795 | NendoU | ||||
Sequence: KPRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPFELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTIDYTEISFMLWCKDGHVETFYP | ||||||
Domain | 6800-7094 | Nidovirus-type SAM-dependent 2'-O-MTase | ||||
Sequence: SQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSATLPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYICGFIQQKLALXGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYLGKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMILSLLSKGRLIIRENNRVVISSDVLV |
Sequence similarities
Belongs to the coronaviruses polyprotein 1ab family.
Keywords
- Domain
Family and domain databases
Sequence
- Sequence statusComplete
- Length7,096
- Mass (Da)794,070
- Last updated2021-09-29 v1
- Checksum25A7FB5E451164C5
Features
Showing features for compositional bias.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Compositional bias | 927-947 | Acidic residues | ||||
Sequence: PDEDEEEGDCEEEEFEPSTQY | ||||||
Compositional bias | 984-1000 | Polar residues | ||||
Sequence: SQQTVGQQDGSEDNQTT |
Keywords
- Coding sequence diversity