A0A109NT92 · A0A109NT92_9ALPC
- ProteinORF1ab polyprotein
- Gene1b
- StatusUniProtKB unreviewed (TrEMBL)
- Organism
- Amino acids6781 (go to sequence)
- Protein existenceInferred from homology
- Annotation score5/5
Function
function
Forms a primer, NSP9-pU, which is utilized by the polymerase for the initiation of RNA chains. Interacts with ribosome signal recognition particle RNA (SRP). Together with NSP8, suppress protein integration into the cell membrane, thereby disrupting host immune defenses.
RNA-directed RNA polymerase that catalyzes the transcription of viral genomic and subgenomic RNAs. Acts in complex with nsp7 and nsp8 to transcribe both the minus and positive strands of genomic RNA. The kinase-like NiRAN domain of NSP12 attaches one or more nucleotides to the amino terminus of NSP9, forming a covalent RNA-protein intermediate that serves as transcription/replication primer. Subgenomic RNAs (sgRNAs) are formed by discontinuous transcription: The polymerase has the ability to pause at transcription-regulating sequences (TRS) and jump to the leader TRS, resulting in a major deletion. This creates a series of subgenomic RNAs that are replicated, transcribed and translated. In addition, Nsp12 is a subunit of the viral RNA capping enzyme that catalyzes the RNA guanylyltransferase reaction for genomic and sub-genomic RNAs. Subsequently, the NiRAN domain transfers RNA to GDP, and forms the core cap structure GpppA-RNA.
Catalytic activity
- ATP + H2O = ADP + H+ + phosphate
- ATP + H2O = ADP + H+ + phosphate
- uridylyl-uridylyl-ribonucleotide-RNA = a 3'-end uridylyl-2',3'-cyclophospho-uridine-RNA + a 5'-end dephospho-ribonucleoside-RNA
Cofactor
Features
Showing features for active site, binding site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Active site | 5714 | |||||
Sequence: D | ||||||
Active site | 5716 | |||||
Sequence: E | ||||||
Active site | 5815 | |||||
Sequence: E | ||||||
Active site | 5891 | |||||
Sequence: H | ||||||
Active site | 5896 | |||||
Sequence: D | ||||||
Binding site | 5954-5960 | S-adenosyl-L-methionine (UniProtKB | ChEBI) | ||||
Sequence: DIGNPKG | ||||||
Active site | 6367 | |||||
Sequence: H | ||||||
Active site | 6382 | |||||
Sequence: H | ||||||
Active site | 6423 | |||||
Sequence: K |
GO annotations
Keywords
- Molecular function
- Biological process
- Ligand
Names & Taxonomy
Protein names
- Recommended nameORF1ab polyprotein
Gene names
Organism names
- Organism
- Taxonomic lineageViruses > Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales > Cornidovirineae > Coronaviridae > Orthocoronavirinae > Alphacoronavirus > Pedacovirus
Accessions
- Primary accessionA0A109NT92
Proteomes
Subcellular Location
UniProt Annotation
GO Annotation
Host membrane ; Multi-pass membrane protein
Membrane ; Multi-pass membrane protein
Features
Showing features for transmembrane.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Transmembrane | 1964-1982 | Helical | ||||
Sequence: LITVFLYILSILGLCFRAF | ||||||
Transmembrane | 2025-2044 | Helical | ||||
Sequence: VLGKFSLGIYALYALLFMTI | ||||||
Transmembrane | 2104-2125 | Helical | ||||
Sequence: LIGNVMPFFYLAFLAIFGGVYV | ||||||
Transmembrane | 2132-2153 | Helical | ||||
Sequence: FIFHYLNILGVFLGLQQSIWFL | ||||||
Transmembrane | 2165-2185 | Helical | ||||
Sequence: IVVFFIVTRVLMFLKHVFLGC | ||||||
Transmembrane | 2528-2546 | Helical | ||||
Sequence: FFWFLCLFIVAVFFALSFF | ||||||
Transmembrane | 2787-2804 | Helical | ||||
Sequence: ILFNCIIAFAAVAVCFLF | ||||||
Transmembrane | 2859-2887 | Helical | ||||
Sequence: WIWHLGFLISYILIAPWWVLMVYAFSAIF | ||||||
Transmembrane | 3337-3354 | Helical | ||||
Sequence: YVTPMFACLSLLSSLLMF | ||||||
Transmembrane | 3361-3381 | Helical | ||||
Sequence: LFFQVFLIPALIVTSCINLAF | ||||||
Transmembrane | 3401-3419 | Helical | ||||
Sequence: GFNAQGLVNIFVCFVVTIL | ||||||
Transmembrane | 3431-3449 | Helical | ||||
Sequence: PVSSVTYVVALLTAAYNYF | ||||||
Transmembrane | 3469-3493 | Helical | ||||
Sequence: WFVGAVCYKAAVYMALRFPTFVAIF | ||||||
Transmembrane | 3500-3520 | Helical | ||||
Sequence: MFCYLVLGYFTCCFYGILYWF |
Keywords
- Cellular component
PTM/Processing
Keywords
- PTM
Structure
Family & Domains
Features
Showing features for domain, region.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Domain | 2-109 | CoV Nsp1 globular | ||||
Sequence: ASNHVTLAFANDAEISAFGFCTASEAVSYYSEAAASGFMQCRFVSFDLADTVEGLLPEDYVMVVVGTTKLSAYVDTFGSRPKNICGWLLFSNCNYFLEELELTFGRRG | ||||||
Domain | 112-364 | CoV Nsp2 N-terminal | ||||
Sequence: IVPVDQYMCGADGKPVLQESEWEYTDFFADSEDGQLNIAGITYVKAWVVERSDVSYASQNLTSIKSITYCSTYEHTFPDGTAMKVARTPKIKKTVVLSEPLATIYREIGSPFVDNGSDARSIIKRPVFLHAFVKCKCGSYHWTVGDWTSYVSTCCGFKCKPVLVASCSATPGSVVVTRAGAGTGVKYYNNMFLRHVADIDGLAFWRILKVQSKDDLACSGKFLEHHEEGFTDPCYFLNDSSIATKLKFDILSG | ||||||
Domain | 383-776 | CoV Nsp2 middle | ||||
Sequence: SALVDIVDDALGQPWFIRKLGDLASAAWEQLKAVVRGLNLLSDEVVLFGKRLSCATLSIVNGVFEFIAEVPEKLAAAVTVFVNFLNELFESACDCLKVGGKTFNKVGSYVLFDNALVKLVKAKVRGPRQAGVCEVRYTSLVIGSTTKVVSKRVENANVNLVVVDEDVTLNTTGRTVVVDGLAFFESDGFYRHLADADVVIEHPVYKSACELKPVFECDPIPDFPMPVAASVAELCVQTDLLLKNYNTPYKTYSCVVRGDKCCITCTLHITAPSYMEDAANFVDLCTKNIGTAGFHEFYITAHEQQDLQGFVTTCCTMSGFECFMPIIPQCPAVLEEIDGGSIWRSFITGLNTMWDFCKHLKVSFGLDGIVVTVARKFKRLGALLAEMYNTYLST | ||||||
Domain | 778-895 | CoV Nsp2 C-terminal | ||||
Sequence: VENLVLAGVSFKYYATSVPKIVLGCCFHSVKSVLASAFQIPVQEGIEKFKVFLNCVHPVVPRVIETSFVELEETTFKPPALNGSIAIVDGFAFYYDGTLYYPTDGNSVVPICFKKKGG | ||||||
Domain | 896-991 | Ubiquitin-like | ||||
Sequence: GDVKFSDEVSVRTIDPVYKVSLEFEFESETIMAVLNKAVGNRIKVTGGWDDVVEYINVAIEVLKDHIDVPKYYIYDEEGGTDPNLPVMVSQWPLND | ||||||
Region | 1011-1045 | Disordered | ||||
Sequence: FEGDEVDSSDPDKVADVANSEPEDDGPNVAPETNV | ||||||
Domain | 1057-1296 | Peptidase C16 | ||||
Sequence: SFIKDTPSTVTKDPFAFDFASYGGLKVLRQSHNNCWVTSTLVQLQLLGIVDDPAMELFSAGRVGPMVRKCYESQKAILGSLGDVSACLESLTKDLHTLKITCSVVCGCGTGERIYEGCAFRMTPTLEPFPYGACAQCAQVLMHTFKSIVGTGIFCRDTTALSLDSLVVKPLCAAAFIGKDSGHYVTNFYDAAMAIDGYGRHQIKYDTLNTICVKDVNWTAPFVPDVEPVLEPVVKPFYSY | ||||||
Domain | 1286-1465 | Macro | ||||
Sequence: LEPVVKPFYSYKNVDFYQGDFSDLVKLPCDFVVNAANENLSHGGGIAKAIDVYTKGMLQKCSNDYIKAHGPIKVGRGVMLEALGLKVFNVVGPRKGKHAPELLVKAYKSVFANSGVALTPLISVGIFSVPLEESLSAFLACVGGRHCKCFCYSDKEREAIINYMDGLVDAIFKDALVDTT | ||||||
Domain | 1630-1685 | Ubiquitin-like | ||||
Sequence: NKSVVIKVTEDTRSVKTVKVESTVTYGQQIGPCLVNDTVVTDNKPVVADVVAKVVP | ||||||
Domain | 1691-1951 | Peptidase C16 | ||||
Sequence: SHYGFDKAGEFHMLDHTGFAFPSEVVNGRRVLKTTDNNCWVNVTCLQLQFARFRFKSAGLQAMWESYCTGDVAMFVHWLYWLTGVDKGQPSDSENALNMLSKYIVPAGSVTIERVTHDGCCCSKRVVTAPVVNASVLKLGVEDGLCPHGLNYIDKVVVVKGTTIVVNVGKPVVAPSHLFLKGVSYTTFLDNGNGVAGHYTVFDHDTGMVHDGDVFVPGDLNVSPVTNVVVSEQTAVVIKDPVKKVELDATKLLDTMNYASE | ||||||
Domain | 2038-2102 | 3Ecto | ||||
Sequence: ALLFMTIRFTPIGGPVCDDVVAGYANSSFDKNEYCNSVICKVCLYGYQELSDFSHTQVVWQHLRD | ||||||
Region | 2176-2266 | Y1 | ||||
Sequence: MFLKHVFLGCDKASCVACSKSARLKRVPVQTIFQGTSKSFYVHANGGSKFCKKHNFFCLNCDSYGPGCTFINDVIATEVGNVVKLNVQSTG | ||||||
Domain | 2176-2516 | CoV Nsp3 Y | ||||
Sequence: MFLKHVFLGCDKASCVACSKSARLKRVPVQTIFQGTSKSFYVHANGGSKFCKKHNFFCLNCDSYGPGCTFINDVIATEVGNVVKLNVQSTGPATILIDKVEFSNGFYYLYSGDTFWKYNFDITDNKYTCKESLKNCSIITDFIVFNNNGSNVNQVKNACVYFSQMLCKPVKLVDSALLASLSVDFGASLHSAFVSVLSNSFGKDLSSCNDMQDCKSTLGFDDVPLDTFNAAVAEAHRYDVLLTDMSFNNFTTSYAKPEEKLPVHDIATCMRVGAKIVNHNVLVKDSIPVVWLVRDFIALSEETRKYIIRTTKVKGITFMLTFNDCRMHTTIPTVCIANKKG | ||||||
Region | 2180-2193 | ZF1 | ||||
Sequence: HVFLGCDKASCVAC | ||||||
Region | 2226-2236 | ZF2 | ||||
Sequence: CKKHNFFCLNC | ||||||
Region | 2267-2516 | CoV-Y | ||||
Sequence: PATILIDKVEFSNGFYYLYSGDTFWKYNFDITDNKYTCKESLKNCSIITDFIVFNNNGSNVNQVKNACVYFSQMLCKPVKLVDSALLASLSVDFGASLHSAFVSVLSNSFGKDLSSCNDMQDCKSTLGFDDVPLDTFNAAVAEAHRYDVLLTDMSFNNFTTSYAKPEEKLPVHDIATCMRVGAKIVNHNVLVKDSIPVVWLVRDFIALSEETRKYIIRTTKVKGITFMLTFNDCRMHTTIPTVCIANKKG | ||||||
Region | 2415-2516 | Y4 | ||||
Sequence: VLLTDMSFNNFTTSYAKPEEKLPVHDIATCMRVGAKIVNHNVLVKDSIPVVWLVRDFIALSEETRKYIIRTTKVKGITFMLTFNDCRMHTTIPTVCIANKKG | ||||||
Domain | 2902-2997 | Nsp4C | ||||
Sequence: LFEGDKFVGSFENAAAGTFVLDMHAYERLANSISTEKLRQYASTYNKYKYYSGSASEADYRLACFAHLAKAMMDYASNHNDTLYTPPTVSYNSTLQ | ||||||
Domain | 2998-3299 | Peptidase C30 | ||||
Sequence: AGLRKMAQPSGVVEKCIVRVCYGNMALNGLWLGDTVICPRHVIASSTTSTIDYDYALSVLRLHNFSISSGNVFLGVVGVTMRGALLQIKVNQNNVHTPKYTYRTVRPGESFNILACYDGSAAGVYGVNMRSNYTIRGSFINGACGSPGYNINNGTVEFCYLHQLELGSGCHVGSDLDGVMYGGYEDQPTLQVEGASSLFTENVLAFLYAALINGSTWWLSSSRIAVDRFNEWAVHNGMTTVVNTDCFSILAAKTGVDVQRLLASIQSLHKNFGGKQILGYTSLTDEFTTGEVIRQMYGVNLQ | ||||||
Domain | 3580-3662 | RdRp Nsp7 cofactor | ||||
Sequence: SKLTDIKCSNVVLLGCLSSMNVSANSTEWAYCVDLHNKINLCNDPEKAQEMLLALLAFFLSKNSAFGLDDLLESYFNDNSMLQ | ||||||
Domain | 3663-3857 | RdRp Nsp8 cofactor | ||||
Sequence: SVASTYVGLPSYVIYENARQQYEDAVNNGSPPQLVKQLRHAMNVAKSEFDREASTQRKLDRMAEQAAAQMYKEARAVNRKSKVVSAMHSLLFGMLRRLDMSSVDTILNLAKDGVVPLSVIPAVSATKLNIVTSDIDSYNRIQREGCVHYAGTIWNIIDIKDNDGKVVHVKEVTAQNAESLSWPLVLGCERIVKLQ | ||||||
Domain | 3858-3965 | Nsp9 ssRNA-binding | ||||
Sequence: NNEIIPGKLKQRSIKAEGDGIVGEGKALYNNEGGRTFMYAFISDKPDLRVVKWEFDGGCNTIELEPPRKFLVDSPNGAQIKYLYFVRNLNTLRRGAVLGYIGATVRLQ | ||||||
Domain | 3966-4103 | ExoN/MTase coactivator | ||||
Sequence: AGKQTEQAINSSLLTLCAFAVDPAKTYIDAVKSGHKPVGNCVKMLANGSGNGQAVTNGVEASTNQDSYGGASVCLYCRAHVEHPSMDGFCRLKGKYVQVPLGTVDPIRFVLENDVCKVCGCWLANGCTCDRSIMQSTD | ||||||
Domain | 4106-4355 | NiRAN | ||||
Sequence: YLNRVRGSSAARLEPCNGTDTQHVYRAFDIYNKDVACLGKFLKVNCVRLKNLDKHDAFYVVKRCTKSAMEHEQSIYSRLEKCGAVAEHDFFTWKDGRAIYGNVCRKDLTEYTMMDLCYALRNFDENNCDVLKSILIKVGACEESYFNNKVWFDPVENEDIHRVYALLGTIVSRAMLKCVKFCDAMVEQGIVGVVTLDNQDLNGDFYDFGDFTCSIKGMGIPICTSYYSYMMPVMGMTNCLASECFVKSDI | ||||||
Domain | 4361-4459 | Nsp12 Interface | ||||
Sequence: KSYDLLEYDFTEHKTALFNKYFKYWGLQYHPNCVDCSDEQCIVHCANFNTLFSTTIPITAFGPLCRKCWIDGVPLVTTAGYHFKQLGIVWNNDLNLHSS | ||||||
Domain | 4460-5027 | Nsp12 RNA-dependent RNA polymerase | ||||
Sequence: RLSINELLQFCSDPALLIASSPALVDQRTVCFSVAALGTGMTNQTVKPGHFNKEFYDFLLEQGFFSEGSELTLKHFFFAQKGDAAVKDFDYYRYNRPTVLDICQARVVYQIVQRYFDIYEGGCITAKEVVVTNLNKSAGYPLNKFGKAGLYYESLSYEEQDELYAYTKRNILPTMTQLNLKYAISGKERARTVGGVSLLSTMTTRQYHQKHLKSIVNTRGASVVIGTTKFYGGWDNMLKNLIDGVENPCLMGWDYPKCDRALPNMIRMISAMILGSKHTTCCSSTDRFFRLCNELAQVLTEVVYSNGGFYLKPGGTTSGDATTAYANSVFNIFQAVSANVNKLLSVDSNVCHNLEVKQLQRKLYECCYRSTTVDDQFVVEYYGYLRKHFSMMILSDDGVVCYNNDYASLGYVADLNAFKAVLYYQNNVFMSASKCWIEPDINKGPHEFCSQHTMQIVDKDGTYYLPYPDPSRILSAGVFVDDVVKTDAVVLLERYVSLAIDAYPLSKHENPEYKKVFYVLLDWVKHLYKTLNAGVLESFSVTLLEDSTAKFWDESFYANMYEKSAVLQ | ||||||
Domain | 5028-5111 | CV ZBD | ||||
Sequence: SAGLCVVCGSQTVLRCGDCLRRPMLCTKCAYDHVIGTTHKFILAITPYVCCASDCGVNDVTKLYLGGLSYWCHDHKPRLAFPLC | ||||||
Domain | 5275-5636 | +RNA virus helicase C-terminal | ||||
Sequence: STIHKLHPAFNIPEAYSSLVPYYQLIGKQKITTIQGPPGSGKSHCVIGLGLYYPGARIVFTACSHAAVDSLCVKASTAYSNDKCSRIIPQRARVECYDGFKSNNTSAQYLFSTVNALPECNADIVVVDEVSMCTNYDLSVINQRISYRHVVYVGDPQQLPAPRVMISRGTLEPKDYNVVTQRMCALKPDVFLHKCYRCPAEIVRTVSEMVYENQFIPVHPDSKQCFKIFCKGNVQVDNGSSINRRQLDVVRMFLAKNPRWSKAVFISPYNSQNYVASRMLGLQIQTVDSSQGSEYDYVIYTQTSDTAHACNVNRFNVAITRAKKGILCIMCDRSLFDVLKFFELKLSDLQANEGCGLFKDCS | ||||||
Domain | 5696-5910 | ExoN | ||||
Sequence: LFCTRDFAMRNVRGWLGFDVEGAHVVGSNVGTNVPLQLGFSNGVDFVVRPEGCVVTESGDYIKPVRARAPPGEQFAHLLPLLKRGQPWDVVRKRIVQMCSDYLANLSDILIFVLWAGGLELTTMRYFVKIGPSKSCDCGKVATCYNSALHTYCCFKHALGCDYLYNPYCIDIQQWGYKGSLSLNHHEHCNVHRNEHVASGDAIMTRCLAIHDCFV | ||||||
Domain | 5919-6140 | N7-MTase | ||||
Sequence: YPFIGNEAVINKSGRIVQSHTMRSVLKLYNPKAIYDIGNPKGIRCAVTDAKWFCFDKNPTNSNVKTLEYDYITHGQFDGLCLFWNCNVDMYPEFSVVCRFDTRCRSPLNLEGCNGGSLYVNNHAFHTPAFDKRAFAKLKPMPFFFYDDTECDKLQDSINYVPLRASNCITKCNVGGAVCSKHCAMYHSYVNAYNTFTSAGFTIWVPTSFDTYNLWQTFSNNL | ||||||
Region | 6031-6045 | GpppA-binding | ||||
Sequence: CNGGSLYVNNHAFHT | ||||||
Domain | 6142-6202 | Nsp15 N-terminal oligomerization | ||||
Sequence: GLENIAFNVVKKGSFVGAEGELPVAVVNDKVLVRDGTVDTLVFTNKTSLPTNVAFELYAKR | ||||||
Domain | 6203-6320 | AV-Nsp11N/CoV-Nsp15M | ||||
Sequence: KVGLTPPITILRNLGVVCTSKCVIWDYEAERPLTTFTKDVCKYTDFDGDVCTLFDNSIVGSLERFSMTQNAVLMSLTAVKKLTGIKLTYGYLNGVPVNTHEDKPFTWYIYTRKNGKFE | ||||||
Domain | 6337-6477 | NendoU | ||||
Sequence: SPRSDMEKDFLSMDMGLFINKYGLEDYGFEHVVYGDVSKTTLGGLHLLISQVRLACMGVLKIDEFVSSNDSTLKSCTVTYADNPSSKMVCTYMDLLLDDFVSILKSLDLGVVSKVHEVMVDCKMWRWMLWCKDHKLQTFYP | ||||||
Domain | 6481-6777 | Nidovirus-type SAM-dependent 2'-O-MTase | ||||
Sequence: ASEWKCGYSMPSIYKIQRMCLEPCNLYNYGAGIKLPDGIMFNVVKYTQLCQYLNSTTMCVPHHMRVLHLGAGSDKGVAPGTAVLRRWLPLDAIIVDNDSVDYVSDADYSVTGDCSTLYLSDKFDLVISDMYDGKIKSCDGENVSKEGFFPYINGVITEKLALGGTVAIKVTEFSWNKKLYELIQRFEYWTMFCTSVNTSSSEAFLIGVHYLGDFASGAVIDGNTMHANYIFWRNSTIMTMSYNSVLDLSKFNCKHKATVVINLKDSSISDVVLGLLKNGKLLVRNNDAICGFSNHLV |
Sequence similarities
Belongs to the coronaviruses polyprotein 1ab family.
Keywords
- Domain
Family and domain databases
Sequence
- Sequence statusComplete
- Length6,781
- Mass (Da)753,409
- Last updated2016-04-13 v1
- Checksum29FE0132153A0D52
Keywords
- Coding sequence diversity