A0A7S9VN56 · A0A7S9VN56_SARS2
- ProteinORF1ab polyprotein
- GeneORF1ab
- StatusUniProtKB unreviewed (TrEMBL)
- Amino acids7096 (go to sequence)
- Protein existenceInferred from homology
- Annotation score5/5
Function
function
Forms a primer, NSP9-pU, which is utilized by the polymerase for the initiation of RNA chains. Interacts with ribosome signal recognition particle RNA (SRP). Together with NSP8, suppress protein integration into the cell membrane, thereby disrupting host immune defenses.
Plays a role in viral RNA synthesis through two distinct activities. The N7-guanine methyltransferase activity plays a role in the formation of the cap structure GpppA-RNA. The proofreading exoribonuclease reduces the sensitivity of the virus to RNA mutagens during replication. This activity acts on both ssRNA and dsRNA in a 3'-5' direction.
RNA-directed RNA polymerase that catalyzes the transcription of viral genomic and subgenomic RNAs. Acts in complex with nsp7 and nsp8 to transcribe both the minus and positive strands of genomic RNA. The kinase-like NiRAN domain of NSP12 attaches one or more nucleotides to the amino terminus of NSP9, forming a covalent RNA-protein intermediate that serves as transcription/replication primer. Subgenomic RNAs (sgRNAs) are formed by discontinuous transcription: The polymerase has the ability to pause at transcription-regulating sequences (TRS) and jump to the leader TRS, resulting in a major deletion. This creates a series of subgenomic RNAs that are replicated, transcribed and translated. In addition, Nsp12 is a subunit of the viral RNA capping enzyme that catalyzes the RNA guanylyltransferase reaction for genomic and sub-genomic RNAs. Subsequently, the NiRAN domain transfers RNA to GDP, and forms the core cap structure GpppA-RNA.
Catalytic activity
- ATP + H2O = ADP + phosphate + H+
- ATP + H2O = ADP + phosphate + H+
- uridylyl-uridylyl-ribonucleotide-RNA = a 3'-end uridylyl-2',3'-cyclophospho-uridine-RNA + a 5'-end dephospho-ribonucleoside-RNA
Cofactor
Protein has several cofactor binding sites:
Features
Showing features for active site, binding site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Active site | 6015 | |||||
Sequence: D | ||||||
Active site | 6017 | |||||
Sequence: E | ||||||
Active site | 6116 | |||||
Sequence: E | ||||||
Active site | 6193 | |||||
Sequence: H | ||||||
Active site | 6198 | |||||
Sequence: D | ||||||
Binding site | 6256-6262 | S-adenosyl-L-methionine (UniProtKB | ChEBI) | ||||
Sequence: DIGNPKA | ||||||
Active site | 6686 | |||||
Sequence: H | ||||||
Active site | 6701 | |||||
Sequence: H | ||||||
Active site | 6741 | |||||
Sequence: K |
GO annotations
Keywords
- Molecular function
- Biological process
- Ligand
Names & Taxonomy
Protein names
- Recommended nameORF1ab polyprotein
Gene names
Organism names
- Strains
- Taxonomic lineageViruses > Riboviria > Orthornavirae > Pisuviricota > Pisoniviricetes > Nidovirales > Cornidovirineae > Coronaviridae > Orthocoronavirinae > Betacoronavirus > Sarbecovirus > Severe acute respiratory syndrome coronavirus
- Virus hosts
Accessions
- Primary accessionA0A7S9VN56
Subcellular Location
UniProt Annotation
GO Annotation
Host membrane ; Multi-pass membrane protein
Features
Showing features for transmembrane.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Transmembrane | 2229-2253 | Helical | ||||
Sequence: IIIWFLLLSVCLGSLIYSTAALGVL | ||||||
Transmembrane | 2328-2351 | Helical | ||||
Sequence: FLAYILFTRFFYVLGLAAIMQLFF | ||||||
Transmembrane | 2363-2387 | Helical | ||||
Sequence: WLMWLIINLVQMAPISAMVRMYIFF | ||||||
Transmembrane | 2773-2790 | Helical | ||||
Sequence: LIKVTLVFLFVAAIFYLI | ||||||
Transmembrane | 3043-3065 | Helical | ||||
Sequence: ISASIVAGGIVAIVVTCLAYYFM | ||||||
Transmembrane | 3077-3098 | Helical | ||||
Sequence: VVAFNTLLFLMSFTVLCLTPVY | ||||||
Transmembrane | 3104-3124 | Helical | ||||
Sequence: VYSVIYLYLTFYLTNDVSFLA | ||||||
Transmembrane | 3136-3154 | Helical | ||||
Sequence: VPFWITIAYIICISTKHFY | ||||||
Transmembrane | 3582-3605 | Helical | ||||
Sequence: WLLLTILTSLLVLVQSTQWSLFFF | ||||||
Transmembrane | 3611-3629 | Helical | ||||
Sequence: FLPFAMGIIAMSAFAMMFV | ||||||
Transmembrane | 3636-3657 | Helical | ||||
Sequence: LCLFLLPSLATVAYFNMVYMPA | ||||||
Transmembrane | 3683-3701 | Helical | ||||
Sequence: VMYASAVVLLILMTARTVY | ||||||
Transmembrane | 3731-3755 | Helical | ||||
Sequence: ISMWALIISVTSNYSGVVTTVMFLA | ||||||
Transmembrane | 3767-3789 | Helical | ||||
Sequence: PIFFITGNTLQCIMLVYCFLGYF |
Keywords
- Cellular component
PTM/Processing
Features
Showing features for disulfide bond.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Disulfide bond | 2282↔2288 | |||||
Sequence: CTGSIPC |
Keywords
- PTM
Interaction
Subunit
Interacts with nsp7 and nsp8 to form the replication-transcription complex (RTC): nsp12, nsp7, two subunits of nsp8, and up to two subunits of nsp13. Interacts with nsp9.
Structure
Family & Domains
Features
Showing features for domain, region, compositional bias.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Domain | 12-127 | CoV Nsp1 globular | ||||
Sequence: THVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQHLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGETLGVLVPHVGEIPVAYRKVLLRKNG | ||||||
Domain | 148-179 | BetaCoV Nsp1 C-terminal | ||||
Sequence: ELGTDPYEDFQENWNTKHSSGVTRELMRELNG | ||||||
Domain | 183-456 | CoV Nsp2 N-terminal | ||||
Sequence: TRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQLDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFPLNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTGDFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESGLKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNLLEILQK | ||||||
Domain | 458-688 | CoV Nsp2 middle | ||||
Sequence: KVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGNFKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAAITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKLKPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTXAKEIKESVQTFFKLVNKFLAL | ||||||
Domain | 690-818 | CoV Nsp2 C-terminal | ||||
Sequence: ADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEIIFLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEKYCALAPNMMVTNNTFTLKGG | ||||||
Domain | 821-929 | Ubiquitin-like | ||||
Sequence: TKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEKCSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEFKLASHMYCSFYPPDE | ||||||
Region | 926-999 | Disordered | ||||
Sequence: PPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPEEEQEEDWLDDDSQQTVGQQDGSEDNQT | ||||||
Compositional bias | 927-947 | Acidic residues | ||||
Sequence: PDEDEEEGDCEEEEFEPSTQY | ||||||
Compositional bias | 984-999 | Polar residues | ||||
Sequence: SQQTVGQQDGSEDNQT | ||||||
Domain | 1025-1194 | Macro | ||||
Sequence: VNSFSGYLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESDDYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLAPLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMK | ||||||
Domain | 1231-1359 | Macro | ||||
Sequence: KIKACVEEVTTTLEETKFLTENLLLYIDINGNLHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKVPTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSI | ||||||
Domain | 1367-1494 | Macro | ||||
Sequence: ILGTVSWNLREMLAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLINTLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSS | ||||||
Domain | 1496-1561 | DPUP | ||||
Sequence: TPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVITFDNLKTLLS | ||||||
Domain | 1565-1620 | Ubiquitin-like | ||||
Sequence: VRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPHNSHEGKTFYV | ||||||
Domain | 1634-1898 | Peptidase C16 | ||||
Sequence: YYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIKWADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELGDVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQIPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSKETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDN | ||||||
Domain | 1911-2021 | Nucleic acid-binding | ||||
Sequence: PIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVTFFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWSTKPVET | ||||||
Domain | 2046-2155 | G2M | ||||
Sequence: PVSEEVVENPTIQKDVLECNVKTTEVVGDIILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNSVPWDTIANYAKPFLNKVVSTTTN | ||||||
Domain | 2247-2317 | 3Ecto | ||||
Sequence: TAALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLETIQITISSFKWD | ||||||
Region | 2395-2485 | Y1 | ||||
Sequence: KSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTD | ||||||
Domain | 2395-2763 | CoV Nsp3 Y | ||||
Sequence: KSYVHVVDGCNSSTCMMCYKRNRATRVECTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGG | ||||||
Region | 2399-2412 | ZF1 | ||||
Sequence: HVVDGCNSSTCMMC | ||||||
Region | 2445-2455 | ZF2 | ||||
Sequence: CKLHNWNCVNC | ||||||
Region | 2486-2580 | Y2 | ||||
Sequence: QSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDV | ||||||
Region | 2486-2763 | CoV-Y | ||||
Sequence: QSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGG | ||||||
Region | 2581-2662 | Y3 | ||||
Sequence: GDSAEVAVKMFDAYVNTFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVECLKLSHQSD | ||||||
Region | 2663-2763 | Y4 | ||||
Sequence: IEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALIWNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGG | ||||||
Domain | 3165-3263 | Nsp4C | ||||
Sequence: VVFNGVSFSTFEEAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHLAKALNDFSNSGSDVLYQPPQTSITSAVLQ | ||||||
Domain | 3264-3569 | Peptidase C30 | ||||
Sequence: SGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVLKLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVNVLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQ | ||||||
Domain | 3860-3942 | RdRp Nsp7 cofactor | ||||
Sequence: SKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKLCEEMLDNRATLQ | ||||||
Domain | 3943-4140 | RdRp Nsp8 cofactor | ||||
Sequence: AIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDADSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQ | ||||||
Domain | 4141-4253 | Nsp9 ssRNA-binding | ||||
Sequence: NNELSPVALRQMSCAAGTTQTACTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQ | ||||||
Domain | 4254-4392 | ExoN/MTase coactivator | ||||
Sequence: AGNATEVPANSTVLSFCAFAVDAAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQ | ||||||
Domain | 4399-4653 | NiRAN | ||||
Sequence: FLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKDEDDNLIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQALLKTVQFCDAMRNAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVVDSYYSLLMPILTLTRALTAESHVDTDL | ||||||
Domain | 4658-4756 | Nsp12 Interface | ||||
Sequence: IKWDLLKYDFTEERLKLFDRYFKYWDQTYHPNCVNCLDDRCILHCANFNVLFSTVFPLTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSS | ||||||
Domain | 4757-5324 | Nsp12 RNA-dependent RNA polymerase | ||||
Sequence: RLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQ | ||||||
Domain | 5004-5166 | RdRp catalytic | ||||
Sequence: PHLMGWDYPKCDRAMPNMLRIMASLVLARKHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMMILSDDAVVCFNSTYASQG | ||||||
Domain | 5325-5408 | CV ZBD | ||||
Sequence: AVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLC | ||||||
Domain | 5581-5932 | +RNA virus helicase C-terminal | ||||
Sequence: NISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKVGILCIMSDRDLYDKLQFTSLEIPRRNVATLQAENVTGL | ||||||
Domain | 5997-6212 | ExoN | ||||
Sequence: MFITREEAIRHVRAWIGFDVEGCHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNAHVASCDAIMTRCLAVHECFV | ||||||
Domain | 6221-6452 | N7-MTase | ||||
Sequence: YPIIGDELKINAACRKVQHMVVKAALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDGVCLFWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFVNLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYLDAYNMMISAGFSLWVYKQFDTYNLWNTFTRLQ | ||||||
Region | 6339-6353 | GpppA-binding | ||||
Sequence: CDGGSLYVNKHAFHT | ||||||
Domain | 6453-6513 | Nsp15 N-terminal oligomerization | ||||
Sequence: SLENVAFNVVNKGHFDGQQGEVPVSIINNTVYTKVDGVDVELFENKTTLPVNVAFELWAKR | ||||||
Domain | 6514-6639 | AV-Nsp11N/CoV-Nsp15M | ||||
Sequence: NIKPVPEVKILNNLGVDIAANTVIWDYKRDAPAHISTIGVCSMTDIAKKPTETICAPLTVFFDGRVDGQVDLFRNARNGVLITEGSVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQ | ||||||
Domain | 6656-6795 | NendoU | ||||
Sequence: KPRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESPFELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKVTIDYTEISFMLWCKDGHVETFYP | ||||||
Domain | 6800-7094 | Nidovirus-type SAM-dependent 2'-O-MTase | ||||
Sequence: SQAWQPGVAMPNLYKMQRMLLEKCDLQNYGDSATLPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEGFFTYICGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFLIGCNYLGKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKEGQINDMILSLLSKGRLIIRENNRVVISSDVLV |
Sequence similarities
Belongs to the coronaviruses polyprotein 1ab family.
Keywords
- Domain
Family and domain databases
Sequence
- Sequence statusComplete
- Length7,096
- Mass (Da)794,116
- Last updated2021-06-02 v1
- Checksum605F258790AC8FE4
Features
Showing features for compositional bias.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Compositional bias | 927-947 | Acidic residues | ||||
Sequence: PDEDEEEGDCEEEEFEPSTQY | ||||||
Compositional bias | 984-999 | Polar residues | ||||
Sequence: SQQTVGQQDGSEDNQT |
Keywords
- Coding sequence diversity