Q01206 · POL1_BAYMJ
- ProteinGenome polyprotein 1
- StatusUniProtKB reviewed (Swiss-Prot)
- Amino acids2410 (go to sequence)
- Protein existenceInferred from homology
- Annotation score5/5
Function
function
6 kDa protein 1
Indispensable for virus replication.
6 kDa protein 2
Indispensable for virus replication.
Viral genome-linked protein
Mediates the cap-independent, EIF4E-dependent translation of viral genomic RNAs (By similarity).
Binds to the cap-binding site of host EIF4E and thus interferes with the host EIF4E-dependent mRNA export and translation (By similarity).
VPg-RNA directly binds EIF4E and is a template for transcription (By similarity).
Also forms trimeric complexes with EIF4E-EIF4G, which are templates for translation (By similarity).
Binds to the cap-binding site of host EIF4E and thus interferes with the host EIF4E-dependent mRNA export and translation (By similarity).
VPg-RNA directly binds EIF4E and is a template for transcription (By similarity).
Also forms trimeric complexes with EIF4E-EIF4G, which are templates for translation (By similarity).
Nuclear inclusion protein A
Has RNA-binding and proteolytic activities.
Nuclear inclusion protein B
An RNA-dependent RNA polymerase that plays an essential role in the virus replication.
Catalytic activity
- a ribonucleoside 5'-triphosphate + RNA(n) = diphosphate + RNA(n+1)
- Hydrolyzes glutaminyl bonds, and activity is further restricted by preferences for the amino acids in P6 - P1' that vary with the species of potyvirus, e.g. Glu-Xaa-Xaa-Tyr-Xaa-Gln-|-(Ser or Gly) for the enzyme from tobacco etch virus. The natural substrate is the viral polyprotein, but other proteins and oligopeptides containing the appropriate consensus sequence are also cleaved.
Features
Showing features for site, binding site, active site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Site | 328-329 | Cleavage | ||||
Sequence: QA | ||||||
Site | 394-395 | Cleavage | ||||
Sequence: QA | ||||||
Binding site | 487-494 | ATP (UniProtKB | ChEBI) | ||||
Sequence: GHTGSGKS | ||||||
Site | 1053-1054 | Cleavage | ||||
Sequence: QA | ||||||
Site | 1175-1176 | Cleavage | ||||
Sequence: EG | ||||||
Site | 1338-1339 | Cleavage | ||||
Sequence: EG | ||||||
Active site | 1404 | For nuclear inclusion protein A activity | ||||
Sequence: H | ||||||
Active site | 1440 | For nuclear inclusion protein A activity | ||||
Sequence: D | ||||||
Active site | 1507 | For nuclear inclusion protein A activity | ||||
Sequence: C | ||||||
Site | 1586-1587 | Cleavage | ||||
Sequence: QA | ||||||
Site | 2114-2115 | Cleavage | ||||
Sequence: AA |
GO annotations
Aspect | Term | |
---|---|---|
Cellular Component | helical viral capsid | |
Cellular Component | host cell cytoplasmic vesicle | |
Molecular Function | ATP binding | |
Molecular Function | cysteine-type peptidase activity | |
Molecular Function | helicase activity | |
Molecular Function | hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides | |
Molecular Function | RNA binding | |
Molecular Function | RNA-dependent RNA polymerase activity | |
Molecular Function | structural molecule activity | |
Biological Process | DNA-templated transcription | |
Biological Process | proteolysis | |
Biological Process | viral RNA genome replication |
Keywords
- Molecular function
- Biological process
- Ligand
Names & Taxonomy
Protein names
- Recommended nameGenome polyprotein 1
- Cleaved into 8 chains
Organism names
- Taxonomic lineageViruses > Riboviria > Orthornavirae > Pisuviricota > Stelpaviricetes > Patatavirales > Potyviridae > Bymovirus > Barley yellow mosaic virus
- Virus hosts
Accessions
- Primary accessionQ01206
Proteomes
Subcellular Location
UniProt Annotation
GO Annotation
6 kDa protein 1
Note: Probably colocalizes with 6K2-induced vesicles associated with host chloroplasts.
6 kDa protein 2
Note: 6K-induced vesicles associate with host chloroplasts.
Coat protein
Keywords
- Cellular component
PTM/Processing
Features
Showing features for chain, modified residue.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Chain | PRO_0000040546 | 1-328 | Protein P3 | |||
Sequence: MEQTLAQAVSRKSKTDTPMAEERKHFSPMNFSANFVAPELFYSANVRKIKNIFRERSTTRFLDAISSDFELVAFLTLSPAHLMQLETVLRHEMRSCVVPIVTSDASFETVAVIKTALDGMRFHFGYTTLEKGWISMMRHAESCLQESSSSAVNDLQTQIKRVGSLLLSGKNRVEGCELSVLNLTARRFRIEYGLNGTYFGEHVAMLRGLKRYIYGTVPKEFLWAKTKKHSLFTIPAWIKRTPIDCFLFCLRVIPILHRCGVAVSLIYWSCAAALNFPAFMSFLFKRQFAKYLAHSFAKHSIYFMFLTIVAILWSFRTFTSQKPKIVLQ | ||||||
Chain | PRO_0000456244 | 1-2410 | Genome polyprotein 1 | |||
Sequence: MEQTLAQAVSRKSKTDTPMAEERKHFSPMNFSANFVAPELFYSANVRKIKNIFRERSTTRFLDAISSDFELVAFLTLSPAHLMQLETVLRHEMRSCVVPIVTSDASFETVAVIKTALDGMRFHFGYTTLEKGWISMMRHAESCLQESSSSAVNDLQTQIKRVGSLLLSGKNRVEGCELSVLNLTARRFRIEYGLNGTYFGEHVAMLRGLKRYIYGTVPKEFLWAKTKKHSLFTIPAWIKRTPIDCFLFCLRVIPILHRCGVAVSLIYWSCAAALNFPAFMSFLFKRQFAKYLAHSFAKHSIYFMFLTIVAILWSFRTFTSQKPKIVLQARSTAEKEKKLMMILASVVGITYLFDYDIAEALGNCLHKISRLSSYLLDDHQGIASRMFGASYGLQAGDSAEDAVTTIISDLLSVTFKIVDEDASQGTVEDASETTFHSWVGVNTLAGRNMSRPLQYSVNETYALTPQNVQLQARKMADANNCWSMVVGHTGSGKSTYLPVQYSNYLSTKSDRRQQILICEPTQAATENVCAGVAANLGRAVYGRHEGWSRMGDHCIQVMTYGSALQCHAMDPSFISTFDAIFLDEAHDVKEHSLVFESICDTFKSVRKFYVSATPRDGSVCPEAARKYPLHVETSVCDSYRKFIAAQGGGDLLDISKHDTALVFLAGRPECIKAANAWNASVTGEKRAFSLSSDNFATDFSMLTERLKTHKTIIFTTNIIETGVTLSVDCVVDFGHTMRPCLDLNQKALRLDRRRVTRNERQQRIGRAGRLKDGYAIVCGDVDRAVNVISPDVLYGAALLSFKHNVPFYMNETFESSWLEGITKAQADTMTIFKLPIFLTRDLINADGSVAKEFLDVLKKHQFTTSDIKQAPSVTAKHIFPTWASYFSLHQALHYGDDKDEVPHELRYARVPFSVTTLSKFDWPALALACEKHRASMSNVFAGIEEPARVVTLQTNPANIQASITHLTHMSKNYKTLIENNQHVRQSMVTNVMYKWFSSTRITKDLDRNLRRCTDNLAVVEATLSSLRQILAGNTQVHATPHMQSTLEDIIGLQASDTLTEESLASALGIFVPKSNLFLLLATKGFKLVYVVCILLLVNLVYLGLRKWREHLKQKGSNEILTNTMPVSEGGEILAEVMKMEPKMRKNIKKDMDAAVESKLCGFTFVFPDDDKIGLEGKGNKYRPREDARLMYSTREDATLDAWNEKAKERRKKVTDKSEPELRRAYEKRPYFNFYDLQTDSNILEAIFYTTEGDEFFRTADPNKDMNLVADKLRSFLDTKLVVGHHQRQMLEETAKVVIKDTKGTAHHMDISQHDPDHLKQNGSGKIGYPEHRGQFRQEGPAKTADYDLGVEFGTDTDDITLEASTGILLSQVGVDVATRVGRIFIGTFNMNCYFYSDWILVPGHLQDRSGNVTIQFPDQTVQTTTDALNANGVKRFYGLDVIAIRRPAILRPRTKLVKAFAIEEPVIAQMVFVDAQGVRKFTQSVRARKEENSGRWSHKISTVLGMCGCQFWTLERQIDGIHVATNYTKKRNEFQPFTQEVVDFINGPGTKIPYCPWVFDRPACGYASHTALFEKPTTLTDIIHMQASDGLHNINNAIEGFGSSLRGQLVSPPTESTRQRFDKLFGSGSFELIGQMNKGLIDKHVIVGENDDVHDFMREHPTFTWLKDFMNEYAPSVLSYSAYYKDLCKYNRAKHVLTYNPEELHYATKGLIKMLEDAGLTQGSVRTPQQVISDIQWNTSAGPSYQGKKRDLCAHLSDDEVLHLAEVCRQQFLEGKSTGVWNGSLKAELRTIEKVEAEKTRVFTASPITSLFAMKFYVDDFNKKFYATNLKAPHTVGINKFGRGWEKLHDKLNRPGWLHGSGDGSRFDSSIDPLFFDVVKTIRKHFLPSEHHKAIDLIYDEILNTTICLANGMVIKKNVGTQRQPSTVVDNTLVLMTAFLYAYIHKTGDRELALLNERFIFVCNGDDNKFAISPQFDEEFGHDFSPELVELGLTYEFDDITSDICENPYMSLTMVKTPFGVGFSLPVERIIAIMQWSKKGGVLHSYLAGISAIYESFNTPKLFKSIYAYLLWLTEEHEAEILAAMTQSSTALPIPSMLDVYRLHYGDDEIWLQAADPLTDAQKEDARIAAADGARFELADADRRRKVEADRVEAARVKKAADAALKPVNLTATRTPTEDDGKLKTPSGARIPSSAADGNWSVPATKQVNAGLTLKIPLNKLKSVPKSVMEHNNSVALESELKAWTDAVRTSLGITTDEAWIDALIPFIGWCCNNGTSDKHAENQVMQIDSGKGAVTEMSLSPFIVHARMNGGLRRIMRNYSDETVLLITNNKLVAHWSMKHGASANAKYAFDFFVPRSWMNPQDIEVSKQARLAALGTGTYNTMLTSDTTNLRKTTNHRVLDSDGHPELT | ||||||
Chain | PRO_0000040547 | 329-394 | 6 kDa protein 1 | |||
Sequence: ARSTAEKEKKLMMILASVVGITYLFDYDIAEALGNCLHKISRLSSYLLDDHQGIASRMFGASYGLQ | ||||||
Chain | PRO_0000040548 | 395-1053 | Cytoplasmic inclusion protein | |||
Sequence: AGDSAEDAVTTIISDLLSVTFKIVDEDASQGTVEDASETTFHSWVGVNTLAGRNMSRPLQYSVNETYALTPQNVQLQARKMADANNCWSMVVGHTGSGKSTYLPVQYSNYLSTKSDRRQQILICEPTQAATENVCAGVAANLGRAVYGRHEGWSRMGDHCIQVMTYGSALQCHAMDPSFISTFDAIFLDEAHDVKEHSLVFESICDTFKSVRKFYVSATPRDGSVCPEAARKYPLHVETSVCDSYRKFIAAQGGGDLLDISKHDTALVFLAGRPECIKAANAWNASVTGEKRAFSLSSDNFATDFSMLTERLKTHKTIIFTTNIIETGVTLSVDCVVDFGHTMRPCLDLNQKALRLDRRRVTRNERQQRIGRAGRLKDGYAIVCGDVDRAVNVISPDVLYGAALLSFKHNVPFYMNETFESSWLEGITKAQADTMTIFKLPIFLTRDLINADGSVAKEFLDVLKKHQFTTSDIKQAPSVTAKHIFPTWASYFSLHQALHYGDDKDEVPHELRYARVPFSVTTLSKFDWPALALACEKHRASMSNVFAGIEEPARVVTLQTNPANIQASITHLTHMSKNYKTLIENNQHVRQSMVTNVMYKWFSSTRITKDLDRNLRRCTDNLAVVEATLSSLRQILAGNTQVHATPHMQSTLEDIIGLQ | ||||||
Chain | PRO_0000040549 | 1054-1175 | 6 kDa protein 2 | |||
Sequence: ASDTLTEESLASALGIFVPKSNLFLLLATKGFKLVYVVCILLLVNLVYLGLRKWREHLKQKGSNEILTNTMPVSEGGEILAEVMKMEPKMRKNIKKDMDAAVESKLCGFTFVFPDDDKIGLE | ||||||
Chain | PRO_0000040550 | 1176-1338 | Viral genome-linked protein | |||
Sequence: GKGNKYRPREDARLMYSTREDATLDAWNEKAKERRKKVTDKSEPELRRAYEKRPYFNFYDLQTDSNILEAIFYTTEGDEFFRTADPNKDMNLVADKLRSFLDTKLVVGHHQRQMLEETAKVVIKDTKGTAHHMDISQHDPDHLKQNGSGKIGYPEHRGQFRQE | ||||||
Modified residue | 1234 | O-(5'-phospho-RNA)-tyrosine | ||||
Sequence: Y | ||||||
Chain | PRO_0000040551 | 1339-1586 | Nuclear inclusion protein A | |||
Sequence: GPAKTADYDLGVEFGTDTDDITLEASTGILLSQVGVDVATRVGRIFIGTFNMNCYFYSDWILVPGHLQDRSGNVTIQFPDQTVQTTTDALNANGVKRFYGLDVIAIRRPAILRPRTKLVKAFAIEEPVIAQMVFVDAQGVRKFTQSVRARKEENSGRWSHKISTVLGMCGCQFWTLERQIDGIHVATNYTKKRNEFQPFTQEVVDFINGPGTKIPYCPWVFDRPACGYASHTALFEKPTTLTDIIHMQ | ||||||
Chain | PRO_0000040552 | 1587-2113 | Nuclear inclusion protein B | |||
Sequence: ASDGLHNINNAIEGFGSSLRGQLVSPPTESTRQRFDKLFGSGSFELIGQMNKGLIDKHVIVGENDDVHDFMREHPTFTWLKDFMNEYAPSVLSYSAYYKDLCKYNRAKHVLTYNPEELHYATKGLIKMLEDAGLTQGSVRTPQQVISDIQWNTSAGPSYQGKKRDLCAHLSDDEVLHLAEVCRQQFLEGKSTGVWNGSLKAELRTIEKVEAEKTRVFTASPITSLFAMKFYVDDFNKKFYATNLKAPHTVGINKFGRGWEKLHDKLNRPGWLHGSGDGSRFDSSIDPLFFDVVKTIRKHFLPSEHHKAIDLIYDEILNTTICLANGMVIKKNVGTQRQPSTVVDNTLVLMTAFLYAYIHKTGDRELALLNERFIFVCNGDDNKFAISPQFDEEFGHDFSPELVELGLTYEFDDITSDICENPYMSLTMVKTPFGVGFSLPVERIIAIMQWSKKGGVLHSYLAGISAIYESFNTPKLFKSIYAYLLWLTEEHEAEILAAMTQSSTALPIPSMLDVYRLHYGDDEIWLQ | ||||||
Chain | PRO_0000040553 | 2114-2410 | Coat protein | |||
Sequence: AADPLTDAQKEDARIAAADGARFELADADRRRKVEADRVEAARVKKAADAALKPVNLTATRTPTEDDGKLKTPSGARIPSSAADGNWSVPATKQVNAGLTLKIPLNKLKSVPKSVMEHNNSVALESELKAWTDAVRTSLGITTDEAWIDALIPFIGWCCNNGTSDKHAENQVMQIDSGKGAVTEMSLSPFIVHARMNGGLRRIMRNYSDETVLLITNNKLVAHWSMKHGASANAKYAFDFFVPRSWMNPQDIEVSKQARLAALGTGTYNTMLTSDTTNLRKTTNHRVLDSDGHPELT |
Post-translational modification
Viral genome-linked protein
VPg is uridylylated by the polymerase and is covalently attached to the 5'-end of the genomic RNA. This uridylylated form acts as a nucleotide-peptide primer for the polymerase (By similarity).
Genome polyprotein 1
The viral RNA1 of bymoviruses is expressed as a single polyprotein which undergoes post-translational proteolytic processing by the main proteinase NIa-pro resulting in the production of at least eight individual proteins.
Keywords
- PTM
Family & Domains
Features
Showing features for region, domain.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Region | 1-22 | Disordered | ||||
Sequence: MEQTLAQAVSRKSKTDTPMAEE | ||||||
Domain | 474-632 | Helicase ATP-binding | ||||
Sequence: KMADANNCWSMVVGHTGSGKSTYLPVQYSNYLSTKSDRRQQILICEPTQAATENVCAGVAANLGRAVYGRHEGWSRMGDHCIQVMTYGSALQCHAMDPSFISTFDAIFLDEAHDVKEHSLVFESICDTFKSVRKFYVSATPRDGSVCPEAARKYPLHVE | ||||||
Domain | 647-813 | Helicase C-terminal | ||||
Sequence: GGGDLLDISKHDTALVFLAGRPECIKAANAWNASVTGEKRAFSLSSDNFATDFSMLTERLKTHKTIIFTTNIIETGVTLSVDCVVDFGHTMRPCLDLNQKALRLDRRRVTRNERQQRIGRAGRLKDGYAIVCGDVDRAVNVISPDVLYGAALLSFKHNVPFYMNETF | ||||||
Domain | 1359-1573 | Peptidase C4 | ||||
Sequence: ITLEASTGILLSQVGVDVATRVGRIFIGTFNMNCYFYSDWILVPGHLQDRSGNVTIQFPDQTVQTTTDALNANGVKRFYGLDVIAIRRPAILRPRTKLVKAFAIEEPVIAQMVFVDAQGVRKFTQSVRARKEENSGRWSHKISTVLGMCGCQFWTLERQIDGIHVATNYTKKRNEFQPFTQEVVDFINGPGTKIPYCPWVFDRPACGYASHTALF | ||||||
Domain | 1857-1980 | RdRp catalytic | ||||
Sequence: WLHGSGDGSRFDSSIDPLFFDVVKTIRKHFLPSEHHKAIDLIYDEILNTTICLANGMVIKKNVGTQRQPSTVVDNTLVLMTAFLYAYIHKTGDRELALLNERFIFVCNGDDNKFAISPQFDEEF | ||||||
Region | 2173-2200 | Disordered | ||||
Sequence: TRTPTEDDGKLKTPSGARIPSSAADGNW |
Sequence similarities
Belongs to the bymoviruses polyprotein 1 family.
Family and domain databases
Sequence
- Sequence statusComplete
- Length2,410
- Mass (Da)270,770
- Last updated1994-02-01 v1
- Checksum6CFCF5D7045044B5