P03361 · POL_BLVJ
- ProteinGag-Pro-Pol polyprotein
- Genepol
- StatusUniProtKB reviewed (Swiss-Prot)
- Amino acids1416 (go to sequence)
- Protein existenceInferred from homology
- Annotation score5/5
Function
function
Matrix protein p15
Capsid protein p24
Nucleocapsid protein p12-pro
Protease
Reverse transcriptase/ribonuclease H
Integrase
Miscellaneous
Reverse transcriptase/ribonuclease H
Catalytic activity
- a 2'-deoxyribonucleoside 5'-triphosphate + DNA(n) = diphosphate + DNA(n+1)
- a 2'-deoxyribonucleoside 5'-triphosphate + DNA(n) = diphosphate + DNA(n+1)
Cofactor
Features
Showing features for site, active site, binding site, dna binding.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Site | 109-110 | Cleavage; by viral protease | ||||
Sequence: LP | ||||||
Site | 322-323 | Cleavage; by viral protease | ||||
Sequence: LV | ||||||
Site | 419-420 | Cleavage; by viral protease | ||||
Sequence: LL | ||||||
Active site | 452 | Protease; shared with dimeric partner | ||||
Sequence: D | ||||||
Site | 544-545 | Cleavage; by viral protease | ||||
Sequence: VG | ||||||
Binding site | 652 | Mg2+ 1 (UniProtKB | ChEBI); catalytic | ||||
Sequence: D | ||||||
Binding site | 727 | Mg2+ 1 (UniProtKB | ChEBI); catalytic | ||||
Sequence: D | ||||||
Binding site | 728 | Mg2+ 1 (UniProtKB | ChEBI); catalytic | ||||
Sequence: D | ||||||
Binding site | 1005 | Mg2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: D | ||||||
Binding site | 1036 | Mg2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: E | ||||||
Binding site | 1057 | Mg2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: D | ||||||
Binding site | 1118 | Mg2+ 2 (UniProtKB | ChEBI) | ||||
Sequence: D | ||||||
Site | 1120-1121 | Cleavage; by viral protease | ||||
Sequence: LL | ||||||
Binding site | 1190 | Mg2+ 3 (UniProtKB | ChEBI); catalytic | ||||
Sequence: D | ||||||
Binding site | 1247 | Mg2+ 3 (UniProtKB | ChEBI); catalytic | ||||
Sequence: D | ||||||
DNA binding | 1352-1400 | Integrase-type | ||||
Sequence: KLFLYLLPGQNNRRWLGPLPALVEASGGALLATDPPVWVPWRLLKAFKC |
GO annotations
Aspect | Term | |
---|---|---|
Cellular Component | viral nucleocapsid | |
Molecular Function | aspartic-type endopeptidase activity | |
Molecular Function | DNA binding | |
Molecular Function | DNA-directed DNA polymerase activity | |
Molecular Function | RNA stem-loop binding | |
Molecular Function | RNA-directed DNA polymerase activity | |
Molecular Function | RNA-DNA hybrid ribonuclease activity | |
Molecular Function | structural constituent of virion | |
Molecular Function | zinc ion binding | |
Biological Process | DNA integration | |
Biological Process | DNA recombination | |
Biological Process | establishment of integrated proviral latency | |
Biological Process | proteolysis | |
Biological Process | symbiont entry into host cell | |
Biological Process | viral budding via host ESCRT complex | |
Biological Process | viral genome integration into host DNA | |
Biological Process | viral translational frameshifting |
Keywords
- Molecular function
- Biological process
- Ligand
Names & Taxonomy
Protein names
- Recommended nameGag-Pro-Pol polyprotein
- Cleaved into 6 chains
Gene names
Organism names
- Taxonomic lineageViruses > Riboviria > Pararnavirae > Artverviricota > Revtraviricetes > Ortervirales > Retroviridae > Orthoretrovirinae > Deltaretrovirus > Bovine leukemia virus
- Virus hosts
Accessions
- Primary accessionP03361
PTM/Processing
Features
Showing features for initiator methionine, lipidation, chain.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Initiator methionine | 1 | Removed; by host | ||||
Sequence: M | ||||||
Lipidation | 2 | N-myristoyl glycine; by host | ||||
Sequence: G | ||||||
Chain | PRO_0000442558 | 2-109 | Matrix protein p15 | |||
Sequence: GNSPSYNPPAGISPSDWLNLLQSAQRLNPRPSPSDFTDLKNYIHWFHKTQKKPWTFTSGGPTSCPPGRFGRVPLVLATLNEVLSNEGGAPGASAPEEQPPPYDPPAIL | ||||||
Chain | PRO_0000125482 | 2-1416 | Gag-Pro-Pol polyprotein | |||
Sequence: GNSPSYNPPAGISPSDWLNLLQSAQRLNPRPSPSDFTDLKNYIHWFHKTQKKPWTFTSGGPTSCPPGRFGRVPLVLATLNEVLSNEGGAPGASAPEEQPPPYDPPAILPIISEGNRNRHRAWALRELQDIKKEIENKAPGSQVWIQTLRLAILQADPTPADLEQLCQYIASPVDQTAHMTSLTAAIAAAEAANTLQGFNPKTGTLTQQSAQPNAGDLRSQYQNLWLQAGKNLPTRPSAPWSTIVQGPAESSVEFVNRLQISLADNLPDGVPKEPIIDSLSYANANRECQQILQGRGPVAAVGQKLQACAQWAPKNKQPALLVHTPGPKMPGPRQPAPKRPPPGPCYRCLKEGHWARDCPTKATGPPPGPCPICKDPSHWKRDCPTLKSKNKLIEGGLSAPQTITPITDSLSEAELECLLSIPLARSRPSVAVYLSGPWLQPSQNQALMLVDTGAENTVLPQNWLVRDYPRIPAAVLGAGGVSRNRYNWLQGPLTLALKPEGPFITIPKILVDTSDKWQILGRDVPSRLQASISIPEEVRPPVVGVLDTPPSHIGLEHLPPPPEVPQFPLNLERLQALQDLVHRSLEAGYISPWDGPGNNPVFPVRKPNGAWRFVHDLRATNALTKPIPALSPGPPDLTAIPTHPPHIICLDLKDAFFQIPVEDRFRFYLSFTLPSPGGLQPHRRFAWRVLPQGFINSPALFERALQEPLRQVSAAFSQSLLVSYMDDILYASPTEEQRSQCYQALAARLRDLGFQVASEKTSQTPSPVPFLGQMVHEQIVTYQSLPTLQISSPISLHQLQAVLGDLQWVSRGTPTTRRPLQLLYSSLKRHHDPRAIIQLSPEQLQGIAELRQALSHNARSRYNEQEPLLAYVHLTRAGSTLVLFQKGAQFPLAYFQTPLTDNQASPWGLLLLLGCQYLQTQALSSYAKPILKYYHNLPKTSLDNWIQSSEDPRVQELLQLWPQISSQGIQPPGPWKTLITRAEVFLTPQFSPDPIPAALCLFSDGATGRGAYCLWKDHLLDFQAVPAPESAQKGELAGLLAGLAAAPPEPVNIWVDSKYLYSLLRTLVLGAWLQPDPVPSYALLYKSLLRHPAIVVGHVRSHSSASHPIASLNNYVDQLLPLETPEQWHKLTHCNSRALSRWPNPRISAWDPRSPATLCETCQKLNPTGGGKMRTIQRGWAPNHIWQADITHYKYKQFTYALHVFVDTYSGATHASAKRGLTTQTTIEGLLEAIVHLGRPKKLNTDQGANYTSKTFVRFCQQFGVSLSHHVPYNPTSSGLDERTNGLLKLLLSKYHLDEPHLPMTQALSRALWTHNQINLLPILKTRWELHHSPPLAVISEGGETPKGSDKLFLYLLPGQNNRRWLGPLPALVEASGGALLATDPPVWVPWRLLKAFKCLKNDGPEDAHNRSSDG | ||||||
Chain | PRO_0000442559 | 110-321 | Capsid protein p24 | |||
Sequence: PIISEGNRNRHRAWALRELQDIKKEIENKAPGSQVWIQTLRLAILQADPTPADLEQLCQYIASPVDQTAHMTSLTAAIAAAEAANTLQGFNPKTGTLTQQSAQPNAGDLRSQYQNLWLQAGKNLPTRPSAPWSTIVQGPAESSVEFVNRLQISLADNLPDGVPKEPIIDSLSYANANRECQQILQGRGPVAAVGQKLQACAQWAPKNKQPAL | ||||||
Chain | PRO_0000442560 | 322-419 | Nucleocapsid protein p12-pro | |||
Sequence: LVHTPGPKMPGPRQPAPKRPPPGPCYRCLKEGHWARDCPTKATGPPPGPCPICKDPSHWKRDCPTLKSKNKLIEGGLSAPQTITPITDSLSEAELECL | ||||||
Chain | PRO_0000442561 | 420-545 | Protease | |||
Sequence: LSIPLARSRPSVAVYLSGPWLQPSQNQALMLVDTGAENTVLPQNWLVRDYPRIPAAVLGAGGVSRNRYNWLQGPLTLALKPEGPFITIPKILVDTSDKWQILGRDVPSRLQASISIPEEVRPPVVG | ||||||
Chain | PRO_0000442562 | 546-1120 | Reverse transcriptase/ribonuclease H | |||
Sequence: VLDTPPSHIGLEHLPPPPEVPQFPLNLERLQALQDLVHRSLEAGYISPWDGPGNNPVFPVRKPNGAWRFVHDLRATNALTKPIPALSPGPPDLTAIPTHPPHIICLDLKDAFFQIPVEDRFRFYLSFTLPSPGGLQPHRRFAWRVLPQGFINSPALFERALQEPLRQVSAAFSQSLLVSYMDDILYASPTEEQRSQCYQALAARLRDLGFQVASEKTSQTPSPVPFLGQMVHEQIVTYQSLPTLQISSPISLHQLQAVLGDLQWVSRGTPTTRRPLQLLYSSLKRHHDPRAIIQLSPEQLQGIAELRQALSHNARSRYNEQEPLLAYVHLTRAGSTLVLFQKGAQFPLAYFQTPLTDNQASPWGLLLLLGCQYLQTQALSSYAKPILKYYHNLPKTSLDNWIQSSEDPRVQELLQLWPQISSQGIQPPGPWKTLITRAEVFLTPQFSPDPIPAALCLFSDGATGRGAYCLWKDHLLDFQAVPAPESAQKGELAGLLAGLAAAPPEPVNIWVDSKYLYSLLRTLVLGAWLQPDPVPSYALLYKSLLRHPAIVVGHVRSHSSASHPIASLNNYVDQL | ||||||
Chain | PRO_0000442563 | 1121-1416 | Integrase | |||
Sequence: LPLETPEQWHKLTHCNSRALSRWPNPRISAWDPRSPATLCETCQKLNPTGGGKMRTIQRGWAPNHIWQADITHYKYKQFTYALHVFVDTYSGATHASAKRGLTTQTTIEGLLEAIVHLGRPKKLNTDQGANYTSKTFVRFCQQFGVSLSHHVPYNPTSSGLDERTNGLLKLLLSKYHLDEPHLPMTQALSRALWTHNQINLLPILKTRWELHHSPPLAVISEGGETPKGSDKLFLYLLPGQNNRRWLGPLPALVEASGGALLATDPPVWVPWRLLKAFKCLKNDGPEDAHNRSSDG |
Post-translational modification
Matrix protein p15
Gag-Pro-Pol polyprotein
Gag-Pro-Pol polyprotein
Keywords
- PTM
Interaction
Subunit
Gag-Pro-Pol polyprotein
Matrix protein p15
Family & Domains
Features
Showing features for motif, zinc finger, domain.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Motif | 100-103 | PPXY motif | ||||
Sequence: PPPY | ||||||
Zinc finger | 345-362 | CCHC-type 1 | ||||
Sequence: PCYRCLKEGHWARDCPTK | ||||||
Zinc finger | 370-387 | CCHC-type 2 | ||||
Sequence: PCPICKDPSHWKRDCPTL | ||||||
Domain | 447-525 | Peptidase A2 | ||||
Sequence: ALMLVDTGAENTVLPQNWLVRDYPRIPAAVLGAGGVSRNRYNWLQGPLTLALKPEGPFITIPKILVDTSDKWQILGRDV | ||||||
Domain | 586-776 | Reverse transcriptase | ||||
Sequence: LEAGYISPWDGPGNNPVFPVRKPNGAWRFVHDLRATNALTKPIPALSPGPPDLTAIPTHPPHIICLDLKDAFFQIPVEDRFRFYLSFTLPSPGGLQPHRRFAWRVLPQGFINSPALFERALQEPLRQVSAAFSQSLLVSYMDDILYASPTEEQRSQCYQALAARLRDLGFQVASEKTSQTPSPVPFLGQMV | ||||||
Domain | 996-1126 | RNase H type-1 | ||||
Sequence: IPAALCLFSDGATGRGAYCLWKDHLLDFQAVPAPESAQKGELAGLLAGLAAAPPEPVNIWVDSKYLYSLLRTLVLGAWLQPDPVPSYALLYKSLLRHPAIVVGHVRSHSSASHPIASLNNYVDQLLPLETP | ||||||
Domain | 1179-1343 | Integrase catalytic | ||||
Sequence: RGWAPNHIWQADITHYKYKQFTYALHVFVDTYSGATHASAKRGLTTQTTIEGLLEAIVHLGRPKKLNTDQGANYTSKTFVRFCQQFGVSLSHHVPYNPTSSGLDERTNGLLKLLLSKYHLDEPHLPMTQALSRALWTHNQINLLPILKTRWELHHSPPLAVISEG |
Domain
Sequence similarities
Keywords
- Domain
Family and domain databases
Sequence & Isoforms
- Sequence statusComplete
This entry describes 3 isoforms produced by Ribosomal frameshifting.
P03361-1
This isoform has been chosen as the canonical sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
- NameGag-Pro-Pol polyprotein
- NoteProduced by -1 ribosomal frameshiftings between gag-pro and pro-pol.
- Length1,416
- Mass (Da)156,397
- Last updated2017-12-20 v2
- Checksum4840617B29D4A098
P03344-1
The sequence of this isoform can be found in the external entry linked below. Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.
View isoform- NameGag polyprotein
P0DOI0-1
The sequence of this isoform can be found in the external entry linked below. Isoforms of the same protein are often annotated in two different entries if their sequences differ significantly.
View isoform- NameGag-Pro polyprotein
Sequence caution
Keywords
- Coding sequence diversity