P04577 · ENV_HV2RO
- ProteinEnvelope glycoprotein gp160
- Geneenv
- StatusUniProtKB reviewed (Swiss-Prot)
- Amino acids858 (go to sequence)
- Protein existenceEvidence at protein level
- Annotation score5/5
Function
function
The surface protein gp120 (SU) attaches the virus to the host lymphoid cell by binding to the primary receptor CD4. This interaction induces a structural rearrangement creating a high affinity binding site for a chemokine coreceptor like CXCR4 and/or CCR5. This peculiar 2 stage receptor-interaction strategy allows gp120 to maintain the highly conserved coreceptor-binding site in a cryptic conformation, protected from neutralizing antibodies. Since CD4 also displays a binding site for the disulfide-isomerase P4HB/PDI, a P4HB/PDI-CD4-CXCR4-gp120 complex may form. In that complex, P4HB/PDI could reach and reduce gp120 disulfide bonds, causing major conformational changes in gp120. TXN, another PDI family member could also be involved in disulfide rearrangements in Env during fusion. These changes are transmitted to the transmembrane protein gp41 and are thought to activate its fusogenic potential by unmasking its fusion peptide (By similarity).
The surface protein gp120 is a ligand for CD209/DC-SIGN and CLEC4M/DC-SIGNR, which are respectively found on dendritic cells (DCs), and on endothelial cells of liver sinusoids and lymph node sinuses. These interactions allow capture of viral particles at mucosal surfaces by these cells and subsequent transmission to permissive cells. DCs are professional antigen presenting cells, critical for host immunity by inducing specific immune responses against a broad variety of pathogens. They act as sentinels in various tissues where they take up antigen, process it, and present it to T-cells following migration to lymphoid organs. HIV subverts the migration properties of dendritic cells to gain access to CD4+ T-cells in lymph nodes. Virus transmission to permissive T-cells occurs either in trans (without DCs infection, through viral capture and transmission), or in cis (following DCs productive infection, through the usual CD4-gp120 interaction), thereby inducing a robust infection. In trans infection, bound virions remain infectious over days and it is proposed that they are not degraded, but protected in non-lysosomal acidic organelles within the DCs close to the cell membrane thus contributing to the viral infectious potential during DCs' migration from the periphery to the lymphoid tissues. On arrival at lymphoid tissues, intact virions recycle back to DCs' cell surface allowing virus transmission to CD4+ T-cells. Virion capture also seems to lead to MHC-II-restricted viral antigen presentation, and probably to the activation of HIV-specific CD4+ cells (By similarity).
The transmembrane protein gp41 (TM) acts as a class I viral fusion protein. Under the current model, the protein has at least 3 conformational states: pre-fusion native state, pre-hairpin intermediate state, and post-fusion hairpin state. During fusion of viral and target intracellular membranes, the coiled coil regions (heptad repeats) assume a trimer-of-hairpins structure, positioning the fusion peptide in close proximity to the C-terminal region of the ectodomain. The formation of this structure appears to drive apposition and subsequent fusion of viral and target cell membranes. Complete fusion occurs in host cell endosomes and is dynamin-dependent, however some lipid transfer might occur at the plasma membrane. The virus undergoes clathrin-dependent internalization long before endosomal fusion, thus minimizing the surface exposure of conserved viral epitopes during fusion and reducing the efficacy of inhibitors targeting these epitopes. Membranes fusion leads to delivery of the nucleocapsid into the cytoplasm (By similarity).
The envelope glycoprotein gp160 precursor down-modulates cell surface CD4 antigen by interacting with it in the endoplasmic reticulum and blocking its transport to the cell surface.
The gp120-gp41 heterodimer seems to contribute to T-cell depletion during HIV-1 infection. The envelope glycoproteins expressed on the surface of infected cells induce apoptosis through an interaction with uninfected cells expressing the receptor (CD4) and the coreceptors CXCR4 or CCR5. This type of bystander killing may be obtained by at least three distinct mechanisms. First, the interaction between the 2 cells can induce cellular fusion followed by nuclear fusion within the syncytium. Syncytia are condemned to die from apoptosis. Second, the 2 interacting cells may not fuse entirely and simply exchange plasma membrane lipids, after a sort of hemifusion process, followed by rapid death. Third, it is possible that virus-infected cells, on the point of undergoing apoptosis, fuse with CD4-expressing cells, in which case apoptosis is rapidly transmitted from one cell to the other and thus occurs in a sort of contagious fashion (By similarity).
The gp120-gp41 heterodimer allows rapid transcytosis of the virus through CD4 negative cells such as simple epithelial monolayers of the intestinal, rectal and endocervical epithelial barriers. Both gp120 and gp41 specifically recognize glycosphingolipids galactosyl-ceramide (GalCer) or 3' sulfo-galactosyl-ceramide (GalS) present in the lipid rafts structures of epithelial cells. Binding to these alternative receptors allows the rapid transcytosis of the virus through the epithelial cells. This transcytotic vesicle-mediated transport of virions from the apical side to the basolateral side of the epithelial cells does not involve infection of the cells themselves (By similarity).
Miscellaneous
Some HIV-2 isolates have been described that can infect cells independently of CD4, using CXCR4 as primary receptor. These isolates may have an exposed coreceptor binding site.
Features
Showing features for site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Site | ?511-?512 | Cleavage; by host furin | ||||
Sequence: RG |
GO annotations
Aspect | Term | |
---|---|---|
Cellular Component | host cell endosome membrane | |
Cellular Component | host cell plasma membrane | |
Cellular Component | membrane | |
Cellular Component | viral envelope | |
Cellular Component | virion membrane | |
Molecular Function | structural molecule activity | |
Biological Process | apoptotic process | |
Biological Process | clathrin-dependent endocytosis of virus by host cell | |
Biological Process | fusion of virus membrane with host endosome membrane | |
Biological Process | suppression by virus of host tetherin activity | |
Biological Process | symbiont-mediated suppression of host innate immune response | |
Biological Process | virion attachment to host cell |
Keywords
- Biological process
Names & Taxonomy
Protein names
- Recommended nameEnvelope glycoprotein gp160
- Alternative names
- Cleaved into 2 chains
Gene names
Organism names
- Taxonomic lineageViruses > Riboviria > Pararnavirae > Artverviricota > Revtraviricetes > Ortervirales > Retroviridae > Orthoretrovirinae > Lentivirus > Human immunodeficiency virus 2
- Virus hosts
Accessions
- Primary accessionP04577
Proteomes
Subcellular Location
UniProt Annotation
GO Annotation
Transmembrane protein gp41
Virion membrane ; Single-pass type I membrane protein
Host cell membrane ; Single-pass type I membrane protein
Host endosome membrane ; Single-pass type I membrane protein
Note: It is probably concentrated at the site of budding and incorporated into the virions possibly by contacts between the cytoplasmic tail of Env and the N-terminus of Gag.
Surface protein gp120
Virion membrane ; Peripheral membrane protein
Host cell membrane ; Peripheral membrane protein
Host endosome membrane ; Peripheral membrane protein
Note: The surface protein is not anchored to the viral envelope, but associates with the extravirion surface through its binding to TM. It is probably concentrated at the site of budding and incorporated into the virions possibly by contacts between the cytoplasmic tail of Env and the N-terminus of Gag (By similarity).
Features
Showing features for topological domain, transmembrane.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Topological domain | 20-679 | Extracellular | ||||
Sequence: TQYVTVFYGVPTWKNATIPLFCATRNRDTWGTIQCLPDNDDYQEITLNVTEAFDAWNNTVTEQAIEDVWHLFETSIKPCVKLTPLCVAMKCSSTESSTGNNTTSKSTSTTTTTPTDQEQEISEDTPCARADNCSGLGEEETINCQFNMTGLERDKKKQYNETWYSKDVVCETNNSTNQTQCYMNHCNTSVITESCDKHYWDAIRFRYCAPPGYALLRCNDTNYSGFAPNCSKVVASTCTRMMETQTSTWFGFNGTRAENRTYIYWHGRDNRTIISLNKYYNLSLHCKRPGNKTVKQIMLMSGHVFHSHYQPINKRPRQAWCWFKGKWKDAMQEVKETLAKHPRYRGTNDTRNISFAAPGKGSDPEVAYMWTNCRGEFLYCNMTWFLNWIENKTHRNYAPCHIKQIINTWHKVGRNVYLPPREGELSCNSTVTSIIANIDWQNNNQTNITFSAEVAELYRLELGDYKLVEITPIGFAPTKEKRYSSAHGRHTRGVFVLGFLGFLATAGSAMGAASLTVSAQSRTLLAGIVQQQQQLLDVVKRQQELLRLTVWGTKNLQARVTAIEKYLQDQARLNSWGCAFRQVCHTTVPWVNDSLAPDWDNMTWQEWEKQVRYLEANISKSLEQAQIQQEKNMYELQKLNSWDIFGNWFDLTSWVKYIQY | ||||||
Transmembrane | 680-700 | Helical | ||||
Sequence: GVLIIVAVIALRIVIYVVQML | ||||||
Topological domain | 701-858 | Cytoplasmic | ||||
Sequence: SRLRKGYRPVFSSPPGYIQQIHIHKDRGQPANEETEEDGGSNGGDRYWPWPIAYIHFLIRQLIRLLTRLYSICRDLLSRSFLTLQLIYQNLRDWLRLRTAFLQYGCEWIQEAFQAAARATRETLAGACRGLWRVLERIGRGILAVPRRIRQGAEIALL |
Keywords
- Cellular component
Phenotypes & Variants
Keywords
- Disease
PTM/Processing
Features
Showing features for signal, chain, glycosylation, disulfide bond, lipidation.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Signal | 1-19 | |||||
Sequence: MMNQLLIAILLASACLVYC | ||||||
Chain | PRO_0000038449 | 20-511 | Surface protein gp120 | |||
Sequence: TQYVTVFYGVPTWKNATIPLFCATRNRDTWGTIQCLPDNDDYQEITLNVTEAFDAWNNTVTEQAIEDVWHLFETSIKPCVKLTPLCVAMKCSSTESSTGNNTTSKSTSTTTTTPTDQEQEISEDTPCARADNCSGLGEEETINCQFNMTGLERDKKKQYNETWYSKDVVCETNNSTNQTQCYMNHCNTSVITESCDKHYWDAIRFRYCAPPGYALLRCNDTNYSGFAPNCSKVVASTCTRMMETQTSTWFGFNGTRAENRTYIYWHGRDNRTIISLNKYYNLSLHCKRPGNKTVKQIMLMSGHVFHSHYQPINKRPRQAWCWFKGKWKDAMQEVKETLAKHPRYRGTNDTRNISFAAPGKGSDPEVAYMWTNCRGEFLYCNMTWFLNWIENKTHRNYAPCHIKQIINTWHKVGRNVYLPPREGELSCNSTVTSIIANIDWQNNNQTNITFSAEVAELYRLELGDYKLVEITPIGFAPTKEKRYSSAHGRHTR | ||||||
Chain | PRO_0000239503 | 20-858 | Envelope glycoprotein gp160 | |||
Sequence: TQYVTVFYGVPTWKNATIPLFCATRNRDTWGTIQCLPDNDDYQEITLNVTEAFDAWNNTVTEQAIEDVWHLFETSIKPCVKLTPLCVAMKCSSTESSTGNNTTSKSTSTTTTTPTDQEQEISEDTPCARADNCSGLGEEETINCQFNMTGLERDKKKQYNETWYSKDVVCETNNSTNQTQCYMNHCNTSVITESCDKHYWDAIRFRYCAPPGYALLRCNDTNYSGFAPNCSKVVASTCTRMMETQTSTWFGFNGTRAENRTYIYWHGRDNRTIISLNKYYNLSLHCKRPGNKTVKQIMLMSGHVFHSHYQPINKRPRQAWCWFKGKWKDAMQEVKETLAKHPRYRGTNDTRNISFAAPGKGSDPEVAYMWTNCRGEFLYCNMTWFLNWIENKTHRNYAPCHIKQIINTWHKVGRNVYLPPREGELSCNSTVTSIIANIDWQNNNQTNITFSAEVAELYRLELGDYKLVEITPIGFAPTKEKRYSSAHGRHTRGVFVLGFLGFLATAGSAMGAASLTVSAQSRTLLAGIVQQQQQLLDVVKRQQELLRLTVWGTKNLQARVTAIEKYLQDQARLNSWGCAFRQVCHTTVPWVNDSLAPDWDNMTWQEWEKQVRYLEANISKSLEQAQIQQEKNMYELQKLNSWDIFGNWFDLTSWVKYIQYGVLIIVAVIALRIVIYVVQMLSRLRKGYRPVFSSPPGYIQQIHIHKDRGQPANEETEEDGGSNGGDRYWPWPIAYIHFLIRQLIRLLTRLYSICRDLLSRSFLTLQLIYQNLRDWLRLRTAFLQYGCEWIQEAFQAAARATRETLAGACRGLWRVLERIGRGILAVPRRIRQGAEIALL | ||||||
Glycosylation | 34 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Disulfide bond | 41↔54 | |||||
Sequence: CATRNRDTWGTIQC | ||||||
Glycosylation | 67 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 76 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Disulfide bond | 98↔214 | |||||
Sequence: CVKLTPLCVAMKCSSTESSTGNNTTSKSTSTTTTTPTDQEQEISEDTPCARADNCSGLGEEETINCQFNMTGLERDKKKQYNETWYSKDVVCETNNSTNQTQCYMNHCNTSVITESC | ||||||
Disulfide bond | 105↔205 | |||||
Sequence: CVAMKCSSTESSTGNNTTSKSTSTTTTTPTDQEQEISEDTPCARADNCSGLGEEETINCQFNMTGLERDKKKQYNETWYSKDVVCETNNSTNQTQCYMNHC | ||||||
Disulfide bond | 110↔163 | |||||
Sequence: CSSTESSTGNNTTSKSTSTTTTTPTDQEQEISEDTPCARADNCSGLGEEETINC | ||||||
Glycosylation | 119 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 120 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 151 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 166 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 179 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 192 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 193 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 196 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 206 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Disulfide bond | 227↔257 | |||||
Sequence: CAPPGYALLRCNDTNYSGFAPNCSKVVASTC | ||||||
Disulfide bond | 237↔249 | |||||
Sequence: CNDTNYSGFAPNC | ||||||
Glycosylation | 238 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 241 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 248 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 272 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 278 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 289 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 300 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Disulfide bond | 305↔340 | |||||
Sequence: CKRPGNKTVKQIMLMSGHVFHSHYQPINKRPRQAWC | ||||||
Glycosylation | 310 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 367 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 371 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Disulfide bond | 392↔446 | |||||
Sequence: CRGEFLYCNMTWFLNWIENKTHRNYAPCHIKQIINTWHKVGRNVYLPPREGELSC | ||||||
Disulfide bond | 399↔419 | |||||
Sequence: CNMTWFLNWIENKTHRNYAPC | ||||||
Glycosylation | 400 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 410 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 447 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 463 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 466 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Chain | PRO_0000038450 | 512-858 | Transmembrane protein gp41 | |||
Sequence: GVFVLGFLGFLATAGSAMGAASLTVSAQSRTLLAGIVQQQQQLLDVVKRQQELLRLTVWGTKNLQARVTAIEKYLQDQARLNSWGCAFRQVCHTTVPWVNDSLAPDWDNMTWQEWEKQVRYLEANISKSLEQAQIQQEKNMYELQKLNSWDIFGNWFDLTSWVKYIQYGVLIIVAVIALRIVIYVVQMLSRLRKGYRPVFSSPPGYIQQIHIHKDRGQPANEETEEDGGSNGGDRYWPWPIAYIHFLIRQLIRLLTRLYSICRDLLSRSFLTLQLIYQNLRDWLRLRTAFLQYGCEWIQEAFQAAARATRETLAGACRGLWRVLERIGRGILAVPRRIRQGAEIALL | ||||||
Glycosylation | 611 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 620 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Glycosylation | 636 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Lipidation | 773 | S-palmitoyl cysteine; by host | ||||
Sequence: C |
Post-translational modification
Specific enzymatic cleavages in vivo yield mature proteins. Envelope glycoproteins are synthesized as an inactive precursor that is heavily N-glycosylated and processed likely by host cell furin in the Golgi to yield the mature SU and TM proteins. The cleavage site between SU and TM requires the minimal sequence [KR]-X-[KR]-R (By similarity).
Palmitoylation of the transmembrane protein and of Env polyprotein (prior to its proteolytic cleavage) is essential for their association with host cell membrane lipid rafts. Palmitoylation is therefore required for envelope trafficking to classical lipid rafts, but not for viral replication (By similarity).
Keywords
- PTM
PTM databases
Interaction
Subunit
Surface protein gp120
The mature envelope protein (Env) consists of a homotrimer of non-covalently associated gp120-gp41 heterodimers. The resulting complex protrudes from the virus surface as a spike. There seems to be as few as 10 spikes on the average virion. Interacts with human CD4, CCR5 and CXCR4, to form a P4HB/PDI-CD4-CXCR4-gp120 complex. Gp120 also interacts with the C-type lectins CD209/DC-SIGN and CLEC4M/DC-SIGNR (collectively referred to as DC-SIGN(R)). Gp120 and gp41 interact with GalCer (By similarity).
Transmembrane protein gp41
The mature envelope protein (Env) consists of a homotrimer of non-covalently associated gp120-gp41 heterodimers. The resulting complex protrudes from the virus surface as a spike. There seems to be as few as 10 spikes on the average virion.
Family & Domains
Features
Showing features for region, compositional bias, coiled coil, motif.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Region | 110-162 | V1 | ||||
Sequence: CSSTESSTGNNTTSKSTSTTTTTPTDQEQEISEDTPCARADNCSGLGEEETIN | ||||||
Compositional bias | 111-140 | Polar residues | ||||
Sequence: SSTESSTGNNTTSKSTSTTTTTPTDQEQEI | ||||||
Region | 111-142 | Disordered | ||||
Sequence: SSTESSTGNNTTSKSTSTTTTTPTDQEQEISE | ||||||
Region | 163-205 | V2 | ||||
Sequence: CQFNMTGLERDKKKQYNETWYSKDVVCETNNSTNQTQCYMNHC | ||||||
Region | 305-339 | V3 | ||||
Sequence: CKRPGNKTVKQIMLMSGHVFHSHYQPINKRPRQAW | ||||||
Region | 399-419 | V4 | ||||
Sequence: CNMTWFLNWIENKTHRNYAPC | ||||||
Region | 462-469 | V5 | ||||
Sequence: NNQTNITF | ||||||
Region | 512-532 | Fusion peptide | ||||
Sequence: GVFVLGFLGFLATAGSAMGAA | ||||||
Region | 575-591 | Immunosuppression | ||||
Sequence: LQARVTAIEKYLQDQAR | ||||||
Coiled coil | 624-645 | |||||
Sequence: QEWEKQVRYLEANISKSLEQAQ | ||||||
Region | 657-678 | MPER; binding to GalCer | ||||
Sequence: KLNSWDIFGNWFDLTSWVKYIQ | ||||||
Motif | 707-710 | YXXV motif; contains endocytosis signal | ||||
Sequence: YRPV | ||||||
Motif | 857-858 | Di-leucine internalization motif | ||||
Sequence: LL |
Domain
Some of the most genetically diverse regions of the viral genome are present in Env. They are called variable regions 1 through 5 (V1 through V5). Coreceptor usage of gp120 is determined mainly by the primary structure of the third variable region (V3) in the outer domain of gp120. Binding to CCR5 involves a region adjacent in addition to V3 (By similarity).
The 17 amino acids long immunosuppressive region is present in many retroviral envelope proteins. Synthetic peptides derived from this relatively conserved sequence inhibit immune function in vitro and in vivo (By similarity).
Keywords
- Domain
Family and domain databases
Sequence
- Sequence statusComplete
- Sequence processingThe displayed sequence is further processed into a mature form.
- Length858
- Mass (Da)98,824
- Last updated1987-08-13 v1
- ChecksumC7266AF1F5C5B9A7
Features
Showing features for compositional bias, sequence conflict.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Compositional bias | 111-140 | Polar residues | ||||
Sequence: SSTESSTGNNTTSKSTSTTTTTPTDQEQEI | ||||||
Sequence conflict | 312 | in Ref. 1; AAB00770 | ||||
Sequence: T → I |