P03518 · GP_RVFV
- ProteinEnvelopment polyprotein
- GeneGP
- StatusUniProtKB reviewed (Swiss-Prot)
- Organism
- Amino acids1206 (go to sequence)
- Protein existenceEvidence at protein level
- Annotation score5/5
Function
function
Glycoprotein N
Structural component of the virion that interacts with glycoprotein C (By similarity).
It shields the hydrophobic fusion loops of the glycoprotein C, preventing premature fusion (By similarity).
The glycoprotein protrusions are arranged on an icosahedral lattice, with T=12 triangulation (PubMed:19193794, PubMed:23319635).
They are able to attach the virion to the host cell receptor CD209/DC-SIGN and to promote fusion of membranes with the late endosome after endocytosis of the virion (By similarity).
Plays a role in the packaging of ribonucleoproteins and polymerase during virus assembly (By similarity).
It shields the hydrophobic fusion loops of the glycoprotein C, preventing premature fusion (By similarity).
The glycoprotein protrusions are arranged on an icosahedral lattice, with T=12 triangulation (PubMed:19193794, PubMed:23319635).
They are able to attach the virion to the host cell receptor CD209/DC-SIGN and to promote fusion of membranes with the late endosome after endocytosis of the virion (By similarity).
Plays a role in the packaging of ribonucleoproteins and polymerase during virus assembly (By similarity).
Glycoprotein C
Structural component of the virion that interacts with glycoprotein N (By similarity).
Acts as a class II fusion protein that is activated upon acidification and subsequent repositioning of the glycoprotein N (PubMed:23319635, PubMed:29097548).
The glycoprotein protrusions are arranged on an icosahedral lattice, with T=12 triangulation (PubMed:19193794, PubMed:23319635).
They are able to attach the virion to the host cell receptor CD209/DC-SIGN and to promote fusion of membranes with the late endosome after endocytosis of the virion (By similarity).
Acts as a class II fusion protein that is activated upon acidification and subsequent repositioning of the glycoprotein N (PubMed:23319635, PubMed:29097548).
The glycoprotein protrusions are arranged on an icosahedral lattice, with T=12 triangulation (PubMed:19193794, PubMed:23319635).
They are able to attach the virion to the host cell receptor CD209/DC-SIGN and to promote fusion of membranes with the late endosome after endocytosis of the virion (By similarity).
Isoform NSm protein
Plays a role for virus dissemination in the mosquito.
NSm-Gn protein
Plays a role for virus dissemination in mosquitoes.
Features
Showing features for site.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Site | 153-154 | Cleavage; by host signal peptidase | ||||
Sequence: AE | ||||||
Site | 690-691 | Cleavage; by host signal peptidase | ||||
Sequence: AC |
GO annotations
Aspect | Term | |
---|---|---|
Cellular Component | host cell endoplasmic reticulum membrane | |
Cellular Component | host cell Golgi membrane | |
Cellular Component | host cell mitochondrial outer membrane | |
Cellular Component | membrane | |
Cellular Component | virion membrane | |
Biological Process | entry receptor-mediated virion attachment to host cell | |
Biological Process | fusion of virus membrane with host endosome membrane | |
Biological Process | symbiont entry into host cell |
Keywords
- Biological process
Names & Taxonomy
Protein names
- Recommended nameEnvelopment polyprotein
- Alternative names
- Cleaved into 3 chains
Gene names
Organism names
- Organism
- Taxonomic lineageViruses > Riboviria > Orthornavirae > Negarnaviricota > Polyploviricotina > Ellioviricetes > Bunyavirales > Phenuiviridae > Phlebovirus > Phlebovirus riftense
- Virus hosts
Accessions
- Primary accessionP03518
- Secondary accessions
Subcellular Location
UniProt Annotation
GO Annotation
Glycoprotein N
Virion membrane ; Single-pass type I membrane protein
Host Golgi apparatus membrane ; Single-pass type I membrane protein
Host endoplasmic reticulum membrane ; Single-pass type I membrane protein
Note: Interaction between Glycoprotein N and Glycoprotein C is essential for proper targeting of Glycoprotein C to the Golgi complex, where virion budding occurs.
Glycoprotein C
Virion membrane ; Single-pass type I membrane protein
Host Golgi apparatus membrane ; Single-pass type I membrane protein
Note: Interaction between Glycoprotein N and Glycoprotein C is essential for proper targeting of Glycoprotein C to the Golgi complex, where virion budding occurs.
Isoform NSm protein
Host mitochondrion outer membrane ; Single-pass type II membrane protein
Features
Showing features for topological domain, transmembrane.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Topological domain | 17-130 | Cytoplasmic | ||||
Sequence: VIRVSLSSTREETCFGDSTNPEMIEGAWDSLREEEMPEELSCSISGIREVKTSSQELYRALKAIIAADGLNNITCHGKDPEDKISLIKGPPHKKRVGIVRCERRRDAKQIGRET | ||||||
Topological domain | 154-582 | Lumenal | ||||
Sequence: EDPHLRNRPGKGHNYIDGMTQEDATCKPVTYAGACSSFDVLLEKGKFPLFQSYAHHRTLLEAVHDTIIAKADPPSCDLQSAHGNPCMKEKLVMKTHCPNDYQSAHYLNNDGKMASVKCPPKYGLTEDCNFCRQMTGASLKKGSYPLQDLFCQSSEDDGSKLKTKMKGVCEVGVQAHKKCDGQLSTAHEVVPFAVFKNSKKVYLDKLDLKTEENLLPDSFVCFEHKGQYKGTMDSGQTKRELKSFDISQCPKIGGHGSKKCTGDAAFCSAYECTAQYANAYCSHANGSGIVQIQVSGVWKKPLCVGYERVVVKRELSAKPIQRVEPCTTCITKCEPHGLVVRSTGFKISSAVACASGVCVTGSQSPSTEITLKYPGISQSSGGDIGVHMAHDDQSVSSKIVAHCPPQDPCLVHGCIVCAHGLINYQCHTA | ||||||
Transmembrane | 583-603 | Helical | ||||
Sequence: LSAFVVVFVFSSIAIICLAVL | ||||||
Topological domain | 604-673 | Cytoplasmic | ||||
Sequence: YRVLKCLKIAPRKVLNPLMWITAFIRWIYKKMVARVAHNINQVNREIGWMEGGQLVLGNPAPIPRHAPIP | ||||||
Topological domain | 691-1159 | Lumenal | ||||
Sequence: CSELIQASSRITTCSTEGVNTKCRLSGTALIRAGSVGAEACLMLKGVKEDQTKFLKIKTVSSELSCREGQSYWTGSISPKCLSSRRCHLVGECHVNRCLSWRDNETSAEFSFVGESTTMRENKCFEQCGGWGCGCFNVNPSCLFVHTYLQSVRKEALRVFNCIDWVHKLTLEITDFDGSVSTIDLGASSSRFTNWGSVSLSLDAEGISGSNSFSFIESPSKGYAIVDEPFSEIPRQGFLGEIRCNSESSVLSAHESCLRAPNLISYKPMIDQLECTTNLIDPFVVFERGSLPQTRNDKTFAASKGNRGVQAFSKGSVQADLTLMFDNFEVDFVGAAVSCDAAFLNLTGCYSCNAGARVCLSITSTGTGSLSAHNKDGSLHIVLPSENGTKDQCQILHFTVPEVEEEFMYSCDGDERPLLVKGTLIAIDPFDDRREAGGESTVVNPKSGSWNFFDWFSGLMSWFGGPLKL | ||||||
Transmembrane | 1160-1180 | Helical | ||||
Sequence: YSSFACMLHYQLGSFSSLYIL | ||||||
Topological domain | 1181-1206 | Cytoplasmic | ||||
Sequence: EEQASLKCGLLPLRRPHRSVRVKVIC |
Keywords
- Cellular component
PTM/Processing
Features
Showing features for signal, chain, disulfide bond, glycosylation.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Signal | 1-16 | |||||
Sequence: MYVLLTILISVLVCEA | ||||||
Chain | PRO_0000036847 | 17-690 | NSm-Gn protein | |||
Sequence: VIRVSLSSTREETCFGDSTNPEMIEGAWDSLREEEMPEELSCSISGIREVKTSSQELYRALKAIIAADGLNNITCHGKDPEDKISLIKGPPHKKRVGIVRCERRRDAKQIGRETMAGIAMTVLPALAVFALAPVVFAEDPHLRNRPGKGHNYIDGMTQEDATCKPVTYAGACSSFDVLLEKGKFPLFQSYAHHRTLLEAVHDTIIAKADPPSCDLQSAHGNPCMKEKLVMKTHCPNDYQSAHYLNNDGKMASVKCPPKYGLTEDCNFCRQMTGASLKKGSYPLQDLFCQSSEDDGSKLKTKMKGVCEVGVQAHKKCDGQLSTAHEVVPFAVFKNSKKVYLDKLDLKTEENLLPDSFVCFEHKGQYKGTMDSGQTKRELKSFDISQCPKIGGHGSKKCTGDAAFCSAYECTAQYANAYCSHANGSGIVQIQVSGVWKKPLCVGYERVVVKRELSAKPIQRVEPCTTCITKCEPHGLVVRSTGFKISSAVACASGVCVTGSQSPSTEITLKYPGISQSSGGDIGVHMAHDDQSVSSKIVAHCPPQDPCLVHGCIVCAHGLINYQCHTALSAFVVVFVFSSIAIICLAVLYRVLKCLKIAPRKVLNPLMWITAFIRWIYKKMVARVAHNINQVNREIGWMEGGQLVLGNPAPIPRHAPIPRYSTYLMLLLIVSYASA | ||||||
Chain | PRO_0000247009 | 17-1206 | Envelopment polyprotein | |||
Sequence: VIRVSLSSTREETCFGDSTNPEMIEGAWDSLREEEMPEELSCSISGIREVKTSSQELYRALKAIIAADGLNNITCHGKDPEDKISLIKGPPHKKRVGIVRCERRRDAKQIGRETMAGIAMTVLPALAVFALAPVVFAEDPHLRNRPGKGHNYIDGMTQEDATCKPVTYAGACSSFDVLLEKGKFPLFQSYAHHRTLLEAVHDTIIAKADPPSCDLQSAHGNPCMKEKLVMKTHCPNDYQSAHYLNNDGKMASVKCPPKYGLTEDCNFCRQMTGASLKKGSYPLQDLFCQSSEDDGSKLKTKMKGVCEVGVQAHKKCDGQLSTAHEVVPFAVFKNSKKVYLDKLDLKTEENLLPDSFVCFEHKGQYKGTMDSGQTKRELKSFDISQCPKIGGHGSKKCTGDAAFCSAYECTAQYANAYCSHANGSGIVQIQVSGVWKKPLCVGYERVVVKRELSAKPIQRVEPCTTCITKCEPHGLVVRSTGFKISSAVACASGVCVTGSQSPSTEITLKYPGISQSSGGDIGVHMAHDDQSVSSKIVAHCPPQDPCLVHGCIVCAHGLINYQCHTALSAFVVVFVFSSIAIICLAVLYRVLKCLKIAPRKVLNPLMWITAFIRWIYKKMVARVAHNINQVNREIGWMEGGQLVLGNPAPIPRHAPIPRYSTYLMLLLIVSYASACSELIQASSRITTCSTEGVNTKCRLSGTALIRAGSVGAEACLMLKGVKEDQTKFLKIKTVSSELSCREGQSYWTGSISPKCLSSRRCHLVGECHVNRCLSWRDNETSAEFSFVGESTTMRENKCFEQCGGWGCGCFNVNPSCLFVHTYLQSVRKEALRVFNCIDWVHKLTLEITDFDGSVSTIDLGASSSRFTNWGSVSLSLDAEGISGSNSFSFIESPSKGYAIVDEPFSEIPRQGFLGEIRCNSESSVLSAHESCLRAPNLISYKPMIDQLECTTNLIDPFVVFERGSLPQTRNDKTFAASKGNRGVQAFSKGSVQADLTLMFDNFEVDFVGAAVSCDAAFLNLTGCYSCNAGARVCLSITSTGTGSLSAHNKDGSLHIVLPSENGTKDQCQILHFTVPEVEEEFMYSCDGDERPLLVKGTLIAIDPFDDRREAGGESTVVNPKSGSWNFFDWFSGLMSWFGGPLKLYSSFACMLHYQLGSFSSLYILEEQASLKCGLLPLRRPHRSVRVKVIC | ||||||
Chain | PRO_0000036848 | 154-690 | Glycoprotein N | |||
Sequence: EDPHLRNRPGKGHNYIDGMTQEDATCKPVTYAGACSSFDVLLEKGKFPLFQSYAHHRTLLEAVHDTIIAKADPPSCDLQSAHGNPCMKEKLVMKTHCPNDYQSAHYLNNDGKMASVKCPPKYGLTEDCNFCRQMTGASLKKGSYPLQDLFCQSSEDDGSKLKTKMKGVCEVGVQAHKKCDGQLSTAHEVVPFAVFKNSKKVYLDKLDLKTEENLLPDSFVCFEHKGQYKGTMDSGQTKRELKSFDISQCPKIGGHGSKKCTGDAAFCSAYECTAQYANAYCSHANGSGIVQIQVSGVWKKPLCVGYERVVVKRELSAKPIQRVEPCTTCITKCEPHGLVVRSTGFKISSAVACASGVCVTGSQSPSTEITLKYPGISQSSGGDIGVHMAHDDQSVSSKIVAHCPPQDPCLVHGCIVCAHGLINYQCHTALSAFVVVFVFSSIAIICLAVLYRVLKCLKIAPRKVLNPLMWITAFIRWIYKKMVARVAHNINQVNREIGWMEGGQLVLGNPAPIPRHAPIPRYSTYLMLLLIVSYASA | ||||||
Disulfide bond | 179↔188 | |||||
Sequence: CKPVTYAGAC | ||||||
Disulfide bond | 229↔239 | |||||
Sequence: CDLQSAHGNPC | ||||||
Disulfide bond | 250↔281 | |||||
Sequence: CPNDYQSAHYLNNDGKMASVKCPPKYGLTEDC | ||||||
Disulfide bond | 271↔284 | |||||
Sequence: CPPKYGLTEDCNFC | ||||||
Disulfide bond | 304↔456 | |||||
Sequence: CQSSEDDGSKLKTKMKGVCEVGVQAHKKCDGQLSTAHEVVPFAVFKNSKKVYLDKLDLKTEENLLPDSFVCFEHKGQYKGTMDSGQTKRELKSFDISQCPKIGGHGSKKCTGDAAFCSAYECTAQYANAYCSHANGSGIVQIQVSGVWKKPLC | ||||||
Disulfide bond | 322↔332 | |||||
Sequence: CEVGVQAHKKC | ||||||
Disulfide bond | 374↔434 | |||||
Sequence: CFEHKGQYKGTMDSGQTKRELKSFDISQCPKIGGHGSKKCTGDAAFCSAYECTAQYANAYC | ||||||
Disulfide bond | 402↔413 | |||||
Sequence: CPKIGGHGSKKC | ||||||
Disulfide bond | 420↔425 | |||||
Sequence: CSAYEC | ||||||
Disulfide bond | 479↔482 | |||||
Sequence: CTTC | ||||||
Disulfide bond | 486↔556 | |||||
Sequence: CEPHGLVVRSTGFKISSAVACASGVCVTGSQSPSTEITLKYPGISQSSGGDIGVHMAHDDQSVSSKIVAHC | ||||||
Disulfide bond | 506↔511 | |||||
Sequence: CASGVC | ||||||
Disulfide bond | 691↔731 | |||||
Sequence: CSELIQASSRITTCSTEGVNTKCRLSGTALIRAGSVGAEAC | ||||||
Chain | PRO_0000036849 | 691-1206 | Glycoprotein C | |||
Sequence: CSELIQASSRITTCSTEGVNTKCRLSGTALIRAGSVGAEACLMLKGVKEDQTKFLKIKTVSSELSCREGQSYWTGSISPKCLSSRRCHLVGECHVNRCLSWRDNETSAEFSFVGESTTMRENKCFEQCGGWGCGCFNVNPSCLFVHTYLQSVRKEALRVFNCIDWVHKLTLEITDFDGSVSTIDLGASSSRFTNWGSVSLSLDAEGISGSNSFSFIESPSKGYAIVDEPFSEIPRQGFLGEIRCNSESSVLSAHESCLRAPNLISYKPMIDQLECTTNLIDPFVVFERGSLPQTRNDKTFAASKGNRGVQAFSKGSVQADLTLMFDNFEVDFVGAAVSCDAAFLNLTGCYSCNAGARVCLSITSTGTGSLSAHNKDGSLHIVLPSENGTKDQCQILHFTVPEVEEEFMYSCDGDERPLLVKGTLIAIDPFDDRREAGGESTVVNPKSGSWNFFDWFSGLMSWFGGPLKLYSSFACMLHYQLGSFSSLYILEEQASLKCGLLPLRRPHRSVRVKVIC | ||||||
Disulfide bond | 704↔713 | |||||
Sequence: CSTEGVNTKC | ||||||
Disulfide bond | 756↔852 | |||||
Sequence: CREGQSYWTGSISPKCLSSRRCHLVGECHVNRCLSWRDNETSAEFSFVGESTTMRENKCFEQCGGWGCGCFNVNPSCLFVHTYLQSVRKEALRVFNC | ||||||
Disulfide bond | 771↔965 | |||||
Sequence: CLSSRRCHLVGECHVNRCLSWRDNETSAEFSFVGESTTMRENKCFEQCGGWGCGCFNVNPSCLFVHTYLQSVRKEALRVFNCIDWVHKLTLEITDFDGSVSTIDLGASSSRFTNWGSVSLSLDAEGISGSNSFSFIESPSKGYAIVDEPFSEIPRQGFLGEIRCNSESSVLSAHESCLRAPNLISYKPMIDQLEC | ||||||
Disulfide bond | 777↔825 | |||||
Sequence: CHLVGECHVNRCLSWRDNETSAEFSFVGESTTMRENKCFEQCGGWGCGC | ||||||
Disulfide bond | 783↔832 | |||||
Sequence: CHVNRCLSWRDNETSAEFSFVGESTTMRENKCFEQCGGWGCGCFNVNPSC | ||||||
Disulfide bond | 788↔814 | |||||
Sequence: CLSWRDNETSAEFSFVGESTTMRENKC | ||||||
Glycosylation | 794 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Disulfide bond | 818↔823 | |||||
Sequence: CGGWGC | ||||||
Disulfide bond | 934↔947 | |||||
Sequence: CNSESSVLSAHESC | ||||||
Disulfide bond | 1029↔1101 | |||||
Sequence: CDAAFLNLTGCYSCNAGARVCLSITSTGTGSLSAHNKDGSLHIVLPSENGTKDQCQILHFTVPEVEEEFMYSC | ||||||
Glycosylation | 1035 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N | ||||||
Disulfide bond | 1039↔1042 | |||||
Sequence: CYSC | ||||||
Disulfide bond | 1049↔1083 | |||||
Sequence: CLSITSTGTGSLSAHNKDGSLHIVLPSENGTKDQC | ||||||
Glycosylation | 1077 | N-linked (GlcNAc...) asparagine; by host | ||||
Sequence: N |
Post-translational modification
Envelopment polyprotein
Specific enzymatic cleavages in vivo yield mature proteins including NSm protein, Glycoprotein C, and Glycoprotein N.
Glycoprotein N
Glycosylated (By similarity).
The glycans can attach to host CD209/DC-SIGN, and may play a role in virus entry into dendritic cells (By similarity).
The glycans can attach to host CD209/DC-SIGN, and may play a role in virus entry into dendritic cells (By similarity).
Glycoprotein C
Glycosylated (By similarity).
The glycans can attach to host CD209/DC-SIGN, and may play a role in virus entry into dendritic cells (By similarity).
The glycans can attach to host CD209/DC-SIGN, and may play a role in virus entry into dendritic cells (By similarity).
Glycoprotein C
Palmitoylated.
Keywords
- PTM
PTM databases
Interaction
Subunit
Glycoprotein N
Heterodimer with glycoprotein C (PubMed:19193794, PubMed:28827346).
Interacts with nucleocapsid protein N and with the polymerase L in order to package them into virus particles (By similarity).
Interacts with nucleocapsid protein N and with the polymerase L in order to package them into virus particles (By similarity).
Glycoprotein C
Heterodimer with glycoprotein C (PubMed:19193794, PubMed:28827346).
Homotrimer (postfusion) (PubMed:29097548).
Interacts with nucleocapsid protein N and with the polymerase L in order to package them into virus particles (By similarity).
Interacts with host E3 ubiquitin-protein ligase UBR4; this interaction is important for viral RNA production (By similarity).
Interacts with host LRP1; this interaction facilitates virus entry into the host cell (By similarity).
Homotrimer (postfusion) (PubMed:29097548).
Interacts with nucleocapsid protein N and with the polymerase L in order to package them into virus particles (By similarity).
Interacts with host E3 ubiquitin-protein ligase UBR4; this interaction is important for viral RNA production (By similarity).
Interacts with host LRP1; this interaction facilitates virus entry into the host cell (By similarity).
Structure
Family & Domains
Features
Showing features for region.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Region | 131-153 | Internal signal sequence for glycoprotein N | ||||
Sequence: MAGIAMTVLPALAVFALAPVVFA | ||||||
Region | 608-650 | Golgi retention signal | ||||
Sequence: KCLKIAPRKVLNPLMWITAFIRWIYKKMVARVAHNINQVNREI | ||||||
Region | 646-650 | Important for correct targeting of the glycoproteins to the Golgi complex but not for heterodimerization | ||||
Sequence: VNREI | ||||||
Region | 675-690 | Internal signal sequence for glycoprotein C | ||||
Sequence: YSTYLMLLLIVSYASA | ||||||
Region | 777-783 | Fusion loop | ||||
Sequence: CHLVGEC | ||||||
Region | 819-830 | Fusion loop | ||||
Sequence: GGWGCGCFNVNP |
Domain
Glycoprotein N
Contains a Golgi retention signal on its C-terminus (By similarity).
The cytoplasmic tail specifically interacts with the ribonucleoproteins and is critical for genome packaging (By similarity).
The cytoplasmic tail specifically interacts with the ribonucleoproteins and is critical for genome packaging (By similarity).
Sequence similarities
Belongs to the phlebovirus envelope glycoprotein family.
Keywords
- Domain
Family and domain databases
Sequence & Isoforms
- Sequence statusComplete
- Sequence processingThe displayed sequence is further processed into a mature form.
This entry describes 3 isoforms produced by Alternative initiation.
P03518-1
This isoform has been chosen as the canonical sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
- Name1
- Length1,206
- Mass (Da)132,053
- Last updated1995-11-01 v2
- ChecksumD2E8017179285924
P03518-2
- NameNSm protein
- SynonymsP14
P03518-3
- NameNSm' protein
- SynonymsP13
Features
Showing features for alternative sequence.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Alternative sequence | VSP_057986 | 1-38 | in isoform NSm protein | |||
Sequence: Missing | ||||||
Alternative sequence | VSP_057985 | 1-51 | in isoform NSm' protein | |||
Sequence: Missing | ||||||
Alternative sequence | VSP_057987 | 154-1197 | in isoform NSm protein and isoform NSm' protein | |||
Sequence: Missing |
Keywords
- Coding sequence diversity
- Technical term