P48432 · SOX2_MOUSE
- ProteinTranscription factor SOX-2
- GeneSox2
- StatusUniProtKB reviewed (Swiss-Prot)
- Organism
- Amino acids319 (go to sequence)
- Protein existenceEvidence at protein level
- Annotation score5/5
Function
function
Transcription factor that forms a trimeric complex with POU5F1 (OCT3/4) on DNA and controls the expression of a number of genes involved in embryonic development such as YES1, FGF4, UTF1 and ZFP206 (PubMed:15863505, PubMed:17097055, PubMed:19740739, PubMed:32703285).
Binds to the proximal enhancer region of NANOG (PubMed:15863505).
Critical for early embryogenesis and for embryonic stem cell pluripotency (By similarity).
Downstream SRRT target that mediates the promotion of neural stem cell self-renewal (PubMed:22198669).
Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation (By similarity).
May function as a switch in neuronal development (By similarity).
Binds to the proximal enhancer region of NANOG (PubMed:15863505).
Critical for early embryogenesis and for embryonic stem cell pluripotency (By similarity).
Downstream SRRT target that mediates the promotion of neural stem cell self-renewal (PubMed:22198669).
Keeps neural cells undifferentiated by counteracting the activity of proneural proteins and suppresses neuronal differentiation (By similarity).
May function as a switch in neuronal development (By similarity).
Biotechnology
POU5F1/OCT4, SOX2, MYC/c-Myc and KLF4 are the four Yamanaka factors. When combined, these factors are sufficient to reprogram differentiated cells to an embryonic-like state designated iPS (induced pluripotent stem) cells. iPS cells exhibit the morphology and growth properties of ES cells and express ES cell marker genes.
Features
Showing features for dna binding.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
DNA binding | 43-111 | HMG box | ||||
Sequence: VKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYK |
GO annotations
Keywords
- Molecular function
- Biological process
Enzyme and pathway databases
Names & Taxonomy
Protein names
- Recommended nameTranscription factor SOX-2
Gene names
Organism names
- Organism
- Strain
- Taxonomic lineageEukaryota > Metazoa > Chordata > Craniata > Vertebrata > Euteleostomi > Mammalia > Eutheria > Euarchontoglires > Glires > Rodentia > Myomorpha > Muroidea > Muridae > Murinae > Mus > Mus
Accessions
- Primary accessionP48432
Proteomes
Organism-specific databases
Phenotypes & Variants
Disruption phenotype
Embryonically lethal.
Features
Showing features for mutagenesis.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Mutagenesis | 247 | Absence of sumoylation. Increased FGF4 activation. No effect on nuclear localization. | ||||
Sequence: K → R |
PTM/Processing
Features
Showing features for chain, modified residue, cross-link.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Chain | PRO_0000048716 | 1-319 | Transcription factor SOX-2 | |||
Sequence: MYNMMETELKPPGPQQASGGGGGGGNATAAATGGNQKNSPDRVKRPMNAFMVWSRGQRRKMAQENPKMHNSEISKRLGAEWKLLSETEKRPFIDEAKRLRALHMKEHPDYKYRPRRKTKTLMKKDKYTLPGGLLAPGGNSMASGVGVGAGLGGGLNQRMDSYAHMNGWSNGSYSMMQEQLGYPQHPGLNAHGAAQMQPMHRYVVSALQYNSMTSSQTYMNGSPTYSMSYSQQGTPGMALGSMGSVVKSEASSSPPVVTSSSHSRAPCQAGDLRDMISMYLPGAEVPEPAAPSRLHMAQHYQSGPVPGTAKYGTLPLSHM | ||||||
Modified residue | 44 | N6-methyllysine | ||||
Sequence: K | ||||||
Modified residue | 119 | N6-methyllysine | ||||
Sequence: K | ||||||
Cross-link | 247 | Glycyl lysine isopeptide (Lys-Gly) (interchain with G-Cter in SUMO) | ||||
Sequence: K | ||||||
Modified residue | 253 | Phosphoserine | ||||
Sequence: S |
Post-translational modification
Sumoylation inhibits binding on DNA and negatively regulates the FGF4 transactivation.
Methylation at Lys-44 and Lys-119 is necessary for the regulation of SOX2 proteasomal degradation.
Ubiquitinated by WWP2, leading to proteasomal degradation.
Keywords
- PTM
Proteomic databases
PTM databases
Expression
Tissue specificity
Expressed in the cochlea (at protein level) (PubMed:32127020).
Expressed in the brain and retina (PubMed:15863505, PubMed:7590241).
A very low level of expression is seen in the stomach and lung (PubMed:15863505, PubMed:7590241).
Expressed in the kidney (PubMed:15863505).
Expressed in the brain and retina (PubMed:15863505, PubMed:7590241).
A very low level of expression is seen in the stomach and lung (PubMed:15863505, PubMed:7590241).
Expressed in the kidney (PubMed:15863505).
Induction
By SRRT.
Developmental stage
Interaction
Subunit
Interacts with ZSCAN10 (PubMed:19740739).
Interacts with SOX3 and FGFR1 (PubMed:17728342).
Interacts with GLIS1 (By similarity).
Interacts with POU5F1; binds synergistically with POU5F1 to DNA (PubMed:15863505).
Interacts with DDX56 (PubMed:32703285).
Interacts with L3MBTL3 and DCAF5 (PubMed:30442713).
The interaction with L3MBTL3 and DCAF5 requires methylation at Lys-44 and is necessary to target SOX2 for ubiquitination by the CRL4-DCAF5 E3 ubiquitin ligase complex (By similarity).
Interacts with RCOR1/CoREST (PubMed:30442713).
Interacts with PHF20L1 (PubMed:29358331).
The interaction with PHF20L1 requires methylation at Lys-44 and Lys-119 and protects SOX2 from degradation (By similarity).
Interacts with TRIM26; this interaction prevents ubiquitination by WWP2 (By similarity).
Interacts with SOX3 and FGFR1 (PubMed:17728342).
Interacts with GLIS1 (By similarity).
Interacts with POU5F1; binds synergistically with POU5F1 to DNA (PubMed:15863505).
Interacts with DDX56 (PubMed:32703285).
Interacts with L3MBTL3 and DCAF5 (PubMed:30442713).
The interaction with L3MBTL3 and DCAF5 requires methylation at Lys-44 and is necessary to target SOX2 for ubiquitination by the CRL4-DCAF5 E3 ubiquitin ligase complex (By similarity).
Interacts with RCOR1/CoREST (PubMed:30442713).
Interacts with PHF20L1 (PubMed:29358331).
The interaction with PHF20L1 requires methylation at Lys-44 and Lys-119 and protects SOX2 from degradation (By similarity).
Interacts with TRIM26; this interaction prevents ubiquitination by WWP2 (By similarity).
Binary interactions
Type | Entry 1 | Entry 2 | Number of experiments | Intact | |
---|---|---|---|---|---|
BINARY | P48432 | Nanog Q80Z64 | 10 | EBI-2313612, EBI-2312517 | |
BINARY | P48432 | Pou5f1 P20263 | 4 | EBI-2313612, EBI-1606219 | |
BINARY | P48432 | Sin3a Q60520 | 3 | EBI-2313612, EBI-349034 | |
BINARY | P48432 | Yy1 Q00899 | 2 | EBI-2313612, EBI-6921536 |
Protein-protein interaction databases
Miscellaneous
Structure
Family & Domains
Features
Showing features for region, compositional bias, motif.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Region | 1-43 | Disordered | ||||
Sequence: MYNMMETELKPPGPQQASGGGGGGGNATAAATGGNQKNSPDRV | ||||||
Compositional bias | 247-266 | Polar residues | ||||
Sequence: KSEASSSPPVVTSSSHSRAP | ||||||
Region | 247-267 | Disordered | ||||
Sequence: KSEASSSPPVVTSSSHSRAPC | ||||||
Motif | 274-282 | 9aaTAD | ||||
Sequence: DMISMYLPG | ||||||
Region | 299-319 | Disordered | ||||
Sequence: HYQSGPVPGTAKYGTLPLSHM |
Domain
The 9aaTAD motif is a transactivation domain present in a large number of yeast and animal transcription factors.
Phylogenomic databases
Family and domain databases
Sequence
- Sequence statusComplete
- Length319
- Mass (Da)34,454
- Last updated1999-07-15 v2
- ChecksumC40DE49E524C49F1
Computationally mapped potential isoform sequences
There is 1 potential isoform mapped to this entry
Entry | Entry name | Gene name | Length | ||
---|---|---|---|---|---|
Q60I23 | Q60I23_MOUSE | Sox2 | 319 |
Features
Showing features for sequence conflict, compositional bias.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Sequence conflict | 153-155 | in Ref. 3; CAA63847 | ||||
Sequence: GGL → AGV | ||||||
Sequence conflict | 203 | in Ref. 3; CAA63847 | ||||
Sequence: V → D | ||||||
Compositional bias | 247-266 | Polar residues | ||||
Sequence: KSEASSSPPVVTSSSHSRAP | ||||||
Sequence conflict | 310-311 | in Ref. 3; CAA63847 | ||||
Sequence: KY → IN |
Keywords
- Technical term
Sequence databases
Nucleotide Sequence | Protein Sequence | Molecule Type | Status | |
---|---|---|---|---|
U31967 EMBL· GenBank· DDBJ | AAC31791.1 EMBL· GenBank· DDBJ | mRNA | ||
X94127 EMBL· GenBank· DDBJ | CAA63847.1 EMBL· GenBank· DDBJ | Genomic DNA |