Q9VWC6 · ZELDA_DROME
- ProteinTranscription factor Zelda
- Genezld
- StatusUniProtKB reviewed (Swiss-Prot)
- Amino acids1596 (go to sequence)
- Protein existenceEvidence at protein level
- Annotation score5/5
Function
function
Transcription factor required for zygotic genome activation (ZGA), a critical event in early embryonic development during which the developmental control passes from maternally provided mRNAs to the expression of the zygotic genome after fertilization (PubMed:18931655, PubMed:20599892, PubMed:22028662, PubMed:22028675, PubMed:24637116, PubMed:24909324, PubMed:25538246, PubMed:26335633, PubMed:26335634, PubMed:29261646, PubMed:30518940, PubMed:30797686, PubMed:30982648, PubMed:34342574, PubMed:35363606, PubMed:37557175).
Binds to regulatory DNA sequences containing a 5'-CAGGTAG-3' sequence motif, which are highly enriched among developmental enhancers (PubMed:18931655, PubMed:22028662, PubMed:22028675, PubMed:35363606, PubMed:37557175).
Within 1 hour into development, or by the embryo's 8th nuclear cycle, binds the majority of its motifs genome-wide (PubMed:22028662, PubMed:22028675).
Zelda-binding promotes nucleosome depletion and chromatin accessibility, thereby facilitating the binding of patterning transcription factors, including the binding of the dorsoventral patterning transcription factors dorsal (dl) and twist (twi), and the anteroposterior patterning transcription factors bicoid (bcd) and caudal (cad) (PubMed:24637116, PubMed:24909324, PubMed:26335633, PubMed:26335634, PubMed:30518940, PubMed:30797686, PubMed:30982648, PubMed:34342574, PubMed:35363606, PubMed:37557175).
Promotes the activity of patterning transcription factors, such as bcd and dl, by lowering the concentration threshold required for transcriptional activation (PubMed:24637116, PubMed:35363606).
Required both for the earliest (minor) and major waves of transcription during ZGA (PubMed:22028662).
Also involved in maternal mRNA clearance during the maternal-to-zygote transition by promoting expression of microRNAs (miRNAs), such as miR-1, miR-9a and miR-309, which mediate degradation of maternally-loaded RNAs (PubMed:18931655, PubMed:24764079).
Also involved in post-blastoderm development: nvolved in nervous system development by maintaining neuroblasts in an undifferentiated state and equired for wing disk development (PubMed:23891688, PubMed:34887421).
Binds to regulatory DNA sequences containing a 5'-CAGGTAG-3' sequence motif, which are highly enriched among developmental enhancers (PubMed:18931655, PubMed:22028662, PubMed:22028675, PubMed:35363606, PubMed:37557175).
Within 1 hour into development, or by the embryo's 8th nuclear cycle, binds the majority of its motifs genome-wide (PubMed:22028662, PubMed:22028675).
Zelda-binding promotes nucleosome depletion and chromatin accessibility, thereby facilitating the binding of patterning transcription factors, including the binding of the dorsoventral patterning transcription factors dorsal (dl) and twist (twi), and the anteroposterior patterning transcription factors bicoid (bcd) and caudal (cad) (PubMed:24637116, PubMed:24909324, PubMed:26335633, PubMed:26335634, PubMed:30518940, PubMed:30797686, PubMed:30982648, PubMed:34342574, PubMed:35363606, PubMed:37557175).
Promotes the activity of patterning transcription factors, such as bcd and dl, by lowering the concentration threshold required for transcriptional activation (PubMed:24637116, PubMed:35363606).
Required both for the earliest (minor) and major waves of transcription during ZGA (PubMed:22028662).
Also involved in maternal mRNA clearance during the maternal-to-zygote transition by promoting expression of microRNAs (miRNAs), such as miR-1, miR-9a and miR-309, which mediate degradation of maternally-loaded RNAs (PubMed:18931655, PubMed:24764079).
Also involved in post-blastoderm development: nvolved in nervous system development by maintaining neuroblasts in an undifferentiated state and equired for wing disk development (PubMed:23891688, PubMed:34887421).
Isoform A
Constitutes the main isoform expressed throughout development (PubMed:29261646).
Transcription factor required for zygotic genome activation (ZGA) (PubMed:29261646).
Transcription factor required for zygotic genome activation (ZGA) (PubMed:29261646).
Isoform D
Acts as a dominant negative inhibitor of transcription factor activity of isoform A.
Isoform F
Acts as a dominant negative inhibitor of transcription factor activity of isoform A.
Miscellaneous
'Vielfaeltig' means 'versatile' or 'manifold' in German.
GO annotations
Keywords
- Molecular function
- Biological process
- Ligand
Enzyme and pathway databases
Names & Taxonomy
Protein names
- Recommended nameTranscription factor Zelda
- Alternative names
Gene names
Organism names
- Strain
- Taxonomic lineageEukaryota > Metazoa > Ecdysozoa > Arthropoda > Hexapoda > Insecta > Pterygota > Neoptera > Endopterygota > Diptera > Brachycera > Muscomorpha > Ephydroidea > Drosophilidae > Drosophila > Sophophora
Accessions
- Primary accessionQ9VWC6
- Secondary accessions
Proteomes
Organism-specific databases
Subcellular Location
Phenotypes & Variants
Disruption phenotype
Embryonic lethality: embryos are defective in cellular blastoderm formation and fail to activate genes essential for cellularization, sex determination and pattern formation (PubMed:18931655).
Cells display nucleosome occupancy at dorsal (dl)-bound regions (PubMed:26335633).
Cells display nucleosome occupancy at dorsal (dl)-bound regions (PubMed:26335633).
Features
Showing features for mutagenesis.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Mutagenesis | 554 | Hyperactive mutant allele, leading to increased expression of target genes. | ||||
Sequence: C → S | ||||||
Mutagenesis | 554-557 | Impaired transcription factor activity due to reduced DNA-binding. | ||||
Sequence: CTAC → STAS | ||||||
Mutagenesis | 561-578 | In Zld(JAZ) allele; female knockin flies are sterile due to embryonic lethality. | ||||
Sequence: FTSSGHLKRHYNTTLHKN → ATASGHLKRHANTTLHKA | ||||||
Mutagenesis | 1328-1331 | Impaired transcription factor activity due to reduced DNA-binding. | ||||
Sequence: CLEC → SLES | ||||||
Mutagenesis | 1386-1389 | Impaired transcription factor activity due to reduced DNA-binding. | ||||
Sequence: CELC → SELS | ||||||
Mutagenesis | 1415-1418 | Impaired transcription factor activity due to reduced DNA-binding. | ||||
Sequence: CDIC → SDIS |
PTM/Processing
Features
Showing features for chain.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Chain | PRO_0000459219 | 1-1596 | Transcription factor Zelda | |||
Sequence: MTSIKTEMPPLHAAEALASSSATDSGGGGAGGGGGGGGGGSGGPGAGGTGGVGSAPATPNATISAAADSSDNQPGTPQPTQQQQSTQQQLQQPQSQQQQQQAMGGGDPQQQQQQQQVTGITHQPYATHHMYSASGGQQQQQQQIYGGLYGGDMQQQQGYASSYINSYEQFYQQQQQQQQGTDYAFGTVGVDYGKSAGVRYHPYLQTPTSGLGGIPTASSAQEEAGSAPSAVSTTTAVAMSPRVVSSSSPTSTSSHLQLGSSSGQTPGSPGGAAGSAGCKLQCKKCGILTTNESELQEHIANVHGESPYGSSGYASSPYIKEEMPTPQPPGVGSATANPGELLDLDSQKMVYHQQLLQQQQQQHDVVAGLPLGALPDPLHSMQSMQQRALHSWEQQPQQTVASVEGLPPYMQQGLGVGLGVGVDKSPYYSPKQSPYHQSTGVGGATLIKQEYGGHGLIKSEYPDSQHYVDKSFDPTAGGAGGAELCASVATSPAEFPSTTTGGPGQEGAAGAAPGGGYRGFEPPSSSSVLPANSLTAKAATWKSNEARRPKTYNCTACNKWFTSSGHLKRHYNTTLHKNAVKSSGQPDPATLPISAHHHPARDPVTKPSRRGNAAAAAAAAAAAASASGQGQQQQPPIPPPPANVPPPEPPRSPPDYGGGGGLGVGAMGGAAMSQYSASPSPTQQQQHHLNHHQQQANGYANGTANGYGYMQQQVQSTTNASPQHASNNNSNNQQQQQQQQQHHQQQMPQHHNSVLNGHPNGLAGPSAPHNNNTTQMPSSQMRGLLNETTTTPPPTTTTRAPQITTTATTTTTAATTVAIKSEQMEDSNHTHTHTHTHPNHSHSQDRSHSSSSSSSMATEEAEEQELRDQEQADDHLHQHQQASQQYLLSARHYHSSTPNTLSSSNTNPSTPSSNSPHTIYRQEQQGTDFSRTTPPPQPLPPMGMLPPMAMDYNMLALDMPMPMPTLMHSNMLQCSSTSTTPLATTITTSMPDTMQPPQQQLVHHYQAVLHPLHQQLGEQHQRQEADHHQQQRELHQLDQQQQQQQALILADSLPHSSSSPTSSSPPPTMPMPLTTITAPQLLPLQPPPPHITSTMPMPPTMHMPIMPPPPQCYQQLQPLDPTMSYHTIIGSGPEAHTGTAGGGYSNQITTSDGQILQLMPTSLFAPYAPLSPYSVAAQRSPQEGDLPPVHTLTTALHAHQQGGQQEAQTPTLTVLSTPYSPTVSSSRATPALEMDMATLMQHHQDYEMEQYQMQHQQLDQMQQHQQQLDHQQQQQILADQTQTMAQQPLAKKRRGGNATPSTTKRRRNSSVGSTSPHSTTLPSGRIKCLECDKEFTKNCYLTQHNKSFHSGEYPFRCQKCGKRFQSEDVYTTHLGRHRTQDKPHKCELCPKQFHHKTDLRRHVEAIHTGLKQHMCDICEKGFCRKDHLRKHLETHNRPRVVGKKSAAAAAAAAAAAAVTATAGLGGAPAAGSIKAAFARSLTVTAVSSEPGAGVAIPSAPATATCAPVSVSASALVQAVNPSLTSGLSLKRHIDDVAVDDDSLLEEDEDEMEMEMEMDMLEEEDEEEEELGDHHMIIKSEYVQEEFQMIEKSIELY |
Proteomic databases
Expression
Tissue specificity
Isoform A
Zygotically expressed in the developing embryonic germ layers, nervous system, imaginal disk primordia and in larval wing and eye disks.
Detected in the germline cells of the ovary, in unfertilized eggs and throughout early development (PubMed:16525017, PubMed:18931655).
Later, it becomes mostly restricted to the nervous system and specific head regions (PubMed:18931655).
Also expressed in imaginal wing disks in third instar larvae (PubMed:23891688).
Later, it becomes mostly restricted to the nervous system and specific head regions (PubMed:18931655).
Also expressed in imaginal wing disks in third instar larvae (PubMed:23891688).
Developmental stage
Expressed both maternally and zygotically.
Gene expression databases
Structure
Family & Domains
Features
Showing features for region, compositional bias, zinc finger.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Region | 1-143 | Disordered | ||||
Sequence: MTSIKTEMPPLHAAEALASSSATDSGGGGAGGGGGGGGGGSGGPGAGGTGGVGSAPATPNATISAAADSSDNQPGTPQPTQQQQSTQQQLQQPQSQQQQQQAMGGGDPQQQQQQQQVTGITHQPYATHHMYSASGGQQQQQQQ | ||||||
Compositional bias | 55-143 | Polar residues | ||||
Sequence: APATPNATISAAADSSDNQPGTPQPTQQQQSTQQQLQQPQSQQQQQQAMGGGDPQQQQQQQQVTGITHQPYATHHMYSASGGQQQQQQQ | ||||||
Compositional bias | 209-269 | Polar residues | ||||
Sequence: SGLGGIPTASSAQEEAGSAPSAVSTTTAVAMSPRVVSSSSPTSTSSHLQLGSSSGQTPGSP | ||||||
Region | 209-273 | Disordered | ||||
Sequence: SGLGGIPTASSAQEEAGSAPSAVSTTTAVAMSPRVVSSSSPTSTSSHLQLGSSSGQTPGSPGGAA | ||||||
Region | 490-530 | Disordered | ||||
Sequence: TSPAEFPSTTTGGPGQEGAAGAAPGGGYRGFEPPSSSSVLP | ||||||
Zinc finger | 552-576 | C2H2-type 1 | ||||
Sequence: YNCTACNKWFTSSGHLKRHYNTTLH | ||||||
Region | 578-813 | Disordered | ||||
Sequence: NAVKSSGQPDPATLPISAHHHPARDPVTKPSRRGNAAAAAAAAAAAASASGQGQQQQPPIPPPPANVPPPEPPRSPPDYGGGGGLGVGAMGGAAMSQYSASPSPTQQQQHHLNHHQQQANGYANGTANGYGYMQQQVQSTTNASPQHASNNNSNNQQQQQQQQQHHQQQMPQHHNSVLNGHPNGLAGPSAPHNNNTTQMPSSQMRGLLNETTTTPPPTTTTRAPQITTTATTTTTA | ||||||
Compositional bias | 633-654 | Pro residues | ||||
Sequence: QQPPIPPPPANVPPPEPPRSPP | ||||||
Compositional bias | 673-756 | Polar residues | ||||
Sequence: SQYSASPSPTQQQQHHLNHHQQQANGYANGTANGYGYMQQQVQSTTNASPQHASNNNSNNQQQQQQQQQHHQQQMPQHHNSVLN | ||||||
Compositional bias | 765-813 | Polar residues | ||||
Sequence: PSAPHNNNTTQMPSSQMRGLLNETTTTPPPTTTTRAPQITTTATTTTTA | ||||||
Compositional bias | 825-844 | Basic and acidic residues | ||||
Sequence: EDSNHTHTHTHTHPNHSHSQ | ||||||
Region | 825-945 | Disordered | ||||
Sequence: EDSNHTHTHTHTHPNHSHSQDRSHSSSSSSSMATEEAEEQELRDQEQADDHLHQHQQASQQYLLSARHYHSSTPNTLSSSNTNPSTPSSNSPHTIYRQEQQGTDFSRTTPPPQPLPPMGML | ||||||
Compositional bias | 845-859 | Polar residues | ||||
Sequence: DRSHSSSSSSSMATE | ||||||
Compositional bias | 878-932 | Polar residues | ||||
Sequence: QHQQASQQYLLSARHYHSSTPNTLSSSNTNPSTPSSNSPHTIYRQEQQGTDFSRT | ||||||
Region | 904-1297 | Transactivation activation domain (TAD) | ||||
Sequence: SNTNPSTPSSNSPHTIYRQEQQGTDFSRTTPPPQPLPPMGMLPPMAMDYNMLALDMPMPMPTLMHSNMLQCSSTSTTPLATTITTSMPDTMQPPQQQLVHHYQAVLHPLHQQLGEQHQRQEADHHQQQRELHQLDQQQQQQQALILADSLPHSSSSPTSSSPPPTMPMPLTTITAPQLLPLQPPPPHITSTMPMPPTMHMPIMPPPPQCYQQLQPLDPTMSYHTIIGSGPEAHTGTAGGGYSNQITTSDGQILQLMPTSLFAPYAPLSPYSVAAQRSPQEGDLPPVHTLTTALHAHQQGGQQEAQTPTLTVLSTPYSPTVSSSRATPALEMDMATLMQHHQDYEMEQYQMQHQQLDQMQQHQQQLDHQQQQQILADQTQTMAQQPLAKKRRGGN | ||||||
Compositional bias | 1017-1034 | Basic and acidic residues | ||||
Sequence: GEQHQRQEADHHQQQREL | ||||||
Region | 1017-1074 | Disordered | ||||
Sequence: GEQHQRQEADHHQQQRELHQLDQQQQQQQALILADSLPHSSSSPTSSSPPPTMPMPLT | ||||||
Compositional bias | 1035-1059 | Polar residues | ||||
Sequence: HQLDQQQQQQQALILADSLPHSSSS | ||||||
Compositional bias | 1252-1288 | Polar residues | ||||
Sequence: QMQHQQLDQMQQHQQQLDHQQQQQILADQTQTMAQQP | ||||||
Region | 1252-1322 | Disordered | ||||
Sequence: QMQHQQLDQMQQHQQQLDHQQQQQILADQTQTMAQQPLAKKRRGGNATPSTTKRRRNSSVGSTSPHSTTLP | ||||||
Compositional bias | 1301-1322 | Polar residues | ||||
Sequence: STTKRRRNSSVGSTSPHSTTLP | ||||||
Zinc finger | 1326-1349 | C2H2-type 2 | ||||
Sequence: IKCLECDKEFTKNCYLTQHNKSFH | ||||||
Zinc finger | 1355-1378 | C2H2-type 3; degenerate | ||||
Sequence: FRCQKCGKRFQSEDVYTTHLGRHR | ||||||
Zinc finger | 1384-1407 | C2H2-type 4 | ||||
Sequence: HKCELCPKQFHHKTDLRRHVEAIH | ||||||
Zinc finger | 1413-1435 | C2H2-type 5 | ||||
Sequence: HMCDICEKGFCRKDHLRKHLETH |
Domain
C2H2-type zinc fingers mediate DNA-binding (PubMed:25538246).
C2H2-type zinc finger 2 serves as an inhibitory domain, and is essential for reprogramming the early embryonic genome, but not for its functions later in development (PubMed:29261646).
C2H2-type zinc finger 2 serves as an inhibitory domain, and is essential for reprogramming the early embryonic genome, but not for its functions later in development (PubMed:29261646).
Keywords
- Domain
Phylogenomic databases
Family and domain databases
Sequence & Isoforms
- Sequence statusComplete
This entry describes 3 isoforms produced by Alternative splicing.
Q9VWC6-1
This isoform has been chosen as the canonical sequence. All positional information in this entry refers to it. This is also the sequence that appears in the downloadable versions of the entry.
- NameA
- SynonymsB, G, ZLD-PA
- Length1,596
- Mass (Da)170,323
- Last updated2000-05-01 v1
- Checksum30C50B924B02CC8D
Q9VWC6-2
- NameD
- SynonymsZLD-PD
- Differences from canonical
- 1351-1373: GEYPFRCQKCGKRFQSEDVYTTH → ASRRQNKSNPSMYPANAQRRIKI
- 1374-1596: Missing
Q9VWC6-3
- NameF
- SynonymsZLD-PF
- Differences from canonical
- 1351-1367: GEYPFRCQKCGKRFQSE → VSLPYVCVKRLKDSQNS
- 1368-1596: Missing
Features
Showing features for compositional bias, alternative sequence.
Type | ID | Position(s) | Description | |||
---|---|---|---|---|---|---|
Compositional bias | 55-143 | Polar residues | ||||
Sequence: APATPNATISAAADSSDNQPGTPQPTQQQQSTQQQLQQPQSQQQQQQAMGGGDPQQQQQQQQVTGITHQPYATHHMYSASGGQQQQQQQ | ||||||
Compositional bias | 209-269 | Polar residues | ||||
Sequence: SGLGGIPTASSAQEEAGSAPSAVSTTTAVAMSPRVVSSSSPTSTSSHLQLGSSSGQTPGSP | ||||||
Compositional bias | 633-654 | Pro residues | ||||
Sequence: QQPPIPPPPANVPPPEPPRSPP | ||||||
Compositional bias | 673-756 | Polar residues | ||||
Sequence: SQYSASPSPTQQQQHHLNHHQQQANGYANGTANGYGYMQQQVQSTTNASPQHASNNNSNNQQQQQQQQQHHQQQMPQHHNSVLN | ||||||
Compositional bias | 765-813 | Polar residues | ||||
Sequence: PSAPHNNNTTQMPSSQMRGLLNETTTTPPPTTTTRAPQITTTATTTTTA | ||||||
Compositional bias | 825-844 | Basic and acidic residues | ||||
Sequence: EDSNHTHTHTHTHPNHSHSQ | ||||||
Compositional bias | 845-859 | Polar residues | ||||
Sequence: DRSHSSSSSSSMATE | ||||||
Compositional bias | 878-932 | Polar residues | ||||
Sequence: QHQQASQQYLLSARHYHSSTPNTLSSSNTNPSTPSSNSPHTIYRQEQQGTDFSRT | ||||||
Compositional bias | 1017-1034 | Basic and acidic residues | ||||
Sequence: GEQHQRQEADHHQQQREL | ||||||
Compositional bias | 1035-1059 | Polar residues | ||||
Sequence: HQLDQQQQQQQALILADSLPHSSSS | ||||||
Compositional bias | 1252-1288 | Polar residues | ||||
Sequence: QMQHQQLDQMQQHQQQLDHQQQQQILADQTQTMAQQP | ||||||
Compositional bias | 1301-1322 | Polar residues | ||||
Sequence: STTKRRRNSSVGSTSPHSTTLP | ||||||
Alternative sequence | VSP_062075 | 1351-1367 | in isoform F | |||
Sequence: GEYPFRCQKCGKRFQSE → VSLPYVCVKRLKDSQNS | ||||||
Alternative sequence | VSP_062074 | 1351-1373 | in isoform D | |||
Sequence: GEYPFRCQKCGKRFQSEDVYTTH → ASRRQNKSNPSMYPANAQRRIKI | ||||||
Alternative sequence | VSP_062076 | 1368-1596 | in isoform F | |||
Sequence: Missing | ||||||
Alternative sequence | VSP_062077 | 1374-1596 | in isoform D | |||
Sequence: Missing |
Keywords
- Coding sequence diversity
- Technical term
Sequence databases
Nucleotide Sequence | Protein Sequence | Molecule Type | Status | |
---|---|---|---|---|
AE014298 EMBL· GenBank· DDBJ | AAF49020.1 EMBL· GenBank· DDBJ | Genomic DNA | ||
AE014298 EMBL· GenBank· DDBJ | AAN09519.1 EMBL· GenBank· DDBJ | Genomic DNA | ||
AE014298 EMBL· GenBank· DDBJ | AFH07480.1 EMBL· GenBank· DDBJ | Genomic DNA | ||
AE014298 EMBL· GenBank· DDBJ | AFH07481.1 EMBL· GenBank· DDBJ | Genomic DNA | ||
AE014298 EMBL· GenBank· DDBJ | AHN59936.1 EMBL· GenBank· DDBJ | Genomic DNA | ||
BT132946 EMBL· GenBank· DDBJ | AEW31265.1 EMBL· GenBank· DDBJ | mRNA |