Plant protein annotation project
Statistics
UniProt release 2019_10 - Nov-13, 2019 contains a total of 561,356 reviewed entries, which includes 40,171 entries from 2,013 species of Viridiplantae.
Arabidospis thaliana - 15,911 reviewed entries.
Number of canonical and isoform protein sequences: organism:3702 reviewed:yes (download data in FASTA format)
Evidence for the existence of protein | Percentage of entries |
---|---|
at protein level | 43.6% |
at transcript level | 42.5% |
inferred from homology | 10.2% |
predicted | 3.1% |
uncertain | 0.7% |
Annotation categories | Entries with | Number of annotations | Coverage |
---|---|---|---|
PubMed citations | 15,911 | 13,296 unique | 100% |
Alternative products | 3,161 | 0 | 19.9% |
General annotation | 15,336 | 71,705 | 96.4% |
Function | 11,143 | 11,257 | 70% |
Catalytic activity | 4,326 | 5,742 | 27.2% |
Subcellular location | 11,990 | 25,261 | N/A |
Sequence annotation | 15,309 | 144,982 | 96.2% |
Amino acid modifications | 4,984 | 18,971 | 31.3% |
Natural Variant | 129 | 817 | 0.8% |
Cross-references | 15,911 | 0 | 100% |
EMBL | 15,881 | 0 | 99.8% |
InterPro | 15,406 | 0 | 96.8% |
PDB | 668 | 0 | 4.2% |
RefSeq | 15,589 | 0 | 98% |
TAIR | 14,659 | 0 | 92.1% |
100% of reviewed Arabidopsis thaliana entries are annotated with at least one keyword .
94.6% of reviewed Arabidopsis thaliana entries are annotated with at least one GO (Gene Ontology ) term.
Oryza sativa subsp. japonica (Rice) - 4,067 reviewed entries.
Number of canonical and isoform protein sequences: organism:39947 reviewed:yes (download data in FASTA format)
Evidence for the existence of protein | Percentage of entries |
---|---|
at protein level | 19.5% |
at transcript level | 63.6% |
inferred from homology | 16.3% |
predicted | 0.3% |
uncertain | 0.3% |
Annotation categories | Entries with | Number of annotations | Coverage |
---|---|---|---|
PubMed citations | 4,067 | 1,875 unique | 100% |
Alternative products | 295 | 0 | 7.3% |
General annotation | 3,993 | 17,889 | 98.2% |
Function | 3,225 | 3,252 | 79.3% |
Catalytic activity | 1,399 | 1,877 | 34.4% |
Subcellular location | 2,893 | uniprot:(organism:39947 reviewed:yes) | 71.1% |
Sequence annotation | 3,830 | 31,865 | 94.2% |
Amino acid modifications | 820 | 2,628 | 20.2% |
Natural Variant | 9 | 24 | 0.2% |
Cross-references | 4,067 | 0 | 100% |
EMBL | 4,067 | 0 | 100% |
InterPro | 4,038 | 0 | 99.3% |
PDB | 73 | 0 | 1.8% |
RefSeq | 3,538 | 0 | 87% |
100% of reviewed Oryza sativa subsp. japonica entries are annotated with at least one keyword .
98.6% of reviewed Oryza sativa subsp. japonica entries are annotated with at least one GO (Gene Ontology ) term.
Oryza sativa subsp. indica (Rice) - 844 reviewed entries.
Number of canonical and isoform protein sequences: organism:39946 reviewed:yes (download data in FASTA format)
Evidence for the existence of protein | Percentage of entries |
---|---|
at protein level | 8.9% |
at transcript level | 27.3% |
inferred from homology | 62.4% |
predicted | 1.3% |
uncertain | 0.1% |
Annotation categories | Entries with | Number of annotations | Coverage |
---|---|---|---|
PubMed citations | 844 | 261 unique | 100% |
Alternative products | 8 | 0 | 0.9% |
General annotation | 839 | 3,095 | 99.4% |
Function | 717 | 728 | 85% |
Catalytic activity | 256 | 338 | 30.3% |
Subcellular location | 677 | 1,317 | 80.2% |
Sequence annotation | 786 | 5,893 | 93.1% |
Amino acid modifications | 151 | 448 | 17.9% |
Natural Variant | 5 | 8 | 0.6% |
Cross-references | 844 | 0 | 100% |
EMBL | 844 | 0 | 100% |
InterPro | 828 | 0 | 98.1% |
PDB | 4 | 0 | 0.5% |
RefSeq | 71 | 0 | 8.4% |
99.9% of reviewed Oryza sativa subsp. indica entries are annotated with at least one keyword .
97.7% of reviewed Oryza sativa subsp. indica entries are annotated with at least one GO (Gene Ontology ) term.