Skip to main content

Table 5. Core proteins added to RefSeq genomes1

From: Solving the Problem: Genome Annotation Standards before the Data Deluge

Protein2 Number of additions3 Avg. Length4
30S ribosomal protein S8 1 131.4+/−2.1
30S ribosomal protein S11 1 130.1+/−5.8
30S ribosomal protein S14 10 84.1+/−19.3
30S ribosomal protein S15 3 94.1+/−17.1
30S ribosomal protein S19 9 96.1+/−15.0
50S ribosomal protein L2 1 273.8+/−10.2
50S ribosomal protein L11 1 144.4+/7.0
50S ribosomal protein L23 2 99.2+/−10.3
50S ribosomal protein L29 7 68.2+/−9.8
elongation factor P 1 185.4+/−16.9
flap-1 endonuclease 2 832.6+/−204.1
translation initiation factor IF-1 4 77.3+/−11.1
  1. 1. Search for protein and nucleotide against RefSeq genomes (Aug. 10, 2010) identified cases where gene/protein were not present as either normal or non-functional. In those cases, a new gene/CDS/protein was added to the RefSeq record.
  2. 2. Protein name/functional name.
  3. 3. Number of proteins added for each category, in some cases multiple additions to the same genome.
  4. 4. The average protein length and standard deviation of lengths for all proteins from all clusters for each functional group. In some cases there are multiple protein clusters for one functional group.