Skip to main content

Table 1 Eight select cases of similarity-based mis-assignment

From: Annotation inconsistencies beyond sequence similarity-based function prediction – phylogeny and genome structure

#

GI #

Accession #

Description

Species

1

19698819

gb|AAL91145.1

putative protein {Nup85}

Arabidopsis thaliana

2

7573329

emb|CAB87799.1

putative protein {Sec16}

Arabidopsis thaliana

3

296819643

ref|XP_002849880.1

protein kinase domain-containing protein {+Nic96}

Arthroderma otae CBS 113480

4

557867390

gb|ESS70565.1

unspecified product {Sec16}

Trypanosoma cruzi Dm28c

5

316978722

gb|EFV61666.1

putative ATP synthase F1, delta subunit {Nup98-96}

Trichinella spiralis

6

308809856

ref|XP_003082237.1

ATP-dependent RNA helicase (ISS) {Sec16}

Ostreococcus tauri

7

255574074

ref|XP_002527953.1

nucleotide binding protein, putative {Sec16}

Ricinus communis

8

443916862

gb|ELU37796.1

DUF1479 domain-containing protein {+Nup85}

Rhizoctonia solani AG-1 IA

  1. Column names: #: case number, GI#: gene identifier number, Accession#: database and accession number, Description: description line, Species: species name (and strain type where available). In curly brackets within the Description field, we list the corresponding protein domains (Nup85, Nup98-96, Nic96 nucleoporins – and ancestral coatomer element 1 Sec16 (ACE1-Sec16-like); + sign: partially correct annotation, missing the domain indicated, two cases)