Annotation

Testing the performance of automated annotation of ESTs with the Kegg Orthology (KO) database demonstrates lack of completeness of clusters

G. R. Fernandes, Mudado, M. A., and Ortega, J. M., Testing the performance of automated annotation of ESTs with the Kegg Orthology (KO) database demonstrates lack of completeness of clusters, vol. 7, pp. 948-957, 2008.

The KEGG Orthology (KO) database was tested as a source for automated annotation of expressed sequence tags (ESTs). We used a control experiment where every EST was assigned to its cognate protein, and an annotation experiment where the ESTs were annotated by proteins from other organisms. Analyzing the results, we could assign classes to the annotation: correct, changed and speculated. The correct annotation ranged from 57 (Caenorhabditis elegans) to 81% (Homo sapiens). In spite of the changed annotation being low (1 in H.

A picture of gene sampling/expression in model organisms using ESTs and KOG proteins

Mde Alvaren Mudado and Ortega, J. Miguel, A picture of gene sampling/expression in model organisms using ESTs and KOG proteins, vol. 5, pp. 242-253, 2006.

The expressed sequence tag (EST) is an instrument of gene discovery. When available in large numbers, ESTs may be used to estimate gene expression. We analyzed gene expression by EST sampling, using the KOG database, which includes 24,154 proteins from Arabidopsis thaliana (Ath), 17,101 from Caenorhabditis elegans (Cel), 10,517 from Drosophila melanogaster (Dme), and 26,324 from Homo sapiens (Hsa), and 178,538 ESTs for Ath, 215,200 for Cel, 261,404 for Dme, and 1,941,556 for Hsa.

Subscribe to Annotation