BEACON: automated tool for Bacterial GEnome Annotation ComparisON
Type
ArticleKAUST Department
Computational Bioscience Research Center (CBRC)Computer Science Program
Applied Mathematics and Computational Science Program
Computer, Electrical and Mathematical Sciences and Engineering (CEMSE) Division
Date
2015-08-18Online Publication Date
2015-08-18Print Publication Date
2015-12Permanent link to this record
http://hdl.handle.net/10754/575255
Metadata
Show full item recordAbstract
Background Genome annotation is one way of summarizing the existing knowledge about genomic characteristics of an organism. There has been an increased interest during the last several decades in computer-based structural and functional genome annotation. Many methods for this purpose have been developed for eukaryotes and prokaryotes. Our study focuses on comparison of functional annotations of prokaryotic genomes. To the best of our knowledge there is no fully automated system for detailed comparison of functional genome annotations generated by different annotation methods (AMs). Results The presence of many AMs and development of new ones introduce needs to: a/ compare different annotations for a single genome, and b/ generate annotation by combining individual ones. To address these issues we developed an Automated Tool for Bacterial GEnome Annotation ComparisON (BEACON) that benefits both AM developers and annotation analysers. BEACON provides detailed comparison of gene function annotations of prokaryotic genomes obtained by different AMs and generates extended annotations through combination of individual ones. For the illustration of BEACON’s utility, we provide a comparison analysis of multiple different annotations generated for four genomes and show on these examples that the extended annotation can increase the number of genes annotated by putative functions up to 27 %, while the number of genes without any function assignment is reduced. Conclusions We developed BEACON, a fast tool for an automated and a systematic comparison of different annotations of single genomes. The extended annotation assigns putative functions to many genes with unknown functions. BEACON is available under GNU General Public License version 3.0 and is accessible at: http://www.cbrc.kaust.edu.sa/BEACON/Citation
BEACON: automated tool for Bacterial GEnome Annotation ComparisON 2015, 16 (1) BMC GenomicsPublisher
Springer NatureJournal
BMC GenomicsPubMed ID
26283419Additional Links
http://www.biomedcentral.com/1471-2164/16/616Relations
Is Supplemented By:- [Dataset]
Kalkatawi, M., Intikhab Alam, & Bajic, V. (2015). BEACON: automated tool for Bacterial GEnome Annotation ComparisON. Figshare. https://doi.org/10.6084/m9.figshare.c.3616301. DOI: 10.6084/m9.figshare.c.3616301 HANDLE: 10754/624132
ae974a485f413a2113503eed53cd6c53
10.1186/s12864-015-1826-4
Scopus Count
Related articles
- GASS: genome structural annotation for Eukaryotes based on species similarity.
- Authors: Wang Y, Chen L, Song N, Lei X
- Issue date: 2015 Mar 4
- PANNOTATOR: an automated tool for annotation of pan-genomes.
- Authors: Santos AR, Barbosa E, Fiaux K, Zurita-Turk M, Chaitankar V, Kamapantula B, Abdelzaher A, Ghosh P, Tiwari S, Barve N, Jain N, Barh D, Silva A, Miyoshi A, Azevedo V
- Issue date: 2013 Aug 16
- Genome Annotation Transfer Utility (GATU): rapid annotation of viral genomes using a closely related reference genome.
- Authors: Tcherepanov V, Ehlers A, Upton C
- Issue date: 2006 Jun 13
- ParsEval: parallel comparison and analysis of gene structure annotations.
- Authors: Standage DS, Brendel VP
- Issue date: 2012 Aug 1
- MyPro: A seamless pipeline for automated prokaryotic genome assembly and annotation.
- Authors: Liao YC, Lin HH, Sabharwal A, Haase EM, Scannapieco FA
- Issue date: 2015 Jun
Related items
Showing items related by title, author, creator and subject.
-
This is a genome assembly third party annotation of Toxoplasma gondii VEG strain based on strand-specific RNA-sequencing and manual re-annotation identifying novel features of UTRs and non-coding transcripts.Ramaprasad, Abhinay; Mourier, Tobias; Naeem, Raeece; Moussa, Ehab; Vermont, Sarah J.; Otto, Thomas D.; Wastling, Jonathan; Pain, Arnab; Malas, Tareq Majed Yasin; Panigrahi, Aswini Kumar (NCBI, 2015-02-26) [Bioproject, Dataset]Toxoplasma gondii is an important protozoan parasite that infects all warm-blooded animals and causes opportunistic infections in immuno-compromised humans. Its closest relative, Neospora caninum, is an important veterinary pathogen that causes spontaneous abortion in livestock. Comparative genomics of these two closely related coccidians has been of particular interest to identify genes that contribute to varied host cell specificity and disease. Automated gene prediction tools that were used for gene annotation can lead to inaccurate gene models and lack information on untranslated regions and non-coding transcripts. Here, we describe a manual re-annotation of these genomes based on strand-specific RNA sequencing and shotgun proteomics. We have corrected the structures of over one third of the gene models and have annotated the complete set of untranslated regions (UTRs). We observe distinctly long UTRs in both the ?organisms??, almost four times longer than other eukaryotes?. We have also identified a putative set of cis-natural antisense transcripts (cis-NATs) and long intergenic non-coding RNAs (lincRNAs). With these, we have significantly improved the quality of annotation in the genomes to serve as a manually curated base for future research on these organisms.
-
This is a genome assembly third party annotation of Neospora caninum LIV strain based on strand-specific RNA-sequencing and manual re-annotation identifying novel features of UTRs and non-coding transcripts.Ramaprasad, Abhinay; Mourier, Tobias; Naeem, Raeece; Moussa, Ehab; Vermont, Sarah J.; Otto, Thomas D.; Wastling, Jonathan; Pain, Arnab; Malas, Tareq Majed Yasin; Panigrahi, Aswini Kumar (NCBI, 2015-02-26) [Bioproject, Dataset]Toxoplasma gondii is an important protozoan parasite that infects all warm-blooded animals and causes opportunistic infections in immuno-compromised humans. Its closest relative, Neospora caninum, is an important veterinary pathogen that causes spontaneous abortion in livestock. Comparative genomics of these two closely related coccidians has been of particular interest to identify genes that contribute to varied host cell specificity and disease. Automated gene prediction tools that were used for gene annotation can lead to inaccurate gene models and lack information on untranslated regions and non-coding transcripts. Here, we describe a manual re-annotation of these genomes based on strand-specific RNA sequencing and shotgun proteomics. We have corrected the structures of over one third of the gene models and have annotated the complete set of untranslated regions (UTRs). We observe distinctly long UTRs in both the ?organisms??, almost four times longer than other eukaryotes?. We have also identified a putative set of cis-natural antisense transcripts (cis-NATs) and long intergenic non-coding RNAs (lincRNAs). With these, we have significantly improved the quality of annotation in the genomes to serve as a manually curated base for future research on these organisms.
-
The NLR-Annotator tool enables annotation of the intracellular immune receptor repertoireSteuernagel, Burkhard; Witek, Kamil; Krattinger, Simon G.; Ramirez-Gonzalez, Ricardo H.; Schoonbeek, Henk-jan; Yu, Guotai; Baggs, Erin; Witek, Agnieszka; Yadav, Inderjit; Krasileva, Ksenia V; Jones, Jonathan D; Uauy, Cristobal; Keller, Beat; Ridout, Christopher James; Wulff, Brande B (Plant Physiology, American Society of Plant Biologists (ASPB), 2020-03-17) [Article]Disease resistance genes encoding nucleotide-binding and leucine-rich repeat (NLR) intracellular immune receptor proteins detect pathogens by the presence of pathogen effectors. Plant genomes typically contain hundreds of NLR-encoding genes. The availability of the hexaploid wheat (Triticum aestivum) cultivar Chinese Spring reference genome allows a detailed study of its NLR complement. However, low NLR expression and high intra-family sequence homology hinders their accurate annotation. Here we developed NLR-Annotator, a software tool for in silico NLR identification independent of transcript support. Although developed for wheat, we demonstrate the universal applicability of NLR-Annotator across diverse plant taxa. We applied our tool to wheat and combined it with a transcript-validated subset of genes from the reference gene annotation to characterize the structure, phylogeny and expression profile of the NLR gene family. We detected 3,400 full-length NLR loci of which 1,560 were confirmed as expressed genes with intact open reading frames. NLRs with integrated domains mostly group in specific subclades. Members of another subclade predominantly locate in close physical proximity to NLRs carrying integrated domains, suggesting a paired helper-function. Most NLRs (88%) display low basal expression (in the lower 10 percentile of transcripts). In young leaves subjected to biotic stress we found upregulation of 266 of the NLRs. To illustrate the utility of our tool for the positional cloning of resistance genes, we estimated the number of NLR genes within the intervals of mapped rust resistance genes. Our study will support the identification of functional resistance genes in wheat to accelerate the breeding and engineering of disease-resistant varieties.