TY - GEN
T1 - Development and implementation of CARAS algorithm for automatic annotation, visualization, and GenBank submission of chloroplast genome sequences
AU - Li, Yankai
AU - Li, Huan
AU - Zhu, Yingjie
AU - Li, Zhoujun
AU - Yin, Chuntao
AU - Lin, Xiaohan
AU - Liu, Chang
PY - 2012
Y1 - 2012
N2 - We present CARAS, a web server that allows the automatic annotation of a chloroplast genome sequence, and the visualization and editing of the annotation results interactively and in real-time. CARAS accepts a complete chloroplast genome sequence as input. First, it accurately predicts protein-coding sequences and exon-intron structures by combining the results from two types of annotation approaches: ab initio prediction algorithms and similarity based methods. Second, tRNA genes and inverted repeats are identified using tRNAscan and vmatch. Using 220 chloroplast genome sequences as test, we show that CARAS outperforms a similar application DOGMA overall. Third, the annotation results are presented as a circular map that includes the name, location, orientation, and length of the various features. A Flex-based module is implemented, allowing the users to add, delete, and edit the features on-line. The results are stored on the server and can be retrieved from a given URL. Users also have options to download the annotation results in GFF3 format for further analyses in other third party software tools or in JPEG format for publication. Finally, CARAS can be used to create a Sequin file for GenBank submission of the annotated genome sequence. CARAS is freely available at http://caras.bicoup.com, and it will significantly facilitate research work involving chloroplast genomes.
AB - We present CARAS, a web server that allows the automatic annotation of a chloroplast genome sequence, and the visualization and editing of the annotation results interactively and in real-time. CARAS accepts a complete chloroplast genome sequence as input. First, it accurately predicts protein-coding sequences and exon-intron structures by combining the results from two types of annotation approaches: ab initio prediction algorithms and similarity based methods. Second, tRNA genes and inverted repeats are identified using tRNAscan and vmatch. Using 220 chloroplast genome sequences as test, we show that CARAS outperforms a similar application DOGMA overall. Third, the annotation results are presented as a circular map that includes the name, location, orientation, and length of the various features. A Flex-based module is implemented, allowing the users to add, delete, and edit the features on-line. The results are stored on the server and can be retrieved from a given URL. Users also have options to download the annotation results in GFF3 format for further analyses in other third party software tools or in JPEG format for publication. Finally, CARAS can be used to create a Sequin file for GenBank submission of the annotated genome sequence. CARAS is freely available at http://caras.bicoup.com, and it will significantly facilitate research work involving chloroplast genomes.
KW - annotation
KW - chloroplast genome
KW - editing
KW - visualization
KW - web server
UR - http://www.scopus.com/inward/record.url?scp=84860471413&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84860471413&partnerID=8YFLogxK
U2 - 10.1109/ComComAp.2012.6154863
DO - 10.1109/ComComAp.2012.6154863
M3 - Conference contribution
AN - SCOPUS:84860471413
SN - 9781457717178
T3 - 2012 Computing, Communications and Applications Conference, ComComAp 2012
SP - 310
EP - 315
BT - 2012 Computing, Communications and Applications Conference, ComComAp 2012
T2 - 2012 Computing, Communications and Applications Conference, ComComAp 2012
Y2 - 11 January 2012 through 13 January 2012
ER -