Similar protocols

Protocol publication

[…] ds dataset has been submitted to the NCBI (, In this study, a total of 91,487 T. saginata unigenes were obtained. Based on a sequence similarity with known proteins, a total of 59,262 unigenes were annotated. Up to 57,607 of which were annotated against the NCBI non-redundant (Nr) protein database, 24,860 were assigned to the protein database Clusters of Orthologous Groups (COG), 26,476 were assigned to the term annotation database of Gene Ontology (GO), and 43,575 were assigned to 200 pathways in the database of Kyoto Encyclopedia of Genes and Genomes (KEGG). Among the annotated unigenes, 61,941 coding sequences (CDS) were obtained by the BLASTx algorithm []. All CDSs were analyzed using the FrameDP software [], which has the ability to self-train directly on EST clusters instead of requiring curated cDNA sets to train the underlying ESTScan and DECODER software []., To minimise the sampling error, only CDS sequences longer than 300 bp were used for this study. The final sequence collection containing 11,399 CDSs was used for our analyses., Codon usage in these genes was assessed using the program codonW 1.4.4 (J Peden, Relative synonymous codon usage (RSCU) is the observed frequency of a codon divided by the frequency expected, if all synonyms for that amino acid were used equally []. Thus, RSCU values close to 1.0 indicate lack of bias whereas values more than 1 indicates that a codon was used more frequently than expected, while the converse is true for RSCU values less than 1. The effe […]

Pipeline specifications

Software tools BLASTX, FrameDP, ESTScan
Organisms Taenia saginata, Homo sapiens
Diseases Ataxia Telangiectasia, Dyskinesias, Brain Diseases, Telangiectasis, DNA Repair-Deficiency Disorders