An RNA motif identification program that takes an RNA sequence alignment as an input and identifies related sequences using a profile-based dynamic programming algorithm. ERPIN differs from other RNA motif search programs in its ability to capture subtle biases in the training set and produce highly specific and sensitive searches, while keeping CPU requirements at a practical level.
A program for detection of human polyadenylation signals. To avoid training on possibly flawed data, the development of polyadq began with a de novo characterization of human mRNA 3' processing signals. This information was used in training two quadratic discriminant functions that polyadq uses to evaluate potential polyA signals.
A web-based software toolbox to recognize functional sites in nucleic acid sequences. Currently in this toolbox, two software tools are provided: TIS Miner and Poly(A) Signal Miner. The TIS Miner can be used to predict translation initiation sites in vertebrate DNA/mRNA/cDNA sequences, and the Poly(A) Signal Miner can be used to predict polyadenylation [poly(A)] signals in human DNA sequences.
A method using support vector machine for poly(A) site prediction. This program takes a file containing DNA/RNA sequences in the FASTA format as input, and 1) makes prediction for putative mRNA polyadenylation sites [or poly(A) sites] and/or 2) generates results indicating the occurrences of different cis-elements. The program is implemented in PERL and runs under UNIX/LINUX systems.
A poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics.
A machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). The program is able to predict the 12 main variants of human poly(A) motifs, i.e., AATAAA, ATTAAA, AAAAAG, AAGAAA, TATAAA, AATACA, AGTAAA, ACTAAA, GATAAA, CATAAA, AATATA, and AATAGA.
Predicts potential PAS-strong, PAS-weak and PAS-less cleavage/poly(A) sites in human sequences by linear discriminant function (LDF) combining characteristics describing functional motifs (polyadenylation signal [PAS]; cleavage site [CS], motif; GU/U motif) and oligonucleotide composition upstream and/or downstream of these sites. In tests, POLYAR shows high accuracy of prediction of the PAS-strong poly(A) sites, though this program's efficiency in searching for PAS-weak and PAS-less poly(A) sites is not very high but is comparable to other available programs.