ble of gene neighbors centered on a query gene. Then the BLASTCLUST program is used to cluster products in the neighborhood and establish conserved co-occurring genes. These conserved gene neighborhoods are then sorted as per a ranking scheme based on occurrence in at least one other phylogenetically distinct lineage ("phylum" in NCBI Taxonomy database), complete conservation in a particular lineage ("phylum") and physical closeness on the chromosome indicating sharing of regulatory -10 and -35 elements. Profile searches were conducted using the PSI-BLAST program [] with a default profile inclusion expectation (E) value threshold of 0.01. Profile-profile comparisons were performed using the HHpred program []. HMM searches were conducted using the newly released HMMER3 program []. Multiple alignments were constructed using Kalign [] followed by manual adjustments based on PSI-BLAST results. Protein secondary structure was predicted using a multiple alignment as the input for the JPRED program [].

Software tools HHPred, HMMER, Kalign
Databases NCBI Taxonomy Database