rnazRandomizeAln.pl
- Randomize alignments by shuffling the columns.
rnazRandomizeAln.pl [options] [file]
Long alignment blocks should be shuffled locally in order to maintain local characteristics of the alignment. Therefore alignments can be shuffled in windows. You can specify here the size of a window and the offset. Defaults are window=120 and slide=120, i.e. the alignments are shuffled in non-overlapping windows of 120 columns.
The shuffling algorithm tries to mantain local conservation patterns, i.e. it shuffles only columns of the same degree of conservation. This becomes limiting if you have many sequences in your alignment. Therfore you can choose the level of ``coarse graining" with this option.
To decide which columns have the same degree of conservation, the mean pairwise identity (MPI) of each column is calculated and finally only columns of the same value are shuffled. You can adjust the rounding of the MPI and thus the ``coarse graining" level with this option. If you have two columns with say 0.52 and 0.48 MPI you get:
level 0: 1 and 0
level 1: 50 and 50
level 2: 52 and 48
So on level 0 you only have ``conserved" (MPI > 0.5) and ``non-conserved" (MPI < 0.5) columns while on level 2 you need almost exactly the same MPI to shuffle two columns.
Default value is 2.
Prints version information and exits.
Prints a brief help message and exits.
Prints the manual page and exits.
rnazRandomizeAln.pl
reads a multiple sequence alignment in Clustal
W or MAF format and returns a randomized version in the same
format. The program uses the algorithm described in Washietl &
Hofacker, J. Mol. Biol. 342(1):19 (2004). It generates alignments of
the same length, the same base composition, the same gap pattern, the
same overall conservation and the same local conservation patterns
(see also option --level).
# rnazRandomizeAln.pl -l 1 some.maf > random.maf
Randomizes the file some.maf
using a less stringent parameter for
maintaining conservation patterns.
Stefan Washietl <wash@tbi.univie.ac.at>