RNADISTANCE
Vienna RNA (l) Return to Main Contents
NAME
RNAdistance - calculate distances between RNA secondary structures
SYNOPSIS
RNAdistance [-D[fhwcFHWCP] [-X[p|m|f|c]] [-S] [-B
[<file>]]
DESCRIPTION
RNAdistance reads RNA secondary structures from stdin and
calculates one or more measures for their dissimilarity, based on
tree or string editing (alignment). In addition it calculates a
"base pair distance" given by the number of base pairs present in
one structure, but not the other. For structures of different
length base pair distance is not recommended.
RNAdistance accepts structures in bracket format, where matching
brackets symbolize base pairs and unpaired bases are represented by
a dot ".", or coarse grained representations where hairpins,
interior loops, bulges, multiloops, stacks and external bases are
represented by (H), (I), (B), (M), (S), and (E), respectively.
These can be optionally weighted. Full structures can be
represented in the same fashion using the identifiers (U) and (P)
for unpaired and paired bases, respectively. We call this the HIT
representation (you don't want to know what this means). For
example the following structure consists of 2 hairpins joined by a
multiloop:
.((..(((...)))..((..)))). full structure (usual format);
(U)((U2)((U3)P3)(U2)((U2)P2)P2) HIT structure;
((H)(H)M) or
((((H)S)((H)S)M)S) coarse grained structure;
(((((H3)S3)((H2)S2)M4)S2)E2) weighted coarse grained.
The program will continue to read new structures until a line
consisting of the single character @ or an end of file condition is
encountered. Input lines neither containing a valid structure nor
starting with > are ignored.
OPTIONS
- -D[fhwcFHWCP]
- use the full, HIT, weighted coarse, or coarse representation to
calculate the distance. Capital letters indicate string alignment
otherwise tree editing is used. Any combination of distances can be
specified. -DP selects the base pair distance. The default
if "f".
- -Xp
- compare the structures pairwise (p), that is first with 2nd,
third with 4th etc. This is the default.
- -Xm
- calculate the distance matrix between all structures. The
output is formatted as a lower triangle matrix.
- -Xf
- compare each structure to the first one.
- -Xc
- compare continuously, that is i-th with (i+1)th structure.
- -S
- Use the Bruce Shapiro's cost matrix for comparing coarse
structures.
- -B [file]
- Print an "alignment" with gaps of the structures, to show
matching substructures. The aligned structures are written to
file, if specified. Otherwise output is written to stdout,
unless the -Xm option is set in which case "backtrack.file" is
used.
REFERENCES
Shapiro B A, (1988) An algorithm for comparing multiple RNA
secondary structures, CABIOS 4, 381-393
Shapiro B A, Zhang K (1990) Comparing multiple RNA secondary
structures using tree comparison, CABIOS 6, 309-318
Fontana W, Konings D A M, Stadler P F, Schuster P, (1993)
Statistics of RNA secondary structures, Biopolymers 33,
1389-1404
I.L. Hofacker, W. Fontana, P.F. Stadler, S. Bonhoeffer, M. Tacker,
P. Schuster (1994) Fast Folding and Comparison of RNA Secondary
Structures. Monatshefte f. Chemie 125, 167-188
VERSION
This man page documents version 1.8.5 Vienna RNA Package
AUTHORS
Walter Fontana, Ivo L Hofacker, Peter F Stadler.
BUGS
We hoped you wouldn't find them. Comments should be sent to
rna@tbi.univie.ac.at
This document was created by man2html,
using the manual pages.
Time: 07:19:16 GMT, February 23, 2011