RNAINVERSE

Vienna RNA (l) Return to Main Contents

NAME

RNAINVERSE - find RNA sequences with given secondary structure

SYNOPSIS

RNAinverse [-F[mp]] [-R repeats]] [-v] [-a alphabet] [-f final] [-T temp] [-d[0|1|2|3] [-4] [-noGU] [-noCloseGU] [-P paramfile] [-e 1|2] [-nsp pairs]

DESCRIPTION

RNAinverse searches for sequences folding into a predefined structure, thereby inverting the folding algorithm. Target structures (in bracket notation) and starting sequences for the search are read alternately from stdin. Characters in the start sequence other than "AUGC" (or the alphabet specified with -a) will be treated as wild cards and replaced by a random character. Any lower case characters in the start sequence will be kept fixed during the search. If necessary, the sequence will be elongated to the length of the structure. Thus a string of "N"s as well as a blank line specify a random start sequence.
For each search the best sequence found and its Hamming distance to the start sequence are printed to stdout. If the the search was unsuccessful, a structure distance to the target is appended. The -Fp and -R options can modify the output format, see below.
The program will continue to read new structures and sequences until a line consisting of the single character "@" or an end of file condition is encountered.

OPTIONS

-F[mp]
Use minimum energy (-Fm), partition function folding (-Fp) or both (-Fmp). In partition function mode, the probability of the target structure exp(-E(S)/kT)/Q is maximized. This probability is written in brackets after the found sequence and Hamming distance. In most cases you'll want to use the -f option in conjunction with -Fp, see below. The default is -Fm.
-f final
In combination with -Fp stop search when sequence is found with E(s)-F is smaller than final, where F=-kT*ln(Q).
-R repeats
Search repeatedly for the same structure. If repeats is negative search until -repeats exact solutions are found, no output is done for unsuccessful searches. Be aware, that the program will not terminate if the target structure can not be found.
-v
in conjunction with -R and a negative repeats, print the last subsequence and substructure for each unsuccessful search.
-a alphabet
Find sequences using only bases from alphabet.

The -T, -d, -4, -noGU, -noCloseGU, -e, -P, -nsp, options work as in RNAfold

EXAMPLE

To search 5 times for sequences forming a simple hairpin structure interrupted by one GA mismatch call
RNAinverse -R 5
and enter the lines
(((.(((....))).)))
NNNgNNNNNNNNNNaNNN

REFERENCES

The calculation of minimum free energy structures is based on dynamic programming algorithm originally developed by M. Zuker and P. Stiegler. The partition function algorithm is based on work by J.S. McCaskill.
If this Program proves useful for your work please cite:

I.L. Hofacker, W. Fontana, P.F. Stadler, S. Bonhoeffer, M. Tacker, P. Schuster (1994) Fast Folding and Comparison of RNA Secondary Structures. Monatshefte f. Chemie 125: 167-188
Other useful references are:
M. Zuker, P. Stiegler (1981) Optimal computer folding of large RNA sequences using thermodynamic and auxiliary information, Nucl Acid Res 9: 133-148
J.S. McCaskill (1990) The equilibrium partition function and base pair binding probabilities for RNA secondary structures, Biopolymers 29: 1105-1119
D.H. Turner N. Sugimoto and S.M. Freier (1988) RNA structure prediction, Ann Rev Biophys Biophys Chem 17: 167-192
D. Adams (1979) The hitchhiker's guide to the galaxy, Pan Books, London

VERSION

This man page documents version 1.8.5 Vienna RNA Package.

AUTHOR

Ivo L Hofacker.

BUGS

If in doubt our program is right, nature is at fault. Comments should be sent to rna@tbi.univie.ac.at.

This document was created by man2html, using the manual pages.
Time: 07:19:16 GMT, February 23, 2011