TANDEM

FUNCTION

Tandem looks for multiple tandem repeats of a given size in a nucleotide sequence.

DESCRIPTION

Tandem looks for tandem repeats in a sequence. For repeated motifs with a gap between, use ERepeat.

AUTHOR

This program was originally written by Richard Durbin and Jean Thierry-Mieg as part of the ACEDB genome database package.

This version was modified for inclusion in EGCG by Peter Rice (E-mail: pmr@sanger.ac.uk Post: Informatics Division, The Sanger Centre, Hinxton Hall, Cambridge, CB10 1RQ, UK).

All EGCG programs are supported by the EGCG Support Team, who can be contacted by E-mail (egcg@embnet.org).

EXAMPLE

Here is a sample session with Tandem.


% tandem

 TANDEM uses nucleotide sequence data

 TANDEM of what sequence ?  GenEMBL:hhtetra

                   Start (* 1 *) ?
                   End (* 1272 *) ?

 Minimum repeat size (* 10 *) ?  6

 Maximum repeat size (* 6 *) ?

 What threshold score (* 20 *) ?

 Allow N as a mismatch (* No *) ?

 Allow uniform consensus (* No *) ?

 What should I call the output file (* hhtetra.tan *) ?

%

OUTPUT

The output from Tandem is a simple report of hits in the sequence.


TANDEM of em_vi:hhtetra check: 7108

 Score      Start        End Length Copies Percent ..

   120        793        936      6     24    93.8 ACCCTA
    90        283        420      6     23    84.8 TAACCC
    38        432        485      6      9    90.7 CCCTAA
    26        494        529      6      6    94.4 CCCTAA
    24        568        597      6      5   100.0 AACCCT

ALGORITHM

INPUT FILE

The input file for Tandem is a GCG nucleotide sequence file.

COMMAND-LINE SUMMARY

All parameters for this program may be put on the command line. Use the option -CHEck to see the summary below and to have a chance to add things to the command line before the program executes. In the summary below, the capitalized letters in the qualifier names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose qualifiers or parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Basic Concepts: Using Programs in the GCG User's Guide.


Minimum Syntax: % tandem [-INfile=]GenEmbl:hhtetra -Default

Prompted Parameters:

-BEGin=1 -END=100           Range of interest
[-OUTfile=]hhtetra.tan   Output file
-THRESHold=20            Score threshold
-MISmatch                Treat ambiguous positions as mismatches
-UNIform                 Allow a uniform consensus
-MINREPeat=10            Minimum repeat length
-MAXREPeat=10            Maximum repeat length

Local Data Files: None

Optional Parameters:

-ORIGinal=stdout         Produce original mimimal output format

OPTIONAL PARAMETERS

The parameters and switches listed below can be set from the command line. For more information, see "Using Program Parameters" in Chapter 3, Basic Concepts: Using Programs in the GCG User's Guide.

-ORIGinal=stdout

requests the output format of the original tandem program, optionally with an output file name (the default is to write to the terminal).

Printed: April 22, 1996 15:55 (1162)