Ereverse

Go back to top

EREVERSE


FUNCTION

EReverse reverses and/or complements a sequence. EReverse is a version of GCG's Reverse with command line control.


DESCRIPTION

EReverse complements and/or reverses the symbols in a sequence. The complements of all of the supported IUB nucleic acid symbols are listed in Appendix III. The output is written into a new sequence file.


AUTHOR

This GCG program was modified by Jaakko Hattula (Tampere University of Technology, Finland) and Peter Rice (E-mail: pmr@sanger.ac.uk Post: Informatics Division, The Sanger Centre, Hinxton Hall, Cambridge, CB10 1RQ, UK).

All EGCG programs are supported by the EGCG Support Team, who can be contacted by E-mail (egcg@embnet.org).


EXAMPLE

The file test.seq contains all of the legitimate GCG sequence characters. Here is a session using EReverse to show how they would look on the opposite strand:

  
  
  % ereverse
  
   EREVERSE uses nucleotide sequence data
  
   EREVERSE of what sequence ?  gamma.seq
  
                 Start (* 1 *) ?
               End (* 11375 *) ?
  
   Do you want to:
  
    1) reverse only
    2) complement only
    3) reverse and complement
  
   Please choose one (* 3 *):
  
   What should I call the output file (* gamma.rev *)
  
  %
  
  


OUTPUT

Here is the output file test.rev:

  
  
    REVERSE-COMPLEMENT of: gamma.seq  check: 6474  from: 1  to: 11375
  
  Human fetal beta globins G and A gamma
  from Shen, Slightom and Smithies,  Cell 26; 191-203.
  Analyzed by Smithies et al. Cell 26; 345-353.
  
  gamma.rev  Length: 11375  March 19, 1996 14:12  Type: N  Check: 3374  ..
  
    1  GAATTCGGCA GTTACTGCAA CTTCCACTTT TCTCTCACCC GCTCCAGGAA
  
   51  AAGTGACCTG CAGTCACTTT CCTGGTAGTA TTGATTCTTT CTTGTTTGTG
  
  101  GCTGTTCCCC ATTTCCAATT GTTTTCCATG ATTATTGCTT CTACTGTGAT
  
  
            /////////////////////////////////////
  
11251  CCTCAGGTGA TCTGCCTGCC TTGGCCTCCC AAAATTCTGG GATTACAGGC
  
11301  GTGAGCCACC ACTCCCAGCC TCTAAACAAG TGAATCTTAA TTGCTCCTCC
  
11351  TCAGACTAAG GAATATCTAG GATCC
  
  


RESTRICTIONS

A reversed sequence is renumbered so that the first base corresponds to the last base of the range you chose. It only makes sense to complement nucleotide sequences! If you don't reverse and complement a sequence, you are in danger of having the sequence in 3' to 5' orientation. All GCG programs and all databases assume that nucleotide sequences are in 5' to 3' order, so be careful. Peptide sequences are generally kept in amino-to-carboxyl orientation. Many legitimate sequence symbols are not IUB-supported nucleic acid symbols, so they have no sensible complement (see Appendix III) .

Embedded comments are lost.


INPUT FILE

Here is the input file test.seq used in the example above:

  
  
  This sequence contains every symbol in the alphabet of
  legitimate GCG sequence characters (Appendix III).
  
  Test.Seq  Length: 389  July 19, 1994 15:05  Type: N  Check: 8468  ..
  
    1
       >starts with the codons from appendix iii>
       GCTGCCGCAG CGGCXGATGA CAATAACRAY TGTTGCTGYG ATGACGAYGA
  
   51  AGAGGARTTT TTCTTYGGTG GCGGAGGGGG XCATCACCAY ATTATCATAA
  
  101  THAAAAAGAA RTTGTTACTT CTCCTACTGT TRCTXYTAYT GYTRYTXATG
  
  151  AATAACAAYC CTCCCCCACC GCCXCAACAG CARCGTCGCC GACGGCGGAG
  
  201  AAGGCGXAGR MGAMGGMGRM GXTCTTCCTC ATCGAGTAGC TCXAGYWSXA
  
  251  CTACCACAAC GACXGTTGTC GTAGTGGTXT GGXXXTATTA CTAYGAAGAG
  
  301  CAACAGSART AATAGTGATA RTRATRR
                                    >continues with all
       uppercase sequence characters>
                                    ABC DEFGHIJKLM NOPQRSTUVW
  
  351  XYZ.+@&*ab cdefghijkl mnopqrstuv wxyz*@&+.
  
                                                 
 


COMMAND-LINE SUMMARY

All parameters for this program may be put on the command line. Use the option -CHEck to see the summary below and to have a chance to add things to the command line before the program executes. In the summary below, the capitalized letters in the qualifier names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose qualifiers or parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Basic Concepts: Using Programs in the GCG User's Guide.

  
  
  Minimal Syntax: % ereverse [-INfile=]ggamma.seq -Default
  
  Prompted Parameters:
  
  -BEGin=1 -END=1700      range of interest
  -REVerse                reverse the strand
  -COMPlement             complement the strand
  [-OUTfile=]ggamma.rev   output file name
  
  Local Data Files: None
  
  Optional Parameters: None
  


LOCAL DATA FILES

None.


OPTIONAL PARAMETERS

None.

Printed: April 22, 1996 15:53 (1162)