EAssemble is a version of GCG's old Assemble program with command line control added.
EAssemble lets you choose as many segments as you please from existing sequences. The ranges chosen can be from either strand. EAssemble puts all of the segments together into a new sequence file.
EAssemble supports the assembly of fragments from circular molecules by letting you define a range in the sequence that extends across the end and into the beginning of a molecule. The terminal bell rings when a circular range is chosen.
This GCG program was modified by Jaakko Hattula (Tampere University of Technology, Finland) and Peter Rice (E-mail: pmr@sanger.ac.uk Post: Informatics Division, The Sanger Centre, Hinxton Hall, Cambridge, CB10 1RQ, UK).
All EGCG programs are supported by the EGCG Support Team, who can be contacted by E-mail (egcg@embnet.org).
Here is part of a session using EAssemble to assemble a coding sequence from the file gamma.seq:
% eassemble EASSEMBLE uses any sequence data EASSEMBLE of what sequence ? gamma.seq What should I call the output file (* gamma.seg *) ? Start (* 1 *) ? 2179 End (* 11375 *) ? 2270 Reverse (* No *) ? That range begins ATGGG and ends GGAAG. Is this correct (* Yes *) ? That is done, now would you like to: A)dd another segment from this sequence G)et segments from another sequence W)rite out this assembly into a file Please choose one (* W *): a Start (* 1 *) ? 2393 End (* 11375 *) ? 2615 Reverse (* No *) ? That range begins GCTCC and ends TCAAG. Is this correct (* Yes *) ? That is done, now would you like to: A)dd another segment from this sequence G)et segments from another sequence W)rite out this assembly to file Please choose one (* W *): a Start (* 1 *) ? 3502 End (* 11375 *) ? 3630 Reverse (* No *) ? That range begins CTCCT and ends ACTGA. Is this correct (* Yes *) ? That is done, now would you like to: A)dd another segment from this sequence G)et segments from another sequence W)rite out this assembly to file Please choose one (* W *): %
Here is some of the output file:
EASSEMBLE September 26, 1995 16:30 Symbols: 1 to: 92 from: gamma.seq ck: 6474, 2179 to: 2270 Symbols: 93 to: 315 from: gamma.seq ck: 6474, 2393 to: 2615 Symbols: 316 to: 444 from: gamma.seq ck: 6474, 3502 to: 3630 Length: 444 September 26, 1995 16:31 Type: N Check: 2906 .. 1 ATGGGTCATT TCACAGAGGA GGACAAGGCT ACTATCACAA GCCTGTGGGG 51 CAAGGTGAAT GTGGAAGATG CTGGAGGAGA AACCCTGGGA AGGCTCCTGG /////////////////////////////////////////////////////// 351 CCATTTCGGC AAAGAATTCA CCCCTGAGGT GCAGGCTTCC TGGCAGAAGA 401 TGGTGACTGG AGTGGCCAGT GCCCTGTCCT CCAGATACCA CTGA
SeqEd is a general purpose, screen-oriented sequence editor. We believe that SeqEd' s Include command is a much better way to assemble sequences than by using EAssemble Reformat puts a sequence file that has been modified with a text editor into GCG sequence file format.
Unknown. EAssemble allows you to concatenate peptide and nucleotide sequences without complaint; however, that is an odd thing to do. Embedded comments from the input segments are lost in the output sequence.
SeqEd' s Include command renders EAssemble obsolete. Please contact us if you disagree.
All parameters for this program may be put on the command line. Use the option -CHEck to see the summary below and to have a chance to add things to the command line before the program executes. In the summary below, the capitalized letters in the qualifier names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose qualifiers or parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Basic Concepts: Using Programs in the GCG User's Guide.
Minimal Syntax: % eassemble -[INfile=]gamma.seq -Default Prompted Parameters: [-OUTfile=]gamma.seg output file name -MENU1=w menu option Local Data Files: None Optional Parameters: -BEGin1=1 -END1=100 limits for each range -REVerse1 direction for each range [-INfile2=]gamma.seq second input file
None.
The parameters and switches listed below can be set from the command line. For more information, see "Using Program Parameters" in Chapter 3, Basic Concepts: Using Programs in the GCG User's Guide.
specifies the start of a fragment, defaults to the start of the sequence. -BEGin2=1 refers to the second input range, and so on.
specifies the end of a fragment, defaults to the end of the sequence. -END2=1201 refers to the second input range, and so on.
specifies the direction of assembly, defaults to forward. -REVerse2 refers to the second input range, and so on.
names the output file for the assembly.
gives the answer to the menu prompt.
names a second input sequence for the assembly.
Printed: April 22, 1996 15:52 (1162)