DbStats counts the number of entries and the total lengths of sequence entries in a GCG formatted database.
DbStats reports simple statistics for a set of sequence files in a directory, for example all individual sequence files in the Embl or GenBank directories.
This program was written by Peter Rice (E-mail: pmr@sanger.ac.uk Post: Informatics Division, The Sanger Centre, Hinxton Hall, Cambridge, CB10 1RQ, UK).
All EGCG programs are supported by the EGCG Support Team, who can be contacted by E-mail (egcg@embnet.org).
Here is a sample session with DbStats
% dbstats DBSTATS uses any sequences DBSTATS of what sequence(s) ? Sw:*_ecoli What should I call the output file (* db.stats *) ? ec.stats DbStats complete: Categories: 0 Entries: 3,466 Total Length: 1,043,874 CPU: 18.96 %
The output from DbStats is a simple report of the number and total length of entries in a set of selected database sequences. from GCG formatted sequence databases.
All parameters for this program may be put on the command line. Use the option -CHEck to see the summary below and to have a chance to add things to the command line before the program executes. In the summary below, the capitalized letters in the qualifier names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose qualifiers or parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Basic Concepts: Using Programs in the GCG User's Guide.
Minimum Syntax: % dbstats [-INfile=]Sw:*_Ecoli -Default Prompted Parameters: [-OUTfile=]db.stats Output file Optional Parameters: -MONitor Show progress through the selected sequences -NOSUMmary Do not write summary statistics at end of run -LISTsize=10 List the n longest entry names selected
The parameters and switches listed below can be set from the command line. For more information, see "Using Program Parameters" in Chapter 3, Basic Concepts: Using Programs in the GCG User's Guide.
reports each entry name.
requests a listing of the 10 longest entries selected in GCG list file format.
Printed: April 22, 1996 15:52 (1162)