Dbstats

Go back to top

DBSTATS


FUNCTION

DbStats counts the number of entries and the total lengths of sequence entries in a GCG formatted database.


DESCRIPTION

DbStats reports simple statistics for a set of sequence files in a directory, for example all individual sequence files in the Embl or GenBank directories.


AUTHOR

This program was written by Peter Rice (E-mail: pmr@sanger.ac.uk Post: Informatics Division, The Sanger Centre, Hinxton Hall, Cambridge, CB10 1RQ, UK).

All EGCG programs are supported by the EGCG Support Team, who can be contacted by E-mail (egcg@embnet.org).


EXAMPLE

Here is a sample session with DbStats

  
  
  % dbstats
  
   DBSTATS uses any sequences
  
   DBSTATS of what sequence(s)  ? Sw:*_ecoli
  
   What should I call the output file (* db.stats *) ?  ec.stats
  
   DbStats complete:
  
Categories: 0
   Entries: 3,466
   Total Length: 1,043,874
       CPU: 18.96
  
  %
  


OUTPUT

The output from DbStats is a simple report of the number and total length of entries in a set of selected database sequences. from GCG formatted sequence databases.


COMMAND-LINE SUMMARY

All parameters for this program may be put on the command line. Use the option -CHEck to see the summary below and to have a chance to add things to the command line before the program executes. In the summary below, the capitalized letters in the qualifier names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose qualifiers or parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Basic Concepts: Using Programs in the GCG User's Guide.

  
  
  Minimum Syntax: % dbstats [-INfile=]Sw:*_Ecoli -Default
  
  Prompted Parameters:
  [-OUTfile=]db.stats   Output file
  
  Optional Parameters:
  
  -MONitor              Show progress through the selected sequences
  -NOSUMmary            Do not write summary statistics at end of run
  -LISTsize=10          List the n longest entry names selected
  


OPTIONAL PARAMETERS

The parameters and switches listed below can be set from the command line. For more information, see "Using Program Parameters" in Chapter 3, Basic Concepts: Using Programs in the GCG User's Guide.

-MONitor

reports each entry name.

-LISTsize=10

requests a listing of the 10 longest entries selected in GCG list file format.

Printed: April 22, 1996 15:52 (1162)