FETCH

[ Program Manual | User's Guide | Data Files | Databases ]

Table of Contents
FUNCTION
DESCRIPTION
EXAMPLE
INPUT FILES
COMMAND-LINE SUMMARY
LOCAL DATA FILES
PARAMETER REFERENCE

FUNCTION

[ Top | Next ]

Fetch copies GCG sequences or data files from the GCG database into your directory or displays them on your terminal screen.

DESCRIPTION

[ Previous | Top | Next ]

The expression % fetch *bov* will retrieve every GCG data file or sequence entry whose name contains the string bov. Sequence specification is described in detail in Chapter 2, Using Sequence Files and Databases of the User's Guide.

When copying a sequence from a database, Fetch creates a file in GCG format whose name is the entry name and whose extension is the database logical name. For example, % fetch GenBank:Hsrep2 copies the requested sequence into a file called hsrep2.gb_pr1. The filename extension is taken from the logical name for the database. In this example, the extension .gb_pr1 indicates that the sequence was copied from the Primate division of the GenBank nucleotide sequence database. (See "Using Database Sequences" in Chapter 2, Using Sequence Files and Databases of the User's Guide for a complete listing of logical names for all GCG databases.) If the file being copied is not from a sequence database, for example enzyme.dat, then its name is not changed.

If your sequence specification contains no logical name, Fetch looks in all the databases and in all the GCG data directories to find all possible entry names. For example, % fetch hum* would do almost the same thing as % fetch GenBank:hum*, except that if any sequences beginning with hum were present in databases other than GenBank or in any GCG data directories, they would also be retrieved.

Special Considerations for Searching

Keep in mind that filenames are case sensitive and database entry names are case insensitive. Because this program searches for both filenames and database entry names, you must take care when you enter the character pattern that makes up your specification.

For example, if you entered Gamma* as a file specification, this program would find all entries in the databases whose names begin with Gamma but no GCG-supplied files would be found. This is because all the files in the Wisconsin Package are named using lowercase letters. Conversely, if you entered gamma*, this program would find all of the entries in the databases and all the GCG-supplied files whose names begin with gamma.

(Note that it is often convenient to add -OUTfile=term to the command line so the data are displayed on your terminal screen.)

EXAMPLE

[ Previous | Top | Next ]

Here is a session using Fetch to retrieve local copies of most of the GenEMBL human beta globin sequences:


% fetch

 FETCH what sequence(s) ?  GenEMBL:HumHb*

 humhb16aa.gb_pr1
 humhb1az.gb_pr1
 humhb24.gb_pr1

 ////////////////

 humhbp68.gb_pr2

%

INPUT FILES

[ Previous | Top | Next ]

Except for the exceptions listed below, Fetch accepts valid single file specifications. You can also specify multiple files by using a list file, for example @project.list, or by using an ambiguous specification with an asterisk (*) wildcard, for example GenEMBL:*. You can use Fetch to obtain copies of multiple sequence format (MSF) or rich sequence format (RSF) files, for example GenDocData:*.msf, but you cannot use specifications such as hsp70.msf{hsp_yeast} to extract sequences from such files into individual sequence files. (To accomplish this task, use the Reformat program.)

COMMAND-LINE SUMMARY

[ Previous | Top | Next ]

All parameters for this program may be added to the command line. Use -CHEck to view the summary below and to specify parameters before the program executes. In the summary below, the capitalized letters in the parameter names are the letters that you must type in order to use the parameter. Square brackets ([ and ]) enclose parameter values that are optional. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.


Minimal Syntax: % fetch [-INfile=]genembl:humhb*

Prompted Parameters: None

Local Data Files: None

Optional Parameters:

-DOCLines=6         copies only the first 6 lines of documentation
-OUTfile=filename   copies file(s)/sequence(s) into one file
-REFerence          copies only the documentation
-NOMONitor          suppresses the screen monitor

LOCAL DATA FILES

[ Previous | Top | Next ]

None.

PARAMETER REFERENCE

[ Previous | Top | Next ]

You can set the parameters listed below from the command line. For more information, see "Using Program Parameters" in Chapter 3, Using Programs in the User's Guide.

-DOCLines=6

sets an individual GCG program to copy only six non-blank lines of documentation from input data files into the output files. Use the % doclines global switch to set this value for your whole session. Usually, Fetch copies all of the documentation from each sequence entry into your new files exactly as it appeared in the original entry.

-OUTfile=filename

copies the sequence(s) and/or data file(s) into one file which you can name. If you leave out the name of the file, Fetch prompts you for one. (Wisconsin Package(TM) programs will not read files containing more than one sequence unless they are in an MSF (multiple sequence format) or RSF (rich sequence format) file.)

It is often useful to use term for the filename so that the data are displayed on your terminal screen.

-MONitor

This program normally monitors its progress on your screen. However, when you use -Default to suppress all program interaction, you also suppress the monitor. You can turn it back on with this parameter. If you are running the program in batch, the monitor will appear in the log file.

-REFerence

copies only the documentation for the sequence or data file. Unless specified, the name of the output file is the entry name concatenated with _ref, followed by the database logical name as the extension.

Printed: December 9, 1998 16:24 (1162)

[ Program Manual | User's Guide | Data Files | Databases ]


Documentation Comments: doc-comments@gcg.com
Technical Support: help@gcg.com

Copyright (c) 1982-2001 Genetics Computer Group, Inc. A subsidiary of Pharmacopeia, Inc. All rights reserved.

Licenses and Trademarks Wisconsin Package is a trademark of Genetics Computer Group, Inc. GCG and the GCG logo are registered trademarks of Genetics Computer Group, Inc.

All other product names mentioned in this documentation may be trademarks, and if so, are trademarks or registered trademarks of their respective holders and are used in this documentation for identification purposes only.

Genetics Computer Group

www.gcg.com