|
"Power Users" To conduct bioinformatic analyses on a larger than one-gene-at-a-time basis, one must begin to really automate and choreograph analyses. UNIX and GCG are just an example of programs that one might wish to control. When you really begin to choreograph these analyses, you are ready to be christened a "power user".
Redirecting input and output UNIX programs have three read/write aspects:
GCG programs offer input and output "flags" (those little -plo=mygraphic.gif "thingies") that also redirect input and output. We will use some of these directions in an example. Controlling execution One can work on multiple files in at least two ways. Probably the simplest manner is to use a wildcard character. sort *.txt will sort all files in the current directory that are "anything".txt to the terminal. UNIX uses a set of other wildcards-to be provided later. The GCG-specific method is to use listfiles, which are lists of sequence IDs. This is a powerful method of both creating lists of files and of manipulating them. Listfiles are generated in two ways: GCG programs may generate lists and YOU can use a text editor to make a listfile. GCG programs that generate listfiles are:
To edit a list file, use your favorite word processor to make a file like the following: Type in a description of the file. This is arbitrary.
You can have single sequences of your own, database sequences, and other lists in
your list. Be sure to save the file and upload it. You can then use this in GCG programs with command like: map @mylist.seq The @ character tells GCG this is a list file. The other way of controlling execution is to use the "at" function of UNIX to execute commands only when certain conditions (time, etc) are true. This can be very valuable for a number of reasons. A subset of this is a "batch" file, which is a simple set of commands. We will write a batch file in class.
|