Documentation read from 01/25/2012 11:58:32 version of /scratch/FIGdisk.server/FIG/bin/svr_aliases_of.



Return all identifiers for genes in the database that are protein-sequence-equivalent to the specified identifiers. In this case, the identifiers are assumed to be in their natural form (without prefixes). For each identifier, the identified protein sequences will be found and then for each protein sequence, all identifiers for that protein sequence or for genes that produce that protein sequence will be returned.

Alternatively, you can ask for identifiers that are precisely equivalent, that is, that identify the same location on the same genome.

------ Example: svr_all_features 3702.1 peg | svr_aliases_of

would produce a 2-column table. The first column would contain PEG IDs for genes occurring in genome 3702.1, and the second would contain the aliases (comma-seprated) of those genes.

The aliases are IDs of genes that have precisely the same protein sequence, but may or may not be from the same genome. ------

The standard input should be a tab-separated table (i.e., each line is a tab-separated set of fields). Normally, the last field in each line would contain the PEG for which aliases are being requested. If some other column contains the PEGs, use

    -c N

where N is the column (from 1) that contains the PEG in each case.

This is a pipe command. The input is taken from the standard input, and the output is to the standard output.

Command-Line Options

-c Column

This is used only if the column containing PEGs is not the last.

-r regexp

This is used to restrict the aliases being returned. Only aliases matching the regexp are returned.


Only identifiers that refer to the same location on the same genome will be returned. If this option is specified, identifiers that refer to proteins rather than features will return no result.

Output Format

The standard output is a tab-delimited file. It consists of the input file with an extra column added (a comma-separated list of aliases).