Converting Tabular Data for CANOCO and TWINSPAN using toccf
24-Mar-1998
The latest version of this document is at
http://www2.biology.ualberta.ca/jbrzusto/ftp/trees/toccfdoc.html
Bugs and changes are listed at the end.
Get the program:
Command line version only:
Send your web browser to:
http://www2.biology.ualberta.ca/jbrzusto/ftp/trees/dos32/toccf.exe or
ftp (with userid anonymous) to gause.biology.ualberta.ca and
change directory to /pub/jbrzusto/trees/dos32 and then
get toccf.exe (making sure to use binary mode).
If you want to run these programs in plain DOS, (ie. not within a Windows DOS box), you must also download cwsdpmi.exe and put it in the same directory, unless you are already running a DPMI server. This driver provides programs with access to extended memory.
The directory http://www2.biology.ualberta.ca/jbrzusto/ftp/trees/source.zip has ANSI-C source code and makefile, if you want to recompile the program on your machine.
Krebs/win:
Get this from
http://www2.biology.ualberta.ca/jbrzusto/ftp/krebs/index.html
Installation instructions are available at
http://www2.biology.ualberta.ca/jbrzusto/ftp/krebs/README.TXT
Potemkin:
Get this from
http://www2.biology.ualberta.ca/jbrzusto/ftp/potemkin/potzip.exe
Installation instructions are available at
http://www2.biology.ualberta.ca/jbrzusto/ftp/potemkin/potemkin.html
This program requires Windows '95.
Put your data into a text file:
toccf works on text files. To put your data into
this format, do one of the following:
Run the program (command line version):
toccf is run from the DOS command prompt,
so if you are using Windows, you must first open a DOS window, and change
to the directory into which you downloaded the programs.
toccf 210 180 -fc test.txt > canotest.txt
The options mean:
210: the number of rows of data in the input file (ie. the number of lines in the file)
180: the number of columns of data in the input file (e.g. a line like "123 34 5.2" has 3 columns)
-fc: format output for CANOCO
test.txt: name of file containing your data
> canotest.txt: output Cornel condensed format to the file canotest.txt
Here is a more complicated example of the command:
toccf 210 180 -nr -nc -ft -r -t -s5F6.2 test1.txt test2.txt > twintest.txt
The options mean:210: the number of rows in the input file, not including column names340: the number of columns in the input files combined, not including row names-nr: names of rows are present. The first item in each row is the name of that row.-nc: names of columns are present. The first line in the file is a list of column names.-ft: format output for TWINSPAN-r: each sample is a row in the input, rather than a column-t: the input file uses TAB characters to separate values, some of which can be blank (treated as 0).
If there are blank entries in your data matrix, your file must use TABs and you must use this option for toccf-s5F6.2: the format for output. In this example, 5 couplets are printed per line
and each character value occupies 6 spaces with 2 digits after the decimal point.test1.txt: name of first data filetest2.txt: name of second data file. Multiple input files are pasted together side by side, so that
the first row in the input consists of the first row in file1 joined to the first row in file 2, and so on. This option is included because
some spreadsheets can't export very wide lines as text files, so you might have to manually select consecutive groups of (for example) 100 columns,
one group at a time, and export each to a separate file.> twintest.txt: output Cornel condensed format to the file twintest.txt| Date reported | Date Fixed/Made | Change/Bug/Fix |
| 21-Oct-1997 | added the -s option for specifying couplet output format |
Bugs, comments, etc. to John Brzustowski junkjbrzusto@ualberta.cacaca but removing the obvious parts
There are more free programs at http://www2.biology.ualberta.ca/jbrzusto/index.html