IDC Contents1. Getting started 2. Running IDC 3. Running multiple data sets and trees 4. Reading the output 5. References and further reading 6. Contact information 7. Appendix - new features - possible bugs Getting StartedIntroduction
IDC is a program designed to calculate independent contrasts from a multivariate quantitative data set and a fully bifurcating phylogeny in Newick format. The calculation of contrasts follows Felsenstein (1985). IDC also calculates the variance-covariance matrix and correlation matrix of independent contrasts. These are calculated following Garland et al. (1992). Installation - Linux/UNIX To install in Linux/UNIX, download the tarball gzip archive file idc_program.tar.gz to your programs directory or other appropriate directory, e.g., /home/your_user_name/programs/idc_program.tar.gz. It is not necessary to create a directory structure for the program. To decompress, execute the following command:
To compile, navigate to linux/ and type the following command: Installation - Windows Unfortunately, IDC is not a real Windows program. However it has been compiled for Windows and can be executed from the Command Prompt in Windows or by double-clicking on the executable. Most Windows installations do not come packaged with a compiler, so I'm distributing the executable and hopefully that will work on your system (it should). The link to that executable is here. If you'd like to download a compiler to compile IDC yourself in Windows, I recommend MinGW. You should also be able to unzip the archive file idc_program.tar.gz using a program such as WinZip or WinAce. Running IDCInput file formatAny run of IDC requires two input files. Input files should be created as plain text which can be accomplished in Linux by using a text editor such as gedit or in Windows by using WordPad and saving in Save as Type: Text Document Format. The first input file is your quantitative trait data file, formatted as follows: The second input file is your tree file, formatted as follows: For ease of use the input files should be in the same directory as the executable! NOTE ON FILE FORMAT- This program will probably be finicky about file format, so be careful to follow the format presented herein and the format of the example data files available with IDC. Running IDC
Running IDC is easy. In Windows XP, you should be able to run the executable idc.exe simply by double clicking on it. If you'd prefer, you can also run it from the command prompt. The easiest way to get a command prompt in XP is to open a RUN window and enter:
To run in Linux/UNIX navigate to the appropriate directory and type: From here on out execution in Linux/UNIX and Windows are identical, so I will follow execution in Linux/UNIX. The entry screen should appear more or less as follows:
The output file will appear as follows: Running multiple data sets and treesWhy run multiple data sets or trees? Many simulation studies generate large numbers of trees and data sets. Also, confidence intervals can be generated around a hypothesis from the comparative method by calculating independent contrasts from many resampled data sets and/or many bootstrap phylogenies or phylogenies sampled from the posterior distribution of a Bayesian analysis. Luckily, in IDC it is very easy to simultaneously run many datasets and/or many phylogenies. File format for multiple data sets or trees The file format for multiple datasets and trees is extremely similar to that used for a single data set/tree. The format for multiple data sets is as follows: The format for the tree file is even simpler and is as follows: Running multiple data sets and trees First, make appropriate selections from the options screen:
Example contrasts file: Reading the output from IDCThe general output from IDC looks as follows:
In the next section is the variance-covariance matrix of independent contrasts. Diagonal elements are mean-squares and off-diagonal elements are mean cross-products. Finally, in the last section is the correlation matrix of independent contrasts, in which the correlation is calculated through the origin following Garland et al. (1992). When multiple data sets or phylogenies are used, by default IDC creates separate output files for the variance-covariance and correlation matrices, e.g., References1. Felsenstein, J. 1985. Phylogenies and the comparative method. Am. Nat. 125:1-15 Contact informationPlease contact me by email with any questions, or if you find the program useful. My email is lrevell@fas.harvard.edu, and my other contact information is listed below. Although I have thoroughly tested the program, I encourage users to do the same and I would be happy to hear about any bugs you might find. Liam J. Revell AppendixStandardization to unit variance - An updated IDC can also print, instead of contrasts standardized to have the expected variance of a contrast = to the tree length, contrasts standardized to have unit variance following Felsenstein (1985). This is performed by selecting U from the modified options menu: The output for the same analysis as shown above would now look like the following: Minimalist output - An updated version of IDC can also return a minimalist output. This might be useful for batch processing (which is what the program is designed to do in the first place). This is performed by selecting M from the modified options menu: Minimalist output looks as follows: Sept. 13, 2006 - It's possible that contrasts will be calculated incorrectly if the taxa order in the data file does not correspond to numerical order. Until I confirm and fix this problem, users should just be careful to order the entries in the data file numerically. Sept. 18, 2006 - One user problem reported today is that if extra tabs are included in the input file (as MS excel tends to do), then the file will be read improperly and an incorrect result will be produced. This problem is particularly devious because it can result in the file being misread without a program error, thus the user might be inclined to believe that the analysis was completed correctly. Just be careful to delete all extraneous tabs from the input file (those at the ends of each line, particularly after the first line). Content copyright. Last updated 11 Oct. 2006. |