Mercurial > repos > bebatut > qiime
comparison compare_categories.xml @ 0:c1bd0c560018 draft default tip
planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/qiime commit bcbe76277f3e60303faf826f8ce7f018bc663a9a-dirty
author | bebatut |
---|---|
date | Tue, 02 Feb 2016 05:50:37 -0500 |
parents | |
children |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:c1bd0c560018 |
---|---|
1 <tool id="qiime_compare_categories" name="compare categories" version="1.9.1galaxy1"> | |
2 | |
3 <description>Analyzes statistical significance of sample groupings using | |
4 distance matrices</description> | |
5 | |
6 <macros> | |
7 <import>macros.xml</import> | |
8 </macros> | |
9 | |
10 <expand macro="requirements" /> | |
11 | |
12 <command> | |
13 <![CDATA[ | |
14 compare_categories.py | |
15 --method=$method | |
16 -i $input_dm | |
17 -m $mapping_file | |
18 -c $categories | |
19 -o compare_categories_output | |
20 #if $num_permutations: | |
21 -n $num_permutations | |
22 #end if | |
23 ]]> | |
24 </command> | |
25 | |
26 <inputs> | |
27 <param label="--method: the statistical method to use. Valid options: | |
28 adonis, anosim, bioenv, morans_i, mrpp, permanova, permdisp, | |
29 dbrda" name="method" optional="False" type="select"> | |
30 <option value="adonis">adonis</option> | |
31 <option value="anosim">anosim</option> | |
32 <option value="bioenv">bioenv</option> | |
33 <option value="morans_i">morans_i</option> | |
34 <option value="mrpp">mrpp</option> | |
35 <option value="permanova">permanova</option> | |
36 <option value="permdisp">permdisp</option> | |
37 <option value="dbrda">dbrda</option> | |
38 </param> | |
39 <param label="-i/--input_dm: the input distance matrix. WARNING: Only | |
40 symmetric, hollow distance matrices may be used as input. Asymmetric | |
41 distance matrices, such as those obtained by the UniFrac Gain metric | |
42 (i.e. beta_diversity.py -m unifrac_g), should not be used as input" | |
43 name="input_dm" optional="False" type="data"/> | |
44 <param label="-m/--mapping_file: the metadata mapping file" | |
45 name="mapping_file" optional="False" type="data"/> | |
46 <param label="-c/--categories: a comma-delimited list of categories from | |
47 the mapping file. Note: all methods except for BIO-ENV accept just a | |
48 single category. If multiple categories are provided, only the first | |
49 will be used" name="categories" optional="False" type="text"/> | |
50 <param default="999" label="-n/--num_permutations: the number of permutations | |
51 to use when calculating statistical significance. Only applies to | |
52 adonis, ANOSIM, MRPP, PERMANOVA, PERMDISP, and db-RDA. Must be greater | |
53 than or equal to zero [default: 999]" name="num_permutations" | |
54 optional="True" type="integer"/> | |
55 </inputs> | |
56 | |
57 <outputs> | |
58 <data format="txt" from_work_dir="compare_categories_output/*.txt" | |
59 name="output_dir" label="Compare_categories.txt"/> | |
60 </outputs> | |
61 | |
62 <tests> | |
63 <test> | |
64 </test> | |
65 </tests> | |
66 | |
67 <help><![CDATA[ | |
68 **What it does** | |
69 | |
70 This script allows for the analysis of the strength and statistical | |
71 significance of sample groupings using a distance matrix as the primary input. | |
72 Several statistical methods are available: adonis, ANOSIM, BIO-ENV, Moran's I, | |
73 MRPP, PERMANOVA, PERMDISP, and db-RDA. | |
74 | |
75 Note: R's vegan and ape packages are used to compute many of these methods, and | |
76 for the ones that are not, their implementations are based on the | |
77 implementations found in those packages. It is recommended to read through the | |
78 detailed descriptions provided by the authors (they are not reproduced here) | |
79 and to refer to the primary literature for complete details, including the | |
80 methods' assumptions. To view the documentation of a method in R, prepend a | |
81 question mark before the method name. For example: | |
82 | |
83 ?vegan::adonis | |
84 | |
85 The following are brief descriptions of the available methods: | |
86 | |
87 adonis - Partitions a distance matrix among sources of variation in order to | |
88 describe the strength and significance that a categorical or continuous | |
89 variable has in determining variation of distances. This is a nonparametric | |
90 method and is nearly equivalent to db-RDA (see below) except when distance | |
91 matrices constructed with semi-metric or non-metric dissimilarities are | |
92 provided, which may result in negative eigenvalues. adonis is very similar to | |
93 PERMANOVA, though it is more robust in that it can accept either categorical or | |
94 continuous variables in the metadata mapping file, while PERMANOVA can only | |
95 accept categorical variables. See vegan::adonis for more details. | |
96 | |
97 ANOSIM - Tests whether two or more groups of samples are significantly | |
98 different based on a categorical variable found in the metadata mapping file. | |
99 You can specify a category in the metadata mapping file to separate | |
100 samples into groups and then test whether there are significant differences | |
101 between those groups. For example, you might test whether 'Control' samples are | |
102 significantly different from 'Fast' samples. Since ANOSIM is nonparametric, | |
103 significance is determined through permutations. See vegan::anosim for more | |
104 details. | |
105 | |
106 BIO-ENV - Finds subsets of variables whose Euclidean distances (after scaling | |
107 the variables) are maximally rank-correlated with the distance matrix. For | |
108 example, the distance matrix might contain UniFrac distances between | |
109 communities, and the variables might be numeric environmental variables (e.g., | |
110 pH and latitude). Correlation between the community distance matrix and | |
111 Euclidean environmental distance matrix is computed using Spearman's rank | |
112 correlation coefficient (rho). This method will only accept categories that are | |
113 numerical (continuous or discrete). This is currently the only method in the | |
114 script that accepts more than one category (via -c). See vegan::bioenv for more | |
115 details. This method is also known as BEST (previously called BIO-ENV) in the | |
116 PRIMER-E software package. | |
117 | |
118 Moran's I - This method uses the numerical (e.g. geographical) data supplied to | |
119 identify what type of spatial configuration occurs in the samples. For example, | |
120 are they dispersed, clustered, or of no distinctly noticeable configuration | |
121 when compared to each other? This method will only accept a category that is | |
122 numerical. See ape::Moran.I for more details. | |
123 | |
124 MRPP - This method tests whether two or more groups of samples are | |
125 significantly different based on a categorical variable found in the metadata | |
126 mapping file. You can specify a category in the metadata mapping file to | |
127 separate samples into groups and then test whether there are significant | |
128 differences between those groups. For example, you might test whether 'Control' | |
129 samples are significantly different from 'Fast' samples. Since MRPP is | |
130 nonparametric, significance is determined through permutations. See | |
131 vegan::mrpp for more details. | |
132 | |
133 PERMANOVA - This method is very similar to adonis except that it only accepts a | |
134 categorical variable in the metadata mapping file. It uses an ANOVA | |
135 experimental design and returns a pseudo-F value and a p-value. Since PERMANOVA | |
136 is nonparametric, significance is determined through permutations. | |
137 | |
138 PERMDISP - This method analyzes the multivariate homogeneity of group | |
139 dispersions (variances). In essence, it determines whether the variances of | |
140 groups of samples are significantly different. The results of both parametric | |
141 and nonparametric significance tests are provided in the output. This method is | |
142 generally used as a companion to PERMANOVA. See vegan::betadisper for more | |
143 details. | |
144 | |
145 db-RDA - This method is very similar to adonis and will only differ if certain | |
146 non-Euclidean semi- or non-metrics are used to generate the input distance | |
147 matrix, and negative eigenvalues are encountered. The only difference then will | |
148 be in the p-values, not the R^2 values. As part of the output, an ordination | |
149 plot is also generated that shows grouping/clustering of samples based on a | |
150 category in the metadata mapping file. This category is used to explain the | |
151 variability between samples. Thus, the ordination output of db-RDA is similar | |
152 to PCoA except that it is constrained, while PCoA is unconstrained (i.e. with | |
153 db-RDA, you must specify which category should be used to explain the | |
154 variability in your data). See vegan::capscale for more details. | |
155 | |
156 For more information and examples pertaining to this script, please refer to | |
157 the accompanying tutorial, which can be found at | |
158 http://qiime.org/tutorials/category_comparison.html. | |
159 | |
160 | |
161 At least one file will be created in the output directory specified by -o. For | |
162 most methods, a single output file containing the results of the test (e.g. the | |
163 effect size statistic and p-value) will be created. The format of the output | |
164 files will vary between methods as some are generated by native QIIME code, | |
165 while others are generated by R's vegan or ape packages. Please refer to the | |
166 script description for details on how to access additional information for | |
167 these methods, including what information is included in the output files. | |
168 | |
169 db-RDA is the only exception in that two output files are created: a results | |
170 text file and a PDF of the ordination plot. | |
171 ]]> | |
172 </help> | |
173 | |
174 <citations> | |
175 <expand macro="citations" /> | |
176 </citations> | |
177 </tool> |