comparison kaks_analysis.xml @ 12:7945134d3956 draft

Uploaded
author greg
date Wed, 01 Mar 2017 15:01:15 -0500
parents a0b1f599becc
children 09ef5cc67c78
comparison
equal deleted inserted replaced
11:a0b1f599becc 12:7945134d3956
182 182
183 **Options** 183 **Options**
184 184
185 * **Required options** 185 * **Required options**
186 186
187 - **Select gene family clusters** - Sequences classified into gene family clusters, optionally including corresponding coding sequences. 187 - **Coding sequences (CDS) fasta file for the species** - Coding sequences (CDS) fasta file for the first species.
188 - **Orthogroups or gene families proteins scaffold** - PlantTribes scaffolds data. 188 - **Aamino acids (proteins) sequences fasta file for the species** - Aamino acids (proteins) sequences fasta file for the first species
189 - **Protein clustering method** - One of GFam (domain architecture based clustering), OrthoFinder (broadly defined clusters) or OrthoMCL (narrowly defined clusters). 189 - **Select method for pairwise sequence comparison to determine homolgous pairs** - Pairwise sequence comparison to determine homolgous pairs (cross species comparison requires selection of inputs for second species).
190 190 - **Orthogroups or gene families proteins scaffold** - PlantTribes scaffolds data installed into Galaxy by the PlantTribes Scaffolds Download Data Manager tool.
191 * **Multiple sequence alignments options** 191
192 192 * **Other (optional) options**
193 - **Select method for multiple sequence alignments** - Method used for setting multiple sequence alignments. 193
194 - **Input sequences include corresponding coding sequences?** - Selecting 'Yes' for this option requires that the selected input data format is 'ptorthocs'. 194 - **Minimum sequence pairwise coverage length between homologous pairs** - Minimum sequence pairwise coverage length between homologous pairs (e.g., 0.5 results in 50% coverage. Legal values lie between 0.3 and 1.0.
195 - **Construct orthogroup multiple codon alignments?** - Construct orthogroup multiple codon alignments. 195 - **Evolutionary rate for recalibrating synonymous subsitutions (ks) of species** - (applies to paralogous ks analysis) Recalibrate synonymous subsitutions (ks) of species using a predetermined evoutionary rate that can be determined from a species tree inferred from a collection single copy genes from taxa of interest (Cui et al., 2006).
196 - **Sequence type used in the phylogenetic inference** - Sequence type (dna or amino acid) used in the phylogenetic inference. 196 - **Select PAML codeml control file?** - Select PAML's codeml control file from your history. This file is used to to perfom ML analysis of protein-coding DNA sequences using codon substitution models. Selecting No uses the default file which does not include input (seqfile, treefile) and output (outfile) parameters of codeml.
197 - **Use corresponding coding sequences?** - Selecting 'Yes' for this option requires that the selected input data format is 'ptorthocs' or this tool will produce an error. 197 - **Fit a mixture model of multivariate normal components to synonymous (ks) distribution?** - Fit a mixture model of multivariate normal components to synonymous (ks) distribution to identify significant duplication event(s) in a genome.
198 198 - **Number components to fit to synonymous subsitutions (ks) distribution** - Number components to fit to synonymous subsitutions (ks) distribution.
199 * **Phylogenetic trees options** 199 - **Lower limit of synonymous subsitutions (ks)** - Lower limit of synonymous subsitutions (ks) - necessary if fitting components to the distribution to reduce background noise from young paralogous pairs due to normal gene births and deaths in a genome.
200 200 - **Upper limit of synonymous subsitutions (ks)** - Upper limit of synonymous subsitutions (ks) - necessary if fitting components to the distribution to exclude likey ancient paralogous pairs.
201 - **Phylogenetic trees inference method** - Phylogenetic trees inference method.
202 - **Select rooting order configuration for rooting trees??** - If 'No' is selected, trees will be rooted using the most distant taxon present in the orthogroup.
203 - **Number of replicates for rapid bootstrap analysis and search for the best-scoring ML tree** - Number of replicates for rapid bootstrap analysis and search for the best-scoring ML tree.
204 - **Maximum number of sequences in orthogroup alignments** - Maximum number of sequences in orthogroup alignments.
205 - **Minimum number of sequences in orthogroup alignments** - Minimum number of sequences in orthogroup alignments.
206
207 * **MSA quality control options**
208
209 - **Remove sequences with gaps of** - Removes gappy sequences in alignments (i.e., 0.5 removes sequences with 50% gaps).
210 - **Select process used for gap trimming** - Either nucleotide based trimming or alignments are trimed using using trimAl's ML heuristic trimming approach.
211 - **Remove sites in alignments with gaps of** - If the process used for gap trimming is nucleotide based, this is the gap value used when removing gappy sites in alignments (i.e., 0.1 removes sites with 90% gaps).
212 201
213 </help> 202 </help>
214 <citations> 203 <citations>
215 <citation type="bibtex"> 204 <citation type="bibtex">
216 @unpublished{None, 205 @unpublished{None,
218 title = {None}, 207 title = {None},
219 year = {None}, 208 year = {None},
220 url = {https://github.com/dePamphilis/PlantTribes} 209 url = {https://github.com/dePamphilis/PlantTribes}
221 } 210 }
222 </citation> 211 </citation>
212 <citation type="doi">10.1093/bioinformatics/btw412</citation>
223 <citation type="doi">10.1186/1471-2105-10-421</citation> 213 <citation type="doi">10.1186/1471-2105-10-421</citation>
224 <citation type="doi">10.1093/molbev/msm088</citation> 214 <citation type="doi">10.1093/molbev/msm088</citation>
225 <citation type="doi">10.18637/jss.v004.i02</citation> 215 <citation type="doi">10.18637/jss.v004.i02</citation>
226 </citations> 216 </citations>
227 </tool> 217 </tool>