Mercurial > repos > greg > gene_family_classifier
comparison gene_family_classifier.xml @ 28:a63b610ec5bd draft
Uploaded
author | greg |
---|---|
date | Fri, 27 Jan 2017 15:03:15 -0500 |
parents | f041f7778540 |
children | 59d1ec149c64 |
comparison
equal
deleted
inserted
replaced
27:f041f7778540 | 28:a63b610ec5bd |
---|---|
1 <tool id="plant_tribes_gene_family_classifier" name="GeneFamilyClassifier" version="1.0.0"> | 1 <tool id="plant_tribes_gene_family_classifier" name="Classify gene sequences" version="0.3"> |
2 <description>pipeline</description> | 2 <description>into precomputed orthologous gene family clusters</description> |
3 <requirements> | 3 <requirements> |
4 <requirement type="package" version="0.2">plant_tribes_gene_family_classifier</requirement> | 4 <requirement type="package" version="0.3">plant_tribes_gene_family_classifier</requirement> |
5 </requirements> | 5 </requirements> |
6 <stdio> | 6 <stdio> |
7 <!-- Anything other than zero is an error --> | 7 <!-- Anything other than zero is an error --> |
8 <exit_code range="1:" /> | 8 <exit_code range="1:" /> |
9 <!-- In case the return code has not been set properly check stderr too --> | 9 <!-- In case the return code has not been set properly check stderr too --> |
69 #end if | 69 #end if |
70 #end if | 70 #end if |
71 | 71 |
72 #if $create_ortho_sequences: | 72 #if $create_ortho_sequences: |
73 #if $create_corresponding_coding_sequences: | 73 #if $create_corresponding_coding_sequences: |
74 && echo "Sequences classified into pre-computed orthologous plant gene family clusters with corresponding coding sequences: `ls $orthogroups_fasta_src_dir | grep f | wc -l` files" > $output | 74 && echo "Sequences classified into precomputed orthologous plant gene family clusters with corresponding coding sequences: `ls $orthogroups_fasta_src_dir | grep f | wc -l` files" > $output |
75 #else: | 75 #else: |
76 && echo "Sequences classified into pre-computed orthologous plant gene family clusters: `ls $orthogroups_fasta_src_dir | grep f | wc -l` files" > $output | 76 && echo "Sequences classified into precomputed orthologous plant gene family clusters: `ls $orthogroups_fasta_src_dir | grep f | wc -l` files" > $output |
77 #end if | 77 #end if |
78 && ls -al $orthogroups_fasta_src_dir | grep f >> $output | 78 && ls -al $orthogroups_fasta_src_dir | grep f >> $output |
79 && mv $orthogroups_fasta_src_dir/* $dest_dir || true | 79 && mv $orthogroups_fasta_src_dir/* $dest_dir || true |
80 #end if | 80 #end if |
81 | 81 |
117 <option value="basic" selected="true">Basic</option> | 117 <option value="basic" selected="true">Basic</option> |
118 <option value="advanced">Advanced</option> | 118 <option value="advanced">Advanced</option> |
119 </param> | 119 </param> |
120 <when value="basic" /> | 120 <when value="basic" /> |
121 <when value="advanced"> | 121 <when value="advanced"> |
122 <param name="super_orthogroups" type="select" label="SuperOrthogroups MCL clustering" help="blastp e-value matrix between all pairs of orthogroups"> | 122 <param name="super_orthogroups" type="select" label="Super Orthogroups" help="Secondary MCL clusters of orthogroups"> |
123 <option value="min_evalue" selected="true">Minimum e-value</option> | 123 <option value="min_evalue" selected="true">Minimum e-value</option> |
124 <option value="avg_evalue">Average e-value</option> | 124 <option value="avg_evalue">Average e-value</option> |
125 </param> | 125 </param> |
126 <conditional name="single_copy_cond"> | 126 <conditional name="single_copy_cond"> |
127 <param name="single_copy" type="select" label="Select single copy configuration"> | 127 <param name="single_copy" type="select" label="Select single copy configuration"> |
180 </output_collection> | 180 </output_collection> |
181 </test> | 181 </test> |
182 </tests> | 182 </tests> |
183 <help> | 183 <help> |
184 This tool is one of the PlantTribes' collection of automated modular analysis pipelines that utilize objective classifications of | 184 This tool is one of the PlantTribes' collection of automated modular analysis pipelines that utilize objective classifications of |
185 complete protein sequences from sequenced plant genomes to perform comparative evolutionary studies. It performs gene family | 185 complete protein sequences from sequenced plant genomes to perform comparative evolutionary studies. This tool classifies gene |
186 classification of the post processed de novo transcripts using either blastp (faster), HMMScan (slower but more sensitive to remote | 186 sequences into precomputed orthologous gene family clusters using either blastp (faster), HMMScan (slower but more sensitive |
187 homologs) or both (more exhaustive). | 187 to remote homologs) or both (more exhaustive). |
188 | |
189 This tool accepts any of the following as input. | |
190 | |
191 * the postprocessed assemblies produced by the **Postprocess de novo assembly transcripts into putative coding sequences** tool | |
192 * externally predicted coding sequences and their corresponding amino acid translations derived from a transcriptome assembly | |
193 * gene predictions from a sequenced genome | |
188 | 194 |
189 ----- | 195 ----- |
190 | 196 |
191 **Options** | 197 **Options** |
192 | 198 |
193 * **Orthogroups or gene families proteins scaffold** - PlantTribes scaffolds data. | 199 * **Orthogroups or gene families proteins scaffold** - PlantTribes scaffolds data. |
194 * **Protein clustering method** - One of GFam, OrthoFinder or OrthoMCL. | 200 * **Protein clustering method** - One of GFam (domain architecture based clustering), OrthoFinder (broadly defined clusters) or OrthoMCL (narrowly defined clusters). |
195 * **Protein classification method** - One of blastp, HMMScan or both. | 201 * **Protein classification method** - blastp (faster), HMMScan (slower but more sensative to the remote homologs) or both (more exhaustive). |
196 * **SuperOrthogroups MCL clustering** - blastp e-value matrix between all pairs of orthogroups. | 202 * **Super Orthogroups** - Secondary MCL clusters of orthogroups. |
197 * **Minumum single copy taxa required in orthogroup** - Used with "Minumum single copy taxa required in orthogroup" configuration only. | 203 * **Minumum single copy taxa required in orthogroup** - Used with "Minumum single copy taxa required in orthogroup" configuration only. |
198 * **Minumum taxa required in single copy orthogroup** - Used with "Minumum single copy taxa required in orthogroup" configuration only. | 204 * **Minumum taxa required in single copy orthogroup** - Used with "Minumum single copy taxa required in orthogroup" configuration only. |
199 * **Corresponding coding sequences (CDS) fasta file** - Used only when selecting "Create orthogroup fasta files?". | 205 * **Corresponding coding sequences (CDS) fasta file** - Used only when selecting "Create orthogroup fasta files?". |
200 | 206 |
201 </help> | 207 </help> |
202 <citations> | 208 <citations> |
203 <citation type="bibtex"> | 209 <citation type="bibtex"> |
204 @unpublished{None, | 210 @unpublished{None, |
205 author = {None}, | 211 author = {Eric Wafula}, |
206 title = {None}, | 212 title = {None}, |
207 year = {None}, | 213 year = {None}, |
208 eprint = {None}, | 214 eprint = {None}, |
209 url = {None} | 215 url = {None} |
210 }</citation> | 216 }</citation> |