Mercurial > repos > pjbriggs > motif_tools
comparison Scan_IUPAC_output_each_match.xml @ 0:b42da9dc4507 draft
Uploaded initial version 1.0.1.
author | pjbriggs |
---|---|
date | Wed, 21 Mar 2018 05:44:12 -0400 |
parents | |
children | 2f34d5e91bc7 |
comparison
equal
deleted
inserted
replaced
-1:000000000000 | 0:b42da9dc4507 |
---|---|
1 <?xml version="1.0" encoding="utf-8"?> | |
2 <tool id="fasta_scan_iupac_each" name="IUPAC scan and output each match" version="@VERSION@"> | |
3 <description>Returns all matches to a given IUPAC in GFF format</description> | |
4 <macros> | |
5 <import>motif_tools_macros.xml</import> | |
6 </macros> | |
7 <expand macro="requirements" /> | |
8 <command><![CDATA[ | |
9 perl $__tool_directory__/Scan_IUPAC_output_each_match.pl $iupac $fasta $output $label $strand | |
10 ]]></command> | |
11 <inputs> | |
12 <param name="iupac" type="text" label="IUPAC string" value="e.g. WGATAR" help="Enter an IUPAC string." size="20"/> | |
13 <param format="fasta" name="fasta" type="data" label="FASTA file" help="Select a FASTA file containing the sequences to be scanned."/> | |
14 <param name="label" type="text" label="Attribute in GFF output" value="IUPAC_or_name" help="The label will be included at the end (attibute) section of each GFF line. This could be the IUPAC string used or the name of the motif." size="20"/> | |
15 <param name="strand" type="select" label="Select sequence strands to scan" help="Scan either both strands or only the forward strand."> | |
16 <option value="0">Scan both strands</option> | |
17 <option value="1">Only scan forward strand</option> | |
18 </param> | |
19 </inputs> | |
20 <outputs> | |
21 <data format="gff" name="output" /> | |
22 </outputs> | |
23 | |
24 <help> | |
25 .. class:: infomark | |
26 | |
27 **What it does** | |
28 | |
29 This tool will find all matches to a DNA pattern in the input DNA sequence, represented by an IUPAC string. The matches are non-overlapping, so searching with 'TTTT' in 'TTTTTTTT' will find two hits to the IUPAC. The output is in GFF format and the last 'attribute' field can be specified using the 'Label' option. | |
30 | |
31 IUPAC = Nucleotide(s): | |
32 | |
33 A = A | |
34 | |
35 C = C | |
36 | |
37 G = G | |
38 | |
39 T = T | |
40 | |
41 M = A/C | |
42 | |
43 R = A/G | |
44 | |
45 W = A/T | |
46 | |
47 S = C/G | |
48 | |
49 Y = C/T | |
50 | |
51 K = G/T | |
52 | |
53 V = A/C/G | |
54 | |
55 H = A/C/T | |
56 | |
57 D = A/G/T | |
58 | |
59 B = C/G/T | |
60 | |
61 N = A/C/G/T | |
62 | |
63 ---- | |
64 | |
65 .. class:: infomark | |
66 | |
67 **Options** | |
68 | |
69 'IUPAC string' - can be entered as upper- or lower-case as the tool will force them to become upper-case, but will only accept the IUPAC codes listed above. | |
70 | |
71 'Attribute in GFF output' - the last field of each GFF line 'attribute' can be specified using the 'Label' option, this should only include letters/numbers, but without spaces. | |
72 | |
73 'Select sequence strands to scan' - Only scanning the forward strand of the input sequence is useful if the IUPAC is a palindrome (e.g. CANNTG). | |
74 | |
75 ---- | |
76 | |
77 .. class:: infomark | |
78 | |
79 **Credits** | |
80 | |
81 This Galaxy tool has been developed within the Bioinformatics Core Facility at the University of Manchester. It runs the Scan_IUPAC_output_each_match.pl Perl script that was written by Ian Donaldson. | |
82 | |
83 Please kindly acknowledge both this Galaxy tool and Scan_IUPAC_output_each_match.pl if you use it. | |
84 </help> | |
85 | |
86 </tool> | |
87 |