Mercurial > repos > devteam > fasta_clipping_histogram
comparison fasta_clipping_histogram.xml @ 3:d0969fa24eb1 draft
planemo upload commit 33927a87ba2eee9bf0ecdd376a66241b17b3d734
| author | devteam |
|---|---|
| date | Tue, 13 Oct 2015 12:38:50 -0400 |
| parents | de44f4045b05 |
| children | 40ec4170291e |
comparison
equal
deleted
inserted
replaced
| 2:20e471a2fdc6 | 3:d0969fa24eb1 |
|---|---|
| 1 <tool id="cshl_fasta_clipping_histogram" name="Length Distribution" version="1.0.0"> | 1 <tool id="cshl_fasta_clipping_histogram" name="Length Distribution" version="1.0.0"> |
| 2 <description>chart</description> | 2 <description>chart</description> |
| 3 <requirements> | 3 <requirements> |
| 4 <requirement type="package" version="0.0.13">fastx_toolkit</requirement> | 4 <requirement type="package" version="0.0.13">fastx_toolkit</requirement> |
| 5 </requirements> | 5 </requirements> |
| 6 <command>fasta_clipping_histogram.pl $input $outfile</command> | 6 <command>fasta_clipping_histogram.pl $input $outfile</command> |
| 7 | |
| 8 <inputs> | |
| 9 <param format="fasta" name="input" type="data" label="Library to analyze" /> | |
| 10 </inputs> | |
| 11 | 7 |
| 12 <outputs> | 8 <inputs> |
| 13 <data format="png" name="outfile" metadata_source="input" /> | 9 <param format="fasta" name="input" type="data" label="Library to analyze" /> |
| 14 </outputs> | 10 </inputs> |
| 15 <help> | |
| 16 | 11 |
| 12 <outputs> | |
| 13 <data format="png" name="outfile" metadata_source="input" /> | |
| 14 </outputs> | |
| 15 <tests> | |
| 16 </tests> | |
| 17 <help> | |
| 17 **What it does** | 18 **What it does** |
| 18 | 19 |
| 19 This tool creates a histogram image of sequence lengths distribution in a given fasta dataset file. | 20 This tool creates a histogram image of sequence lengths distribution in a given fasta dataset file. |
| 20 | 21 |
| 21 **TIP:** Use this tool after clipping your library (with **FASTX Clipper tool**), to visualize the clipping results. | 22 **TIP:** Use this tool after clipping your library (with **FASTX Clipper tool**), to visualize the clipping results. |
| 22 | 23 |
| 23 ----- | 24 ----- |
| 24 | 25 |
| 25 **Output Examples** | 26 **Output Examples** |
| 26 | 27 |
| 27 In the following library, most sequences are 24-mers to 27-mers. | 28 In the following library, most sequences are 24-mers to 27-mers. |
| 28 This could indicate an abundance of endo-siRNAs (depending of course of what you've tried to sequence in the first place). | 29 This could indicate an abundance of endo-siRNAs (depending of course of what you've tried to sequence in the first place). |
| 29 | 30 |
| 30 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_1.png | 31 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_1.png |
| 31 | 32 |
| 32 | 33 In the following library, most sequences are 19,22 or 23-mers. |
| 33 In the following library, most sequences are 19,22 or 23-mers. | |
| 34 This could indicate an abundance of miRNAs (depending of course of what you've tried to sequence in the first place). | 34 This could indicate an abundance of miRNAs (depending of course of what you've tried to sequence in the first place). |
| 35 | 35 |
| 36 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_2.png | 36 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_2.png |
| 37 | 37 |
| 38 | |
| 39 ----- | 38 ----- |
| 40 | |
| 41 | 39 |
| 42 **Input Formats** | 40 **Input Formats** |
| 43 | 41 |
| 44 This tool accepts short-reads FASTA files. The reads don't have to be short, but they do have to be on a single line, like so:: | 42 This tool accepts short-reads FASTA files. The reads don't have to be short, but they do have to be on a single line, like so:: |
| 45 | 43 |
| 47 AGTAGTAGGTGATGTAGAGAGAGAGAGAGTAG | 45 AGTAGTAGGTGATGTAGAGAGAGAGAGAGTAG |
| 48 >sequence2 | 46 >sequence2 |
| 49 GTGTGTGTGGGAAGTTGACACAGTA | 47 GTGTGTGTGGGAAGTTGACACAGTA |
| 50 >sequence3 | 48 >sequence3 |
| 51 CCTTGAGATTAACGCTAATCAAGTAAAC | 49 CCTTGAGATTAACGCTAATCAAGTAAAC |
| 52 | |
| 53 | 50 |
| 54 If the sequences span over multiple lines:: | 51 If the sequences span over multiple lines:: |
| 55 | 52 |
| 56 >sequence1 | 53 >sequence1 |
| 57 CAGCATCTACATAATATGATCGCTATTAAACTTAAATCTCCTTGACGGAG | 54 CAGCATCTACATAATATGATCGCTATTAAACTTAAATCTCCTTGACGGAG |
| 61 Use the **FASTA Width Formatter** tool to re-format the FASTA into a single-lined sequences:: | 58 Use the **FASTA Width Formatter** tool to re-format the FASTA into a single-lined sequences:: |
| 62 | 59 |
| 63 >sequence1 | 60 >sequence1 |
| 64 CAGCATCTACATAATATGATCGCTATTAAACTTAAATCTCCTTGACGGAGTCTTCGGTCATAACACAAACCCAGACCTACGTATATGACAAAGCTAATAGaactggtctttacctTTAAGTTG | 61 CAGCATCTACATAATATGATCGCTATTAAACTTAAATCTCCTTGACGGAGTCTTCGGTCATAACACAAACCCAGACCTACGTATATGACAAAGCTAATAGaactggtctttacctTTAAGTTG |
| 65 | 62 |
| 66 | |
| 67 ----- | 63 ----- |
| 68 | |
| 69 | |
| 70 | 64 |
| 71 **Multiplicity counts (a.k.a reads-count)** | 65 **Multiplicity counts (a.k.a reads-count)** |
| 72 | 66 |
| 73 If the sequence identifier (the text after the '>') contains a dash and a number, it is treated as a multiplicity count value (i.e. how many times that individual sequence repeated in the original FASTA file, before collapsing). | 67 If the sequence identifier (the text after the '>') contains a dash and a number, it is treated as a multiplicity count value (i.e. how many times that individual sequence repeated in the original FASTA file, before collapsing). |
| 74 | 68 |
| 82 GGGATATATCCCCACACACACACAC | 76 GGGATATATCCCCACACACACACAC |
| 83 | 77 |
| 84 Each sequence is counts as one, to produce the following chart: | 78 Each sequence is counts as one, to produce the following chart: |
| 85 | 79 |
| 86 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_3.png | 80 .. image:: ${static_path}/fastx_icons/fasta_clipping_histogram_3.png |
| 87 | |
| 88 | 81 |
| 89 Example 2 - The following FASTA file have multiplicity counts:: | 82 Example 2 - The following FASTA file have multiplicity counts:: |
| 90 | 83 |
| 91 >seq1-2 | 84 >seq1-2 |
| 92 GGATCC | 85 GGATCC |
| 104 ------ | 97 ------ |
| 105 | 98 |
| 106 This tool is based on `FASTX-toolkit`__ by Assaf Gordon. | 99 This tool is based on `FASTX-toolkit`__ by Assaf Gordon. |
| 107 | 100 |
| 108 .. __: http://hannonlab.cshl.edu/fastx_toolkit/ | 101 .. __: http://hannonlab.cshl.edu/fastx_toolkit/ |
| 109 | 102 </help> |
| 110 </help> | |
| 111 <!-- FASTA-Clipping-Histogram is part of the FASTX-toolkit, by A.Gordon (gordon@cshl.edu) --> | |
| 112 </tool> | 103 </tool> |
