Mercurial > repos > pjbriggs > trimmomatic
annotate README.rst @ 14:0fb869e9dee6 draft
Update to Trimmomatic 0.38.
| author | pjbriggs |
|---|---|
| date | Mon, 08 Jul 2019 06:07:07 -0400 |
| parents | 361f703e4094 |
| children | ed7f4b065bb0 |
| rev | line source |
|---|---|
| 1 | 1 Trimmomatic: flexible read trimming tool for Illumina NGS data |
| 2 ============================================================== | |
| 3 | |
| 4 Galaxy tool wrapper for the Trimmomatic program, which provides various functions for | |
| 5 manipluating Illumina FASTQ files (both single and paired-end). | |
| 6 | |
| 7 Trimmomatic has been developed within Bjorn Usadel's group at RWTH Aachen university | |
| 8 http://www.usadellab.org/cms/index.php?page=trimmomatic | |
| 9 | |
| 10 The reference for Trimmomatic is: | |
| 11 | |
| 12 - Bolger, A.M., Lohse, M., & Usadel, B. (2014). Trimmomatic: A flexible trimmer | |
| 13 for Illumina Sequence Data. Bioinformatics, btu170. | |
| 14 | |
| 15 Automated installation | |
| 16 ====================== | |
| 17 | |
| 2 | 18 Installation via the Galaxy Tool Shed will take care of installing the tool wrapper |
| 19 and the trimmomatic program and data, and setting the appropriate environment | |
| 20 variables. | |
| 1 | 21 |
|
8
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
22 Controlling the available memory |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
23 ================================ |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
24 |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
25 The default amount of memory avilable to trimmomatic is set to 8GB. |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
26 To change the default amount of memory you can set the environment variable |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
27 ``_JAVA_OPTIONS`` to ``-Xmx<amount_of_memory_in_GB>G``. The recommended way to |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
28 set this is in the job_conf.xml file. To change the available memory to 6GB, a |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
29 line like the below should be added: |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
30 |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
31 ``<env id="_JAVA_OPTIONS">-Xmx6G</env>`` |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
32 |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
33 This will set the environment variable ``_JAVA_OPTIONS`` to ``-Xmx6G``. |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
34 |
| 1 | 35 Manual Installation |
| 36 =================== | |
| 37 | |
| 38 There are two files to install: | |
| 39 | |
| 40 - ``trimmomatic.xml`` (the Galaxy tool definition) | |
| 41 - ``trimmomatic.sh`` (the shell script wrapper) | |
| 42 | |
| 43 The suggested location is in a ``tools/trimmomatic/`` folder. You will then | |
| 44 need to modify the ``tools_conf.xml`` file to tell Galaxy to offer the tool | |
| 45 by adding the line: | |
| 46 | |
| 47 <tool file="trimmomatic/trimmomatic.xml" /> | |
| 48 | |
| 4 | 49 You will also need to install trimmomatic 0.36: |
| 1 | 50 |
| 4 | 51 - http://www.usadellab.org/cms/uploads/supplementary/Trimmomatic/Trimmomatic-0.36.zip |
| 1 | 52 |
| 53 The tool wrapper uses the following environment variables in order to find the | |
| 54 appropriate files: | |
| 55 | |
| 56 - ``TRIMMOMATIC_DIR`` should point to the directory holding the | |
| 4 | 57 ``trimmomatic-0.36.jar`` file |
| 1 | 58 - ``TRIMMOMATIC_ADAPTERS_DIR`` should point to the directory holding the adapter |
| 59 sequence files (used by the ``ILLUMINACLIP`` option). | |
| 60 | |
| 61 If you want to run the functional tests, copy the sample test files under | |
| 62 sample test files under Galaxy's ``test-data/`` directory. Then: | |
| 63 | |
| 64 ./run_tests.sh -id trimmomatic | |
| 65 | |
| 66 You will need to have set the environment variables above. | |
| 67 | |
| 68 History | |
| 69 ======= | |
| 70 | |
| 71 ========== ====================================================================== | |
| 72 Version Changes | |
| 73 ---------- ---------------------------------------------------------------------- | |
| 12 | 74 0.36.6 - Added trimlog and log outputs; add support for ``fastqillumina`` |
| 75 and ``fastqsolexa`` input types | |
|
11
86bedbd3c5c2
Uploaded version 0.36.5 (use conda to resolve dependencies)
pjbriggs
parents:
9
diff
changeset
|
76 0.36.5 - Remove tool_dependencies.xml and always use conda to resolve tool |
|
86bedbd3c5c2
Uploaded version 0.36.5 (use conda to resolve dependencies)
pjbriggs
parents:
9
diff
changeset
|
77 dependencies |
| 9 | 78 0.36.4 - Add option to provide custom adapter sequences for ILLUMINACLIP |
| 79 - Add options ``minAdapterLength`` and ``keepBothReads`` for ILLUMINACLIP | |
| 80 in palindrome mode | |
|
8
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
81 0.36.3 - Fix naming of output collections. Instead of all outputs being called |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
82 "Trimmomatic on collection NN" these will now be called "Trimmomatic |
|
a923b799c77c
Version 0.36.3: fix the naming of output collections to differentiate btwn paired/unpaired; document the _JAVA_OPTIONS env var (thanks Marius van den Beek).
pjbriggs
parents:
7
diff
changeset
|
83 on collection NN: paired" or "Trimmomatic on collection NN: unpaired". |
| 7 | 84 0.36.2 - Support fastqsanger.gz datatype. If fastqsanger.gz is used as input |
| 85 the output will also be fastqsanger.gz. | |
| 86 - Use $_JAVA_OPTIONS to customize memory requirements. | |
|
6
b9415df5fc32
Updated to 0.36.1: Reimplement to work with bioconda Trimmomatic 0.36 (toolshed version is still supported for now).
pjbriggs
parents:
4
diff
changeset
|
87 0.36.1 - Reimplement to work with bioconda Trimmomatic 0.36 (toolshed version |
|
b9415df5fc32
Updated to 0.36.1: Reimplement to work with bioconda Trimmomatic 0.36 (toolshed version is still supported for now).
pjbriggs
parents:
4
diff
changeset
|
88 is still supported for now). |
| 4 | 89 0.36.0 - Update to Trimmomatic 0.36. |
| 90 0.32.4 - Add support for ``AVGQUAL`` and ``MAXINFO`` operations. | |
|
3
a7139c612c45
Updated to version 0.32.3: add support for FASTQ pairs (dataset collections)
pjbriggs
parents:
2
diff
changeset
|
91 0.32.3 - Add support for FASTQ R1/R2 pairs using dataset collections (input |
|
a7139c612c45
Updated to version 0.32.3: add support for FASTQ pairs (dataset collections)
pjbriggs
parents:
2
diff
changeset
|
92 can be dataset collection, in which case tool also outputs dataset |
|
a7139c612c45
Updated to version 0.32.3: add support for FASTQ pairs (dataset collections)
pjbriggs
parents:
2
diff
changeset
|
93 collections) and improve order and naming of output files. |
| 2 | 94 0.32.2 - Use ``GALAXY_SLOTS`` to set the appropriate number of threads to use |
| 95 at runtime (default is 6). | |
| 1 | 96 0.32.1 - Remove ``trimmomatic_adapters.loc.sample`` and hard-code adapter files |
| 97 into the XML wrapper. | |
| 98 0.32.0 - Add tool_dependencies.xml to install Trimmomatic 0.32 automatically and | |
| 99 set the environment. | |
| 100 - Update tool versioning to use Trimmomatic version number (i.e. ``0.32``) | |
| 101 with tool iteration appended (i.e. ``.1``). | |
| 102 0.0.4 - Specify '-threads 6' in <command> section. | |
| 103 0.0.3 - Added MINLEN, LEADING, TRAILING, CROP and HEADCROP options of trimmomatic. | |
| 104 0.0.2 - Updated ILLUMINACLIP option to use standard adapter sequences (requires | |
| 105 the trimmomatic_adapters.loc file; sample version is supplied) plus | |
| 106 cosmetic updates to wording and help text for some options. | |
| 107 0.0.1 - Initial version | |
| 108 ========== ====================================================================== | |
| 109 | |
| 110 | |
| 7 | 111 Credits |
| 112 ======= | |
| 113 | |
| 114 This wrapper has been developed and is maintained by Peter Briggs (@pjbriggs). | |
| 12 | 115 Peter van Heusden (@pvanheus) and Marius van den Beek (@mvdbeek) contributed |
| 9 | 116 support for gz compressed FastQ files. Charles Girardot (@cgirardot) and |
| 117 Jelle Scholtalbers (@scholtalbers) contributed additional options to ILLUMINACLIP. | |
| 12 | 118 Matthias Bernt (@bernt-matthias) added log and trimlog output. |
| 7 | 119 |
| 1 | 120 Developers |
| 121 ========== | |
| 122 | |
| 123 This tool is developed on the following GitHub repository: | |
| 124 https://github.com/fls-bioinformatics-core/galaxy-tools/tree/master/trimmomatic | |
| 125 | |
| 126 For making the "Galaxy Tool Shed" http://toolshed.g2.bx.psu.edu/ tarball I use | |
| 127 the ``package_trimmomatic.sh`` script. | |
| 128 | |
| 129 | |
| 130 Licence (MIT) | |
| 131 ============= | |
| 132 | |
| 133 Permission is hereby granted, free of charge, to any person obtaining a copy | |
| 134 of this software and associated documentation files (the "Software"), to deal | |
| 135 in the Software without restriction, including without limitation the rights | |
| 136 to use, copy, modify, merge, publish, distribute, sublicense, and/or sell | |
| 137 copies of the Software, and to permit persons to whom the Software is | |
| 138 furnished to do so, subject to the following conditions: | |
| 139 | |
| 140 The above copyright notice and this permission notice shall be included in | |
| 141 all copies or substantial portions of the Software. | |
| 142 | |
| 143 THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR | |
| 144 IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, | |
| 145 FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE | |
| 146 AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER | |
| 147 LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, | |
| 148 OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN | |
| 149 THE SOFTWARE. |
