Mercurial > repos > bgruening > interproscan5
comparison interproscan.xml @ 1:2d56d3ae499c draft
Uploaded
| author | bgruening |
|---|---|
| date | Tue, 08 Oct 2013 09:14:43 -0400 |
| parents | 98c6e970cbc1 |
| children |
comparison
equal
deleted
inserted
replaced
| 0:98c6e970cbc1 | 1:2d56d3ae499c |
|---|---|
| 5 <requirement type="package">phobius</requirement> | 5 <requirement type="package">phobius</requirement> |
| 6 <requirement type="package">tmhmm</requirement> | 6 <requirement type="package">tmhmm</requirement> |
| 7 <requirement type="set_environment">INTERPROSCAN_SCRIPT_PATH</requirement> | 7 <requirement type="set_environment">INTERPROSCAN_SCRIPT_PATH</requirement> |
| 8 </requirements> | 8 </requirements> |
| 9 <command> | 9 <command> |
| 10 | |
| 10 #import os | 11 #import os |
| 11 interproscan.sh -dp --fasta $infile --seqtype $seqtype -f $oformat --applications $appl --tempdir \$TEMP | 12 interproscan.sh |
| 13 ## disables the precalculated lookup service, all calculation will be run locally | |
| 14 -dp | |
| 15 --input $infile | |
| 16 --seqtype $seqtype | |
| 17 -f $oformat | |
| 18 --applications $appl | |
| 19 --tempdir \$TEMP | |
| 12 | 20 |
| 13 $pathways | 21 $pathways |
| 14 $goterms | 22 $goterms |
| 15 $iprlookup | 23 $iprlookup |
| 16 | 24 |
| 55 <option value="SignalP-EUK" selected="true">SignalP Eukaryotic Bacteria</option> | 63 <option value="SignalP-EUK" selected="true">SignalP Eukaryotic Bacteria</option> |
| 56 <option value="Phobius" selected="true">Phobius: combined transmembrane topology and signal peptide predictor</option> | 64 <option value="Phobius" selected="true">Phobius: combined transmembrane topology and signal peptide predictor</option> |
| 57 <option value="TMHMM" selected="true">TMHMM: Prediction of transmembrane helices in proteins</option> | 65 <option value="TMHMM" selected="true">TMHMM: Prediction of transmembrane helices in proteins</option> |
| 58 </param> | 66 </param> |
| 59 | 67 |
| 60 <param name="pathways" truevalue="--pathways" falsevalue="" checked="True" type="boolean" label="Include pathway information" help="Option that provides mappings from matches to pathway information, which is based on the matched manually curated InterPro entries."/> | 68 <param name="pathways" truevalue="--pathways" falsevalue="" checked="True" type="boolean" label="Include pathway information" |
| 61 <param name="goterms" truevalue="--goterms" falsevalue="" checked="True" type="boolean" label="Include Gene Ontology (GO) mappings" /> | 69 help="Option that provides mappings from matches to pathway information, which is based on the matched manually curated InterPro entries. (--pathways)"/> |
| 62 <param name="iprlookup" truevalue="--iprlookup" falsevalue="" checked="False" type="boolean" label="Provide mappings from matched member database signatures to the InterPro entries that they are integrated into" /> | 70 <param name="goterms" truevalue="--goterms" falsevalue="" checked="True" type="boolean" label="Include Gene Ontology (GO) mappings" |
| 71 help="Look up of corresponding Gene Ontology annotation. Implies -iprlookup option. (--goterms)"/> | |
| 72 <param name="iprlookup" truevalue="--iprlookup" falsevalue="" checked="False" type="boolean" | |
| 73 label="Provide additional mappings" help="Provide mappings from matched member database signatures to the InterPro entries that they are integrated into (--iprlookup)"/> | |
| 63 | 74 |
| 64 <param name="oformat" type="select" label="Output format" help="Please select a output format."> | 75 <param name="oformat" type="select" label="Output format" help="Please select a output format."> |
| 65 <option value="TSV" selected="true">Tab-separated values format (TSV)</option> | 76 <option value="TSV" selected="true">Tab-separated values format (TSV)</option> |
| 66 <option value="GFF3">GFF3</option> | 77 <option value="GFF3">GFF3</option> |
| 67 <option value="SVG">SVG</option> | 78 <option value="SVG">SVG</option> |
| 109 * GFF: The `GFF 3.0`_ format | 120 * GFF: The `GFF 3.0`_ format |
| 110 * HTML: An HTML representation of the protein matches | 121 * HTML: An HTML representation of the protein matches |
| 111 * SVG: An Scalable Vector Graphics representation of the protein matches | 122 * SVG: An Scalable Vector Graphics representation of the protein matches |
| 112 | 123 |
| 113 | 124 |
| 114 .. _`GFF3 3.0`: http://gmod.org/wiki/GFF#GFF3_Format | 125 .. _`GFF 3.0`: http://gmod.org/wiki/GFF#GFF3_Format |
| 115 .. _here: http://www.ebi.ac.uk/interpro/resources/schemas/interproscan5 | 126 .. _here: http://www.ebi.ac.uk/interpro/resources/schemas/interproscan5 |
| 116 | 127 |
| 117 | 128 |
| 118 Previous versions of InterProScan_ 5 (I5RC1 - I5RC6) could only output data for nucleotide sequences in XML and GFF3. This has now changed and InterProScan_ 5 can output results for nucleotide sequences in all formats. *Please note* you can only trace protein match positions to the original nucleotide sequence with GFF3 and XML. | |
| 119 | |
| 120 You can override the default output formats using the *-f* option, e.g.:: | |
| 121 | |
| 122 ./interproscan.sh -f XML -f HTML -i /path/to/sequences.fasta -b /path/to/output_file | |
| 123 | |
| 124 or | |
| 125 | |
| 126 :: | |
| 127 | |
| 128 ./interproscan.sh -f XML, HTML -i /path/to/sequences.fasta -b /path/to/output_file | |
| 129 | |
| 130 | |
| 131 These two equivalent commands will output the results in XML and HTML format. | |
| 132 | 129 |
| 133 Tab-separated values format (TSV) | 130 Tab-separated values format (TSV) |
| 134 ================================= | 131 ================================= |
| 135 | 132 |
| 136 Basic tab delimited format. | 133 Basic tab delimited format. |
| 171 | 168 |
| 172 XML representation of the matches - this is the richest form of the data. The XML Schema Definition (XSD) is available [http://www.ebi.ac.uk/interpro/resources/schemas/interproscan5 here]. | 169 XML representation of the matches - this is the richest form of the data. The XML Schema Definition (XSD) is available [http://www.ebi.ac.uk/interpro/resources/schemas/interproscan5 here]. |
| 173 | 170 |
| 174 Example Output | 171 Example Output |
| 175 -------------- | 172 -------------- |
| 176 | |
| 177 :: | |
| 178 | 173 |
| 179 .. image:: $PATH_TO_IMAGES/example_xml_output.png | 174 .. image:: $PATH_TO_IMAGES/example_xml_output.png |
| 180 | 175 |
| 181 | 176 |
| 182 | 177 |
| 218 | 213 |
| 219 | 214 |
| 220 Scalable Vector Graphics (SVG) and HyperText Markup Language (HTML) | 215 Scalable Vector Graphics (SVG) and HyperText Markup Language (HTML) |
| 221 ==================================================================== | 216 ==================================================================== |
| 222 | 217 |
| 223 InterProScan_ 5 outputs a single HTML/SVG file for each protein sequence analysed. The HTML/SVG file(s) are compressed into a single gzipped tar archive (or "tarball") that includes the resources (images, Javascript, style etc) to render the pages/images in a browser or image viewer. (Note that from version 5RC4, the SVG format has no external dependencies.) | 218 InterProScan_ 5 outputs a single HTML/SVG file for each protein sequence analysed. |
| 224 | 219 |
| 225 The tarball will be named something similar to:: | |
| 226 | |
| 227 base_output_file_name.html.tar.gz OR base_output_file_name.svg.tar.gz | |
| 228 | |
| 229 | |
| 230 To access the HTML pages/SVG images, unzip the tarball using a command like: | |
| 231 | |
| 232 tar -xvzf base_output_file_name.html(svg).tar.gz | |
| 233 | |
| 234 You can then open the unzipped HTML/SVG files in any browser or image viewer (for SVG). | |
| 235 | 220 |
| 236 Example Output | 221 Example Output |
| 237 -------------- | 222 -------------- |
| 238 | 223 |
| 239 .. image:: $PATH_TO_IMAGES/P51587.svg.png | 224 .. image:: $PATH_TO_IMAGES/P51587.svg.png |
| 242 | 227 |
| 243 | 228 |
| 244 ---------- | 229 ---------- |
| 245 References | 230 References |
| 246 ---------- | 231 ---------- |
| 232 | |
| 233 | |
| 234 If you use this Galaxy tool in work leading to a scientific publication please | |
| 235 cite the following papers: | |
| 236 | |
| 237 Peter J.A. Cock, Björn A. Grüning, Konrad Paszkiewicz and Leighton Pritchard (2013). | |
| 238 Galaxy tools and workflows for sequence analysis with applications | |
| 239 in molecular plant pathology. PeerJ 1:e167 | |
| 240 http://dx.doi.org/10.7717/peerj.167 | |
| 247 | 241 |
| 248 Zdobnov EM, Apweiler R (2001) | 242 Zdobnov EM, Apweiler R (2001) |
| 249 InterProScan an integration platform for the signature-recognition methods in InterPro. | 243 InterProScan an integration platform for the signature-recognition methods in InterPro. |
| 250 Bioinformatics 17, 847-848. | 244 Bioinformatics 17, 847-848. |
| 251 http://dx.doi.org/10.1093/bioinformatics/17.9.847 | 245 http://dx.doi.org/10.1093/bioinformatics/17.9.847 |
| 260 Nucleic Acids Research 37 (Database Issue), D224-228. | 254 Nucleic Acids Research 37 (Database Issue), D224-228. |
| 261 http://dx.doi.org/10.1093/nar/gkn785 | 255 http://dx.doi.org/10.1093/nar/gkn785 |
| 262 | 256 |
| 263 | 257 |
| 264 This wrapper is available to install into other Galaxy Instances via the Galaxy Tool Shed at | 258 This wrapper is available to install into other Galaxy Instances via the Galaxy Tool Shed at |
| 265 http://toolshed.g2.bx.psu.edu/view/bgruening/interproscan | 259 http://toolshed.g2.bx.psu.edu/view/bgruening/interproscan5 |
| 266 | 260 |
| 267 | 261 |
| 268 **Galaxy Wrapper Author**:: | 262 **Galaxy Wrapper Author**:: |
| 269 | 263 |
| 270 * Bjoern Gruening, Pharmaceutical Bioinformatics, University of Freiburg | 264 * Bjoern Gruening, University of Freiburg |
| 271 * Konrad Paszkiewicz, Exeter Sequencing Service, University of Exeter | 265 * Konrad Paszkiewicz, University of Exeter |
| 272 | 266 |
| 273 </help> | 267 </help> |
| 274 </tool> | 268 </tool> |
