Mercurial > repos > jjohnson > snpeff_with_dep
annotate snpEff.xml @ 25:7c1c337fb21c default tip
Update tool_dependencies.xml for testtoolshed
| author | Jim Johnson <jj@umn.edu> |
|---|---|
| date | Tue, 07 May 2013 15:05:54 -0500 |
| parents | f22f28cd881e |
| children |
| rev | line source |
|---|---|
| 24 | 1 <tool id="snpEff" name="SnpEff" version="3.2"> |
| 0 | 2 <description>Variant effect and annotation</description> |
| 3 <!-- | |
| 4 You will need to change the path to wherever your installation is. | |
| 5 You can change the amount of memory used by snpEff, just change the -Xmx parameter (e.g. use -Xmx2G for 2Gb of memory) | |
| 6 <command>java -Xmx6G -jar /path/to/your/snpEff/snpEff.jar eff -c /path/to/your/snpEff/snpEff/snpEff.config $inputFormat $offset -upDownStreamLen $udLength $filterIn $filterHomHet -no $filterOut -stats $statsFile $genomeVersion $input > $output </command> | |
| 24 | 7 Options: |
| 8 -a , -around : Show N codons and amino acids around change (only in coding regions). Default is 0 codons. | |
| 9 -i <format> : Input format [ vcf, txt, pileup, bed ]. Default: VCF. | |
| 10 -o <format> : Ouput format [ txt, vcf, gatk, bed, bedAnn ]. Default: VCF. | |
| 11 -interval : Use a custom interval file (you may use this option many times) | |
| 12 -chr <string> : Prepend 'string' to chromosome name (e.g. 'chr1' instead of '1'). Only on TXT output. | |
| 13 -s, -stats : Name of stats file (summary). Default is 'snpEff_summary.html' | |
| 14 -t : Use multiple threads (implies '-noStats'). Default 'off' | |
| 15 | |
| 16 Sequence change filter options: | |
| 17 -del : Analyze deletions only | |
| 18 -ins : Analyze insertions only | |
| 19 -hom : Analyze homozygous variants only | |
| 20 -het : Analyze heterozygous variants only | |
| 21 -minQ X, -minQuality X : Filter out variants with quality lower than X | |
| 22 -maxQ X, -maxQuality X : Filter out variants with quality higher than X | |
| 23 -minC X, -minCoverage X : Filter out variants with coverage lower than X | |
| 24 -maxC X, -maxCoverage X : Filter out variants with coverage higher than X | |
| 25 -nmp : Only MNPs (multiple nucleotide polymorphisms) | |
| 26 -snp : Only SNPs (single nucleotide polymorphisms) | |
| 27 | |
| 28 Results filter options: | |
| 29 -fi <bedFile> : Only analyze changes that intersect with the intervals specified in this file (you may use this option many times) | |
| 30 -no-downstream : Do not show DOWNSTREAM changes | |
| 31 -no-intergenic : Do not show INTERGENIC changes | |
| 32 -no-intron : Do not show INTRON changes | |
| 33 -no-upstream : Do not show UPSTREAM changes | |
| 34 -no-utr : Do not show 5_PRIME_UTR or 3_PRIME_UTR changes | |
| 35 | |
| 36 Annotations options: | |
| 37 -cancer : Perform 'cancer' comparissons (Somatic vs Germline). Default: false | |
| 38 -canon : Only use canonical transcripts. | |
| 39 -geneId : Use gene ID instead of gene name (VCF output). Default: false | |
| 40 -hgvs : Use HGVS annotations for amino acid sub-field. Default: false | |
| 41 -lof : Add loss of function (LOF) and Nonsense mediated decay (NMD) tags. | |
| 42 -reg <name> : Regulation track to use (this option can be used add several times). | |
| 43 -oicr : Add OICR tag in VCF file. Default: false | |
| 44 -onlyReg : Only use regulation tracks. | |
| 45 -onlyTr <file.txt> : Only use the transcripts in this file. Format: One transcript ID per line. | |
| 46 -sequenceOntolgy : Use Sequence Ontolgy terms. Default: false | |
| 47 -ss, -spliceSiteSize <int> : Set size for splice sites (donor and acceptor) in bases. Default: 2 | |
| 48 -ud, -upDownStreamLen <int> : Set upstream downstream interval length (in bases) | |
| 49 | |
| 50 Generic options: | |
| 51 -0 : File positions are zero-based (same as '-inOffset 0 -outOffset 0') | |
| 52 -1 : File positions are one-based (same as '-inOffset 1 -outOffset 1') | |
| 53 -c , -config : Specify config file | |
| 54 -h , -help : Show this help and exit | |
| 55 -if, -inOffset : Offset input by a number of bases. E.g. '-inOffset 1' for one-based input files | |
| 56 -of, -outOffset : Offset output by a number of bases. E.g. '-outOffset 1' for one-based output files | |
| 57 -noLog : Do not report usage statistics to server | |
| 58 -noStats : Do not create stats (summary) file | |
| 59 -q , -quiet : Quiet mode (do not show any messages or errors) | |
| 60 -v , -verbose : Verbose mode | |
| 61 | |
| 0 | 62 --> |
| 63 <requirements> | |
| 24 | 64 <requirement type="package" version="3.2">snpEff</requirement> |
| 0 | 65 </requirements> |
|
22
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
66 <command> |
| 24 | 67 SNPEFF_DATA_DIR=`grep '^data_dir' \$JAVA_JAR_PATH/snpEff.config | sed 's/.*data_dir.*[=:]//'`; |
| 68 eval "if [ ! -e \$SNPEFF_DATA_DIR/$genomeVersion ] ; | |
|
22
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
69 then java -Xmx6G -jar \$JAVA_JAR_PATH/snpEff.jar download -c \$JAVA_JAR_PATH/snpEff.config $genomeVersion ; |
| 24 | 70 fi"; |
| 71 java -Xmx6G -jar \$JAVA_JAR_PATH/snpEff.jar eff -c \$JAVA_JAR_PATH/snpEff.config -i $inputFormat -o $outputFormat -upDownStreamLen $udLength | |
| 72 #if $spliceSiteSize and $spliceSiteSize.__str__ != '': | |
| 73 -spliceSiteSize $spliceSiteSize | |
| 74 #end if | |
| 75 #if $filterIn and $filterIn.__str__ != 'no_filter': | |
| 76 -$filterIn | |
| 77 #end if | |
| 78 #if $filterHomHet and $filterHomHet.__str__ != 'no_filter': | |
| 79 -$filterHomHet | |
| 80 #end if | |
| 81 #if $annotations and $annotations.__str__ != '': | |
| 82 -#slurp | |
| 83 #echo ' -'.join($annotations.__str__.split(',')) | |
| 84 #end if | |
| 85 #if $filterOut and $filterOut.__str__ != '': | |
| 86 -#slurp | |
| 87 #echo ' -'.join($filterOut.__str__.split(',')) | |
| 88 #end if | |
| 89 #if str( $transcripts ) != 'None': | |
| 90 -onlyTr $transcripts | |
| 91 #end if | |
| 92 #if str( $intervals ) != 'None': ### fix this for multiple dataset input | |
| 93 -interval $intervals | |
| 94 #end if | |
| 95 #if $statsFile: | |
| 96 -stats $statsFile | |
| 97 #end if | |
| 98 #if $offset.__str__ != '': | |
| 99 -${offset} | |
| 100 #end if | |
| 101 #if $chr.__str__.strip() != '': | |
| 102 -chr "$chr" | |
| 103 #end if | |
| 104 $noLog $genomeVersion $input > $snpeff_output | |
| 105 </command> | |
| 0 | 106 <inputs> |
| 107 <param format="vcf,tabular,pileup,bed" name="input" type="data" label="Sequence changes (SNPs, MNPs, InDels)"/> | |
| 108 | |
| 109 <param name="inputFormat" type="select" label="Input format"> | |
| 24 | 110 <option value="vcf" selected="true">VCF</option> |
| 111 <option value="txt">Tabular (Deprecated)</option> | |
| 112 <option value="pileup">Pileup (Deprecated)</option> | |
| 113 <option value="bed">BED (Deprecated)</option> | |
| 0 | 114 </param> |
| 115 | |
| 116 <param name="outputFormat" type="select" label="Output format"> | |
| 24 | 117 <option value="vcf" selected="true">VCF (only if input is VCF)</option> |
| 0 | 118 <option value="txt">Tabular</option> |
| 119 <option value="bed">BED</option> | |
| 120 <option value="bedAnn">BED Annotations</option> | |
| 121 </param> | |
| 122 | |
| 123 <param name="genomeVersion" type="select" label="Genome"> | |
| 24 | 124 <!--GENOME DESCRIPTION--> |
|
22
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
125 <options from_file="snpeffect_genomedb.loc"> |
|
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
126 <column name="name" index="1"/> |
|
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
127 <column name="value" index="0"/> |
|
4430626bd994
Use snpeffect_genomedb.loc for genomeVersions, have snpEff attempt to download genomeVersions that are not already locally cached
Jim Johnson <jj@umn.edu>
parents:
20
diff
changeset
|
128 </options> |
| 0 | 129 </param> |
| 130 | |
| 131 <param name="udLength" type="select" label="Upstream / Downstream length"> | |
| 132 <option value="0">No upstream / downstream intervals (0 bases)</option> | |
| 133 <option value="200">200 bases</option> | |
| 134 <option value="500">500 bases</option> | |
| 135 <option value="1000">1000 bases</option> | |
| 136 <option value="2000">2000 bases</option> | |
| 137 <option value="5000" selected="true">5000 bases</option> | |
| 138 <option value="10000">10000 bases</option> | |
| 139 <option value="20000">20000 bases</option> | |
| 140 </param> | |
| 141 | |
| 24 | 142 <param name="spliceSiteSize" type="select" optional="true" label="Set size for splice sites (donor and acceptor) in bases. Default: 2"> |
| 143 <option value="1">1 base</option> | |
| 144 <option value="2">2 bases</option> | |
| 145 <option value="3">3 bases</option> | |
| 146 <option value="4">4 bases</option> | |
| 147 <option value="5">5 bases</option> | |
| 148 <option value="6">6 bases</option> | |
| 149 <option value="7">7 bases</option> | |
| 150 <option value="8">8 bases</option> | |
| 151 <option value="9">9 bases</option> | |
| 152 </param> | |
| 0 | 153 |
| 24 | 154 <param name="filterHomHet" type="select" display="radio" label="Filter homozygous / heterozygous changes"> |
| 155 <option value="no_filter" selected="true">No filter (analyze everything)</option> | |
| 156 <option value="hom">Analyze homozygous sequence changes only </option> | |
| 157 <option value="het">Analyze heterozygous sequence changes only </option> | |
| 158 </param> | |
| 159 | |
| 160 <!-- The tool testing code can not handle select,radio,checkbox values that start with '-', so the '-' is added in the command generation --> | |
| 161 <param name="filterIn" type="select" display="radio" label="Filter sequence changes"> | |
| 162 <option value="no_filter" selected="true">No filter (analyze everything)</option> | |
| 163 <option value="del">Analyze deletions only </option> | |
| 164 <option value="ins">Analyze insertions only </option> | |
| 165 <option value="mnp">Only MNPs (multiple nucleotide polymorphisms) </option> | |
| 166 <option value="snp">Only SNPs (single nucleotide polymorphisms) </option> | |
| 167 </param> | |
| 0 | 168 |
| 24 | 169 <param name="annotations" type="select" display="checkboxes" multiple="true" optional="true" label="Annotation options"> |
| 170 <option value="cancer">Perform 'cancer' comparissons (Somatic vs Germline). Default: false</option> | |
| 171 <option value="canon">Only use canonical transcripts.</option> | |
| 172 <option value="geneId">Use gene ID instead of gene name (VCF output). Default: false</option> | |
| 173 <option value="hgvs">Use HGVS annotations for amino acid sub-field. Default: false</option> | |
| 174 <option value="lof">Add loss of function (LOF) and Nonsense mediated decay (NMD) tags.</option> | |
| 175 <option value="oicr">Add OICR tag in VCF file. Default: false</option> | |
| 176 <option value="onlyReg">Only use regulation tracks.</option> | |
| 177 <option value="sequenceOntolgy">Use Sequence Ontolgy terms. Default: false</option> | |
| 178 </param> | |
| 179 | |
| 180 <param name="regulation" type="select" display="checkboxes" multiple="true" optional="true" label="Non-coding and regulatory Annotation"> | |
| 181 <help>These are available for only a few genomes</help> | |
| 182 <!--GENOME REG_NAME --> | |
| 183 <options from_file="snpeffect_regulationdb.loc"> | |
| 184 <column name="name" index="1"/> | |
| 185 <column name="value" index="0"/> | |
| 186 <filter type="param_value" ref="genomeVersion" key="name" column="1" /> | |
| 187 </options> | |
| 188 </param> | |
| 0 | 189 |
| 24 | 190 <param name="intervals" format="bed" type="data" optional="true" label="Use custom interval file for annotation"/> |
| 191 <param name="transcripts" format="tabular" type="data" optional="true" label="Only use the transcripts in this file. Format: One transcript ID per line."/> | |
| 192 | |
| 193 <param name="filterOut" type="select" display="checkboxes" multiple="true" optional="true" label="Filter output"> | |
| 194 <option value="no-downstream">Do not show DOWNSTREAM changes </option> | |
| 195 <option value="no-intergenic">Do not show INTERGENIC changes </option> | |
| 196 <option value="no-intron">Do not show INTRON changes </option> | |
| 197 <option value="no-upstream">Do not show UPSTREAM changes </option> | |
| 198 <option value="no-utr">Do not show 5_PRIME_UTR or 3_PRIME_UTR changes </option> | |
| 199 </param> | |
| 200 | |
| 201 <param name="offset" type="select" display="radio" optional="true" label="Chromosomal position"> | |
| 202 <option value="" selected="true">Use default (based on input type)</option> | |
| 203 <option value="0">Force zero-based positions (both input and output)</option> | |
| 204 <option value="1">Force one-based positions (both input and output)</option> | |
| 0 | 205 </param> |
| 24 | 206 <param name="chr" type="text" optionl="true" label="Text to prepend to chromosome name" help="By default SnpEff simplifies all chromosome names. For instance 'chr1' is just '1'. You can prepend any string you want to the chromosome name."> |
| 207 <validator type="regex" message="No whitespace allows">^\S*$</validator> | |
| 208 | |
| 209 </param> | |
| 210 <param name="generate_stats" type="boolean" truevalue="" falsevalue="-noStats" checked="true" label="Produce Summary Stats"/> | |
| 211 <param name="noLog" type="boolean" truevalue="-noLog" falsevalue="" checked="true" label="Do not report usage statistics to server"/> | |
| 0 | 212 </inputs> |
| 213 <outputs> | |
| 24 | 214 <data format="vcf" name="snpeff_output" > |
| 0 | 215 <change_format> |
| 24 | 216 <when input="outputFormat" value="vcf" format="vcf" /> |
| 0 | 217 <when input="outputFormat" value="txt" format="tabular" /> |
| 218 <when input="outputFormat" value="bed" format="bed" /> | |
| 219 <when input="outputFormat" value="bedAnn" format="bed" /> | |
| 220 </change_format> | |
| 221 </data> | |
| 24 | 222 |
| 223 <data format="html" name="statsFile"> | |
| 224 <filter>generate_stats == True</filter> | |
| 225 </data> | |
| 0 | 226 </outputs> |
| 5 | 227 <stdio> |
| 228 <exit_code range="1:" level="fatal" description="Error" /> | |
| 24 | 229 <exit_code range="-1" level="fatal" description="Error: Cannot open file" /> |
| 5 | 230 </stdio> |
| 24 | 231 <tests> |
| 232 <test> | |
| 233 <param name="input" ftype="vcf" value="vcf_homhet.vcf"/> | |
| 234 <param name="inputFormat" value="vcf"/> | |
| 235 <param name="outputFormat" value="vcf"/> | |
| 236 <param name="genomeVersion" value="testCase"/> | |
| 237 <param name="udLength" value="0"/> | |
| 238 <param name="filterHomHet" value="no_filter"/> | |
| 239 <param name="filterIn" value="no_filter"/> | |
| 240 <param name="generate_stats" value="False"/> | |
| 241 <!-- | |
| 242 <param name="filterOut" value="no-upstream"/> | |
| 243 --> | |
| 244 <output name="snpeff_output"> | |
| 245 <assert_contents> | |
| 246 <!-- Check that an effect was added --> | |
| 247 <has_text text="EFF=" /> | |
| 248 </assert_contents> | |
| 249 </output> | |
| 250 <!-- Check for a HTML header indicating that this was successful --> | |
| 251 <!-- | |
| 252 <output name="statsFile"> | |
| 253 <assert_contents> | |
| 254 <has_text text="SnpEff: Variant analysis" /> | |
| 255 </assert_contents> | |
| 256 </output> | |
| 257 --> | |
| 258 </test> | |
| 259 | |
| 260 <test> | |
| 261 <param name="input" ftype="vcf" value="vcf_homhet.vcf"/> | |
| 262 <param name="inputFormat" value="vcf"/> | |
| 263 <param name="outputFormat" value="vcf"/> | |
| 264 <param name="genomeVersion" value="testCase"/> | |
| 265 <param name="udLength" value="0"/> | |
| 266 <param name="filterHomHet" value="het"/> | |
| 267 <param name="filterIn" value="no_filter"/> | |
| 268 <!-- | |
| 269 <param name="filterOut" value=""/> | |
| 270 --> | |
| 271 <param name="generate_stats" value="False"/> | |
| 272 <output name="snpeff_output"> | |
| 273 <assert_contents> | |
| 274 <!-- Check that NO effects were added since -het is set --> | |
| 275 <not_has_text text="EFF=NON_SYNONYMOUS_CODING" /> | |
| 276 </assert_contents> | |
| 277 </output> | |
| 278 </test> | |
| 279 <test> | |
| 280 <param name="input" ftype="vcf" value="vcf_homhet.vcf"/> | |
| 281 <param name="inputFormat" value="vcf"/> | |
| 282 <param name="outputFormat" value="vcf"/> | |
| 283 <param name="genomeVersion" value="testCase"/> | |
| 284 <param name="udLength" value="0"/> | |
| 285 <param name="filterHomHet" value="no_filter"/> | |
| 286 <param name="filterIn" value="del"/> | |
| 287 <!-- | |
| 288 <param name="filterOut" value=""/> | |
| 289 --> | |
| 290 <param name="generate_stats" value="False"/> | |
| 291 <output name="snpeff_output"> | |
| 292 <assert_contents> | |
| 293 <!-- Check that deleletions were evaluated --> | |
| 294 <has_text_matching expression="Y\t59030478\t.*EFF=INTERGENIC" /> | |
| 295 <!-- Check that insertion on last line was NOT evaluated --> | |
| 296 <has_text_matching expression="Y\t59032947\t.*SF=5\tGT" /> | |
| 297 </assert_contents> | |
| 298 </output> | |
| 299 </test> | |
| 300 <test> | |
| 301 <param name="input" ftype="vcf" value="vcf_homhet.vcf"/> | |
| 302 <param name="inputFormat" value="vcf"/> | |
| 303 <param name="outputFormat" value="vcf"/> | |
| 304 <param name="genomeVersion" value="testCase"/> | |
| 305 <param name="udLength" value="0"/> | |
| 306 <param name="filterHomHet" value="no_filter"/> | |
| 307 <param name="filterIn" value="no_filter"/> | |
| 308 <param name="filterOut" value="no-upstream"/> | |
| 309 <param name="generate_stats" value="False"/> | |
| 310 <output name="snpeff_output"> | |
| 311 <assert_contents> | |
| 312 <!-- Check that NO UPSTREAM effect was added --> | |
| 313 <not_has_text text="UPSTREAM" /> | |
| 314 </assert_contents> | |
| 315 </output> | |
| 316 </test> | |
| 317 | |
| 318 </tests> | |
| 0 | 319 <help> |
| 320 | |
| 321 This tool calculate the effect of variants (SNPs/MNPs/Insertions) and deletions. | |
| 322 | |
| 323 For details about this tool, please go to http://snpEff.sourceforge.net | |
| 324 | |
| 325 </help> | |
| 326 </tool> | |
| 327 |
