Mercurial > repos > jjohnson > snpeff
annotate snpSift_caseControl.xml @ 5:8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
| author | Jim Johnson <jj@umn.edu> |
|---|---|
| date | Wed, 27 Nov 2013 09:11:32 -0600 |
| parents | 6ad9205c1307 |
| children | 0ad9733e22a4 |
| rev | line source |
|---|---|
|
5
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
1 <tool id="snpSift_caseControl" name="SnpSift CaseControl" version="3.4"> |
| 0 | 2 <description>Count samples are in 'case' and 'control' groups.</description> |
| 3 <!-- | |
| 4 You will need to change the path to wherever your installation is. | |
| 5 You can change the amount of memory used, just change the -Xmx parameter (e.g. use -Xmx2G for 2Gb of memory) | |
| 6 --> | |
| 7 <requirements> | |
|
5
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
8 <requirement type="package" version="3.4">snpEff</requirement> |
| 0 | 9 </requirements> |
| 10 <command> | |
| 2 | 11 java -Xmx1G -jar \$SNPEFF_JAR_PATH/SnpSift.jar caseControl -q |
| 0 | 12 #if $name.__str__.strip() != '': |
| 13 -name $name | |
| 14 #end if | |
| 15 #if $ctrl.ctrl_src == 'caseString': | |
| 16 '$ctrl.caseControlStr' | |
| 17 #else | |
| 18 -tfam "$ctrl.tfam" | |
| 19 #end if | |
| 20 $input > $output | |
| 21 </command> | |
| 22 <inputs> | |
| 23 <param format="vcf" name="input" type="data" label="VCF input"/> | |
| 24 <conditional name="ctrl"> | |
| 25 <param name="ctrl_src" type="select" label="Case Control defined in"> | |
| 26 <option value="caseString">Case Control String</option> | |
| 27 <option value="tfam">TFAM</option> | |
| 28 </param> | |
| 29 <when value="caseString"> | |
| 30 <param name="caseControlStr" type="text" label="Case / Control column designation" size="50"> | |
| 31 <help> | |
| 32 Case and control are defined by a string containing plus and minus symbols {'+', '-', '0'} where '+' is case, '-' is control and '0' is neutral | |
| 33 </help> | |
| 34 <validator type="regex" message="must be only plus(+), minus(-), or zero(0) characters">[+-0]+</validator> | |
| 35 </param> | |
| 36 </when> | |
| 37 <when value="tfam"> | |
| 38 <param format="tabular" name="tfam" type="data" label="PLINK TFAM file"/> | |
| 39 </when> | |
| 40 </conditional> | |
| 41 <param name="name" type="text" optional="true" label="name" help="name to append to the 'Cases' or 'Controls' tags"> | |
| 42 <validator type="regex" message="Use only valid ID characters">[_a-zA-Z0-9]+</validator> | |
| 43 </param> | |
| 44 </inputs> | |
| 45 <outputs> | |
| 46 <data format="vcf" name="output" /> | |
| 47 </outputs> | |
| 48 <stdio> | |
| 49 <exit_code range=":-1" level="fatal" description="Error: Cannot open file" /> | |
| 50 <exit_code range="1:" level="fatal" description="Error" /> | |
| 51 </stdio> | |
| 52 <tests> | |
| 53 | |
| 54 <test> | |
| 55 <param name="input" ftype="vcf" value="test.private.01.vcf"/> | |
| 56 <param name="ctrl_src" value="caseString"/> | |
| 57 <param name="caseControlStr" value="--"/> | |
| 58 <output name="output"> | |
| 59 <assert_contents> | |
| 60 <has_text text="Cases=0,0,0;" /> | |
| 61 <has_text text="Controls=0,0,0;" /> | |
| 62 </assert_contents> | |
| 63 </output> | |
| 64 </test> | |
| 65 | |
| 66 <test> | |
| 67 <param name="input" ftype="vcf" value="test.private.02.vcf"/> | |
| 68 <param name="ctrl_src" value="caseString"/> | |
| 69 <param name="caseControlStr" value="--"/> | |
| 70 <output name="output"> | |
| 71 <assert_contents> | |
| 72 <has_text text="Cases=0,0,0;" /> | |
| 73 <has_text text="Controls=2,0,4;" /> | |
| 74 </assert_contents> | |
| 75 </output> | |
| 76 </test> | |
| 77 | |
| 78 <test> | |
| 79 <param name="input" ftype="vcf" value="test.private.02.vcf"/> | |
| 80 <param name="name" value=""/> | |
| 81 <param name="ctrl_src" value="caseString"/> | |
| 82 <param name="caseControlStr" value="-+"/> | |
| 83 <output name="output"> | |
| 84 <assert_contents> | |
| 85 <has_text text="Cases=1,0,2;" /> | |
| 86 <has_text text="Controls=1,0,2;" /> | |
| 87 </assert_contents> | |
| 88 </output> | |
| 89 </test> | |
| 90 | |
| 91 </tests> | |
| 92 | |
| 93 <help> | |
| 94 | |
| 95 **SnpSift CaseControl** | |
| 96 | |
| 97 Allows you to count how many samples are in 'case' group and a 'control' group. You can count 'homozygous', 'heterozygous' or 'any' variants. | |
| 98 | |
| 99 Case and control are defined by a string containing plus and minus symbols {'+', '-', '0'} where '+' is case, '-' is control and '0' is neutral. | |
| 100 | |
| 101 This command adds two annotations to the VCF file: | |
| 102 | |
| 103 - **CaseControl**: Two comma separated numbers numbers representing the number of samples that have the variant in the case and the control group. Example: | |
| 104 | |
| 105 "CaseControl=3,4" *the variant is present in 3 cases and 4 controls.* | |
| 106 | |
| 107 | |
| 108 - **CaseControlP**: A p-value (Fisher exact test) that the number of cases is N or more. Example: | |
| 109 | |
| 110 "CaseControl=4,0;CaseControlP=3.030303e-02" *in this case the pValue of having 4 or more cases and zero controls is 0.03* | |
| 111 | |
| 112 | |
| 113 For example, if we have ten samples (which means ten genotype columns in the VCF file), the first four are 'case' and the last six are 'control', so the description string would be "++++------". Let's say we want to distinguish genotypes that are homozygous in 'case' and either homozygous or heterozygous in 'control'. We would set: | |
| 114 | |
| 115 - Hom/Het case = "hom" | |
| 116 | |
| 117 - Hom/Het control = "any" | |
| 118 | |
| 119 - Case / Control column designation = ""++++------" | |
| 120 | |
| 121 | |
| 122 For details about this tool, please go to http://snpeff.sourceforge.net/SnpSift.html#casecontrol | |
| 123 | |
|
5
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
124 SnpEff citation: |
|
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
125 "A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3.", Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, Land SJ, Lu X, Ruden DM. Fly (Austin). 2012 Apr-Jun;6(2):80-92. PMID: 22728672 [PubMed - in process] |
|
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
126 |
|
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
127 SnpSift citation: |
|
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
128 "Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift", Cingolani, P., et. al., Frontiers in Genetics, 3, 2012. |
|
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
129 |
|
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
130 |
|
8952990fcab9
Update to snpEff version 3.4 and add data managers to download snpEff genome reference databases
Jim Johnson <jj@umn.edu>
parents:
2
diff
changeset
|
131 |
| 0 | 132 </help> |
| 133 </tool> |
