annotate condel_web/condel_web.xml @ 7:67212f958e5a draft default tip

Uploaded
author saketkc
date Tue, 15 Apr 2014 13:04:57 -0400
parents 699ab41c6d76
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
3
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
1 <tool id="condel_web" name="condel">
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
2 <description>Condel web service</description>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
3 <requirements>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
4 <requirement type="package" version="2.2.1">requests</requirement>
6
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
5 <requirement type="package" version="7.19.3.1">pycurl</requirement>
7
67212f958e5a Uploaded
saketkc
parents: 6
diff changeset
6 <requirement type="package" version="4.1.0">beautifulsoup4</requirement>
3
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
7 <requirement type="python-module">requests</requirement>
6
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
8 <requirement type="python-package">pycurl</requirement>
7
67212f958e5a Uploaded
saketkc
parents: 6
diff changeset
9 <requirement type="python-package">bs4</requirement>
3
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
10 </requirements>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
11 <command interpreter="python">
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
12 condel_web.py --input $input --output $output
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
13 </command>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
14 <inputs>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
15 <param name="input" format="text" type="data" label="Input Variants" />
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
16 </inputs>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
17 <outputs>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
18 <data name="output" format="tabular"/>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
19 </outputs>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
20 <tests>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
21 <test>
6
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
22 <param name="input" value="condel_input.tsv"/>
4
328c3e782d31 Uploaded
saketkc
parents: 3
diff changeset
23 <output name="output" file="condel_output.csv"/>
3
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
24 </test>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
25 </tests>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
26 <help>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
27 **What it does**
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
28
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
29 This script calls condel web api at http://bg.upf.edu/condel/
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
30
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
31 Condel stands for CONsensus DELeteriousness score of non-synonymous single nucleotide variants (SNVs).
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
32 The idea behind it is to integrate the output of computational tools aimed at assessing the impact of non synonymous SNVs on protein function.
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
33 To do this, it computes a weighted average of the scores (WAS) of these tools : SIFT, Polyphen2, MAPP, LogR Pfam E-value
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
34 (implemented ad hoc following the instructions at Clifford RJ, Edmonson MN, Nguyen C, and Buetow KH (2004)
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
35 Large-scale analysis of non-synonymous coding region single nucleotide polymorphisms. Bioinformatics 20, 1006-1014) and MutationAssessor
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
36
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
37 **How does it work**
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
38
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
39 The scores of different methods are weighted using the complementary cumulative distributions produced by the five methods on a
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
40 dataset of approximately 20000 missense SNPs, both deleterious and neutral. The probability that a predicted deleterious mutation is not a
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
41 false positive of the method and the probability that a predicted neutral mutation is not a false negative are employed as weights.
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
42
6
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
43 **Input**
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
44
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
45 There are two main formats allowed:
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
46
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
47
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
48 SNVs may be submitted for analysis both in chromosome and protein coordinates.
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
49
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
50
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
51 The chromosome coordinates (hg19) input must follow this format:
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
52
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
53
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
54 [CHROMOSOME] [START] [END] [MUTANT_NUCLEOTIDE]
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
55
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
56
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
57
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
58 The END column is the same as the START for SNVs.
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
59 Those four columns must be separated by tabs. Also a fifth column can optionally be added with the Variant name
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
60
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
61
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
62 Ex:
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
63
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
64 9 32473058 32473058 A
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
65
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
66 7 43918688 43918688 C
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
67
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
68 Additionally, the input could be composed by two columns the strand of the SNV and an identifier:
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
69
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
70 [PROTEIN_ID][variant]
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
71
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
72 Also tab separated. Currently only Uniprot, RefSeq_Peptide and Ensembl identifiers are recognized by the webserver.
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
73
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
74 The variant column must contain the following information (in this order ): change_position, reference_aminoacid and changed_aminoacid
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
75
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
76 Ex:
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
77
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
78 EGFR_HUMAN R521K
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
79
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
80 EGFR_HUMAN R98Q
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
81
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
82 .. class:: warningmark
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
83
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
84 Note
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
85
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
86 Please, note that the Variant Effect Predictor assumes that the allele submitted is coded in the forward strand.
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
87
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
88 If you are assessing the effect of variants coded in the reverse strand, please introduce the complementary nucleotide instead.
699ab41c6d76 Uploaded
saketkc
parents: 4
diff changeset
89
3
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
90 **Citation**
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
91
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
92
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
93 If you use this tool please cite:
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
94
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
95
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
96 Improving the Assessment of the Outcome of Nonsynonymous SNVs with a Consensus Deleteriousness Score, Condel (2011) Abel González-Pérez and Nuria López-Bigas, American Journal of Human Genetics 10.1016/j.ajhg.2011.03.004
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
97
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
98
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
99 </help>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
100 </tool>
65a71127f1fa Uploaded
saketkc
parents:
diff changeset
101