Mercurial > repos > iuc > basil
comparison basil.xml @ 0:341246805ea1 draft
planemo upload commit b89c8017aeef91f940543a1cc7dadb4a85290865
| author | iuc |
|---|---|
| date | Thu, 30 May 2019 21:14:37 -0400 |
| parents | |
| children | 16eccd223c6e |
comparison
equal
deleted
inserted
replaced
| -1:000000000000 | 0:341246805ea1 |
|---|---|
| 1 <tool id="basil" name="basil" version="1.2.0"> | |
| 2 <description>Breakpoint detection, including large insertions</description> | |
| 3 <requirements> | |
| 4 <requirement type="package" version="1.2.0">anise_basil</requirement> | |
| 5 </requirements> | |
| 6 <version_command>basil --version 2>&1 | grep 'basil version' | cut -f 3 -d ' '</version_command> | |
| 7 <command detect_errors="aggressive"><![CDATA[ | |
| 8 ln -s '$ref' 'ref.fa' && | |
| 9 ln -s '$bam' 'in.bam' && | |
| 10 ln -s '$vcf' 'out.vcf' && | |
| 11 basil | |
| 12 --input-reference 'ref.fa' | |
| 13 --input-mapping 'in.bam' | |
| 14 --out-vcf 'out.vcf' | |
| 15 --oea-min-support-each-side '$min_oea_each_side' | |
| 16 ]]></command> | |
| 17 <inputs> | |
| 18 <param name="ref" argument="--input-reference" type="data" format="Fasta" label="Reference Sequence File" help="FASTA file with the reference."/> | |
| 19 <param name="bam" argument="--input-mapping" type="data" format="sam,bam" label="Alignment File" help="SAM/BAM file to use as the input."/> | |
| 20 <param name="min_oea_each_side" argument="--oea-min-support-each-side" type="integer" value="2" label="Minimum supporting reads, each side" help="Smallest number of OEA (one-end-anchor) reads on each side to support an insertion. In range [1..inf]. This is the minimum number of supporting reads (without mapped partners) on each side of an insertion breakpoint required to not be filtered." /> | |
| 21 </inputs> | |
| 22 <outputs> | |
| 23 <data name="vcf" format="vcf" /> | |
| 24 </outputs> | |
| 25 <tests> | |
| 26 <test> | |
| 27 <param name="ref" value="ref.fa"/> | |
| 28 <param name="bam" value="simulated.bam"/> | |
| 29 <param name="min_oea_each_side" value="2"/> | |
| 30 <output name="vcf" file="basil.vcf"/> | |
| 31 </test> | |
| 32 </tests> | |
| 33 <help><![CDATA[ | |
| 34 BASIL is a method to detect breakpoints for structural variants (including insertion breakpoints) from aligned paired HTS reads in BAM format. Use BASIL to analyze BAM files for tentative insertion sites. | |
| 35 | |
| 36 Note that BASIL will in general detect all kinds of breakpoints, e.g. for inversions on real-world data. | |
| 37 | |
| 38 BASIL VCF fields | |
| 39 | |
| 40 A typical line in BASIL might look as follows. | |
| 41 | |
| 42 1 5001 site_0 T <INS> . PASS IMPRECISE;SVTYPE=INS GSCORE:CLEFT:CRIGHT:OEALEFT:OEARIGHT 46.4256:10:12:35:32 | |
| 43 | |
| 44 The first seven columns are as usually in VCF files (ref name, 1-based position, reference base, abbreviation for long insertion, no assigned quality, passing all filters, imprecise insertion SV). | |
| 45 | |
| 46 The eighth column contains the names of the score values given in the ninth column: | |
| 47 | |
| 48 GSCORE Geometric mean of the sum of "1 + $score" for all of the following scores. | |
| 49 CLEFT Number of clipping signatures supporting the site from the left side. | |
| 50 CRIGHT Number of clipping signatures supporting the site from the right side. | |
| 51 OEALEFT Number of OEA alignments supporting the site from the left. | |
| 52 OEARIGHT Number of OEA alignmetns supproting the site from the right. | |
| 53 | |
| 54 Generally, one should filter for a minimum support of OEA records on each side, e.g. a value of 10 makes sense for a 30x coverage and showed good results on simulated data. | |
| 55 | |
| 56 For a ranking, GSCORE is a suitable measure but we did not develop any statistical model for BASIL matches and it is a mean of pseudocounts only. It carries no statistically precise meaning. | |
| 57 ]]></help> | |
| 58 <citations> | |
| 59 <citation type="doi">10.1093/bioinformatics/btv051</citation> | |
| 60 </citations> | |
| 61 </tool> |
