changeset 17:5a6ea187933b draft

Uploaded
author greg
date Wed, 16 Dec 2015 19:53:24 -0500
parents b40ad4bee6cb
children e1d437bd7d36
files genetrack.xml
diffstat 1 files changed, 38 insertions(+), 0 deletions(-) [+]
line wrap: on
line diff
--- a/genetrack.xml	Wed Dec 16 14:17:15 2015 -0500
+++ b/genetrack.xml	Wed Dec 16 19:53:24 2015 -0500
@@ -137,6 +137,44 @@
  * **Exclusion zone of upstream called peaks** - Defines the exclusion zone centered over peaks upstream of a peak.
  * **Exclusion zone of downstream called peaks** - Defines the exclusion zone centered over peaks downstream of a peak.
  * **Filter** - Absolute read filter, restricts output to only peaks with larger peak height.
+ 
+ -----
+
+**Output gff Columns**
+
+ 1. Chromosome
+ 2. Script
+ 3. Placeholder (no meaning)
+ 4. Start of peak exclusion zone (-e 20)
+ 5. End of peak exclusion zone
+ 6. Tag sum (not peak height or area under curve, which LionDB provides)
+ 7. Strand
+ 8. Placeholder (no meaning)
+ 9. Attributes (standard deviation of reads located within exclusion zone) = fuzziness of peak
+
+ -----
+ 
+ **Considerations**
+ 
+ In principle, the width of the exclusion zone may be as large as the DNA region occupied by the native protein
+ plus a steric exclusion zone between the protein and the exonuclease.  On the other hand the site might be considerably
+ smaller if the protein is in a denatured state during exonuclease digestion (since it is pre-treated with SDS).
+ 
+ In general, higher resolution data or smaller binding site size data should use smaller sigma values.  Large binding site
+ size data such as 147 bp nucleosomal DNA use a larger sigma value like 20 (-s 20).  For transcription factors mapped by
+ ChIP-exo, sigma may initially be set at 5, and the exclusion zone set at 20 (-s 5 –e 20).  Sigma is typically varied
+ between ~3 and ~20.  Too high of a sigma value may merge two independent nearby binding events.  This may be desirable if
+ closely bound factors are not distinguishable.  Too low of a sigma value will cause some tags that contribute to a binding
+ event to be excluded, because they may not be located sufficiently close to the main peak.  If alternative (mutually
+ exclusive) binding is expected for two overlapping sites, and these sites are to be independently recorded, then an
+ empirically determined smaller exclusion zone width is set.  Thus, the value of sigma is set empirically for each mapped
+ factor depending upon the resolution and binding site size of the binding event.
+
+It might make sense to exclude peaks that have only a single tag, where -F 1 is used, or have their tags located on only
+a single coordinate (called Singletons, where stddev=0 in the output file).  However, low coverage datasets might be
+improved by including them, if additional analysis (e.g., motif discovery) validates them.  In addition, idealized action
+of the exonuclease in ChIP-exo might place all tags for a peak on a single coordinate.
+
     </help>
     <expand macro="citations" />
 </tool>