comparison README.rst @ 5:5f3cc8229771 draft

Uploaded v0.0.1e with links to sample data in README
author peterjc
date Tue, 20 Aug 2013 07:37:19 -0400
parents b14c822a37fe
children 36b2c2b5051e
comparison
equal deleted inserted replaced
4:b14c822a37fe 5:5f3cc8229771
2 secreted proteins from a given protein FASTA file. 2 secreted proteins from a given protein FASTA file.
3 3
4 It runs SignalP v3.0 (Bendtsen et al. 2004) and selects only proteins with a 4 It runs SignalP v3.0 (Bendtsen et al. 2004) and selects only proteins with a
5 strong predicted signal peptide, and then runs TMHMM v2.0 (Krogh et al. 2001) 5 strong predicted signal peptide, and then runs TMHMM v2.0 (Krogh et al. 2001)
6 on those, and selects only proteins without a predicted trans-membrane helix. 6 on those, and selects only proteins without a predicted trans-membrane helix.
7 This workflow was used in Kikuchi et al (2001), and is a simplification of 7 This workflow was used in Kikuchi et al. (2011), and is a simplification of
8 the candidate effector protocol described in Jones et al (2009). 8 the candidate effector protocol described in Jones et al. (2009).
9 9
10 See http://www.galaxyproject.org for information about the Galaxy Project. 10 See http://www.galaxyproject.org for information about the Galaxy Project.
11
12
13 Sample Data
14 ===========
15
16 This workflow was developed and run on several nematode species. For example,
17 try the protein set for Bursaphelenchus xylophilus (Kikuchi et al. 2011):
18
19 ftp://ftp.sanger.ac.uk/pub/pathogens/Bursaphelenchus/xylophilus/Assembly-v1.2/BUX.v1.2.genedb.protein.fa.gz
20
21 You can upload this directly into Galaxy via this URL. Galaxy will handle
22 removing the gzip compression to give you the FASTA protein file which has
23 18,074 sequences. The expected result (selecting organism type Eukaryote)
24 is a FASTA protein file of 2,297 predicted secreted protein sequences.
11 25
12 26
13 Citation 27 Citation
14 ======== 28 ========
15 29