annotate tool-data/blastdb_d.loc.sample @ 2:fae4084a0bc0 draft

Uploaded v0.0.20, preview 5 Cope if cElementTree is missing in BLAST XML to tabular script.
author peterjc
date Thu, 02 May 2013 11:20:43 -0400
parents
children
Ignore whitespace changes - Everywhere: Within whitespace: At end of lines:
rev   line source
2
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
1 #This is a sample file distributed with Galaxy that is used to define a
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
2 #list of protein domain databases, using three columns tab separated
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
3 #(longer whitespace are TAB characters):
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
4 #
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
5 #<unique_id> <database_caption> <base_name_path>
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
6 #
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
7 #The captions typically contain spaces and might end with the build date.
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
8 #It is important that the actual database name does not have a space in it,
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
9 #and that there are only two tabs on each line.
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
10 #
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
11 #You can download the NCBI provided databases as tar-balls from here:
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
12 #ftp://ftp.ncbi.nih.gov/pub/mmdb/cdd/little_endian/
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
13 #
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
14 #So, for example, if your database is CDD and the path to your base name
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
15 #is /data/blastdb/Cdd, then the blastdb_d.loc entry would look like this:
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
16 #
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
17 #Cdd{tab}NCBI Conserved Domains Database (CDD){tab}/data/blastdb/Cdd
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
18 #
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
19 #and your /data/blastdb directory would contain all of the files associated
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
20 #with the database, /data/blastdb/Cdd.*.
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
21 #
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
22 #Your blastdb_d.loc file should include an entry per line for each "base name"
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
23 #you have stored. For example:
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
24 #
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
25 #Cdd NCBI CDD /data/blastdb/domains/Cdd
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
26 #Kog KOG (eukaryotes) /data/blastdb/domains/Kog
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
27 #Cog COG (prokaryotes) /data/blastdb/domains/Cog
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
28 #Pfam Pfam-A /data/blastdb/domains/Pfam
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
29 #Smart SMART /data/blastdb/domains/Smart
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
30 #Tigr TIGR /data/blastdb/domains/Tigr
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
31 #Prk Protein Clusters database /data/blastdb/domains/Prk
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
32 #...etc...
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
33 #
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
34 #See also blastdb.loc which is for any nucleotide BLAST database, and
fae4084a0bc0 Uploaded v0.0.20, preview 5
peterjc
parents:
diff changeset
35 #blastdb_p.loc which is for any protein BLAST databases.