# HG changeset patch # User galaxyp # Date 1493814737 14400 # Node ID 643d9192e3f55f7f62f1ec25ccf764ab39989c39 # Parent 0127e0bdf93795b36be49e4c00c7e6483d7acce4 planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/master/tools/dbbuilder commit bc3d2ee3b9fe72b2c11a19ac506e91cfd9146622 diff -r 0127e0bdf937 -r 643d9192e3f5 dbbuilder.xml --- a/dbbuilder.xml Tue May 02 08:51:48 2017 -0400 +++ b/dbbuilder.xml Wed May 03 08:32:17 2017 -0400 @@ -1,4 +1,4 @@ - + gnu-wget @@ -20,20 +20,46 @@ ##set $url = "ftp://ftp.thegpm.org/fasta/cRAP/crap.fasta" #set $url = "https://raw.githubusercontent.com/pravs3683/cRAP/master/cRAP_protein_database.fasta" #set $type = "direct" + #elif $source.from == "HMP" + #set $url = 'http://downloads.hmpdacc.org/data/reference_genomes/body_sites/' + str($source.site) + '.pep.fsa' + #set $type = "direct" + #elif $source.from == "HOMD" + #set $url = 'ftp://ftp.homd.org/human_oral_microbial_genomic_sequences/current/' + str($source.annotation) + #if str($source.annotation).endswith('.tar.gz'): + #set $type = "tgz" + #elif str($source.annotation).endswith('.zip'): + #set $type = "zip" + #end if + #elif $source.from == 'EBI Metagenomics' + #set $url = 'https://www.ebi.ac.uk/metagenomics/projects/' + str($source.ebi_project) + '/samples/' + str($source.ebi_sample) + '/runs/' + str($source.ebi_run) + '/results/versions/' + str($source.ebi_version) + '/sequences/' + str($source.ebi_annotation) + '/chunks/1' + #set $type = "gzip" #elif $source.from == "url" #set $url = $source.url - #set $type = "direct" + #set $type = $source.archive_type #end if #if $type =="direct" - wget -nv '$url' -O '${output_database}' + wget -nv '$url' -O '${output_database}' --no-check-certificate + #elif $type =="zip" + wget -nv '$url' -O tmp.zip --no-check-certificate && zcat -c tmp.zip > '${output_database}' + #elif $type =="gzip" + wget -nv '$url' -O tmp.gz --no-check-certificate && (if `command -v gzcat > /dev/null`; then gzcat tmp.gz; else zcat tmp.gz ; fi) > '${output_database}' + #elif $type =="bzip2" + wget -nv '$url' -O tmp.bz2 --no-check-certificate && bzcat tmp.bz2 > '${output_database}' + #elif $type =="tgz" + wget -nv '$url' -O tmp.tar.gz && tar zxfO tmp.tar.gz > '${output_database}' + #elif $type =="tbz" + wget -nv '$url' -O tmp.tar.bz && tar jxfO tmp.tar.bz > '${output_database}' #end if ]]> - + + + + @@ -67,19 +93,59 @@ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + - + + + + + + + + + - + @@ -99,11 +165,36 @@ **External Links** -_Galaxy-P 101 shows usage Protein Database Downloader tool in the creation of a workflow -.. _Galaxy-P 101: http://msi-galaxy-p.readthedocs.org/en/latest/sections/galaxyp_101.html -_UniProtKB provides additional information about the UniProt Knowledgebase + - Galaxy-P_101_ shows usage Protein Database Downloader tool in the creation of a workflow + - UniProtKB_ provides additional information about the UniProt Knowledgebase + + +.. _Galaxy-P_101: http://msi-galaxy-p.readthedocs.org/en/latest/sections/galaxyp_101.html .. _UniProtKB: http://www.uniprot.org/help/uniprotkb + + +**Additional Protein Fasta URLs** + + *HUMAN GUT METAPROTEOME:* + + * 512MB gzip ftp://public.genomics.org.cn/BGI/gutmeta/UniSet/UniGene.pep.gz + * 61MB gzip http://www.bork.embl.de/~arumugam/Qin_et_al_2010/frequent_microbe_proteins.fasta.gz + + + *MOUSE GUT MICROBIOTA:* + + * 417MB gzip ftp://climb.genomics.cn/pub/10.5524/100001_101000/100114/Genecatalog/184sample_2.6M.GeneSet.pep.gz + * See: http://gigadb.org/dataset/view/id/100114/token/mZlMYJIF04LshpgP + + ]]> + + 10.1093/nar/gkw1099 + 10.1093/nar/gkv1195 + 10.1093/database/baq013 + 10.1038/nature11209 + 10.1038/nature11234 +