Mercurial > repos > iuc > fileidentification
comparison fileidentification.xml @ 0:aae476ce9dc3 draft
planemo upload for repository https://github.com/galaxyproject/tools-iuc/tree/main/tools/fileidentification commit bb234555cae70fdd9be475415855a7a03c4139df
| author | iuc |
|---|---|
| date | Thu, 30 Oct 2025 16:52:16 +0000 |
| parents | |
| children | c08a138f5824 |
comparison
equal
deleted
inserted
replaced
| -1:000000000000 | 0:aae476ce9dc3 |
|---|---|
| 1 <tool id="fileidentification" name="File Format Identification" version="@TOOL_VERSION@+galaxy@VERSION_SUFFIX@" profile="24.1"> | |
| 2 <description>Check multimedia files if they are corrupt or duplicated</description> | |
| 3 <macros><import>macros.xml</import></macros> | |
| 4 <expand macro="requirements"/> | |
| 5 <command detect_errors="exit_code"><![CDATA[ | |
| 6 unzip '$input' -d input_dir && | |
| 7 /app/.venv/bin/python /app/identify.py ./input_dir && | |
| 8 /app/.venv/bin/python /app/identify.py ./input_dir --inspect > results.txt | |
| 9 ]]></command> | |
| 10 <inputs> | |
| 11 <param type="data" name="input" format="zip" label="ZIP containing the files to be analysed"/> | |
| 12 </inputs> | |
| 13 <outputs> | |
| 14 <data name="out" format="txt" from_work_dir="results.txt" label="${tool.name} on ${on_string}"/> | |
| 15 </outputs> | |
| 16 <tests> | |
| 17 <test expect_num_outputs="1"> | |
| 18 <param name="input" value="test-data.zip"/> | |
| 19 <output name="out" ftype="txt" file="output_results_linux_gh_action.txt"/> | |
| 20 </test> | |
| 21 </tests> | |
| 22 <help><![CDATA[ | |
| 23 Do you have a huge number of multimedia files and you don't know if they are corrupt, | |
| 24 or if they have the correct extension? | |
| 25 This tool: | |
| 26 | |
| 27 - gives you an overview of what file types there are | |
| 28 - checks if the extension of the files match their content | |
| 29 - checks if there are any duplicates | |
| 30 - checks if the content of the files is intact | |
| 31 | |
| 32 Input: A ZIP containing the files to be analysed (may be a nested folder structure) | |
| 33 Output: A report in plain text form | |
| 34 | |
| 35 A possible use case are digital preservation workflows, | |
| 36 where you want to make sure that you only preserve high-quality files for the future. | |
| 37 | |
| 38 Supported file types: A wide range of image formats (pixel and vector), videos, audios, pdf, MS Office. | |
| 39 | |
| 40 Note: The original fileidentification tool is more feature-rich. | |
| 41 In particular, it offers bulk conversion of files, which is currently not supported on Galaxy. | |
| 42 | |
| 43 Find more information in `the GitHub repo <https://github.com/dasch-swiss/fileidentification>`_. | |
| 44 ]]></help> | |
| 45 <citations> | |
| 46 <citation type="bibtex"> | |
| 47 @misc{githubfileidentification, | |
| 48 author = {Swiss National Data and Service Center for the Humanities}, | |
| 49 year = {2025}, | |
| 50 title = {Fileidentification - A CLI to Identify Multimedia File Formats and Bulk Convert Files}, | |
| 51 publisher = {GitHub}, | |
| 52 journal = {GitHub repository}, | |
| 53 url = {https://github.com/dasch-swiss/fileidentification}, | |
| 54 }</citation> | |
| 55 </citations> | |
| 56 </tool> |
