text_processing: replace_text_in_column.xml comparison

comparison replace_text_in_column.xml @ 6:8928e6d1e7ba draft

Uploaded

author	bgruening
date	Thu, 08 Jan 2015 09:07:31 -0500
parents	56e80527c482
children	d64eace4f9f3

comparison

equal deleted inserted replaced

-:3f0e0d4c15a9
+:8928e6d1e7ba
 </macros>
 <expand macro="requirements">
 <requirement type="package" version="4.1.0">gnu_awk</requirement>
 </expand>
 <version_command>awk --version | head -n 1</version_command>
-<command interpreter="sh">
+<command>
 <![CDATA[
-##adapt to awk's quirks - to pass an acutal backslash - two backslashes are required (just like in a C string)
-REPLACE_PATTERN=\${$replace_pattern//\\/\\\\};
 awk
--v OFS="\t"
+-v OFS="	"
 --re-interval
---sandbox "{ \$$column = gensub( /$find_pattern/, \"$replace_pattern\", \"g\", \$$column ) ; print \$0 ; }"
+--sandbox '{ \$$column = gensub( /$find_pattern/, "$replace_pattern", "g", \$$column ) ; print \$0 ; }'
 "$infile"
-> "$output"
+> "$outfile"
 ]]>
 </command>
 <inputs>
 <param format="tabular" name="infile" type="data" label="File to process" />
 <param name="column" label="in column" type="data_column" data_ref="infile" accept_default="true" />
 </valid>
 </sanitizer>
 </param>
 </inputs>
 <outputs>
-<data format="input" name="output" metadata_source="infile" />
+<data name="outfile" format_source="infile" metadata_source="infile" />
 </outputs>
 <tests>
 <test>
-<param name="infile" value="replace_text_in_column_in1.txt" ftype="tabular" />
+<param name="infile" value="replace_text_in_column1.txt" ftype="tabular" />
 <param name="column" value="4" />
 <param name="find_pattern" value=".+_(R.)" />
-<param name="replace_pattern" value="\1" />
+<param name="replace_pattern" value="\\1" />
-<output name="output" file="replace_text_in_column_output1.txt" />
+<output name="outfile" file="replace_text_in_column_results1.txt" />
 </test>
 </tests>
 <help>
 <![CDATA[
 **What it does**
-This tool performs find &amp; replace operation on a specified column in a given file.
+This tool performs find & replace operation on a specified column in a given file.
 .. class:: infomark
 The **pattern to find** uses the **extended regular** expression syntax (same as running 'awk --re-interval').
 **Examples of Replace Patterns**
 - **WORLD**  The word 'WORLD' will be placed whereever the find pattern was found.
-- **FOO-&amp;-BAR**  Each time the find pattern is found, it will be surrounded with 'FOO-' at the begining and '-BAR' at the end. **&amp;** (ampersand) represents the matched find pattern.
+- **FOO-&-BAR**  Each time the find pattern is found, it will be surrounded with 'FOO-' at the begining and '-BAR' at the end. **&** (ampersand) represents the matched find pattern.
 - **\\1**   The text which matched the first parenthesis in the Find Pattern.
 -----
 -----
 **Example 2**
 **Find Pattern:** ^(.{4})
-**Replace Pattern:** &amp;\\t
+**Replace Pattern:** &\\t
 Find the first four characters in each line, and replace them with the same text, followed by a tab character. In practice - this will split the first line into two columns. This operation affects only the selected column.
 -----

Mercurial > repos > bgruening > text_processing

comparison replace_text_in_column.xml @ 6:8928e6d1e7ba draft