data by a column and perform aggregate operation on other columns.
grouping.py
$out_file1
$input1
$groupcol
$ignorecase
#for $op in $operations
'${op.optype}
${op.opcol}
${op.opround}'
#end for
rpy
.. class:: infomark
**TIP:** If your data is not TAB delimited, use *Text Manipulation->Convert*
-----
**Syntax**
This tool allows you to group the input dataset by a particular column and perform aggregate functions like Mean, Median, Mode, Sum, Max, Min, Count, Random draw and Concatenate on other columns.
- All invalid, blank and comment lines are skipped when performing the aggregate functions. The number of skipped lines is displayed in the resulting history item.
- If multiple modes are present, all are reported.
-----
**Example**
- For the following input::
chr22 1000 1003 TTT
chr22 2000 2003 aaa
chr10 2200 2203 TTT
chr10 1200 1203 ttt
chr22 1600 1603 AAA
- **Grouping on column 4** while ignoring case, and performing operation **Count on column 1** will return::
AAA 2
TTT 3
- **Grouping on column 4** while not ignoring case, and performing operation **Count on column 1** will return::
aaa 1
AAA 1
ttt 1
TTT 2