Aggregating Data
Posted: Thu Mar 11, 2010 10:04 am
I have 4 columns in my input as below
Grp_ID
Act_ID
Agntin
Agtxid
And I want to count the number of rows that are duplicate across the 4 columns. So I am using an aggregate stage and am using the count no of rows to calculate the no of rows by grouping on all the 4 input columns.
I am outputting the count into a new column NO_OF_ROWS.
Is this the right way to do?
Grp_ID
Act_ID
Agntin
Agtxid
And I want to count the number of rows that are duplicate across the 4 columns. So I am using an aggregate stage and am using the count no of rows to calculate the no of rows by grouping on all the 4 input columns.
I am outputting the count into a new column NO_OF_ROWS.
Is this the right way to do?