I have 4 columns in my input as below
Grp_ID
Act_ID
Agntin
Agtxid
And I want to count the number of rows that are duplicate across the 4 columns. So I am using an aggregate stage and am using the count no of rows to calculate the no of rows by grouping on all the 4 input columns.
I am outputting the count into a new column NO_OF_ROWS.
Is this the right way to do?
Aggregating Data
Moderators: chulett, rschirm, roy
Re: Aggregating Data
ppp wrote: I am using an aggregate stage and am using the count no of rows to calculate the no of rows by grouping on all the 4 input columns.
Is this the right way to do?
As you are using all the four columns, you can get the output count of the Whole DUPLICATE ROWS, If you want the count of duplicate value in a particular column, then you have to mention that particular column.
RAJ
-
- Participant
- Posts: 57
- Joined: Wed Oct 21, 2009 4:46 am
- Location: India