Hi ,
I have source data sample as
msisdn,date
750500337,6/4/2009
750500337 ,6/5/2009
750500337 ,6/6/2009
750500337 ,6/7/2009
750500467,6/4/2009
750500467 ,6/5/2009
750500467,6/6/2009
750500467 ,6/7/2009
and i want output as
750500337,6/4/2009
750500467,6/4/2009
i.e. minimum of date in each msisdn group.
Please help me how to do this..??
logic for grouping
Moderators: chulett, rschirm, roy
-
- Premium Member
- Posts: 536
- Joined: Thu Oct 11, 2007 1:48 am
- Location: Bangalore
-
- Premium Member
- Posts: 783
- Joined: Mon Jan 16, 2006 10:17 pm
- Location: Sydney, Australia
-
- Premium Member
- Posts: 1735
- Joined: Thu Mar 01, 2007 5:44 am
- Location: Troy, MI
-
- Premium Member
- Posts: 536
- Joined: Thu Oct 11, 2007 1:48 am
- Location: Bangalore
Yes i am doing the same thing but target data is mismatching,due to multiple nodeskeshav0307 wrote:use a sort stage,
sort on MISDN and DATE ASC.
remove duplicate on MISDN and keep the first record.
When i am running on single node,it works fine.
So is there any way so that if i run the job on multiple node,it will give the exact result??
-
- Participant
- Posts: 3337
- Joined: Mon Jan 17, 2005 4:49 am
- Location: United Kingdom