remove duplicates

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
harryhome
Participant
Posts: 112
Joined: Wed Oct 18, 2006 7:10 am

remove duplicates

Post by harryhome »

I have 5 partitions and trying to get distinct records on a key column using remove duplicate stage. getting different number of record count each time I run job.
SURA
Premium Member
Premium Member
Posts: 1229
Joined: Sat Jul 14, 2007 5:16 am
Location: Sydney

Re: remove duplicates

Post by SURA »

SORT it!

What do you mean 5 partition?

DS User
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Is this the same question as this one?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
prakashdasika
Premium Member
Premium Member
Posts: 72
Joined: Mon Jul 06, 2009 9:34 pm
Location: Sydney

Post by prakashdasika »

Do you mean 5 nodes ? link Sort the data in hash partion on the key. ascending or descending depends on the preference.
Prakash Dasika
ETL Consultant
Sydney
Australia
Post Reply