Identify Unique and Non_Unique Records...

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
praburaj
Premium Member
Premium Member
Posts: 133
Joined: Thu Jun 30, 2005 1:26 am
Location: philippines

Identify Unique and Non_Unique Records...

Post by praburaj »

Hi,

My source is a file. I have one column name like ENAME and data's are like below.
ENAME
1
1
2
3.
Now, my problem is I need to capture all the unique records in one file. The records not unique need to capture in other files. I know only If any duplicate records are there, I will capture that records and store it in seperate file. This logic I don't have idea how to implement this. Please any help is really appreciated.

Output:

Uniq_File1 NotUniq_File2
2 1
3 1
prabakaran.v
ds@tcs
Participant
Posts: 24
Joined: Thu Mar 17, 2011 6:26 am
Location: chennai
Contact:

Re: Identify Unique and Non_Unique Records...

Post by ds@tcs »

First sort the data
then use aggregator and perform group count, then use filterstage and send the records which contains count >1 to alink and count=1 to another link

try this
Regards
Sankar
praburaj
Premium Member
Premium Member
Posts: 133
Joined: Thu Jun 30, 2005 1:26 am
Location: philippines

Re: Identify Unique and Non_Unique Records...

Post by praburaj »

Thanks ds. Really your input helped me a lot. I fixed my problem. Thx once again
prabakaran.v
Post Reply