Avoid Dupicates in Transformer stage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
psreepa123
Participant
Posts: 5
Joined: Sat Jan 02, 2010 6:23 pm

Avoid Dupicates in Transformer stage

Post by psreepa123 »

Hello all,
I have an existing job as

dataset stage --> transformer stage --> 5 different dataset stages

I need to filter out the dupicate recs from one of the 5 output links in transformer stage to dataset stage. Can I do it without having remove duplicates stage between transformer stage and dataset stage.

Thanks
Psrrepa
Sainath.Srinivasan
Participant
Posts: 3337
Joined: Mon Jan 17, 2005 4:49 am
Location: United Kingdom

Post by Sainath.Srinivasan »

You can include a link sort to remove duplicates. But that will not be advisable.

Why do you NOT need a remove duplicate stage ?
ArjunK
Participant
Posts: 30
Joined: Sun Apr 30, 2006 6:32 pm

Post by ArjunK »

What other way did you have in mind? You can do it either using the stage Or directly on the link.
psreepa123
Participant
Posts: 5
Joined: Sat Jan 02, 2010 6:23 pm

how

Post by psreepa123 »

I know we can sort on the link.. but how are we going to remove the duplicates in Transformer stage
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Stage variables and Group Change Detection techniques. An exact search here for that phrase should turn up the details.
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Or you can have a unique sort on the input link of the Transformer stage.

That's still "in the Transformer stage".
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply