REMOVE DUPLICATES IN TRANSFORMER STAGE

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
hemanth12
Participant
Posts: 6
Joined: Tue Feb 01, 2011 11:45 pm

REMOVE DUPLICATES IN TRANSFORMER STAGE

Post by hemanth12 »

Our requirement is

DB-->TR-->Seq or DB(rest of the records in target)
|
Rejected file(seq)(in this i need duplicates in rejected file rest of the records in target)

plz give me the briefly explanation.if it is posible with screen shots to send me..
my mail-id:hemanthrao.k@gmail.com

Thanks in advance...
DATASTAGE DEVELOPER
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Set up a unique sort on the input link to the Transformer stage.

But it's a silly requirement to want to do this in the Transformer stage.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
hemanth12
Participant
Posts: 6
Joined: Tue Feb 01, 2011 11:45 pm

REMOVE DUPLICATES IN TRANSFORMER STAGE

Post by hemanth12 »

thanks for reply....

but my requirement is
Don't use any of the part ions technic and sort ...unique

I need duplication records in one file and remaining records in another file .using only for transformer stage..don't use any remove duplicate .in transformer don't use any of the partion technic ....


Regards
Hemanth

ray.wurlod wrote:Set up a unique sort on the input link to the Transformer stage.

But it's a silly requirement to want to do this in the Transformer stage. ...
DATASTAGE DEVELOPER
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Resist stupid requirements. You can't do it without sorted data. And that implies properly partitioned data.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Vidyut
Participant
Posts: 24
Joined: Wed Oct 13, 2010 12:45 am

Post by Vidyut »

Ray is right without sorted data u cant remove duplicates using transformer.
What you can do is sort the data on key and then use two stage variables to remove the duplicates.
one having the last value and one current and by comparing both u can remove the duplicates
Post Reply