Mergestage issue

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
nani0907
Participant
Posts: 155
Joined: Wed Apr 18, 2007 10:30 am

Mergestage issue

Post by nani0907 »

Hi ,

we are using merge stage to identify the insert and updates.its working fine with fewer records.in case of more records it doesnot identifies the inserts ad updates correctly.we are not using any partition technique and running on two nodes.please help me.
thanks n regards
nani
singhald
Participant
Posts: 180
Joined: Tue Aug 23, 2005 2:50 am
Location: Bangalore
Contact:

Post by singhald »

for merge stage , it is always better to provide sorted data set, use sort stage and sort both input on key fields,
Regards,
Deepak Singhal
Everything is okay in the end. If it's not okay, then it's not the end.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

And if that doesn't help, give us a better idea what "it doesnot identifies the inserts ad updates correctly" means.
-craig

"You can never have too many knives" -- Logan Nine Fingers
throbinson
Charter Member
Charter Member
Posts: 299
Joined: Wed Nov 13, 2002 5:38 pm
Location: USA

Post by throbinson »

You are using two nodes. This means you are partitioning on two nodes. This means, in general, half the data goes to one partition and half the data goes to the other partition. If you have not consciously planned for this partitioning then Datastage has done it for you. Erroneously, it would seem. Meaning, you must examine the partitioning scheme used and take a look at the keys you are using to merge the two datasets. Make sure ALL keys from both datasets end up in the same partitions.
A record in one partition will not be merged with a record in the other partition. Keys determine both partitioning and merging. Partitioning and merging keys must be consistent. This may be your problem. To verify that partitioning is your problem, use a single node config file. If the data is correct, then your problem is partitioning.
Or sorting...
Sreenivasulu
Premium Member
Premium Member
Posts: 892
Joined: Thu Oct 16, 2003 5:18 am

Post by Sreenivasulu »

HiAll,

We are also facing a similar issue with the merge stage.

Regards
Sreeni
nagarjuna
Premium Member
Premium Member
Posts: 533
Joined: Fri Jun 27, 2008 9:11 pm
Location: Chicago

Post by nagarjuna »

In addition to partitioning and sorting , take care f duplicates in the master link of the merge stage
Nag
Post Reply