About Join Stage in DataStage EE Edition

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
bhargav_dd
Premium Member
Premium Member
Posts: 57
Joined: Tue Jun 30, 2009 9:38 am

About Join Stage in DataStage EE Edition

Post by bhargav_dd »

Hi,
My job has two sources for Join Stage, 1 of them is DB2 API Stage connected to MVS DB and the other is DB2 EE Stage connected to AIX DB2. The Join Stage use column 'FILE_ID' as Join Key to perform a Left Outter join. The mode of this Join Stage is Defaulf(Parallel) and the Partition Type in the input tab is 'Auto'. The problem I met is recently I found that I can not get the expecting outcome from this Join Stage anymore. I am going to show you guys the data in the Source Stage and the outcome I got below, fyi...
Data in DB2 API Source Stage:
EVENT_NAME

FILE_ID

BATCH_NBR

FILE_NAME

STATUS

LAST_UPT_ID

LAST_UPT_TIME
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

What do you mean by "incorrect"? Do you get correct results if you disable parallel execution (as a test)? Have you tried explicitly partitioning on FILE_ID using a key-based algorithm? Are both inputs sorted on FILE_ID?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply