Hash partition not working in Join stage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
wfis
Premium Member
Premium Member
Posts: 70
Joined: Wed Feb 28, 2007 2:38 am
Location: India

Hash partition not working in Join stage

Post by wfis »

Hi All,

I have a datastage job in which the i am fetching data from a Teradata Connector and second from Dataset.

I join the data using a join stage.

When I am partitioning the data using Hash/Sort method on both the input links, the join is not provioding the correct output.

But when I change the partition method to ENTIRE, the join stage works fine. I cant go for Entire partition as the Volume of data is supposed to be very huge.

Has anyone ever faced such issue before??? If so what can be the resolution as the workaround of ENTIRE does not seem good to me.

Please suggest.

Regards,
Amit
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

If the inputs are identically hash partitioned and sorted, then it will work. Therefore I must assume that there was some difference between the inputs in your case in how the partition and/or sort keys were specified.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
wfis
Premium Member
Premium Member
Posts: 70
Joined: Wed Feb 28, 2007 2:38 am
Location: India

Post by wfis »

ray.wurlod wrote:If the inputs are identically hash partitioned and sorted, then it will work. Therefore I must assume that there was some difference between the inputs in your case in how the partition and/or sort keys were specified.
We checked it many times, we did not have any discrepancy in the Hash/Sort order of the key columns in both of the links.

Still no Luck. Ultimately we tries the Entire method, where it worked...


What can be the reasonm for this???
Mike
Premium Member
Premium Member
Posts: 1021
Joined: Sun Mar 03, 2002 6:01 pm
Location: Tampa, FL

Post by Mike »

As Ray said , if the keys are identical in both inputs, then it will work. Are the data types of the key columns exactly the same as well?

Mike
Shruthi
Participant
Posts: 74
Joined: Sun Oct 05, 2008 10:59 pm
Location: Bangalore

Re: Hash partition not working in Join stage

Post by Shruthi »

Hi Amit,

I'm also facing similar issue. How did u use entire partitioning?

My target is a DB2 table. When I give entire partition in join, it gives primary key violation.

Going mad with this... Please help!
Shruthi
shershahkhan
Participant
Posts: 64
Joined: Fri Jan 25, 2008 4:41 am

Post by shershahkhan »

If you are using the instage sort then try using a saparate sort stage before partioning, it worked in my case
Post Reply