Sorting with Hash parition

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
koolnitz
Participant
Posts: 138
Joined: Wed Sep 07, 2005 5:39 am

Sorting with Hash parition

Post by koolnitz »

Hi,

I have a query on JOIN stage with Hash partitioning.

Is there any scenario when Sorting should not be done while joining two streams with Hash partitioning? Or should I blindly check the Sort option whenever partition type is set to Hash.

Thanks!
Nitin Jain | India

If everything seems to be going well, you have obviously overlooked something.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Read the chapter in the Parallel Job Developer's Guide on the Join stage. It

Code: Select all

requires 
its inputs to be sorted (on join key columns). The partitioning algorithm is irrelevant to this requirement.

The only time you would elect not to use Sort on the input link is where you are totally confident that the data on the input links are already correctly sorted.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply