How to increase the JOIN stage performance

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
HariD
Participant
Posts: 15
Joined: Tue Feb 23, 2010 3:14 am
Location: Chennai
Contact:

How to increase the JOIN stage performance

Post by HariD »

I have so many JOIN stages in my job then i want to improve the performance of the job.All join stages partition type is Auto is there result while setting the any other partition technique..

Any answer would be appreciated.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Joins want sorted data, so it is best to put in your own explicit sort stages and, when not necessary, mark sort key columns as "don't sort, previously sorted". Partitioning and repartitioning must be done on the join keys.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Investigate your logic - see if you can eliminate any of the sorts. As Arnd noted, using a Sort stage to avoid re-sorting is often useful.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply