Performance tuning in DS for Join..

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
devesh_ssingh
Participant
Posts: 148
Joined: Thu Apr 10, 2008 12:47 am

Performance tuning in DS for Join..

Post by devesh_ssingh »

Hello Mentors,

Hope you doing well.
i have somthing to discuss relating join stage and its performance.

Its lot been dicussed but i would like to know how would optimize more in JOIN stage of DS...

lets take a scenerio
Join two tables each having 10 millions records....inner join
what i did is sorted them by qyerying to db and then again applied hash partition on 2-node system....i have flexibilty to select 4-node, 8-node
as well...

let me know if i have acheived great level of optimization or i can improve it further.

Thanks,
Devesh
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

DataStage is unnecessarily re-sorting your data. Add a Sort stage to each input link to the Join stage. In each Sort stage specify "don't sort (already sorted)" for the sort key columns.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
arun_im4u
Premium Member
Premium Member
Posts: 104
Joined: Mon Nov 08, 2004 8:42 am

Post by arun_im4u »

Ray,

Wouldn't the Stable sort option in the Join stage do the same function?
asorrell
Posts: 1707
Joined: Fri Apr 04, 2003 2:00 pm
Location: Colleyville, Texas

Post by asorrell »

The Stable option doesn't mean "no sort" - it means "if you have multiple records with the same identical keys, keep them in the same order going out that they were coming in", something that isn't guaranteed otherwise.
Andy Sorrell
Certified DataStage Consultant
IBM Analytics Champion 2009 - 2020
arun_im4u
Premium Member
Premium Member
Posts: 104
Joined: Mon Nov 08, 2004 8:42 am

Post by arun_im4u »

Thanks for the clarification.
Post Reply