Page 1 of 1

Explicit sort vs. Sort in the join stage?

Posted: Wed Mar 15, 2006 7:00 pm
by mctny
Helloo Guys/Girls,

I have a simple question, when you are joining two tables by using join stage, do you have to sort and partition the tables by using a sort stage or is it enough to click on the auto box inside the join stage. in the reference book it says it is enough to just click on the auto check box and it will automatically sort by using the key fields and partition the file.

if it is enough which one is better? a person told me it is not a good idea to use the auto click box inside the join stage, it can give wrong results, poor performance etc, but I don't completely agree with him. just wanted to make sure here.

p.s. I am new to DS PX and since I don't have a parallel extender I cannot test by myself. just trying to learn by reading only , for now.

thanks in advance

Posted: Thu Mar 16, 2006 3:32 am
by kumar_s
Ok I assume the 'click box auto' that you mean is the Auto partiton.
Yes by enabling auto partiton, orchestrate it self will introduce a sort and partiton the data based on the key specifed in the join stage.
But based on the previous experience (of all who use this this forum) it is all better and effecient to use a manual sort stage and partion the data based on the key.
If you use auto partion, it not gaurented the type of partiton used. For scalability and for better troubleshooting, follow a the harder method.

Posted: Thu Mar 16, 2006 9:12 am
by DSguru2B
Its always better to sort in a seperate stage, this will help in your performance as mentioned by kumar.