Regarding partioning
Posted: Wed Apr 25, 2012 8:50 am
I have a doubt in partitioning. In my job i'm using a sort -> remove duplicate->join.
Here sort and RD is based on 3 keys (say key1, key2 and key 3) and join is based on 2 keys (key1 and key2).. so in this case should i need to re-partition(hash) in join stage based on these 2 keys?
Thes two join keys are already has partioned in sort stage.
Here sort and RD is based on 3 keys (say key1, key2 and key 3) and join is based on 2 keys (key1 and key2).. so in this case should i need to re-partition(hash) in join stage based on these 2 keys?
Thes two join keys are already has partioned in sort stage.