What is difference between explicit Sort stage and sort ....
Posted: Thu Sep 20, 2007 2:57 am
Hi All,
To use a join stage the data should be Hash partitioned and sorted. In our jobs we join 2 tables. We use Sort stage for each input link, to sort data and to Hash partition, before the Join stage.
By using an explicit Sort stage is there any advantage over the in-stage sorting ? By in-stage sorting i mean the Sort option inside the Join stage.
Is explicit sort performance wise better compared to in-stage sort ?
Can anyone please clarify and provide more details on this?
Thanks in Advance !
To use a join stage the data should be Hash partitioned and sorted. In our jobs we join 2 tables. We use Sort stage for each input link, to sort data and to Hash partition, before the Join stage.
By using an explicit Sort stage is there any advantage over the in-stage sorting ? By in-stage sorting i mean the Sort option inside the Join stage.
Is explicit sort performance wise better compared to in-stage sort ?
Can anyone please clarify and provide more details on this?
Thanks in Advance !