sorted input to Join
Posted: Wed Feb 08, 2006 9:55 am
Must Join stage have sorted inputs (all or any one) ![Question :?:](./images/smilies/icon_question.gif)
![Question :?:](./images/smilies/icon_question.gif)
The data sets input to the Join stage must be key partitioned and sorted.
It is recommended to sort before join so that the join will be more efficient. If you're sorting than all sources must be sorts the same way before join.djoni wrote:Must Join stage have sorted inputs (all or any one)
Recommended or Mandatory?felixyong wrote:It is recommended to sort before join so that the join will be more efficient. If you're sorting than all sources must be sorts the same way before join.djoni wrote:Must Join stage have sorted inputs (all or any one)
It is even better to sort using the RDBMs if the join is already indexed in the RDBMs so that you can save processing time & resources in DataStage Server.
Runs well on two un-sorted sequential files, auto partitioned no sort.ray.wurlod wrote:Mandatory if the manual is to be believed. I believe it so have always key partitioned and sorted Join stage inputs. Perhaps you'd like to try without, and let us know the result?
djoni wrote:Runs well on two un-sorted sequential files, auto partitioned no sort.ray.wurlod wrote:Mandatory if the manual is to be believed. I believe it so have always key partitioned and sorted Join stage inputs. Perhaps you'd like to try without, and let us know the result?
So, is something wrong with the manual and .... EE Essential course?