Link1 - 30 million rows
Link2 - 10 million rows (Right Link)
When i use the explicit link sort (hash,sort) on both these links on my join keys , the job aborts after running for some time with this fatal error:
buffer(20),1: APT_BufferOperator: Add block to queue failed. This means that your buffer filesystems all ran out of file space, or that some other system error occurred. Please ensure that you have sufficient scratchdisks in either the default or "buffer" pools on all nodes in your configuration file.
But when I place an explicit Sort Stage on both the input links to the join stage, the job runs successfully to completion.
What exactly happens during a link sort that differs from a Sort Stage? Can someone please throw some light on the process.
Thanks in advance
![Smile :)](./images/smilies/icon_smile.gif)