In the current scenario, I have 29 Million rows in PROD for refernce dataset, with each row having 2 Varchar Columns of length 150, 1 varchar column of length 25 and 2 decimal(38,10) columns for surrogate keys.
Complicating this is the fact that its a single node system on windows 32 Bit, so im forced to use execution mode = sequential.. Apparantly there were install related issues for multi node config..
I tried using join stage(left Outer) with data presorted on both key columns on the input and ref dataset, but job failed with "DSEE-TFIO" errors like these
ora_InvHdr_Vdr,0: Write to dataset on [fd 4] failed (Insufficient system resources exist to complete the requested service. ) on node node1, hostname A300SM111
There is also an existing server job that seems to work fine for now , but due to forecasted data volumes, the 32 bit hash file would reach its storage limit, and we are trying to avoid server jobs in the hope that the single node issue will be resolved sometime .
Sparse lookup does not seem to be an option as data volumes are in the range of 500 K to 1 M rows every morning : )
Im now trying to use a lookup file set - but thats just a testing exercise , and awaiting results
![Embarassed :oops:](./images/smilies/icon_redface.gif)