how is sratch space calculated

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
goriparthi
Charter Member
Charter Member
Posts: 57
Joined: Fri Feb 24, 2006 7:44 am

how is sratch space calculated

Post by goriparthi »

Hi All,

I am running a parallel job with a remove duplicate stage on a input source of 5 columns. I ended up with error saying not enough space.

One of the column is varchar2(4000). If i calculate space needed with columns length as 4000 then the error is right, If i consider with the length of data in that column(around 20) then it is wrong.

Can anyone confirm if Datastage consider 4000 or the length of the data in it

thanks
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

When DataStage allocates structures and space it needs to use the defined maximum length. When it writes data to disk it will use the actual contents unless you specify fixed length. I think that the remove duplicates is allocating space based on the metadata and doing it's comparisons as if the string were fixed length CHARs, which explains your problem.
Post Reply