Hashfile cache - how much memory is used

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
netland
Participant
Posts: 12
Joined: Tue Apr 08, 2003 11:43 pm

Hashfile cache - how much memory is used

Post by netland »

Hi,

I'm trying to find out how much memory is used when defining cache in the administrator (tunable)

I've set read and write cache to 128MB (default)
And my job is using some 50 hashfiles of different sizes, most of them are quite small)

Does my job allocate 50*128MB, or only the total size of the 50hashfiles, but with a project maximum of 128MB ?

Is the 128MB allocated for ALL hashfiles that are cache enabled. or is it a maximum for each hashfile, each job or the project ?

br
Tom
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Hello netland,

the cache size you specify is per hash file; and is not persistent. The cache is not pre-allocated but generated at runtime and discarded once finished.
scottr
Participant
Posts: 51
Joined: Thu Dec 02, 2004 11:20 am

Post by scottr »

in 32bit env the max hash file size is 2.2 Gig
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Scottr -
in 32bit env the max hash file size is 2.2 Gig
That is not quite true. The maximum size of a file system object is limited to 2Gb; but a default dynamic (type 30) hash file is actually composed of 2 file system files, so you can get significantly more data into a hashed file depending on the key types and data - but you never know how much data you can fit in until you *bang* hit the limit and most likely have a very lengthy fixfile process ahead of you :shock:
kcbland
Participant
Posts: 5208
Joined: Wed Jan 15, 2003 8:56 am
Location: Lutz, FL
Contact:

Re: Hashfile cache - how much memory is used

Post by kcbland »

netland wrote: Does my job allocate 50*128MB
No
netland wrote: or only the total size of the 50hashfiles, but with a project maximum of 128MB ?
No
netland wrote: Is the 128MB allocated for ALL hashfiles that are cache enabled.
No
netland wrote: or is it a maximum for each hashfile
Yes
netland wrote: each job or the project ?
No
Kenneth Bland

Rank: Sempai
Belt: First degree black
Fight name: Captain Hook
Signature knockout: right upper cut followed by left hook
Signature submission: Crucifix combined with leg triangle
kcbland
Participant
Posts: 5208
Joined: Wed Jan 15, 2003 8:56 am
Location: Lutz, FL
Contact:

Post by kcbland »

Okay, enough fun. Each hash file has a maximum size that can be cached, once the file at preload time exceeds that size you get a message telling you that the file is too big and it won't preload.

As for write delay, that's the maximum that can write delay until it has to start writing to disk.

You will find that DataStage uses very little memory, but when you can, you could enable hash file sharing in a job incase the same hash file is referenced many times in the job. This allows only one footprint in memory for that file. You can also look at the hash cache daemon to manage a shared hash file across jobs so that the job doesn't incur any preload time because the file is already cached.
Kenneth Bland

Rank: Sempai
Belt: First degree black
Fight name: Captain Hook
Signature knockout: right upper cut followed by left hook
Signature submission: Crucifix combined with leg triangle
throbinson
Charter Member
Charter Member
Posts: 299
Joined: Wed Nov 13, 2002 5:38 pm
Location: USA

Post by throbinson »

Can you tell me the practical differences between Link Private, Link Public and System caching?
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Those are well described in the "manual" dsdskche.pdf. System caching allows one hashed file to be shared between multiple jobs. I haven't attempted it but it may even allow sharing between jobs in multiple projects!
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply