Hash file question

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
rafik2k
Participant
Posts: 182
Joined: Wed Nov 23, 2005 1:36 am
Location: Sydney

Hash file question

Post by rafik2k »

One basic question related to hash file:
In which scenario should we set Pre-load file to Memory-Enabled.

I mean to say, is it wise to set above property when hash file is big?

In my case, in the sequence jobs are running parallel and some of the jobs using hash files as well.

Need to know best practice on same.

Thanks in advance.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

If you 'know' it will always be too big to cache, don't set it - it will save a little startup time. Will it hurt to set it? No. It will try and you'll get a message it could not be cached.
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Did you mean "hashed" file?
:roll:
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

It will load the file to memory until it determines that it won't fit; the program isn't smart enough to detect from the file size that it cannot fit. This load causes IO and CPU and can take a long time on many systems, so it is best to do as Craig has already suggested and don't set the switch.
rameshrr3
Premium Member
Premium Member
Posts: 609
Joined: Mon May 10, 2004 3:32 am
Location: BRENTWOOD, TN

Post by rameshrr3 »

The setting you mentioned will be used when you need the link private caching facility. More details are found in the Disk Caching Guide. Problem is , you rarely can predict the size of a hashed file that you will use.
rafik2k
Participant
Posts: 182
Joined: Wed Nov 23, 2005 1:36 am
Location: Sydney

Post by rafik2k »

rameshrr3 wrote:The setting you mentioned will be used when you need the link private caching facility. More details are found in the Disk Caching Guide. Problem is , you rarely can predict the size of a hashed file that you will use.

Ramesh: Can you please explain little bit more about link private caching facility. I don't have any idea on same.
rafik2k
Participant
Posts: 182
Joined: Wed Nov 23, 2005 1:36 am
Location: Sydney

Post by rafik2k »

Thanks everyone for valuable input!

Craig: Will it give error message or warning message if data can not fit in the memory, if I set Pre-load file to Memory-Enabled ?
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

No, it will just be informational. Here, one from my logs:

Code: Select all

Project:XXX
Job name:YYYY
Event #:3187
Timestamp:5/27/2007 9:56:28 AM
Event type:Info
User:dsuser
Message:
Job.Link.HashedFileName: Read caching disabled, file size exceeds cache size
-craig

"You can never have too many knives" -- Logan Nine Fingers
rafik2k
Participant
Posts: 182
Joined: Wed Nov 23, 2005 1:36 am
Location: Sydney

Post by rafik2k »

chulett wrote:No, it will just be informational. Here, one from my logs:

Code: Select all

Project:XXX
Job name:YYYY
Event #:3187
Timestamp:5/27/2007 9:56:28 AM
Event type:Info
User:dsuser
Message:
Job.Link.HashedFileName: Read caching disabled, file size exceeds cache size
Thanks a lot Craig!
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

You might like to read the Technical Bulletin dsdskche.pdf to gain an in-depth knowledge of different possibilities for hashed file cache, such as private link cache, public link cache and shared cache.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply