Hash file as a global reference

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
paddu
Premium Member
Premium Member
Posts: 232
Joined: Tue Feb 22, 2005 11:14 am
Location: California

Hash file as a global reference

Post by paddu »

Hi there,

I have a requirement where I need to use a hash file as global reference in several jobs out those jobs I have to parallel atleast two or more.

So two or more jobs using the the same hash file as a reference(lookup) as well as I need to insert new records into that hash file in order to keep it current. How good this approach will work and is performence an issue?

Will hash file work based on some file lock protocol for reference vs update with different processes running at the same time?

If this approach is not the right one, what would be the best way to go when I have to use the same dataset as a global reference in several jobs where I also need to update the hashfile.

Thanks in advance.
Paddu
kcbland
Participant
Posts: 5208
Joined: Wed Jan 15, 2003 8:56 am
Location: Lutz, FL
Contact:

Post by kcbland »

Run the jobs that update the hash file first, then all of the other jobs can access a static hash file. Using locking mechanisms will cause a lot of headaches as jobs will stall waiting on locks, plus you probably will end up hanging your jobs when you aren't releasing locks quickly enough. This is a bad approach, take the recommendation stated.
Kenneth Bland

Rank: Sempai
Belt: First degree black
Fight name: Captain Hook
Signature knockout: right upper cut followed by left hook
Signature submission: Crucifix combined with leg triangle
susanm
Premium Member
Premium Member
Posts: 4
Joined: Wed Jan 08, 2003 4:45 pm

Post by susanm »

Hello Paddu,

You can also investigate the use of hashed file cache sharing.

There is a check box on the properties page of the job where you specify that you want to share the hashed file cache. This will allow several jobs to share the hashed file cache for both reading and writing.

Before you can use this option you need to update your uvconfig file to set some caching tunables.

For further reading refer to the DataStage Disk Cache Sharing Guide.

Susan
Post Reply