Usage of Read Cache Size and Write Cache Size

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
sumesh.abraham
Participant
Posts: 153
Joined: Thu May 11, 2006 1:52 am
Location: Bangalore

Usage of Read Cache Size and Write Cache Size

Post by sumesh.abraham »

Hello,

I am analyzing a set of Server jobs for performance analysis.
One of the jobs has a design as
Sequential file -> Transformer(Hashed file lookup) -> Transformer (hashed file lookup) -> Load Oracle table (DRS stage).

916990 records are passed into 2nd transformer and the lookup is against hashed file with 178678 records. The lookup and subsequent load to Oracle table (Array Size and Transaction size are 15000) takes close to 14 minutes. I am considering to raise the project default setting for Read cache size and Write cache size of 128 MB to higher like 256 MB.
Is this a good approach to improve performance?

Appreciate your inputs.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Possibly, but we can't really say because we don't know how large your rows are. If the smaller cache size is working (all your hashed file data fit into memory) then allocating more memory will have no effect.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Keep in mind that a change at that level will affect all cached hashed files and effectively double the memory they will allocate when running (assuming you go 128 -> 256).
-craig

"You can never have too many knives" -- Logan Nine Fingers
sumesh.abraham
Participant
Posts: 153
Joined: Thu May 11, 2006 1:52 am
Location: Bangalore

Post by sumesh.abraham »

Thanks. The lookups that we perform vary from few hundred records to 200 or 3000 thousand records. My understanding is that the Read cache and Write cache are set at Project level. In this case the other jobs that do lookup for fewer records may not require the lookups to happen in memory. I just want to make sure that we are not impacting other jobs while trying to tune a handful.

Please let me know your thoughts.
Post Reply