Performance Tuning.

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
mprashant
Participant
Posts: 18
Joined: Thu Apr 29, 2004 1:23 pm

Performance Tuning.

Post by mprashant »

Hello
I have a huge job that has performance issues. It takes 1 hr 25 min to move data across for 2100 rows! The size of the data is not too huge however there are 2 transformers directly connected with over 20 look ups each to universe stages. The resaon for universe stages is beacuse the has files do not allow me to look up for values between a range.
Is there a way I can increase performance of this job ? the previous consultant used the universe stage? Is therea benefit using Universe stages as opposed to say a DRS stage?
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Check this post for a very interesting way to do what it sounds like you are doing with UV stages in a hash stage instead. Takes a little bit to setup but very speedy stuff.

Or you may just need to build indexes for your hashed key columns... are they indexed?
-craig

"You can never have too many knives" -- Logan Nine Fingers
rasi
Participant
Posts: 464
Joined: Fri Oct 25, 2002 1:33 am
Location: Australia, Sydney

Post by rasi »

Try spiltting one job which does 20 lookups into many jobs with few lookups on each job. This should help.


Cheers
Rasi
mprashant
Participant
Posts: 18
Joined: Thu Apr 29, 2004 1:23 pm

Post by mprashant »

Now the look ups were against Universe databases and I changed all that to the DRS stages and it now runs in 5 min. I split the look-ups over a couple of more transformers, turned on the inter-process buffering, increased the read write cache size for the Hashed Files to 999 MB, tried playing around with a variety of Array sizes and have managed to get it to 4 min.
I was wondering if there was anything else I could do to increase the performance? I am still not happy with the 4 min and i think it takes that much time because it has to go over the network for each row for all those look-ups with the DRS stages.The reason we are using DRS stages instead of hash files is we are checking for contraints between two values
Post Reply