Lookup Performance Issue

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
paranoid
Premium Member
Premium Member
Posts: 185
Joined: Tue May 29, 2007 5:50 am

Lookup Performance Issue

Post by paranoid »

Hi,

I am construction a DS job that uses two lookups that contains 7 million records in each but my source is having only 150 records. When i run this job it is taking a lot of time as it needs to parse all those 7 million records in the lookuop. I have already indexed the columns in the WHERE clause.

Could any one of you suggest me a good way of increasing the performace?

Thanks

Sue
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

What stage type is being used for the lookup?
paranoid
Premium Member
Premium Member
Posts: 185
Joined: Tue May 29, 2007 5:50 am

Post by paranoid »

I am using a transformer stage and the output stage is XML output stage.

Thanks
paranoid
Premium Member
Premium Member
Posts: 185
Joined: Tue May 29, 2007 5:50 am

Post by paranoid »

I am using Oraclestage for lookup.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

A typical solution would involve building a hashed file from your Oracle source based on the natural keys in your data, then use the hashed file for the lookup. That way you don't pull 7M records into the hash but rather just as many of the 150 as you need.

Load the keys into a work table and join them to your source table(s).
-craig

"You can never have too many knives" -- Logan Nine Fingers
paranoid
Premium Member
Premium Member
Posts: 185
Joined: Tue May 29, 2007 5:50 am

Post by paranoid »

I think hashed file takes up lot of space in my server which i dont have. Is there any other solution?
nivas
Participant
Posts: 117
Joined: Sun Mar 21, 2004 4:40 pm

Post by nivas »

switch lookup and source positions if possible. I mean to say change the lookup to source and source to lookup as the present source has only 150 records.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

paranoid wrote:I think hashed file takes up lot of space in my server which i dont have.
No, they don't - not when handled properly.
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

paranoid wrote:I think hashed file takes up lot of space in my server which i dont have. Is there any other solution?
Get more space on your server.

If you haven't got enough space for a moderately sized hashed file you are going to run out of space somewhere, sometime in your ETL processing.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
paranoid
Premium Member
Premium Member
Posts: 185
Joined: Tue May 29, 2007 5:50 am

Post by paranoid »

Hey thank you everyone for replying to this message.I have requested to increase the space in my server.Please close this ticket.

Have a great day!!
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

:lol: Excellent.

ps. It's not a ticket. And you close it by marking it as "Resolved" which you've done. So we good.
-craig

"You can never have too many knives" -- Logan Nine Fingers
Krazykoolrohit
Charter Member
Charter Member
Posts: 560
Joined: Wed Jul 13, 2005 5:36 am
Location: Ohio

Post by Krazykoolrohit »

won't using a left outer join help here?

Ofcourse it will help only after the oracle stage has been replaced with the hash file as suggested by ray.

But, I think you can try using the join and run the job till the time you get additional space.
Post Reply