Performance suggestion.

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
pimmit22043
Participant
Posts: 38
Joined: Thu Jul 26, 2007 6:11 am

Performance suggestion.

Post by pimmit22043 »

Oracle db source has 10,000 rows and joining with 8mill rows table. Even thought it is a parallel job it runs in sequential mode; job is taking more than 2 hrs to complete.
lookup table or source table is not oracle partitioned so we are not partitioning the datastage job and runs sequential mode in parallel job.



Can anybody suggests me how I can improve the performance parallel job.


Thansk for time;
pt
pimmit22043
Participant
Posts: 38
Joined: Thu Jul 26, 2007 6:11 am

Post by pimmit22043 »

any suggestions please !
singhald
Participant
Posts: 180
Joined: Tue Aug 23, 2005 2:50 am
Location: Bangalore
Contact:

Post by singhald »

i want to know few more details about the join conditions,

what are the column you are using to join, \are these column have index or not ?. are u performing join in sql or using DS join stage
Regards,
Deepak Singhal
Everything is okay in the end. If it's not okay, then it's not the end.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

pimmit22043 wrote:any suggestions please !
Learn patience. Your two posts were 11:51pm and 3:24am my time.
How long does the Oracle query take by itself to complete (not to return the first row)?
What is your job design?
Are you doing normal lookup or sparse lookup?
How many rows are there in the reference data? These have to be loaded into a virtual Data Set before the main stream can start.
Would a join in Oracle be more efficacious?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply