Improve Performance of a job

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
jagadam
Premium Member
Premium Member
Posts: 107
Joined: Wed Jul 01, 2009 4:55 pm
Location: Phili

Improve Performance of a job

Post by jagadam »

Hi Everyone,

I have a job design which pulls data from source compares with old data using change capture stage and then performs inserts/updates/deletes.

Design:
Oracle EE stage ( OLD DATA )
|
Source (Oracle EE stage)---> CC ---- updates/inserts/deletes.(Oracle EE stage)

Apart from these stages job has 3 join stages and one transformer.

Now data has grown huge and it's taking more than 2 hours to compare 6-8 million records and some additional time for inserts/updates/deletes.

Source doesn't have any time stamp fields to pull only delta records. So i am doing full image comparison.

Could anyone suggest how to increase the performance.

Any inputs would really appreciated.

Thanks in Advance
SURA
Premium Member
Premium Member
Posts: 1229
Joined: Sat Jul 14, 2007 5:16 am
Location: Sydney

Re: Improve Performance of a job

Post by SURA »

1. You can create a checksum to compare.

2. You need to split the job.

DS User
Post Reply