Performance Issue

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
xch2005
Participant
Posts: 85
Joined: Fri Apr 29, 2005 3:13 am
Location: India

Performance Issue

Post by xch2005 »

Basically we are loading from Sybase to Oracle tables, it takes close to 30 mins. for loading 2 million records. It is direct mapping and no validation.

For the first run it will be full referesh but then onwards it would be delta records. Hence we are using a upsert process (Update then insert)

SYB -> TRN -> ORA

Apart from this when we add a column to the existing record and load the overall load time for 2 million records is 1 hour.

Any suggestions on how to bring the load time down please.

Thanks
DSguru2B
Charter Member
Charter Member
Posts: 6854
Joined: Wed Feb 09, 2005 3:44 pm
Location: Houston, TX

Post by DSguru2B »

Ask your oracle dba to drop indexes and turn off logging for the first run. once your run is complete, ask him to recreate the indexes and turn logging back on and perform a reorg. That should speed up your process.
Creativity is allowing yourself to make mistakes. Art is knowing which ones to keep.
greggknight
Premium Member
Premium Member
Posts: 120
Joined: Thu Oct 28, 2004 4:24 pm

Post by greggknight »

I would do Insert then Update as opposed to Update then insert.
Less scanning.
"Don't let the bull between you and the fence"

Thanks
Gregg J Knight

"Never Never Never Quit"
Winston Churchill
greggknight
Premium Member
Premium Member
Posts: 120
Joined: Thu Oct 28, 2004 4:24 pm

Post by greggknight »

And actually
We have two seperate jobs Inserts and Updates
We don't use upsert
but if you .do the before post would be the better way
"Don't let the bull between you and the fence"

Thanks
Gregg J Knight

"Never Never Never Quit"
Winston Churchill
xch2005
Participant
Posts: 85
Joined: Fri Apr 29, 2005 3:13 am
Location: India

Post by xch2005 »

Thanks.
I have now split the load into 2 jobs.. Now the time taken for load is better.

Thank you.
Post Reply