Performance tuning for one to one jobs

120267 · Post by **120267** » Thu Dec 15, 2005 2:42 am

Hi all,

I am receiving the 20 million source records.Using one to one mapping i am loading these source records directly into target.But its taking nearly 45 minutes to load.
How can i increase the performance in this case?
Pls do the needful.

kerensho · Post by **kerensho** » Thu Dec 15, 2005 2:51 am

Hi,

what is your target? Oracle, Informix, SqlServer...? are you using Plug in or ODBC?

in any case, one of the things you should check is the number of rows per transaction (I.E after how many rows the DS will send a commit request).

Good luck,
Keren

ArndW · Post by **ArndW** » Thu Dec 15, 2005 2:55 am

In addition to the target, what type is your Source?

120267 · Post by **120267** » Thu Dec 15, 2005 3:04 am

My Source & Targets are:Oracle stage.I am using the transaction size as zero.If i change this to 100 0r 1000 what would be the impact to performance?
Thanks in advance

kerensho wrote:Hi,

what is your target? Oracle, Informix, SqlServer...? are you using Plug in or ODBC?

in any case, one of the things you should check is the number of rows per transaction (I.E after how many rows the DS will send a commit request).

Good luck,
Keren

ArndW · Post by **ArndW** » Thu Dec 15, 2005 3:14 am

We can't answer that, as there are too many factors. But why don't you try it in your system and get a definitive answer?

Oracle -> Oracle. Hmmm, if they are in the same instance/schema then it might be fastest to stay within Oracle and do the copy.

You can make the job multi-instance and add a parameter that limits the SELECT of the source to a subset of the total data, then call several instances in parallel, each getting a portion of the total work. This creates a parallel load which can be much faster.

If you are loading to Oracle you can use the bulk load capabilities of DataStage; but most likely you will be constrained by the speed of the read in that case. You should read up on the Oracle stage in the documentation, specifically the array size.

kerensho · Post by **kerensho** » Thu Dec 15, 2005 3:25 am

Hi,

Like Arnd said, there are too many factors to answer you. but just to make sure you fully understand transaction size: the number you put there is how many rows DS will write before doing a commit. in other words, all those rows will be written to Oracle Buffers in case it will need to Rollback. when you put "0" you tell DS to write everything in one transaction - in case of 20M rows, you are probably filling those buffers - you will see that the number of rows per seconds in DS statistics is going down as it progressing.

in short, you probably don't want to put 0 on such a big insert

Keren

DSXchange

Performance tuning for one to one jobs

Performance tuning for one to one jobs

not enough information

Re: not enough information

Number of rows