Multi-instance dataset to Oracle job - slow

koti9 · Post by **koti9** » Wed Feb 08, 2017 12:23 pm

Hello all,

We have a multi-instance dataset to Oracle job which runs slow if I run it against all database instances in parallel. However it runs quite fast when I run it against single database instance using bulk load?

My job design is pretty straight forward.

DataSet ----> Transformer ----> Oracle Connector (Bulk Load)

I am not using copy stage to load multiple databases as i am doing multi-instance.

Do you have any ideas to get the same throughput as if i am running against single database instance while doing multi-instance run in parallel?

Thanks & Regards
Koti

samyamkrishna · Post by **samyamkrishna** » Thu Feb 09, 2017 10:30 am

Hi,

As a first step, you can or ask your admins to monitor the resources on Datastage server and the Oracle server to find out the bottle neck and then take the necessary actions.

chulett · Post by **chulett** » Thu Feb 09, 2017 1:48 pm

When you say it goes slow when you run it "against all database instances in parallel", do you mean you are targeting the same table in separate databases with each invocation id? Assuming that's what you meant but wanted to clarify.

koti9 · Post by **koti9** » Fri Feb 17, 2017 2:01 pm

Yes that's Right Craig, I am loading the same table in separate databases

PaulVL · Post by **PaulVL** » Fri Feb 17, 2017 2:47 pm

Are those target databases on the same host?

koti9 · Post by **koti9** » Fri Feb 17, 2017 3:45 pm

No Paul, They are on different hosts

forest416 · Post by **forest416** » Tue Feb 28, 2017 12:33 pm

Hi,

As you said, the connections are from one host to multiple target host. There might be bottleneck at the Datastage server side.

It may hit the limit of Datastage server host bandwidth. (I pretty sure, the bandwidth usage won't reach to 100MiB/s, if you are on gigabit network. The actual maximum bandwidth of function of the network latency and TCP recevie/send buffer on both side. ). If you reached 1/3 of the eithernet bandwidth (30MiB/s on giga ethernet), I suggest you contact network admin to tune the network performace.

Another thing I noticed is that when use BulkLoad (with DB2), the varchar field are transferred through network in FULL-length.

For example, if I have a table to load with BULD load by DB2 connector.
The table structure is like:

num int,
desc varchar(255)

The source is also a DB2 connection from another database. And the maximun length of value of desc is 10

on Datastage host, I can see the outbound network usage be 10+ times than the inbound.

That means, if you have a big varchar field, you need either tune the varchar maximum length, or tune the network bandwidth, or avoid to use BULK.

On Linux/Unix, network bandwidth usage can be seen by:
sar -n DEV 3 #instance , every 3 second
sar -n DEV # history.

DSXchange

Multi-instance dataset to Oracle job - slow

Multi-instance dataset to Oracle job - slow

Re: Multi-instance dataset to Oracle job - slow

Yes that's Right Craig