Need to Improve Oracle to Netezza Performance

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Hema.k
Participant
Posts: 43
Joined: Wed Dec 30, 2009 3:28 am

Need to Improve Oracle to Netezza Performance

Post by Hema.k »

Hi ,

We are using 8.1 DS parallel job with 4 node configuration.
Job Design: Oracle db------->netezza db
issue: Slow performance.

Can anybody discuss steps to improve the performance.

Thanks in advance.

Regards,
Hema K
zulfi123786
Premium Member
Premium Member
Posts: 730
Joined: Tue Nov 04, 2008 10:14 am
Location: Bangalore

Post by zulfi123786 »

Performance improvement has always been a topic of debate.
well you could use the performance measurement tools to find what is actually going beyond the limits.

Are both databases over the same server? if not your job will be limited by the network bandwidth

Are you doing an upsert? For bulky data you could separate the Insert and Update for better results and for huge data you could use bulk loading options

Look at the select query if you could redo the SQL trying to use indexed columns in join if you have any.

Try increasing the array sizes for database stages to a suitable limit for better results.
MarkB
Premium Member
Premium Member
Posts: 95
Joined: Fri Oct 27, 2006 9:13 am

Post by MarkB »

Considering Netezza is a database appliance and his source database is Oracle, they are obviously on a different server :roll: .

It's possible your network is slowing you down. Are you running directly from Oracle to Netezza with no other steps (Transformer, etc)? Have you tried writing the Oracle data to a sequential file and using nzload at the shell level to load the file and compare load times?
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Slow compared to what?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Hema.k
Participant
Posts: 43
Joined: Wed Dec 30, 2009 3:28 am

Post by Hema.k »

Hi All,

Thanks for the suggestions.

@Ray,
Normally the jobs that uses the same db sever is fine.
If we use different servers say oracle ---->netezza ,its taking longer time for the extraction of data from the source.Is there any options to overcome this.
Please suggest.

Regards,
Hema K
jwiles
Premium Member
Premium Member
Posts: 1274
Joined: Sun Nov 14, 2004 8:50 pm
Contact:

Post by jwiles »

To improve performance, you should first identify what is causing your perceived performance problem.

With reference to MarkB's post, have you isolated the extract and load steps to separately test their performance? Run each several times to get a good average, with each run several minutes apart to help negate caching effects.

Original job:
- Oracle --------------> Neteeza

Test jobs:
- Oracle ------> dataset
- dataset ------> Neteeza

Also, as MarkB asks, is there any other work being done? Transformers, sorts, etc.?

You are running with a 4-node configuration file, but are your database stages running in parallel mode? zulfi has provided some good suggestions as to where to begin looking, as well as a valid reason or for slower performance.

Are there differences in database configuration? Partitioning/Hardware/other applications

You've only told us it's running "slower". Please quantify that: How much slower? For all we know, you could now be averaging just 100 rows/sec slower than before, from 30000/sec to 29900/sec.

Regards,
- james wiles


All generalizations are false, including this one - Mark Twain.
Post Reply