Reduce time taken to read data/rows from sql table

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
dsuser7
Participant
Posts: 41
Joined: Sat Nov 29, 2008 2:21 am

Reduce time taken to read data/rows from sql table

Post by dsuser7 »

Hi,

I'm trying to read data from 1 SQL table in the DataStage parallel job and write it to a Sequential File.

However it is taking around 2hrs of time for this process with near 760,000 records data (there is no transformer stage-just an ODBC stage, copy stage and the output Sequential stage). The output file generated is of 900MB size.
There are 700 columns per row.

Please suggest how to improve performance and fine tune.

Thank you.
miwinter
Participant
Posts: 396
Joined: Thu Jun 22, 2006 7:00 am
Location: England, UK

Post by miwinter »

1. Refine your SQL to ensure it's as performant as it can be
2. Which RDBMS platform are you running against?
3. Do you need to be writing to a sequential output? Can a dataset be employed instead?
4. The filesystem you are writing to for output could be suffering contention with other processes.
Mark Winter
<i>Nothing appeases a troubled mind more than <b>good</b> music</i>
sajidkp
Participant
Posts: 114
Joined: Thu Apr 30, 2009 12:17 am
Location: New Delhi

Post by sajidkp »

Try to utilise parellism in the source ODBC connector optimised with your resources
Regards,
Sajid KP
Post Reply