Number of Readers per node

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
arun@sinmax
Premium Member
Premium Member
Posts: 13
Joined: Tue Jun 14, 2005 6:25 am

Number of Readers per node

Post by arun@sinmax »

Hi,
When I worked with sequensial file I have tried 'No of Readers per Node' option to make the extraction process faster.But the result I have got is just against :shock: .
Following are result of my trails.

Start Time - Production Run Time (in Sec) - No of Nodes

03 - 08 - 1
03 - 09 - 2
04 - 09 - 4
07 - 10 - 8
09 - 13 - 16

Can you help me in this issue that how to apply this to make it faster.
gbusson
Participant
Posts: 98
Joined: Fri Oct 07, 2005 2:50 am
Location: France
Contact:

Post by gbusson »

how many rows do you want to read?
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

For a lengthier record, sometime it may not be effecient to increate the number of reader per node.
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Never use anything less than a couple of minutes runtime for any timing comparison purposes! Your startup-time accounts for almost all your activity, so of course using more readers is going to add significantly to your times.

Don't even think about using multiple processes before your runtime goes over 5 or 10 minutes; it will just cause more overall system load and make your server run slower. This means that even though you might get your job to run in 30 seconds instead of 60 seconds the system will need to work 3 times as hard so your overall performance has gone down - unless you are not running in a multiuser/multijob environment.
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

Arnd - The sample shows the increase in elapse time of Runtime. Shouldnt it get shorter.
arun@sinmax - If you have large volume of data to run, pls do it and produce the result, which will be optimal to analyze.
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

The times depend upon repartitioning after the read as well, and we don't know what has been done with the data.
Post Reply