Round robin partition

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
vcsasikala
Participant
Posts: 26
Joined: Wed Jun 20, 2007 1:13 am
Location: Chennai

Round robin partition

Post by vcsasikala »

Hi,
I want to split the number of records equally in each partiton. So I have used Roun Robin partition in Transformer stage.

Job looks like


Dataset Datasets(3)
| | | |
| | | |
Dataset---join------Lookup-------Transformer-----Seq. File


I'm having 50 records in input.

In 2 node processing each node should get 25 records resp .
But in my job it is spliting into 24 and 26.

Why it is so...


Thanks!
Sasikala
Regards,

Sasikala V C
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Because there is a minimum quantum of data that DataStage moves at a time. 24/26 is close enough to evenly spread.

For what it's worth, Round Robin is almost certainly completely wrong for your Join and Lookup stages.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply