Entire Partitioner issue in Data set

Poornimayvs · Post by **Poornimayvs** » Tue May 24, 2011 9:52 am

Hi all,

I am trying to see the difference between different types of Partitioner. I used a flat file as my input stage and output is a Data set. My input has got 11 records, When i use the Entire Partitioner in the Partitioning tab i am seeing that the output generated contains duplicate records i mean my output is 22 records instead of 11.

Can any one help me regarding this issue.

Thanks.

greggknight · Post by **greggknight** » Tue May 24, 2011 11:04 am

Entire:
means just that, that the entire data is written to all nodes.
I am assuming you have a two node config.

SURA · Post by **SURA** » Tue May 24, 2011 6:28 pm

You should read the doc and understand where to use which partition!

singhald · Post by **singhald** » Wed May 25, 2011 12:07 am

when you select "entire partition" it basically copy all records to number of nodes defined in node configuration file.

for more details you go through Advance parallel job developer guide

DSXchange

Entire Partitioner issue in Data set

Entire Partitioner issue in Data set

Re: Entire Partitioner issue in Data set