Entire Partitioner issue in Data set

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Poornimayvs
Participant
Posts: 5
Joined: Fri Apr 08, 2011 9:32 am

Entire Partitioner issue in Data set

Post by Poornimayvs »

Hi all,

I am trying to see the difference between different types of Partitioner. I used a flat file as my input stage and output is a Data set. My input has got 11 records, When i use the Entire Partitioner in the Partitioning tab i am seeing that the output generated contains duplicate records i mean my output is 22 records instead of 11.

Can any one help me regarding this issue.

Thanks.
greggknight
Premium Member
Premium Member
Posts: 120
Joined: Thu Oct 28, 2004 4:24 pm

Post by greggknight »

Entire:
means just that, that the entire data is written to all nodes.
I am assuming you have a two node config.
"Don't let the bull between you and the fence"

Thanks
Gregg J Knight

"Never Never Never Quit"
Winston Churchill
SURA
Premium Member
Premium Member
Posts: 1229
Joined: Sat Jul 14, 2007 5:16 am
Location: Sydney

Re: Entire Partitioner issue in Data set

Post by SURA »

You should read the doc and understand where to use which partition!
singhald
Participant
Posts: 180
Joined: Tue Aug 23, 2005 2:50 am
Location: Bangalore
Contact:

Post by singhald »

when you select "entire partition" it basically copy all records to number of nodes defined in node configuration file.

for more details you go through Advance parallel job developer guide
Regards,
Deepak Singhal
Everything is okay in the end. If it's not okay, then it's not the end.
Post Reply