Page 1 of 1

Sequential file partition

Posted: Wed Jul 24, 2013 8:22 am
by srini.dw
Hi,

Got a question from DS exam.

You have a job that reads in sequential file followed by transformer stage. When you run this job, which partitioning method will be used by default?

The answer is Round Robin.

Any idea why.

Thanks,

Posted: Wed Jul 24, 2013 8:48 am
by ArndW
Because that is the way that they designed it. The round robin algorithm requires no knowledge of the data or column contents and effectively distributes the data across all parallel nodes.

Posted: Wed Jul 24, 2013 8:52 am
by srini.dw
Thank you.

Posted: Wed Jul 24, 2013 4:50 pm
by ray.wurlod
This thread (final post) explains what (Auto) uses.