Datasets

sshettar · Post by **sshettar** » Fri Sep 11, 2009 8:45 am

Hi All,

I think i have asked this question before , but i still have certain doubts while using a dataset.
If we have created one dataset using 4 nodes and the other using 1 nodes can i do a lookup of dataset1 with that of dataset2 using 2 nodes in another job , will my job run successfully ?
cause i'm thinking it would abort as one dataset is created using 2 nodes and other using 4 nodes and i'm trying use these two datasets in another job which uses 2 nodes only ?

Any advice is highly appreciated?

Thanks in advance

Sainath.Srinivasan · Post by **Sainath.Srinivasan** » Fri Sep 11, 2009 8:48 am

The resulting job must work fine.

PX will repartition the data as long as you don't force it to preserve yours.

ArndW · Post by **ArndW** » Fri Sep 11, 2009 8:50 am

DataStage will automatically repartition unless you explicitly tell it otherwise (there are various means of doing this including explicitly setting links or changing default system APT_ variables).

If you created a job whith which reads a 2 node dataset and joins it to a 4 node dataset while running in a 3-node configuration file it will repartition for you at runtime.

sshettar · Post by **sshettar** » Fri Sep 11, 2009 9:31 am

Thanks for information.That was really helpful!!!!