Datasets

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
sshettar
Premium Member
Premium Member
Posts: 264
Joined: Thu Nov 30, 2006 10:37 am

Datasets

Post by sshettar »

Hi All,

I think i have asked this question before , but i still have certain doubts while using a dataset.
If we have created one dataset using 4 nodes and the other using 1 nodes can i do a lookup of dataset1 with that of dataset2 using 2 nodes in another job , will my job run successfully ?
cause i'm thinking it would abort as one dataset is created using 2 nodes and other using 4 nodes and i'm trying use these two datasets in another job which uses 2 nodes only ?

Any advice is highly appreciated?

Thanks in advance
Sainath.Srinivasan
Participant
Posts: 3337
Joined: Mon Jan 17, 2005 4:49 am
Location: United Kingdom

Post by Sainath.Srinivasan »

The resulting job must work fine.

PX will repartition the data as long as you don't force it to preserve yours.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

DataStage will automatically repartition unless you explicitly tell it otherwise (there are various means of doing this including explicitly setting links or changing default system APT_ variables).

If you created a job whith which reads a 2 node dataset and joins it to a 4 node dataset while running in a 3-node configuration file it will repartition for you at runtime.
sshettar
Premium Member
Premium Member
Posts: 264
Joined: Thu Nov 30, 2006 10:37 am

Post by sshettar »

Thanks for information.That was really helpful!!!!
Post Reply