surrogate key in the initial loading

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
cetzhbo
Premium Member
Premium Member
Posts: 38
Joined: Tue Aug 28, 2007 10:20 am

surrogate key in the initial loading

Post by cetzhbo »

Hello Gurus,

when creating job for initial loading dimension tables, do we have to make surrogate key column for these dimension tables with surrogate key stage ?

or I guess if we can leave the creation of all surrogate key to the phrase of recurring loading .

thanks very much!
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

This is more of a design question than a DS implementation one. Using a generated key from the beginning ensures that there are no duplicates and less "holes".
cetzhbo
Premium Member
Premium Member
Posts: 38
Joined: Tue Aug 28, 2007 10:20 am

Post by cetzhbo »

thanks ArndW,

will you please give more detail about why there will occur duplicate or hole for surrogate key created when recurring loading ?
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

I didn't say that these would occur, just that they could occur. If the initial load uses a different method from the recurrent one then there is a chance that the two methods won't synchronize (particularly after problems such as job aborts or retries with new data). If you only use one method then you have remove a potential source of error.
Post Reply