SCD2

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
suryadev
Premium Member
Premium Member
Posts: 211
Joined: Sun Jul 11, 2010 7:39 pm

SCD2

Post by suryadev »

Hello,

I need to develop a job to implement SCD type 2 and the version is 7.5.2

which is the best way to do it?
I have 2 options
1) lookup and transformer , collecting rejected records and upd, ins
2) using change data capture and transformer.

please let me know...which would be the better option?


Thanks
Thanks,
Surya
SURA
Premium Member
Premium Member
Posts: 1229
Joined: Sat Jul 14, 2007 5:16 am
Location: Sydney

Re: SCD2

Post by SURA »

If you use Change Capture, work will be straight forward.

If you use join & tfm then you need to build your own conditions, null handling.

You wont find much difference.

otherwise, get the checksum and compare that.

DS User
suryadev
Premium Member
Premium Member
Posts: 211
Joined: Sun Jul 11, 2010 7:39 pm

Post by suryadev »

Sure, will do CDC.
Thanks,
Surya
vishal_rastogi
Participant
Posts: 47
Joined: Thu Dec 09, 2010 4:37 am

Post by vishal_rastogi »

I think it depends upon your requirement
how is your data
is it changing frequently(every day) .
or it is changing not so often(in a month/quaterly...etc)
Vish
suryadev
Premium Member
Premium Member
Posts: 211
Joined: Sun Jul 11, 2010 7:39 pm

Post by suryadev »

It changes twice a month.

Using CDC I developed a job for scd2. The fields from the source are transformed and then passed to the CDC stage and then the fields from the target are also passed to CDC stage.
Here I have a small issue with the begindate and enddate from the target. These two fields are from the target table, when they are passed to CDC I have no fields which can be matched with source columns as they are transformed after CDC. (begindate and enddate are transformed from creationdate and lastupdatedate of source and they have different transformations for each one)

So do I need to pass these two columns from target(begdt,enddt) and source(credt,lasdt) to CDC, If so how can I pass them as they are different?
Is there any other way to do it? Please let me know..
Thanks,
Surya
Post Reply