SCD typeII implementation

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Puli
Participant
Posts: 53
Joined: Mon May 26, 2008 2:59 am
Location: Australia

SCD typeII implementation

Post by Puli »

I need to Use SCD stage in data stage parallel 8. in my design, I have a source table which lookup dimension table based on primary key, and tgt table have startdate,end_date along with the Src table.

inside the SCD stage there are options to generate surrogate key using flat file option.I need to know whether this flatfile is created at runtime or do i need to create it manually before using the file. and if so how to create it?

there are some flat file created already,which is there in the repository(No idea how that was created), when i try to use that file,the SCD stage is not reading the particular flatfile. am getting the error as:

APT_CombinedOperatorController(0),0: Lookup table is empty, no further warnings will be issued.
APT_CombinedOperatorController(0),0: Unable to read state file /home/dsadm/dsinputfiles/s_key_src.csv: Invalid argument.
Unable to read state file /home/dsadm/dsinputfiles/s_key_src.csv: Invalid argument.

one more doubt: regarding state file in SCD stage...wat are statefiles and how they created...
Mike
Premium Member
Premium Member
Posts: 1021
Joined: Sun Mar 03, 2002 6:01 pm
Location: Tampa, FL

Post by Mike »

All answered in the product documentation. In particular study the Surrogate Key Generator stage and the Slowly Changing Dimension stage.

The Surrogate Key Generator stage produces a state file... basically the metadata that keeps track of the surrogate key value.

The Slowly Changing Dimension stage is one stage that can utilize the state file for generating surrogate keys. It is a very powerful stage that makes slowly changing dimension logic virtually trivial to implement.

Mike
nani0907
Participant
Posts: 155
Joined: Wed Apr 18, 2007 10:30 am

Post by nani0907 »

Hi Mike.

I have Console for IBM Information Server installed on my machine. could you please tell me where exactly i can find the product documentation
thanks n regards
nani
JRodriguez
Premium Member
Premium Member
Posts: 425
Joined: Sat Nov 19, 2005 9:26 am
Location: New York City
Contact:

Post by JRodriguez »

Hi,

Start --> Programs --> Information Server --> Documentation

The IBM Red book "IBM InfoSphere DataStage Data Flow and Job Design" explain this in detail
Julio Rodriguez
ETL Developer by choice

"Sure we have lots of reasons for being rude - But no excuses
Mike
Premium Member
Premium Member
Posts: 1021
Joined: Sun Mar 03, 2002 6:01 pm
Location: Tampa, FL

Post by Mike »

From a typical client install, you would just navigate to it from the Windows Start menu... otherwise you could always go back to your installation media for it.

Mike
nani0907
Participant
Posts: 155
Joined: Wed Apr 18, 2007 10:30 am

Post by nani0907 »

Thanks alot mike and JRodriguez for your support and help
thanks n regards
nani
Puli
Participant
Posts: 53
Joined: Mon May 26, 2008 2:59 am
Location: Australia

Post by Puli »

hi Mike,

thanks for ur comments,

if i seperately generate surrogate key using SKG stage and stores them in a file,so that i can refer them as a statefile inside the SCD stage? is it wat u r mentioning....if so, how DS keeps updating the surrogate key when it is in use...
Puli
Participant
Posts: 53
Joined: Mon May 26, 2008 2:59 am
Location: Australia

Post by Puli »

hi Mike,

thanks for ur comments,

if i seperately generate surrogate key using SKG stage and stores them in a file,so that i can refer them as a statefile inside the SCD stage? is it wat u r mentioning....if so, how DS keeps updating the surrogate key when it is in use...
Mike
Premium Member
Premium Member
Posts: 1021
Joined: Sun Mar 03, 2002 6:01 pm
Location: Tampa, FL

Post by Mike »

I'm not concerned with how Datastage manages the state file... just that it does manage it automatically. I haven't tested thoroughly enough to see how it handles concurrent key generating processes, but that's not a concern for me. I use a state file per table and only one job to generate keys for a given table.

Mike
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Good advice. Treat the state file as a "black box". It works - be happy with that.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply