Using DataStage to populate tables in Star schema

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
yaminids
Premium Member
Premium Member
Posts: 387
Joined: Mon Oct 18, 2004 1:04 pm

Using DataStage to populate tables in Star schema

Post by yaminids »

Friends,

All these days we were using DataStage only to extract data into Staging tables. From now on we want to use it to process the staging data and populate dim/fact tables in Star schema

Can someone please answer the following questions
1) Can DataStage be used effectively to populate tables containing millions of records in a Star schema
2) Are there previous posts discussing such topics

In my case I am planning to populate 1 fact table with 5 dim tables. The problem is I don't know how I can retrieve the keys from the dim tables before inserting data into fact table. I think building 'Hash' files is out of equation as some of dim tables contain multi million records.

Can someone guide me in this case as this is the first I am using DataStage to do this kind of work

Any help would be highly appreciated
Yamini
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

DataStage was originally developed precisely for loading star schemas (the preferred implementation for data warehouses and data marts). It works brilliantly. As with any other load sequence you need to load the tables in the correct order so that referential integrity is preserved - outriders first, then dimension tables, and finally fact tables.

As to whether there are previous topics, have you noticed that there is a Search facility available in the menu at the top of the page?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply