Page 1 of 1

Exact use of staging tables?

Posted: Thu Jun 04, 2009 1:36 am
by ravireddy25
Exact use of staging tables?

Posted: Thu Jun 04, 2009 1:58 am
by arvind_ds
Staging tables are used to store extracted data temporarily to avoid dependency on source systems. Once data is loaded into staging tables, it can further be loaded into desired dimension/detail/transaction/fact tables.
Its a kind of intermediate storage.

Posted: Thu Jun 04, 2009 6:09 am
by chulett
While I normally avoid crap sites like 'Geek Interview' there's a decent answer out there on it here. Typical nonsense here. In both cases the comments are fairly useless. Google up your own answers for more fun. :wink:

Posted: Thu Jun 04, 2009 4:10 pm
by ray.wurlod
Exact use of staging tables? Staging.

Posted: Thu Jun 04, 2009 5:00 pm
by chulett
That made me L-O-L. :wink:

Posted: Thu Jun 04, 2009 10:42 pm
by dsex100
They're also good to make sure that all your data has been loaded into the target data warehouse area (i.e. the "staging" area).

In the case where for example, not all your source data files where available then you could wait until all the staging jobs completed, at this point you'd know all your data is ready to be loaded into the data warehouse.

Then you can load your work tables using some transformations, the whole process happening in it's own separate environment.

Posted: Thu Jun 04, 2009 11:53 pm
by ray.wurlod
I prefer Data Sets for that (with parallel jobs). They are more efficiently loaded and unloaded than database tables.

:idea: Things like referential integrity should have been checked earlier in the ETL process - you don't need a "staging database" to do that.