Incremental Aggregation loading

ravij · Post by **ravij** » Thu Jan 05, 2006 2:35 am

Hi,

How to implement the Incremental Aggregation Loading in Server Jobs?

Any assistance can be appreciated.
thanks in advance

ArndW · Post by **ArndW** » Thu Jan 05, 2006 2:43 am

Ravij,

it is very important when posing sweeping questions like this to explain what you've already tried, otherwise we will assume that you haven't tried anything and are just looking for an easy answer.

The concept of incrementatl aggregation is a relatively new one. Much, if not most, of the code for such an incremental aggregation needs to be tailored to the specific environment and there is no single answer. The core is that you need to know which data you've already processed; and determining this is different when you have direct database access or just see export flat files. The approach is almost the same as for CDC.

kumar_s · Post by **kumar_s** » Thu Jan 05, 2006 4:37 am

HI,
As Arnd specifed you need to be specific.
The way to do the incremental aggregation is only possible by determining the incremetal load (data). So basically you need to do a change detection with the current and previous run file.
You have built in stages available to assist you.
Chage capture/Chagne apply.
Based on the captured code, you can aggregate the data.

-Kumar

Sreenivasulu · Post by **Sreenivasulu** » Thu Jan 05, 2006 6:03 am

Kumar,

I believe the question posed by ravi is with respect to server jobs and not parallel jobs. The change capture stage and change apply stage referred are present in PX.

Ravij,
Incremental aggregation can be done using a date window for extraction.
You load only a particular window's data from the source systems and
then store in a different partition (one partition for each window).
Beofre storing the in the partition do the aggregation as usual and it would
be an incremental one since the extraction was for a small window.

Regards
Sreeni

ray.wurlod · Post by **ray.wurlod** » Thu Jan 05, 2006 3:52 pm

"Hire a competent consultant" is the fastest solution! :D "Get some training" is the next best, and better for longer term maintainability.