Aggregator - problem with memory

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
eoyylo
Participant
Posts: 57
Joined: Mon Jun 30, 2003 6:56 am

Aggregator - problem with memory

Post by eoyylo »

Hi,
i have an aggragator that must aggregate 20M of records but it abort.
I suppose that can be a memory problem. If the aggregator manage until 6-7M of record work well. Over this amount don't work.
How can i resolve this problem?
i tried to use the sort plug_in but it is very slow. Can the sort plug_in aggregate the records?

thanks in advance

Mario
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

On those older versions of DS Aggregator should be renamed Aggrevator. If your source data is in a table you may get much better performance by doing the aggregation in the source database plug-in. In DataStage the sort stage will sort but not aggregate.

Run your job again and keep an eye on temp file space as the job runs. The aggregator writes a lot of data to temporary files while it aggregates the input data.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Actually, it's the Sort stage that uses temp files, the Aggregator works in memory without landing anything. Unless things were different back in 5.x but I don't believe so. :?

You can substantially reduce the amount of memory (and time) used by the Aggregator by presorting the data and asserting the sort order in the Aggregator stage by marking the appropriate fields. Then again, this advantage may be offset by the amount of time and resources it takes to sort the data in job. If the Sort stage is too slow, is there any way you can deliver the data to job sorted? Perhaps a simple sort at the UNIX level or some external sorting package you may have access to? Or can the data be created in the order required to support the aggregation?
-craig

"You can never have too many knives" -- Logan Nine Fingers
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

Thanks for the clarification.

I remember last time I had to aggregate a large amount of data the aggregation stage would eventually fail and I had to resort to putting the data into a staging table and aggregating in a database stage.
Post Reply