Abnormal termination of stage Error In Aggregator stage

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
csc.datastage
Premium Member
Premium Member
Posts: 23
Joined: Wed Sep 10, 2008 4:46 pm
Location: USA

Abnormal termination of stage Error In Aggregator stage

Post by csc.datastage »

Hi,

My Job design is:

Seq File-Transformer-Aggregator-seq file.

I am getting this error while running:

Abnormal termination of stage TestZOCTRHS2_ChkDuplicates_Job1_v1..Aggregator_26 detected

After reseting the job: getting following message:

From previous run
DataStage Job 89 Phantom 16362
jobnotify: Unknown error
Job Aborted after Fatal Error logged.
Program "DSD.WriteLog": Line 239, Abort.
[16654] DSD.StageRun TestZOCTRHS2_ChkDuplicates_Job1_v1. TestZOCTRHS2_ChkDuplicates_Job1_v1.Aggregator_26 1 0/0 - core dumped.
Attempting to Cleanup after ABORT raised in stage TestZOCTRHS2_ChkDuplicates_Job1_v1..Transformer_23

DataStage Phantom Aborting with @ABORT.CODE = 1

Other Key items are:

Input data file size is 703 MB.

Thanks for any help in advance.
Focus on ABC: Audit , Balance and Control..
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Are your data sorted on the grouping keys?

Most Aggregator problems in server jobs relate to running out of memory because they have to keep the entire result set in memory. When input data are sorted this is not the case, and jobs run orders of magnitude faster as a result. You do, however, have to tell the Aggregator stage that its inputs are sorted in addition to providing sorted input.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
csc.datastage
Premium Member
Premium Member
Posts: 23
Joined: Wed Sep 10, 2008 4:46 pm
Location: USA

Post by csc.datastage »

Thanks Ray!
You are right. Input to Aggregator stage is not sorted.
However I am confused about how to sort the input data.
By UNIX sort or SyncSort or sort Stage.
Which method will provide best performance.
Focus on ABC: Audit , Balance and Control..
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

The third party sorters, like SyncSort and CoSort, cost money because they sort fast. The UNIX sort command is acceptable in many cases. Server Sort stage, however, is notoriously slow.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
csc.datastage
Premium Member
Premium Member
Posts: 23
Joined: Wed Sep 10, 2008 4:46 pm
Location: USA

Post by csc.datastage »

ray.wurlod wrote:You do, however, have to tell the Aggregator stage that its inputs are sorted in addition to providing sorted input.
Thanks a lot Ray!
I am exploring different options.
How we can tell Aggregator stage that Inputs are already sorted.

Thanks!
Focus on ABC: Audit , Balance and Control..
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

There are "Sort" columns in the Aggregator, one for order and another for direction. So, you can say "Fred" is the 1st field sorted ascending, etc. Make sure you don't lie to it, it will bust you! :wink:

And make sure you sort to match your grouping columns or the Aggregator will throw it all away and still do its own thing like it is doing now.
-craig

"You can never have too many knives" -- Logan Nine Fingers
csc.datastage
Premium Member
Premium Member
Posts: 23
Joined: Wed Sep 10, 2008 4:46 pm
Location: USA

Post by csc.datastage »

Thanks Craig!
I am working on 2 options: Sort Stage and SyncSort Utility.
I will keep updating how it goes.
Focus on ABC: Audit , Balance and Control..
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

You can tell your sorting is "working" when rows flow through the Aggregator rather than none coming out until all have gone in.
-craig

"You can never have too many knives" -- Logan Nine Fingers
Post Reply