Abnormal termination of stage

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
vintipa
Participant
Posts: 136
Joined: Wed May 07, 2008 11:26 am
Location: Sydney, Australia
Contact:

Abnormal termination of stage

Post by vintipa »

Hi Experts,

We have Scheduled hundreds of Jobs to run every night. Jobs ran fine till last week. But since few days randomly some jobs get aborted with the error : Abnormal termination of stage.

There is no other warning or error message. Both sequencers and the jobs fail. these Jobs if reset and run afain then they run fine. So only scheduled jobs are getting aborted with this error.

Please suggest me how to know the reason for these aborts.

regards,
vinay.
Vinay
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

There must be other messages logged with this. If you manually Reset the aborted jobs, is there a 'From previous run...' message logged? If so, please post it.
-craig

"You can never have too many knives" -- Logan Nine Fingers
vintipa
Participant
Posts: 136
Joined: Wed May 07, 2008 11:26 am
Location: Sydney, Australia
Contact:

Post by vintipa »

Creg,

Error: Abnormal termination of stage RRE_LoadHRP1003..Transformer_183 detected.

this is the only error that we are getting. no other warning or error.

regards,
Vinay
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

And when you Reset the job?
-craig

"You can never have too many knives" -- Logan Nine Fingers
vintipa
Participant
Posts: 136
Joined: Wed May 07, 2008 11:26 am
Location: Sydney, Australia
Contact:

Post by vintipa »

When reset the job and run, it runs fine without any warings or errors.
Vinay
vintipa
Participant
Posts: 136
Joined: Wed May 07, 2008 11:26 am
Location: Sydney, Australia
Contact:

Post by vintipa »

hey But one perticular job that reads a table and writes into a hash file gave the following warning while it was reset.

error: TedEmp_LKP_TedEmpPSA..Ted_Emp_PSA_Hash.To_TGT_PSA: DSD.UVOpen Unable to open file '/dsdata/Tedweb/Lookup/Ted_Emp_PSA'.

This is the same job that gave the previous mentined error.
Vinay
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

:? That's not what I asked. Find an Aborted job, Reset the job from the Director and tell us if any 'From previous run...' messages get logged.

Find out what changed last week, obviously something did. To me it sounds like a resource issue if you have the problem when "hundreds" of jobs are running but they work fine when rerun on an individual (or small number) basis.
-craig

"You can never have too many knives" -- Logan Nine Fingers
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Those errors logged during the reset are meaningless and can ignored.
-craig

"You can never have too many knives" -- Logan Nine Fingers
vintipa
Participant
Posts: 136
Joined: Wed May 07, 2008 11:26 am
Location: Sydney, Australia
Contact:

Post by vintipa »

No...

there is no other warning or error when the job is reset.
Vinay
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

That's going to make your job difficult. I'd still pursue the resource angle and also involve your official support provider.
-craig

"You can never have too many knives" -- Logan Nine Fingers
vintipa
Participant
Posts: 136
Joined: Wed May 07, 2008 11:26 am
Location: Sydney, Australia
Contact:

Post by vintipa »

OK Craig,

I'll try in the directions mentioned by you.

thanks
Vinay
vintipa
Participant
Posts: 136
Joined: Wed May 07, 2008 11:26 am
Location: Sydney, Australia
Contact:

Post by vintipa »

Hi All ,

I restarted the Unix server where the datastage is installed.
this brought down the paging space utilization from 4.2 % to .5 %.
and the jobs are not geeting aborted now. so restarting to clear unreleased memory helped.

Thanks to all,
:D
Vinay
Post Reply