Jobs getting killed in Master Sequence without any error

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
parag.s.27
Participant
Posts: 221
Joined: Fri Feb 17, 2006 3:38 am
Location: India
Contact:

Jobs getting killed in Master Sequence without any error

Post by parag.s.27 »

Hi All,

In the IIS jobs we are facing a new problem for past few days. The jobs that were running fine till now are getting killed as if OS is killing the process.

The detailed log does not show any warning or fatal or any information but when we check the job status in the status window, it shows that job is aborted. But inside the log there is nothing.

Many times it gives a warning as: -

Code: Select all

Job Control process (pid <process number>) has failed.
but this warning is also not consistent. Also for few instances, the job and the sequence hangs for an indefinite time and even the status shown as running.

Can someone please help and let me know whether its an OS level error where the AIX is killing processes due to process limit overflow. Or is there a problem with IIS 8
Thanks & Regards
Parag Saundattikar
Certified for Infosphere DataStage v8.0
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

No way for us to know, because we can't monitor your AIX system. You (or your UNIX admin) must do that. How busy is the system?
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
parag.s.27
Participant
Posts: 221
Joined: Fri Feb 17, 2006 3:38 am
Location: India
Contact:

Post by parag.s.27 »

ray.wurlod wrote:No way for us to know, because we can't monitor your AIX system. You (or your UNIX admin) must do that. How busy is the system? ...
Actually that is the problem. The server is on client site and we are not having the access to check the OS level settings. The infrastructure team on Client Site is suggesting that there must be a problem with the way the IIS job is built ,and not the OS.

But there seems to be no problem with IIS job because the jobs that are getting killed are only used to load data in Orable tables using OCI bulk load. No other processing is done apart from loading the data. Can the amount of data be responsible for jobs getting killed? We have approx 40 million records getting loaded.
Thanks & Regards
Parag Saundattikar
Certified for Infosphere DataStage v8.0
parag.s.27
Participant
Posts: 221
Joined: Fri Feb 17, 2006 3:38 am
Location: India
Contact:

Post by parag.s.27 »

Can anyone please help in suggesting what can be the possible reason. We have even raised a PMR with IBM, but it is also pending.

What I had found that typically for records more than 3 million the jobs are having such problem. And it is also happening recently. Earlier it was not the case.
Thanks & Regards
Parag Saundattikar
Certified for Infosphere DataStage v8.0
Post Reply