Hi everyone:
We encountered the following problem with DataStage and particularly with KBA Job Control and we are trying to determine root cause so that we can prevent this from happening again.
KBA Job Control batches saw jobs as completed normal when DataStage hadn't even run the job. Thus we had a situation where dependent jobs were then kicked off when their parent jobs hadn't been run.
In other words, it seemed as if KBA told DataStage, "run job x". DataStage didn't/couldn't run it and sent back something to KBA Job Control. KBA Job control interpreted that something as "Job complete" instead of "Job couldn't be run".
We think that something was going on in DataStage at the time this happened. Some of our DataStage Sequences got errors (eg, "Error calling DSRunJob(jobName1), code=-14 [Timed out while waiting for an event]"). And, we got the dreaded "ACCESS DENIED: Project is currently locked. The project is currently locked by the Administrator." Please note, that soon after getting this message, we could log on to the projects again without a reboot. At the time we had a large number of processes running and some of them were very system intensive.
We are guessing that DataStage reached its server resource limit (based on a search of the -14 code here in the forums) and couldn't kick off any kind of process (be it a job or a login).
If this is the case, does KBA Job Control not correctly trap the -14 code? Or perhaps something else is going on and DataStage sent a different code to KBA Job Control causing KBA Job Control to think the job had run successfully?
Thanks for any input on this issue,
Pam
KBA Job Control and code=-14
Moderators: chulett, rschirm, roy
-
- Charter Member
- Posts: 15
- Joined: Mon Jan 31, 2005 1:53 pm
- Location: CO, USA
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
A suite of utilities from Ken Bland & Associates. You can find more (including their web address) by searching DSXchange. The web address is part of Ken's signature block.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Pam, the first question that comes to mind is: have there been any changes made to the code that I left there? I don't recall ever seeing the behaviour you've noted but have seen something almost the opposite where it goes to start a job and never hears back that it started and thus you get a 'living dead' running hang.
However, if the code is the same then I can poke around in the area that controls all that and see if anything looks... promising. I would imagine that you are correct that the system stress you were seeing manifest as those -14 errors played a role here as well.
However, if the code is the same then I can poke around in the area that controls all that and see if anything looks... promising. I would imagine that you are correct that the system stress you were seeing manifest as those -14 errors played a role here as well.
-craig
"You can never have too many knives" -- Logan Nine Fingers
"You can never have too many knives" -- Logan Nine Fingers
-
- Charter Member
- Posts: 15
- Joined: Mon Jan 31, 2005 1:53 pm
- Location: CO, USA