Page 1 of 1

Checkpoint Does not WorkWhen A Sequence is Stopped

Posted: Thu Oct 22, 2009 1:06 pm
by olgc
Hi there,

When a sequence is abended prematurely, it has status "Aborted/Restartable". When it's restarted, it starts from the failed point or the checkpoint.

When a sequence is stopped, it logged an item as below:

Item #: 176
Event ID: 175
Timestamp: 2009-10-08 16:20:55
Type: Info
User Name: xxx
Message: CxxJGGroup..JobControl (DSWaitForJob): Job (CxxJGGroup) Stopped

And it has status "Stopped/Restartable". When it's restarted, it can't and logged another item as below:

Item #: 177
Event ID: 176
Timestamp: 2009-10-09 15:13:27
Type: Warning
User Name: xxx
Message: Job CxxJGGroup must be reset before it can run.

Hello, if the sequence has to be reset then it can be started, its checkpoint will be gone and it is not restartable. So what's about the status "Stopped/Restartable"? The "restartable" here is not restartable. Any one knows how to make a stopped sequence restartable?

Thanks,

Posted: Thu Oct 22, 2009 1:10 pm
by chulett
You can't. I have no idea why it works that way or what "Stopped/Restartable" is really supposed to mean when it cannot be restarted from the point of failure. :roll:

Posted: Thu Oct 22, 2009 3:51 pm
by ray.wurlod
Who stopped it? "Stopped" means stopped by user intervention.

Posted: Thu Oct 22, 2009 5:22 pm
by chulett
I assume someone pressed the Stop button in Director. I've done the same in the past and made a crazy assumption that when I saw 'Stopped/Restartable' as the status that I would get a checkpoint restart when I ran the job next. Problem is you cannot, you must reset it before it can be run again and thus blow away all of your checkpoints.

Smells like a bug to me. I guess I need to see if it still works that way in the 8.x world.

Posted: Fri Oct 23, 2009 6:52 am
by olgc
[quote="ray.wurlod"]Who stopped it? "Stopped" means stopped by user intervention. ...[/quote]

It's stopped by another job having trouble. These jobs are running in parallel.

Posted: Fri Oct 23, 2009 6:56 am
by olgc
[quote="chulett"]You can't. I have no idea why it works that way or what "Stopped/Restartable" is really supposed to mean when it [i]cannot[/i] be restarted from the point of failure. :roll:[/quote]

That's really bad. It destroys the great checkpoint feature completely if this defect is not fixed. IBM should fix the problem. When a sequence is stopped by others and its status is "Stopped/Restartable", it's treated as "Aborted/Restartable".

Thanks,