Page 1 of 1

Job run issue

Posted: Wed Sep 14, 2016 9:15 am
by kumar11
main_program: Unable to start ORCHESTRATE job: APT_PMwaitForPlayersToStart failed while waiting for players to confirm startup. This likely indicates a network problem.
Status from APT_PMpoll is 0; node name is etl01

Re: Job run issue

Posted: Wed Sep 14, 2016 9:55 am
by chulett
kumar11 wrote:This likely indicates a network problem.
Did you investigate this?

Re: Job run issue

Posted: Wed Sep 14, 2016 10:36 am
by kumar11
have gone through the dsxchange search and found it as a network issue and reran the job and got the below error:
node_etl03: Fatal Error: Unable to start ORCHESTRATE process on node etl03 (mdcdetlndw01a): APT_PMPlayer::APT_PMPlayer: fork() failed, Not enough space

Posted: Wed Sep 14, 2016 10:53 am
by PaulVL
look at your TMPDIR path and your scratch disk paths too.

make sure you have enough room.

Posted: Wed Sep 14, 2016 11:46 am
by kumar11
the TMPDIR path is set to /dsScrath
and there is enough space in the directory.

Posted: Wed Sep 14, 2016 1:38 pm
by PaulVL
but, at the time of execution...

Does the job fail 100% of the time?

When you run the job. start looking at the space consumption on that mount.

df -g /dsScrath

look at your % creep up. I bet it will max out at some point.

Your APT file might be set up with multiple spill over scratch mounts but TMPDIR does not relocate the same when when full. He will fail with fork errors and broken pipes, and Not Enough space.

Posted: Wed Sep 14, 2016 2:26 pm
by kumar11
Thanks Paul!!. was able to resolve the issue.

Posted: Wed Sep 14, 2016 3:27 pm
by chulett
For future searchers, you resolved this by doing what exactly?

Posted: Mon Sep 19, 2016 11:04 am
by kumar11
from the Admin console set the environment variable /TMPDIR to the /dsScratch path and the job ran fine after the settings.