Limits on number of links out of a Switch/ into a Join

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
miwinter
Participant
Posts: 396
Joined: Thu Jun 22, 2006 7:00 am
Location: England, UK

Limits on number of links out of a Switch/ into a Join

Post by miwinter »

Hi guys,

This is a bit of a long shot but...

I have a job design here which uses a Switch stage which feeds a Join. The Switch and Join are both fed by datasets. On the Switch output, there are 18 links in all. We frequently get failures on this job, but it is not contention-related as I have tried running these one by one, in isolation, and furthermore, it doesn't seem to be volume-related as it happens equally on small and larger data volumes.

The error we see seems to actually relate to the Join stage (which takes in the 18 links from the Switch), namely:

"node_node1: Player 19 terminated unexpectedly"
"main_program: Unexpected termination by Unix signal 9(SIGKILL)"

Does anyone know if there is some kind of limit we should adhere to, when exiting data from a Switch or the number of links a Join can take as an input?

Multiple re-runs of these jobs do end successfully (usually after 3 or 4 attempts)

Cheers fellas
Mark Winter
<i>Nothing appeases a troubled mind more than <b>good</b> music</i>
JoshGeorge
Participant
Posts: 612
Joined: Thu May 03, 2007 4:59 am
Location: Melbourne

Post by JoshGeorge »

Set APT_NO_JOBMON to True. This should solve your problem. This was identified earlier in this mighty site :)
Joshy George
<a href="http://www.linkedin.com/in/joshygeorge1" ><img src="http://www.linkedin.com/img/webpromo/bt ... _80x15.gif" width="80" height="15" border="0"></a>
miwinter
Participant
Posts: 396
Joined: Thu Jun 22, 2006 7:00 am
Location: England, UK

Post by miwinter »

Cheers Josh, I'll give that a spin :D


EDIT...

PS. Does anyone know the reason behind this issue?
Mark Winter
<i>Nothing appeases a troubled mind more than <b>good</b> music</i>
attu
Participant
Posts: 225
Joined: Sat Oct 23, 2004 8:45 pm
Location: Texas

Post by attu »

miwinter wrote:Cheers Josh, I'll give that a spin :D


EDIT...

PS. Does anyone know the reason behind this issue?
We had the same issue and after disabling APT_NO_JOBMON the job ran fine. My question is, is it a Bug with 7.51A and AIX 5.3 (we have ML 6 SP 3) ? Why will only this job fail and why not the others ?
Appreciate your responses.
Thanks
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

So your job also features a Switch stage with 18 links?

If not, you're hijacking this thread, which we frown upon.

Please begin a new thread, with a meaningful subject.

This will assist future searchers.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply