DS Director and Designer is not responding
Moderators: chulett, rschirm, roy
-
- Participant
- Posts: 251
- Joined: Tue Dec 02, 2008 3:09 pm
DS Director and Designer is not responding
Currently we are having issue, in out Test environment, when we are trying to run the parallel job fromm designer Or director, both designer and director get freezed( not responding), we have to force fully kill the session, but while running sequence job we are not having issue from director and designer.please can you help us out.
We have already open the PMR with IBM guys they gave few steps to follow but still we are not able to figure it out.
Additional information
1) We are having same issue in all the projects on the that server
2)Running parallel job on single node, but still did not work(designer and director getting hanged for parallel job or sequence job with parallel calling)
3) Created new project and ran the job but still did not work (designer and director getting hanged for parallel job or sequence job with parallel calling)
4)every thing is fine while running sequence job only
Thank you
suneel
We have already open the PMR with IBM guys they gave few steps to follow but still we are not able to figure it out.
Additional information
1) We are having same issue in all the projects on the that server
2)Running parallel job on single node, but still did not work(designer and director getting hanged for parallel job or sequence job with parallel calling)
3) Created new project and ran the job but still did not work (designer and director getting hanged for parallel job or sequence job with parallel calling)
4)every thing is fine while running sequence job only
Thank you
suneel
thanks,
suneel
suneel
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
1. Is it the first time trying to run the job in this server?
2. Configuration did properly?
3. Create a sample job say RowGen --> Peak and test. This will ensure we are not accessing any file system.
4. Is there is any log entries after the job is terminated ?
5. Check the ISALite (I am not sure in UNIX) report
6. Check the version.xml
2. Configuration did properly?
3. Create a sample job say RowGen --> Peak and test. This will ensure we are not accessing any file system.
4. Is there is any log entries after the job is terminated ?
5. Check the ISALite (I am not sure in UNIX) report
6. Check the version.xml
Thanks
Ram
----------------------------------
Revealing your ignorance is fine, because you get a chance to learn.
Ram
----------------------------------
Revealing your ignorance is fine, because you get a chance to learn.
-
- Participant
- Posts: 251
- Joined: Tue Dec 02, 2008 3:09 pm
Thank you for the replay
@ Ray
I am trying to run dsjob command,let you know how it goes
@ Sura
This is not first we are running the job, till yesterday every thing was fine, we are having issue from today early morning. Even simple parallel job (peek and row generator), not able to run it, as it freezes the designer or director when we run it.
We have also generated the ISAlite stats and gave it to IBM. They said, there are no specific error messages
Version.xml is valid one
@ Ray
I am trying to run dsjob command,let you know how it goes
@ Sura
This is not first we are running the job, till yesterday every thing was fine, we are having issue from today early morning. Even simple parallel job (peek and row generator), not able to run it, as it freezes the designer or director when we run it.
We have also generated the ISAlite stats and gave it to IBM. They said, there are no specific error messages
Version.xml is valid one
thanks,
suneel
suneel
In your job, please set the following:
DS_PXDEBUG=1 (you will need to create this env var)
APT_STARTUP_STATUS=true
APT_DISABLE_COMBINATION=true
CC_MSG_LEVEL=1
Then force recompile the job and run again.
After you have killed the job, please find any useful info in the job log. And if you go to the project folder you should see a 'debugging' folder.
Inside will be a folder with the job name. Find if that helps you.
DS_PXDEBUG=1 (you will need to create this env var)
APT_STARTUP_STATUS=true
APT_DISABLE_COMBINATION=true
CC_MSG_LEVEL=1
Then force recompile the job and run again.
After you have killed the job, please find any useful info in the job log. And if you go to the project folder you should see a 'debugging' folder.
Inside will be a folder with the job name. Find if that helps you.
Thanks
Ram
----------------------------------
Revealing your ignorance is fine, because you get a chance to learn.
Ram
----------------------------------
Revealing your ignorance is fine, because you get a chance to learn.
-
- Participant
- Posts: 251
- Joined: Tue Dec 02, 2008 3:09 pm
@ SURA
After setting the parameters which you have mentioned in the job and when i try to run the job, director is freezing (unresponsive) and i don't see debugging director is been created.
@ Ray
When i ran the Seq job from DS with DSjob command i got following error message
/info_server/IBM/IS85/Server/DSEngine/bin/dsjob -domain ARLSPMWST01:9081 -user <username> -password <Password> -server ARLSPMDST01.CORP.CAT.COM:31539 -run SB3 Test12121_Jobs
Reply=255
Output from command ====>
Error running job
Status code = 30107
After setting the parameters which you have mentioned in the job and when i try to run the job, director is freezing (unresponsive) and i don't see debugging director is been created.
@ Ray
When i ran the Seq job from DS with DSjob command i got following error message
/info_server/IBM/IS85/Server/DSEngine/bin/dsjob -domain ARLSPMWST01:9081 -user <username> -password <Password> -server ARLSPMDST01.CORP.CAT.COM:31539 -run SB3 Test12121_Jobs
Reply=255
Output from command ====>
Error running job
Status code = 30107
thanks,
suneel
suneel
-
- Participant
- Posts: 251
- Joined: Tue Dec 02, 2008 3:09 pm
-
- Participant
- Posts: 251
- Joined: Tue Dec 02, 2008 3:09 pm
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
-
- Premium Member
- Posts: 1735
- Joined: Thu Mar 01, 2007 5:44 am
- Location: Troy, MI
-
- Participant
- Posts: 251
- Joined: Tue Dec 02, 2008 3:09 pm
-
- Participant
- Posts: 251
- Joined: Tue Dec 02, 2008 3:09 pm
Hi suneelchallagali
If you haven't, Please bounce the server and try to run the job.
If you have already bounced the server, then ensure that all the necessary services are running for xmeta too.
This is my guess only, but wont harm on trying this!
If you haven't, Please bounce the server and try to run the job.
If you have already bounced the server, then ensure that all the necessary services are running for xmeta too.
This is my guess only, but wont harm on trying this!
Thanks
Ram
----------------------------------
Revealing your ignorance is fine, because you get a chance to learn.
Ram
----------------------------------
Revealing your ignorance is fine, because you get a chance to learn.
Are you running on a cluster/grid?
Sequencers run on the Head Node, Jobs run on the Cluster, the rsh/ssh might be the issue.
Look at the user id that is executing the job. There may be something in that login that might be prompting you for data.
What is the LAST log message in your job execution log when the job hangs?
Sequencers run on the Head Node, Jobs run on the Cluster, the rsh/ssh might be the issue.
Look at the user id that is executing the job. There may be something in that login that might be prompting you for data.
What is the LAST log message in your job execution log when the job hangs?
-
- Participant
- Posts: 251
- Joined: Tue Dec 02, 2008 3:09 pm
Hi,
After running the trace on issue with help of IBM guys we got temporary solution,issue was file($DS.SIGNAL) under catdir(under DSEngine directory) got locked due to our DSEngine has been mounted to NAS.
Here is the tech notes from IBM guys for more details.
http://www.ibm.com/support/docview.wss?uid=swg21594973
Thank you,
suneel
After running the trace on issue with help of IBM guys we got temporary solution,issue was file($DS.SIGNAL) under catdir(under DSEngine directory) got locked due to our DSEngine has been mounted to NAS.
Here is the tech notes from IBM guys for more details.
http://www.ibm.com/support/docview.wss?uid=swg21594973
Thank you,
suneel
thanks,
suneel
suneel