Connection Lost Contact

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
DeepakCorning
Premium Member
Premium Member
Posts: 503
Joined: Wed Jun 29, 2005 8:14 am

Connection Lost Contact

Post by DeepakCorning »

Version : 7.5.2
Server : Windows OS

Issue : One of my sequencer triggers 4 jobs simultaneously out of which 3 complete , but 1 waits for ~90 minutes and then fails with ORA-03135: Connection lost contact.

The job which fails is not the same one all the time , and in addition to that the error is sporadic. Like it is fine for say 7 days and then suddenly one day it appears. When I restart the job and everything goes back to normal.

SQL trace file on datastage server shows the following connection attempt made , but strangely we do not even use the following TNS entry at all (not in the TNS file , and not in any of the jobs) -

(DESCRIPTION=(ADDRESS=(PROTOCOL=BEQ)(PROGRAM=oracle)(ARGV0=oracleORCL)(ARGS='(DESCRIPTION=(LOCAL=YES)(ADDRESS=(PROTOCOL=beq)))'))(CONNECT_DATA=(SID=ORCL)(CID=(PROGRAM=E:\Ascential\DataStage\Engine\bin\uvsh.exe)(HOST=XXX)(USER=XXXX))))
Protocol Error

To me it looks like somehow its not able to decipher the correct connection setting sometimes (may be CPU is too busy) and tries with a wrong TNS name. Any one has seen this????

Thanks
Dk
Thanks
Deepak Patil

Convince Them Confuse Them .. What's the difference?
DeepakCorning
Premium Member
Premium Member
Posts: 503
Joined: Wed Jun 29, 2005 8:14 am

Post by DeepakCorning »

Some more details - The failed job will have only 7 entries - Main ones to notice are

- Start Job
- Load Environment Variables
- Failure (after waiting for 90 Minutes).


No "queries" are issued.... probably just trying to ping??
Thanks
Deepak Patil

Convince Them Confuse Them .. What's the difference?
Post Reply