Link Collector Problem

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
cheerfu1
Participant
Posts: 37
Joined: Mon Apr 21, 2003 8:47 pm

Link Collector Problem

Post by cheerfu1 »

Hi All,
I have encountered the following error in one of my datastage jobs using one Link Collector SOMETIMES which results in abort. After i reset it and run again, it would be ok. Thus, im puzzled as it would occur unpredictably as there are situations whereby continuous runs of 3-4 times are successful.

S10..MergeAll.Rejected: ds_ipcclose() - Error in mutex_unlock() - Operation not permitted.

DS10..MergeAll.Rejected: DSD.INTClose Error 11 in GCI close.

DS10..Compare: ds_ipcopen() - Error in open(/tmp/TMOPT.DS10..Compare) - A file or directory in the path name does not exist.

DS10..MergeAll: |Error 11 in GCI Link initialisation.|

Attempting to Cleanup after ABORT raised in stage DS10..MergeAll

DataStage Job 15 Phantom 8325
DS10..MergeAll.Rejected: DSD.INTClose Error 11 in GCI close.
DataStage Phantom Finished

Can anyone please enlighten on what may be the problem?
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Just a guess, but I'd say you might be running out of temp space.

-craig
dlvsrinivas
Participant
Posts: 18
Joined: Mon Jul 07, 2003 12:56 am
Location: India

Post by dlvsrinivas »

Hi All,

Is there any solid reason for the above problem? I am also facing the same problem.

Regards,
Srinivas.
PhilHibbs
Premium Member
Premium Member
Posts: 1044
Joined: Wed Sep 29, 2004 3:30 am
Location: Nottingham, UK
Contact:

Re: Link Collector Problem

Post by PhilHibbs »

cheerfu1 wrote:Hi All,
I have encountered the following error in one of my datastage jobs using one Link Collector SOMETIMES which results in abort. After i reset it and run again, it would be ok. Thus, im puzzled as it would occur unpredictably as there are situations whereby continuous runs of 3-4 times are successful.
I am having the same problem. The job failed twice, then worked. Here are some error messages:

Message: AR_CES_IM_J05_DMR3_DOC_Create_Segments..Collate_GL_entries.E1BPACGL09_vendor: ds_ipcflush() - Error in mutex_unlock() - Not owner
Message: AR_CES_IM_J05_DMR3_DOC_Create_Segments..Collate_GL_entries.E1BPACGL09_vendor: DSD.INTClose Error 11 in GCI close.
Message: AR_CES_IM_J05_DMR3_DOC_Create_Segments..Populate_customer_segments.E1BPACGL09_customer: Run stopped after 1547 rows

Any ideas? The only thing that I did before the third run (which worked) is to delete the destination files, but there was no problem with file permissions.
Phil Hibbs | Capgemini
Technical Consultant
chucksmith
Premium Member
Premium Member
Posts: 385
Joined: Wed Jun 16, 2004 12:43 pm
Location: Virginia, USA
Contact:

Post by chucksmith »

Some of your problems may be version specific. In version 7.something, the collector and partitioner were rewritten to perform non-blocking I/O. This solves most problems. Prior to the rewrite, if a collector or partitioner became starved for resources, it could hang. Uneven row counts on the partitioned links was also a problem at earlier releases.

I also suspect that having more processes trying to write to the same job log may also create a problem. However, this may be solved by changes in the uvconfig file, but I have not tested this, yet.
chunsli
Participant
Posts: 23
Joined: Sun Aug 10, 2003 5:31 pm

Re: Link Collector Problem

Post by chunsli »

Unix file permissions may cause this kind of problem if you run the job under different userIDs, expecially when the userIDs are in different groups and have different rights.

chunsli

cheerfu1 wrote:Hi All,
I have encountered the following error in one of my datastage jobs using one Link Collector SOMETIMES which results in abort. After i reset it and run again, it would be ok. Thus, im puzzled as it would occur unpredictably as there are situations whereby continuous runs of 3-4 times are successful.

S10..MergeAll.Rejected: ds_ipcclose() - Error in mutex_unlock() - Operation not permitted.

DS10..MergeAll.Rejected: DSD.INTClose Error 11 in GCI close.

DS10..Compare: ds_ipcopen() - Error in open(/tmp/TMOPT.DS10..Compare) - A file or directory in the path name does not exist.

DS10..MergeAll: |Error 11 in GCI Link initialisation.|

Attempting to Cleanup after ABORT raised in stage DS10..MergeAll

DataStage Job 15 Phantom 8325
DS10..MergeAll.Rejected: DSD.INTClose Error 11 in GCI close.
DataStage Phantom Finished

Can anyone please enlighten on what may be the problem?
changming
Participant
Posts: 68
Joined: Wed Oct 13, 2004 3:35 am

link collector not effiective

Post by changming »

I have same problem, so I seldom use link collector.
when I have to use, I use hash file. or I write into several sequentail file and then cat then after jobs.
rumu
Participant
Posts: 286
Joined: Mon Jun 06, 2005 4:07 am

Re: link collector not effiective

Post by rumu »

Hi,
I think it is a performance issue,check out in performance tab which option is checked-user-project defaults if so then check all those parameter in administrator ,if not then check inter process under Enable row buffer option and increase the time out-say 60sec-since u r facing mutex error it might be problem of dead lock.
PhilHibbs
Premium Member
Premium Member
Posts: 1044
Joined: Wed Sep 29, 2004 3:30 am
Location: Nottingham, UK
Contact:

Re: Link Collector Problem

Post by PhilHibbs »

chunsli wrote:Unix file permissions may cause this kind of problem if you run the job under different userIDs,
Like I said, it was not a file permissions problem.
Phil Hibbs | Capgemini
Technical Consultant
tcj
Premium Member
Premium Member
Posts: 98
Joined: Tue Sep 07, 2004 6:57 pm
Location: QLD, Australia
Contact:

Re: Link Collector Problem

Post by tcj »

On a project I was working on we were having mutex errors while using link collectors. They were running datastage 7.5.1 on a AIX server. Seems that there was a glitch in he AIX operating system which was causing the mutex error on any jobs using a link collecter.

Anyone here that is having mutex errors running on a AIX server?

Tim
Post Reply