Page 1 of 1

Configuration file error in Datastage

Posted: Sat Mar 12, 2011 12:34 am
by meet_deb85
Hi all,

The server where my Datastage is installed has the hostname : kasitmisdb1, I have db2 installed in another server kasitmisdb3,
so I have created the following config to enable remote connectivity through Enterprise edition.

My config file has been designed as : -

Code: Select all

{
        node "node1"
        {
                fastname "kasitmisdb1"
                pools ""
                resource disk "/opt/IBM/InformationServer/Server/Datasets" {pools ""}
                resource scratchdisk "/opt/IBM/InformationServer/Server/Scratch" {pools ""}
        }
        node "node2"
        {
                fastname "kasitmisdb3"
                pools "db2" ""
                resource disk "/tmp" {pools "db2"}
                resource scratchdisk "/tmp" {pools "db2"}
        }

}
while checking it I am getting the following error : -

##I IIS-DSEE-TFCN-00001 12:00:20(000) <main_program>
IBM WebSphere DataStage Enterprise Edition 8.5.0.5746
Copyright (c) 2001, 2005-2008 IBM Corporation. All rights reserved



##I IIS-DSEE-TFCN-00006 12:00:20(001) <main_program> conductor uname: -s=Linux; -r=2.6.18-164.el5; -v=#1 SMP Tue Aug 18 15:51:48 EDT 2009; -n=kasitmisdb1; -m=x86_64
##I IIS-DSEE-TCOA-00067 12:00:20(002) <main_program> OS charset: UTF-8.
##I IIS-DSEE-TCOA-00068 12:00:20(003) <main_program> Input charset: UTF-8.
##I IIS-DSEE-TFSC-00001 12:00:20(004) <main_program> APT configuration file: /opt/IBM/InformationServer/Server/Configurations/db2_ee.apt
##E IIS-DSEE-TFIO-00211 12:00:20(000) <APT_RealFileExportOperator in APT_FileExportOperator,0> APT_Communicator::connectTo: connect() failed due to Unix error = 111 (Connection refused) on node kasitmisdb1 using ConnectionInfo object 'TCP, connection Host: localhost (127.0.0.1), TCP port number: 11001', RETRYING connect()
##E IIS-DSEE-TFIO-00211 12:00:20(001) <APT_RealFileExportOperator in APT_FileExportOperator,0> APT_Communicator::connectTo: connect() failed due to Unix error = 111 (Connection refused) on node kasitmisdb1 using ConnectionInfo object 'TCP, connection Host: localhost (127.0.0.1), TCP port number: 11001', RETRYING connect()
##F IIS-DSEE-TFIO-00112 12:00:20(002) <APT_RealFileExportOperator in APT_FileExportOperator,0> Fatal Error: APT_Communicator::pmSendPartitionInfo() failed on node kasitmisdb1 for partition 0 of dataset 0 (write failed to handle 8) Bad file descriptor
##E IIS-DSEE-TFPM-00192 12:00:20(000) <node_node1> Player 2 terminated unexpectedly.
##E IIS-DSEE-TFPM-00338 12:00:20(005) <main_program> APT_PMsectionLeader(1, node1), player 2 - Unexpected exit status 1.
##E IIS-DSEE-TFSC-00011 12:00:25(000) <main_program> Step execution finished with status = FAILED.
##E IIS-DSEE-TCOA-00069 12:00:25(001) <main_program> ERROR: check configuration file failed.


I have enabled SSH between the two servers and I have made the required change in the remsh file.

I can do password less ssh between the two servers.

I have gone through the other posts in this forum, and even tried a few solution without any luck.

Please help and thanks in advance.

Posted: Sat Mar 12, 2011 1:56 am
by ray.wurlod
You shouldn't need a configuration file for server edition.

Assuming, therefore, that you're using enterprise edition, the error is in TCP and described by exit status (error code) 111 - connection refused. Therefore we must conclude that, in spite of your assertions, communication between the machines has not been set up correctly. Can you perform an password-less remote shell command from kasitmisdb1 to kasitmisdb3?

Posted: Sat Mar 12, 2011 2:10 am
by meet_deb85
Hi Ray,

It is for Parallel edition not server, that wasa mistake from my end to put the post into Server.

Also my premium membership has lapsed, so could not read your reply.

Posted: Sat Mar 12, 2011 2:19 am
by lstsaur
The error message indicates "connect() failed due to Unix error = 111 (Connection refused) on node kasitmisdb1". Check your host file.

Posted: Sat Mar 12, 2011 2:23 am
by meet_deb85
This is the entry in my /etc/hosts file : -

127.0.0.1 kasitmisdb1 localhost.localdomain localhost
::1 localhost6.localdomain6 localhost6
172.21.13.53 kasitmisdb1
172.21.13.254 kasitmisdb3

Posted: Sat Mar 12, 2011 2:25 am
by meet_deb85
I fail to understand why it is having problem in connecting to the same server.

Posted: Sat Mar 12, 2011 8:36 am
by chulett
Moved.

Posted: Sat Mar 12, 2011 3:41 pm
by rupeshg
What is the content of your default.apt default configuration file?
Try using the same fastname as given in default.apt file.

Posted: Sun Mar 13, 2011 12:10 am
by meet_deb85
My default.apt config file looks like this : -

{
node "node1"
{
fastname "kasitmisdb1"
pools ""
resource disk "/opt/IBM/InformationServer/Server/Datasets" {pools ""}
resource scratchdisk "/opt/IBM/InformationServer/Server/Scratch" {pools ""}
}
}

I have used the fastname in the other config file... I want to setup DB2 enterprise edition settings, and the server where DB2 lies has the fastname kasitmisdb3.

Posted: Sun Mar 13, 2011 9:42 am
by ppgoml
meet_deb85 wrote:This is the entry in my /etc/hosts file : -

127.0.0.1 kasitmisdb1 localhost.localdomain localhost
::1 localhost6.localdomain6 localhost6
172.21.13.53 kasitmisdb1
172.21.13.254 kasitmisdb3
try removing "kasitmisdb1" from the first line.

Posted: Mon Mar 14, 2011 12:40 am
by meet_deb85
Tried it by removing kasitmisdb1, but getting the same result wihtout any luck

Posted: Mon Mar 14, 2011 2:49 am
by meet_deb85
Do I need to make entries in any other file to get this stuff done??

Posted: Mon Mar 14, 2011 6:33 am
by meet_deb85
Finally resolved it by removing the extra entries of hostnames from the second server's /etc/hosts.