Clarification required about Cluster Environment

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
maffan76
Participant
Posts: 110
Joined: Tue Aug 23, 2005 5:27 am

Clarification required about Cluster Environment

Post by maffan76 »

Hi,
I want to clarify that we have a windows cluster running; one machine conatins the OS and the other contains Data Drives, so from DS point of view is it a single node or a multinode system? because incase of multinode system the installation of DS-PX will change interms of RSH config etc and will have to setup extra ENV variable. Or DS has it own definition of cluster e.g. N independent computers are connected by installing DS server on each machine and confguring RSH for comunication among the nodes.

Thanks in advance.
Regards,
Affan
"Questioning is Half Knowledge"
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

It depends on what resources you are "borrowing" from those other machines. Assuming your DataStage server gets installed on the OS machine do you anticipate this machine has the CPUs and RAM required or will you be borrowing CPU and RAM from the cluster? Or are you only using them for disk space? Would be easy to link them to the OS server as network drives and reference them in the node configuration file.
maffan76
Participant
Posts: 110
Joined: Tue Aug 23, 2005 5:27 am

Post by maffan76 »

Thanks vincent,
But Whats the default config of DS and hows does it see the cluster during the installation as it does not ask anything. so to avoid any licnesing prolem i want to understand its behaviour.

Thanks in Advance.
Regards,
Affan
"Questioning is Half Knowledge"
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

The default configuration is the one you see as "default" in the configuration file viewer/editor in Manager client.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
maffan76
Participant
Posts: 110
Joined: Tue Aug 23, 2005 5:27 am

Post by maffan76 »

So another question,
Can i change the paths of DataSet and Scratch for each node differently, i mean for node1 Drive C,node 2 Drive D and So on?
Regards,
Affan
"Questioning is Half Knowledge"
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Yes.

Read the chapter on configuration files in the Manager manual, particularly the sections on resources.

Data Sets are automatically partitioned across all the processing nodes in the configuration, unless explicitly constrained into a node pool or a disk resource pool by means of stage properties.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
maffan76
Participant
Posts: 110
Joined: Tue Aug 23, 2005 5:27 am

Post by maffan76 »

Thanks Ray, I'll read the chapter
Regards,
Affan
"Questioning is Half Knowledge"
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

You should get better results if you can split your node resource and scratch disks onto different disk drives as this should ease I/O bottlenecks.
Post Reply