Creating new configuration file
Moderators: chulett, rschirm, roy
-
- Participant
- Posts: 2
- Joined: Fri Dec 01, 2006 2:47 am
Creating new configuration file
Hi All,
Before creating a configuration file how to decide the number of nodes,pools and disk space.. what are the design considerations..
Regards,
Subha
Before creating a configuration file how to decide the number of nodes,pools and disk space.. what are the design considerations..
Regards,
Subha
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Welcome aboard. :D
The default configuration file (default.apt) comprises two nodes, each using disk resource within the DataStage Engine directory - because the install script can be sure that this location exists.
Your main considerations will be the volume of data to be processed and the available hardware resources - CPUs, memory and disk space.
You will create more than one configuration file, because not all jobs will require the full degree of parallelism of which your system is capable. But in the development environment you only need a two-node configuration file, since if it runs on two it will run on 2000.
The default configuration file (default.apt) comprises two nodes, each using disk resource within the DataStage Engine directory - because the install script can be sure that this location exists.
Your main considerations will be the volume of data to be processed and the available hardware resources - CPUs, memory and disk space.
You will create more than one configuration file, because not all jobs will require the full degree of parallelism of which your system is capable. But in the development environment you only need a two-node configuration file, since if it runs on two it will run on 2000.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
-
- Participant
- Posts: 2
- Joined: Fri Dec 01, 2006 2:47 am
thanks for your quick reply.. I would like to know how to decide the number of nodes for optimized parallelism.. for optimised parallelism what are the design considerations for creating configuration file
[quote="ray.wurlod"]Welcome aboard. :D
The default configuration file (default.apt) comprises two nodes, each using disk resource within the DataStage Engine directory - because the install script can be sure that t ...[/quote]
[quote="ray.wurlod"]Welcome aboard. :D
The default configuration file (default.apt) comprises two nodes, each using disk resource within the DataStage Engine directory - because the install script can be sure that t ...[/quote]
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
-
- Premium Member
- Posts: 1255
- Joined: Wed Feb 02, 2005 11:54 am
- Location: United States of America
Great answer!. That's why you can have a customized configuration file assigned to particular job based on the design of your job. Am I right, Ray?ray.wurlod wrote:"Optimized" varies on a job by job basis, and indeed even on a run by run basis. There is no such thing as a "one size fits all" configuration file.
Anything that won't sell, I don't want to invent. Its sale is proof of utility, and utility is success.
Author: Thomas A. Edison 1847-1931, American Inventor, Entrepreneur, Founder of GE
Author: Thomas A. Edison 1847-1931, American Inventor, Entrepreneur, Founder of GE
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Partly. It's why best practice is always to set up $APT_CONFIG_FILE as a job parameter so that you can run a job using different configuration files, depending (for example) on the volume of data to be processed. For example, a retail DW might ordinarily use ten nodes, but during the post-Xmas sales have much more data, so run using sixteen nodes. But for a job that pre-loads Lookup File Sets, maybe one or two nodes suffices even at the busiest of times.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
-
- Premium Member
- Posts: 1255
- Joined: Wed Feb 02, 2005 11:54 am
- Location: United States of America