Server and client sizing

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
Devyani Sarkar
Participant
Posts: 26
Joined: Thu Apr 15, 2004 12:54 am

Server and client sizing

Post by Devyani Sarkar »

Hi,
Generally what should be the indicative server and client configuration assuming that we have around 1 TB data flowing through?
Are there any high level sizing guidelines for Ascential Server?
Thanks,
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

The client configuration, at least the minimums, are outlined by IBM/Ascential. Sizing the server is a bit tougher. I can flow 1Tb through my notebook with no problems, except it will take quite a bit longer than on a top-end Solaris box. So part of the sizing questions must be times and not just data amounts. Is the data being processed heavily (i.e. a lot "T" from the ETL equation) or just being moved? What are the sources and the database(s)? Are they local or remote?

Many factor play into sizing and capacity planning, making correct judgement a matter of experience (and luck).
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

Are lookin at the version of Datastage or the size of the server?
Assuming you are after the server size and capacity, the vauge answer would be, "it depends on your jobs desing that you use". That true. More the stage, more the resource used in Datastage. I have seen some site, where, ETL is used just to trigger Oracle Stored procedure, and Extract and load directly into other server. Which may not ponentially utilize the Datastage resource. Say if you have some lookup through hashed file need to be done, and the whole file need to be palce in Datastage server (atleast for better performance).
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
Devyani Sarkar
Participant
Posts: 26
Joined: Thu Apr 15, 2004 12:54 am

Post by Devyani Sarkar »

Thanks ArndW & kumar_w,
The requirement is that the expected data volume handled to be of 1TB and the tranformation and load will be carried out on Datastage Designer. What will be your suggestion based on your experience, we can get the leads from this ?
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

Since the input you have, just gives the size inoformation, I again assume that you are after Disk Size.
As mentioned, it all depends on the level of transformation involed. What is your source/targer? Is it generally a daily run/weekly/Fortnightly/monthly run? Based on your high level desing, how many days you require the staging files/tables to be retained. Is it in the same server or different?
IF you dont have any of this information, just go ahead with double or triple of your input (assuming that you are not going to process all the 1TB at the same time).
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Ok, I recommend using an old Intel 486 processor (or a Pentium if you feel like splurging) and the minimum amount of memory that the IBM/Ascential datastage guide suggests, but you could probably make do with 512Mb. This will be your cheapest solution.

(do you now understand that this is why you've been asked twice about performance This DS Server configuration will work, but your load might take over a week to complete).

This is like me asking you what sort of a car I should purchase and tell you that I have 60 boxes in my garage that I need to move.

Do you have operational constraints? Is the company a UNIX or a Windows shop The list of questions can go on quite a bit, but without any information about the general parameters you will not be able to get a good answer from anyone. Except perhaps a hardware salesman - he/she will sell you a box with pleasure.
Devyani Sarkar
Participant
Posts: 26
Joined: Thu Apr 15, 2004 12:54 am

Post by Devyani Sarkar »

Server will be hosted on Unix Box, and acceptable processing time should be about 1-2 hour for daily run for full load. Datastage server should be able to process about 1TB Data in 1 hour or 2 hours maximum, this includes read from sources transform and load the presentation layer.
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Devyani,

great (that rules out me being able to sell you my old PC, though).

What UNIX hardware platform is currently used in-house; usually companies prefer homogenous hardware?

Any UNIX system can process 1TB from a sequential file to a sequential file in that time period. But if your database isn't on your DS server and you have a gigabit ntework card you will only be able to transfer just under 3Tb per hour (if nothing else is going on and everything is perfect)
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

All these terabytes flying around in '1 or 2 hours maximum' using the Server product? Interesting...
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

NVLLVS ANXIETAS

If you have multiple CPUs (or even sufficient spare CPU capacity) you can create multiple instances each of which processes a subset of the rows - partition parallelism, in short.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

ray.wurlod wrote:NVLLVS ANXIETAS
Cvte... :wink:

Obviously, you can do pretty much anything with a big enough X - lever, server, whatever. Just wondering how practical it would be, that's all.
-craig

"You can never have too many knives" -- Logan Nine Fingers
Post Reply