How long time to need to start with DB2 64 partitions

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
dhwankim
Premium Member
Premium Member
Posts: 45
Joined: Mon Apr 07, 2003 2:18 am
Location: Korea
Contact:

How long time to need to start with DB2 64 partitions

Post by dhwankim »

Hi All

Now I am participating DW Project with DataStage PX and DB2 BCU Architecture.

This system will Near Real Time DW with huge volumn data.

So DBA draws 64 Partition Configuration on 8 SMP Machines.

My Concern is how long time to need to start DS PX Job with DB2 Enterprise Stage.

Does Any body have a experience with 64 Partition DB2 Archiecture.

Batch window is 20 minutes with 1000 more jobs.

So If DataStage PX start time takes long time(?). It should be affected ETL Performance.

I really really want to actual information.

Now I do not have Product machine. so I could not test this environment.

Please Help Me.

Thx in advance.
hamzaqk
Participant
Posts: 249
Joined: Tue Apr 17, 2007 5:50 am
Location: islamabad

Post by hamzaqk »

it would depend on several factors such as how good your hardware is, how well your jobs are designed / tuned . i dont think there is a general answer to this.
Teradata Certified Master V2R5
hamzaqk
Participant
Posts: 249
Joined: Tue Apr 17, 2007 5:50 am
Location: islamabad

Post by hamzaqk »

it would depend on several factors such as how good your hardware is ? network ? how well your jobs are designed / tuned ? . i don't think there is a general answer to this.
Teradata Certified Master V2R5
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

The startup time isn't as important as the high-performance throughput you are going to get by using a 64-way partitioned database. PX Jobs should be designed to process a lot of data. If you are worried about startup time then perhaps you should be using server jobs instead of PX jobs for those jobs whose ratio of startup time : processing time is high.
kcbland
Participant
Posts: 5208
Joined: Wed Jan 15, 2003 8:56 am
Location: Lutz, FL
Contact:

Post by kcbland »

You have all choices in partitioning from sources and thru transformation (any target lookups are the exception), it's only when loading into your 64-way do you need to consider aligning your data to the database partitioning scheme.
Kenneth Bland

Rank: Sempai
Belt: First degree black
Fight name: Captain Hook
Signature knockout: right upper cut followed by left hook
Signature submission: Crucifix combined with leg triangle
dhwankim
Premium Member
Premium Member
Posts: 45
Joined: Mon Apr 07, 2003 2:18 am
Location: Korea
Contact:

Suggested Machine and BCU Architecture

Post by dhwankim »

Hi

Now IBM has suggested Db2 BCU Architecture with datastage EE.

My role is a design of ETL Architecture with this platform.


The Platform is below =============
It consists of 10 P570 Macnines.
One of Machine is DB2 Admin BCU and another is ETL BCU (ETL Main Machine, ETL Main machine has 12 core (24cpu))
and others are Data BCU.

My Designe for General ETL Archiecture.
Generally, I will use 2 degress of Parallelism for PX Jobs. Because
I have run 1000 more jobs in 20 minutes batch window every hour.
and I will use transformation processing in ETL BCU Machine Only.

Big table has 20 billion record (About 10 tables have that volumn).

but generally Most of txn tables is fed 1 million record less in each Batch Window(20 minutes). (All handling records of tables is about 30 million in 20 minutes)

and during ETL Batch Time, I have to service data for Biz user.
so All Load mode is upsert like record level handling not load mode.
and I have to give low stress to DB2 during ETL Time.

Frankly speaking, My Concern is that It is possible to do ETL in the batch window with DB2 BCU Architecture.

so I want you to share your exprience with DB2 and DataStage.

Thx .
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Hello DaeHwan,

If you are concerned that the time window might be a problem, go back to IBM and ask them on what basis they recommended this particular configuration?

There are several very large corporations using 24+ nodes for DB2 (with large DataStage configurations such as 24 nodes in the DB2 node pool and more nodes allocated for processing). Data volumes (and execution time windows) are achieved comfortably.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply