How to decide which partition to be used in which stage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Rameshgoldenhill
Participant
Posts: 5
Joined: Tue Aug 26, 2008 9:00 am

How to decide which partition to be used in which stage

Post by Rameshgoldenhill »

How to use parallelism in data stage job and what is the best possible partition selection in which stage to optimize performance and parallelism. Thanks in advance
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Begin by reading Chapter 2 of the Parallel Job Developer's Guide then post any specific questions that you may have.

As it stands your question is too vague. There is no single "best" algorithm, there's no general solution to "optimize parallelism", and you would need to define what you mean by "performance" in an ETL context (rows/second is an almost meaningless metric).
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
John Smith
Charter Member
Charter Member
Posts: 193
Joined: Tue Sep 05, 2006 8:01 pm
Location: Australia

Post by John Smith »

Have you attended the IBM datastage training? that might be a good start.
DS consultant.
Post Reply