We have different type of partition ie.
Hash Partition
Entire Prtition
Round robin
Same.
How can we decide which partition has to be used?
How to decide which partition to be used in what kind of job
Moderators: chulett, rschirm, roy
-
- Participant
- Posts: 251
- Joined: Mon Jun 09, 2008 5:52 am
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Welcome aboard.
There are actually eight choices for partitioning algorithm, and four for collecting. However, the decision is usually easier than that.
If you don't need to keep like-valued keys together, use an algorithm that spreads rows as evenly as possible over processing nodes. If you do need to keep like-valued keys together, use a key-based algorithm (modulus for a single integer key, hash otherwise). Range partitioning is rarely used, and requires that you pre-process your data to generate a "range map". Entire for reference input to Lookup stage is handy in that it guarantees that all valid lookups will succeed, but comes at a cost on cluster/grid environments in that all records have to be sent to all nodes (in an SMP environment one copy is lodged in shared memory).
There are actually eight choices for partitioning algorithm, and four for collecting. However, the decision is usually easier than that.
If you don't need to keep like-valued keys together, use an algorithm that spreads rows as evenly as possible over processing nodes. If you do need to keep like-valued keys together, use a key-based algorithm (modulus for a single integer key, hash otherwise). Range partitioning is rarely used, and requires that you pre-process your data to generate a "range map". Entire for reference input to Lookup stage is handy in that it guarantees that all valid lookups will succeed, but comes at a cost on cluster/grid environments in that all records have to be sent to all nodes (in an SMP environment one copy is lodged in shared memory).
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.