Which Partition techniques for which stage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply

Which Partition Techniques are used mostly

Poll ended at Mon Apr 11, 2011 12:38 pm

hash
4
100%
modulus
0
No votes
 
Total votes: 4

kstechno
Participant
Posts: 6
Joined: Tue Apr 05, 2011 9:14 pm

Which Partition techniques for which stage

Post by kstechno »

Hi I understood all the partition techniques.
Can any one tell me
which technique used for the which stages
and according to which situation which key based or key less techniques are used ( With small examples)

Thankyou
Last edited by kstechno on Wed Apr 06, 2011 9:37 pm, edited 1 time in total.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

"The correct one" is the one to choose. Your poll is meaningless.

It's not about stage type, it's about data type of the key for which adjacency needs to be established (Modulus is only for use when the key is an integer of some kind, Hash is the other key-based algorithm to be used for all other data types).

Stage types that need key-partitioned data are those that rely on same key values being on the same partition. These include Join, Merge, Remove Duplicates and Aggregator and may include Transformer and Lookup (and others).
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
kstechno
Participant
Posts: 6
Joined: Tue Apr 05, 2011 9:14 pm

Post by kstechno »

ray.wurlod wrote:"The correct one" is the one to choose. Your poll is meaningless.

It's not about stage type, it's about data type of the key for which adjacency needs to be established (Modulus is only for use wh ...

Hi ray thanks for your reply.
I think the way i asked in the poll is wrong one.

And I m not premium member of this website. So that I cant read your message reply.
Any way once again thanks for your reply.
ds@tcs
Participant
Posts: 24
Joined: Thu Mar 17, 2011 6:26 am
Location: chennai
Contact:

Post by ds@tcs »

If key column datatype is text then we use Hash
if key column datatype numeric we use modulous
if one key column numeric and another text then also we use Hash
if both key columns numeric then we use Modulous.

because modulous can be performed only on numbers
Regards
Sankar
kstechno
Participant
Posts: 6
Joined: Tue Apr 05, 2011 9:14 pm

Post by kstechno »

ds@tcs wrote:If key column datatype is text then we use Hash
if key column datatype numeric we use modulous
if one key column numeric and another text then also we use Hash
if both key columns numeric then we use Modulous.

because modulous can be performed only on numbers
thankyou shankar
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

kstechno wrote:And I m not premium member of this website. So that I cant read your message reply.
This link might help.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply