Which is better database join or Datastage join

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
apkselvam
Participant
Posts: 31
Joined: Mon Sep 04, 2006 2:37 am

Which is better database join or Datastage join

Post by apkselvam »

Hi All,

I have a requirement to join 3 tables which has is having more than 10 million of data on each and send the output to the target table(In between I have few transformation). My question here,

1) Can I go for Database join while extracting from the database?
2) Can I use Datastage join stage by sorting & Hash partitioning all i/p table by same primary key?

Which will give better performence ? eagerly waiting for your reply

Thanks in advance
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Are all the tables in the same database? That is probably the most important factor for this question.
apkselvam
Participant
Posts: 31
Joined: Mon Sep 04, 2006 2:37 am

Post by apkselvam »

[quote="ArndW"]Are all the tables in the same database? That is probably the most important factor for this question. ...[/quote]

Thanks for the quick response, all are in same database
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

The best answer is "it depends". If the database server has spare capacity and/or if the join columns are indexed then chances are that the join will be resolved faster in the database. But the job will have to wait until the first row is delivered from the database.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
apkselvam
Participant
Posts: 31
Joined: Mon Sep 04, 2006 2:37 am

Post by apkselvam »

[quote="ray.wurlod"]The best answer is "it depends". If the database server has spare capacity and/or if the join columns are indexed then chances are that the join will be resolved faster in the database. But the job ...[/quote]

Thanks Ray
Post Reply