Performance of Aggregator

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
dsuser08
Participant
Posts: 29
Joined: Wed Feb 06, 2008 3:56 am

Performance of Aggregator

Post by dsuser08 »

Hi

I have to achieve below result :

Select empid,sum(sal)+100 from X
group by empid

Please let me know if its better ( with respect to performance) to use Aggregator and transformer to achieve above result or Database component with SQL query and a transformer


Thanks
niharranjan
Participant
Posts: 18
Joined: Mon Jul 23, 2007 1:32 am
Location: Singapore

Post by niharranjan »

i think using aggregator
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

It depends (as always). If the grouping columns are supported by indexes, then it may well be faster to do it in the database. If you have sorted data, then it may be faster to use an Aggregator stage. Is the database local or remote? If remote, it may be better to aggregate in the database, because you would send fewer rows across the network. And so on. It depends.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply