Hi,
In the job we are giving 19 crore data to aggregator stage,it is taking 3 hrs time.here we are giving sorted data and hash partitioned data to agg and method in agg is sort method..please let me know if i can reduce the time in any other manner..
Thanks,
Rajashekar.
Aggregator Performance
Moderators: chulett, rschirm, roy
Re: Aggregator Performance
How about the data volume. no of columns used to aggregation?
Find out where the time is consumed more?
Split the job may help to reduce the time.
DS User
Find out where the time is consumed more?
Split the job may help to reduce the time.
DS User
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Please advise what the grouping columns for aggregation are.
Essentially, though, you need to partition on the first only of these (unless it has very few distinct values) and sort on all of them in order, to be able use Sort as the aggregation method.
Essentially, though, you need to partition on the first only of these (unless it has very few distinct values) and sort on all of them in order, to be able use Sort as the aggregation method.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
-
- Premium Member
- Posts: 783
- Joined: Mon Jan 16, 2006 10:17 pm
- Location: Sydney, Australia
Compare with a simple select Job Vs Aggregator in Job.
I assume the throughput from Source stage is a well to note measure in depicting overall performance of your Job.
I will also suggest dumping the data into dataset and using that as a source to compare your results and see if there is any improvement oppurtunity.
I assume the throughput from Source stage is a well to note measure in depicting overall performance of your Job.
I will also suggest dumping the data into dataset and using that as a source to compare your results and see if there is any improvement oppurtunity.