To speed up the aggregator
Moderators: chulett, rschirm, roy
To speed up the aggregator
Dear All,
Is there any way to speed up the aggregator?
Any suggestion pls.
Thanks,
ICE
Is there any way to speed up the aggregator?
Any suggestion pls.
Thanks,
ICE
pre-sort the incoming data and tell the aggravator about it.
<a href=http://www.worldcommunitygrid.org/team/ ... TZ9H4CGVP1 target="WCGWin">
</a>
</a>
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
-
- Participant
- Posts: 3593
- Joined: Thu Jan 23, 2003 5:25 pm
- Location: Australia, Melbourne
- Contact:
Wow, version 4. You could use a Universe stage, write all your data into it and pull it out with a group by SQL command. You could upgrade a couple versions and try multiple instance jobs. Upgrade to DataStage EE parallel jobs with much faster sort and aggregation functions.
Certus Solutions
Blog: Tooling Around in the InfoSphere
Twitter: @vmcburney
LinkedIn:Vincent McBurney LinkedIn
Blog: Tooling Around in the InfoSphere
Twitter: @vmcburney
LinkedIn:Vincent McBurney LinkedIn
Dear Ray,
Just now I am checking the option#6 from DS.tools.
I see there are some options such as ReportAfterRows, ReportAfterTime,TableSize, and so on ... to choose.
So may I know which option would be maximize the performance of the aggregator?
Is it the TableSize option???
Thanks in advance,
ICE
Just now I am checking the option#6 from DS.tools.
I see there are some options such as ReportAfterRows, ReportAfterTime,TableSize, and so on ... to choose.
So may I know which option would be maximize the performance of the aggregator?
Is it the TableSize option???
Thanks in advance,
ICE
ray.wurlod wrote:You might also investigate option #6 on the DS.TOOLS menu, where you can modify the reporting intervals and memory allocation model. ...
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
At the menu prompt enter each number followed by a question mark; for example - this will give you more information. Which ones to change depends on where your performance problem is. Increasing the reporting interval (so that the stage updates its status less frequently) will always help.
Sorted input (sorted by grouping keys) will give the best gains.
Code: Select all
2?
Sorted input (sorted by grouping keys) will give the best gains.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Thank you, Ray.
Thank you all for your advice.
Thank you all for your advice.
ray.wurlod wrote:At the menu prompt enter each number followed by a question mark; for example- this will give you more information. Which ones to change depends on where your performance problem is. ...Code: Select all
2?
-
- Participant
- Posts: 57
- Joined: Sat Jun 09, 2007 1:14 am
- Location: chicago
Hi always
sort the data before it is sent to aggregator
so that sll the data belonging to one particular group
will go to one chunk and hence grouping at aggr stage will become faster
thanks
suresh
sort the data before it is sent to aggregator
so that sll the data belonging to one particular group
will go to one chunk and hence grouping at aggr stage will become faster
thanks
suresh
ArndW wrote:pre-sort the incoming data and tell the aggravator about it. ...
Hi I have experience in parallel extender datastage I am ready to give/take help from other
hope we all help each other hand in hand
hope we all help each other hand in hand
As noted, there is more to it than that. You also have to assert the sorted order in the stage so it knows you've done this. And hopefully you've sorted in such a manner that supports the grouping being done, otherwise it's all for naught.karrisuresh wrote:always sort the data before it is sent to aggregator
-craig
"You can never have too many knives" -- Logan Nine Fingers
"You can never have too many knives" -- Logan Nine Fingers
Dear Chulett and karrisuresh,
Could u pls let me know whether I need to use sort stage before I sent to aggregator?
Currently I am using the sort stage to sort the data before I sent to aggregator. But I got below error message.
ImptoTEST.sort14: Unable to retrieve value for property 'SortSpec
I have no experience for sort stage.
Thanks you all for your advice.
ICE
Could u pls let me know whether I need to use sort stage before I sent to aggregator?
Currently I am using the sort stage to sort the data before I sent to aggregator. But I got below error message.
ImptoTEST.sort14: Unable to retrieve value for property 'SortSpec
I have no experience for sort stage.
Thanks you all for your advice.
ICE
chulett wrote:As noted, there is more to it than that. You also have to assert the sorted order in the stage so it knows you've done this. And hopefully you've sorted in such a manner that supports the grouping being done, otherwise it's all for naught.karrisuresh wrote:always sort the data before it is sent to aggregator
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
The correct spelling of the second person personal pronoun is "you" not "u".
You can use any method you like to sort the data. A Sort stage is one way. If the source is a database table, you could include an ORDER BY clause in the extraction SQL.
You can use any method you like to sort the data. A Sort stage is one way. If the source is a database table, you could include an ORDER BY clause in the extraction SQL.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
As noted, many sort options exist, the specific stage is only one such option available. Check the Property Help for the Sort Specification in the stage, it has examples of the syntax needed. Yes, it's a little confusing if you've never used it before, but check the help and give that a shot.
Just as an FYI, that stage is pretty slow. If you can do this in the source, say an order by in a database, that would generally be more performant. Or perhaps leverage a high-speed sort package or just the plain old 'sort' command from the O/S. All of them typically beat the pants off the Sort stage.
Just as an FYI, that stage is pretty slow. If you can do this in the source, say an order by in a database, that would generally be more performant. Or perhaps leverage a high-speed sort package or just the plain old 'sort' command from the O/S. All of them typically beat the pants off the Sort stage.
-craig
"You can never have too many knives" -- Logan Nine Fingers
"You can never have too many knives" -- Logan Nine Fingers
Hummmm...........
I use "u" as short word of "you" due to boring to type a lot of words
Ok. Later will try to type full word.
Thank you for your advice
Thank you,
ICE
I use "u" as short word of "you" due to boring to type a lot of words
Ok. Later will try to type full word.
Thank you for your advice
Thank you,
ICE
ray.wurlod wrote:The correct spelling of the second person personal pronoun is "you" not "u".
You can use any method you like to sort the data. A Sort stage is one way. If the source ...
Oop!!! Really???
Ok. I think I better use the order by clause in my sql. Thank you for your advice.
Thanks,
ICE
Ok. I think I better use the order by clause in my sql. Thank you for your advice.
Thanks,
ICE
chulett wrote:As noted, many sort options exist, the specific stage is only one such option available. Check the Property Help for the Sort Specification in the stage, it has examples of the syntax needed. Yes, it's a little confusing if you've never used it before, but check the help and give that a shot.
Just as an FYI, that stage is pretty slow. If you can do this in the source, say an order by in a database, that would generally be more performant. Or perhaps leverage a high-speed sort package or just the plain old 'sort' command from the O/S. All of them typically beat the pants off the Sort stage.