Page 1 of 1

Regarding aggregator

Posted: Thu Mar 23, 2006 3:31 pm
by Bilwakunj
Hello,
When we say sort the data in the aggregator or any other stage, where exactly it's done? Where the data resides being sort. Could anyone please explain this process? Is this same for Server and PX?

Thanks in advance.

Re: Regarding aggregator

Posted: Thu Mar 23, 2006 3:46 pm
by THEDSKID
The Sort checkbox is your way of telling the aggregator stage that your incoming data is sorted. By providing this information to the aggregator stage you can cut down on the processing time within the aggregator and you will notice sizeable increases in the performance of large data sets.

So the sorting is done outside of the aggregator within your input file. I believe this is the same for Server and EE.

Hope this helps
Bilwakunj wrote:Hello,
When we say sort the data in the aggregator or any other stage, where exactly it's done? Where the data resides being sort. Could anyone please explain this process? Is this same for Server and PX?

Thanks in advance.

Re: Regarding aggregator

Posted: Thu Mar 23, 2006 3:50 pm
by Bilwakunj
Thanks for your reply. But if data is sorted in the aggregator, is it sorted in memory or scratch disk?


THEDSKID wrote:The Sort checkbox is your way of telling the aggregator stage that your incoming data is sorted. By providing this information to the aggregator stage you can cut down on the processing time within the aggregator and you will notice sizeable increases in the performance of large data sets.

So the sorting is done outside of the aggregator within your input file. I believe this is the same for Server and EE.

Hope this helps
Bilwakunj wrote:Hello,
When we say sort the data in the aggregator or any other stage, where exactly it's done? Where the data resides being sort. Could anyone please explain this process? Is this same for Server and PX?

Thanks in advance.

Re: Regarding aggregator

Posted: Thu Mar 23, 2006 4:34 pm
by vcannadevula
Bilwakunj wrote:Thanks for your reply. But if data is sorted in the aggregator, is it sorted in memory or scratch disk?


THEDSKID wrote:The Sort checkbox is your way of telling the aggregator stage that your incoming data is sorted. By providing this information to the aggregator stage you can cut down on the processing time within the aggregator and you will notice sizeable increases in the performance of large data sets.

So the sorting is done outside of the aggregator within your input file. I believe this is the same for Server and EE.

Hope this helps
Bilwakunj wrote:Hello,
When we say sort the data in the aggregator or any other stage, where exactly it's done? Where the data resides being sort. Could anyone please explain this process? Is this same for Server and PX?

Thanks in advance.


IF your data is not sorted and you check the option in the input link,

If the data is less than 25MB, it will sort in the memory, else it will use the scratch disk with sort pool, if it is full use the default scratch disk, if it is full it will use the TMPDIR, if it is full it will usee the /tmp in the server box.
If it is full, it will abort.