Counts and sums in Datastage with out 'Aggregator' stage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

anu123
Premium Member
Premium Member
Posts: 143
Joined: Sun Feb 05, 2006 1:05 pm
Location: Columbus, OH, USA

Post by anu123 »

'Auto' the default one.

DSguru2B wrote:In the transformer, go to the stage properties, what partitioning are you providing?
Thank you,
Anu
DSguru2B
Charter Member
Charter Member
Posts: 6854
Joined: Wed Feb 09, 2005 3:44 pm
Location: Houston, TX

Post by DSguru2B »

And thats where you are going wrong. You need to partition it by keys. Do this, provide Hash as partitioning and choose the three keys. Now run your job again. See what happens.
Creativity is allowing yourself to make mistakes. Art is knowing which ones to keep.
DSguru2B
Charter Member
Charter Member
Posts: 6854
Joined: Wed Feb 09, 2005 3:44 pm
Location: Houston, TX

Post by DSguru2B »

Coming back to the "Why you are not using the Aggregator stage", inconsistant and inaccurate data is not the fault of the aggregator stage. Its the fault of how it was used. Take a small set of data, use the aggregator stage to build a job out of it. Test it out. If you have problems, we are here to help you out. Once you get that working, then feed in a couple of millions. If that works out too then feed in your complete data feed. Prove to "them" that aggregator works just fine. Instead of reinventing the wheel. You do realize that doing all those current row to previous row comparisons will also take some time. Wont be drastically slow but still.
Creativity is allowing yourself to make mistakes. Art is knowing which ones to keep.
anu123
Premium Member
Premium Member
Posts: 143
Joined: Sun Feb 05, 2006 1:05 pm
Location: Columbus, OH, USA

Post by anu123 »

I got it. thanks Guru.

one last question. Is there any way that we can write last record with count to output instead of having a sort and Remove Dupl. stages after transformer.
DSguru2B wrote:And thats where you are going wrong. You need to partition it by keys. Do this, provide Hash as partitioning and choose the three keys. Now run your job again. See what happens.
Thank you,
Anu
DSguru2B
Charter Member
Charter Member
Posts: 6854
Joined: Wed Feb 09, 2005 3:44 pm
Location: Houston, TX

Post by DSguru2B »

Not that i can think of right now. Maybe someone else knows and will shed some light on it.
Creativity is allowing yourself to make mistakes. Art is knowing which ones to keep.
anu123
Premium Member
Premium Member
Posts: 143
Joined: Sun Feb 05, 2006 1:05 pm
Location: Columbus, OH, USA

Post by anu123 »

Anyway thank you Guru and all.
DSguru2B wrote:Not that i can think of right now. Maybe someone else knows and will shed some light on it.
Thank you,
Anu
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Tail stage set to 1 row and executing on one node only (perhaps in sequential mode).
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply