Aggregator stage Inconsistent output

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
nvuradi
Participant
Posts: 54
Joined: Tue Dec 12, 2006 11:03 pm

Aggregator stage Inconsistent output

Post by nvuradi »

Hi,
The requirement is to aggregate the Amount column based on Amount Type.

I am dealing with the data of around 45 million and currently it has only 2 amount types.

I am getting different results each time i run the job.

I used to Sort method with a preceding Sort stage and Sort on Amount type with Hash partition. I also tried using a single node configuration and sequential node also but i am getting the same result. Is it a bug in dstage?


Any help regarding this will be highly appreciated.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Not sure what you mean by 'it only has 2 amount types'. And is your source static? If it is a database table, perhaps you're seeing normal changes over time in the data. :?
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Please confirm:
(a) the names of your grouping columns
(b) the names of your hash key columns
(c) the names of your sort key columns
(d) the Aggregator operation method (hash or sort)
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
nvuradi
Participant
Posts: 54
Joined: Tue Dec 12, 2006 11:03 pm

Post by nvuradi »

(a) the names of your grouping columns -Amount Type
(b) the names of your hash key columns - Amount Type
(c) the names of your sort key columns - Amount Type
(d) the Aggregator operation method (hash or sort) - Sort ...

And Amount Type has only 2 values in my data sample.

The data is static, I am reading it from a flat file
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

The answers to the other questions are important as well. Also, how are you detecting that the output is different? Are there just 2 output rows and values are different? If so, what is the output column type?
nvuradi
Participant
Posts: 54
Joined: Tue Dec 12, 2006 11:03 pm

Post by nvuradi »

ArndW wrote:The answers to the other questions are important as well. Also, how are you detecting that the output is different? Are there just 2 output rows and values are different? If so, what is the output col ...
I have given all the required information for the questions...

if i run for the first time i am getting say 10,000 amount but when i run it the next time it changes to 10,001. this one is just an example the real values are huge...
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Floating point.
-craig

"You can never have too many knives" -- Logan Nine Fingers
nvuradi
Participant
Posts: 54
Joined: Tue Dec 12, 2006 11:03 pm

Post by nvuradi »

chulett wrote:Floating point. ...
Any idea on how to resolve this?
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Lots of discussions here on the Aggregator and floating point issues. One such example is here.
-craig

"You can never have too many knives" -- Logan Nine Fingers
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

nvuradi wrote:I have given all the required information for the questions...
I see that you went back and re-edited an earlier post to add that missing information.
budiman.bun
Participant
Posts: 2
Joined: Wed Dec 03, 2008 10:44 pm

Post by budiman.bun »

so how is it?
is it just because of floating point?
what if the output difference is huge in ammount.
let say, 1st run, the job return the total value 10,000
2nd run, the job return 12,000.

any ideas?
i'm using Sort operation method in Aggregator. The source is static, and from Oracle DB using Oracle Stage.
The output is to Oracle Stage.
is this a bug in Oracle Stage?

Thanks
Post Reply