Aggregator stranges

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
snassimr
Premium Member
Premium Member
Posts: 281
Joined: Tue May 17, 2005 5:27 am

Aggregator stranges

Post by snassimr »

Hi !

I meet sometimes starnge behavior of aggreagtor.
I have a simple job :

DB2 -> TRANSFORMER -> AGGREGATOR -> SEQFILE

In DB2 I have 1000000 rows when the job run statistics of rows in link between DB2 and TRANSFORMER rich 3000000 rows (3 times exactly f the source number rows)

Any explanation
loveojha2
Participant
Posts: 362
Joined: Thu May 26, 2005 12:59 am

Re: Aggregator stranges

Post by loveojha2 »

snassimr wrote:Hi !

I meet sometimes starnge behavior of aggreagtor.
I have a simple job :

DB2 -> TRANSFORMER -> AGGREGATOR -> SEQFILE

In DB2 I have 1000000 rows when the job run statistics of rows in link between DB2 and TRANSFORMER rich 3000000 rows (3 times exactly f the source number rows)

Any explanation
What is the source query that you are running in DB2?
May be there are duplicates within the source table,
most probably it is result of a cartesian within the source query.
Success consists of getting up just one more time than you fall.
snassimr
Premium Member
Premium Member
Posts: 281
Joined: Tue May 17, 2005 5:27 am

Post by snassimr »

Its not the case .

After the job finished the counter retutned to 1000000 but I feel the DS do more work that reqiered for 1000000 rows
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

If you run the job with just 1 row input do you get 3 rows output? This will make your debugging a lot easier. Also, what are you doing in the aggregator stage?
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

I have also seen something like this behaviour. It also occurs in the job monitor, so is probably a problem with how the link row count is being collected when there is a direct link from a Transformer to an Aggregator stage. As noted, the final row counts are correct. It might be an artifact of row buffering - I never pursued it since I was only interested in the final row counts. Does someone have lots of free time to construct a duplicate case? (It doesn't need to be as many as 1 million rows, but does need to be a substantial number.)
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply