Too Many Aggregators

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
sureshreddy2009
Participant
Posts: 62
Joined: Sat Mar 07, 2009 4:59 am
Location: Chicago
Contact:

Too Many Aggregators

Post by sureshreddy2009 »

Hi Viewers, :D

Please give me a thought on this requirement
I have one input dataset, from that I have to create one sequential file report which will contains 30 aggregate count columns, There are 4 key columns and 30 additional columns in source file, I have to calculate non missing values on every column based on group of this 4 key columns. Non missing in the sense which are not null. So for 30 columns I am using 30 aggregators, Is there any way to achieve this with out using this many aggregators
Suresh Reddy
ETL Developer
Research Operations

"its important to know in which direction we are moving rather than where we are"
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Use one Aggregator. Process 30 columns (plus grouping columns, if any) in it.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
sureshreddy2009
Participant
Posts: 62
Joined: Sat Mar 07, 2009 4:59 am
Location: Chicago
Contact:

Post by sureshreddy2009 »

The function need to perform in aggregator is Non Missing Columns, It is not allowing for multiple columns, even though grouping columns are same but non missing column is calculated on each column, Could any one help..?
Suresh Reddy
ETL Developer
Research Operations

"its important to know in which direction we are moving rather than where we are"
jwiles
Premium Member
Premium Member
Posts: 1274
Joined: Sun Nov 14, 2004 8:50 pm
Contact:

Post by jwiles »

You can certainly do what you want in a single aggregator.

For each column you want to calculate an aggregation on, add the "Column for Calculation" option. Then under that option, specify the type of aggregation and the output column it will be placed in.

For example:

Column for Calculation = A
Non-missing Values Count Output Column = CountA
Column for Calculation = B
Non-missing Values Count Output Column = CountB

You can't specify the Non-missing Values Count aggregation multiple times under a single "Column for Calculation". I suspect this is what you were trying to do.

Regards,
- james wiles


All generalizations are false, including this one - Mark Twain.
sureshreddy2009
Participant
Posts: 62
Joined: Sat Mar 07, 2009 4:59 am
Location: Chicago
Contact:

Post by sureshreddy2009 »

Thanks jwiles

I thought we cannot repeat column for calculation multiple times but this is possible, Now I had done my job , Thanks a lot
Suresh Reddy
ETL Developer
Research Operations

"its important to know in which direction we are moving rather than where we are"
Post Reply