Funnel Stage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
kittu.raja
Premium Member
Premium Member
Posts: 175
Joined: Tue Oct 14, 2008 1:48 pm

Funnel Stage

Post by kittu.raja »

Hi,

I am having 2 datasets.

In one of my job i am using funnel to merge the data of the two datasets.

My design is

DS----->Xfm----->
Funnel ---------->Remove Dup --------->Oracle
DS----->Xfm----->

When I try to run the job the funnel is taking a long time to pass the data from XFM to Remove Dup.

I can see the 2 million records before the funnel and I used continuos funnel.

I dont know why it is very slow.

Can anybody help me out.

Thanks,
Rajesh Kumar
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Probably not the Funnel. Have you dumped the score for this? The Remove Duplicates stage requires sorted data from what I recall, so there may be a tsort operator in there it added that is holding up the show.
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

You can use Performance Analysis to determine where the "hot spots" are in a job.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
sandeepgs
Participant
Posts: 87
Joined: Wed Jul 02, 2008 12:22 am

Post by sandeepgs »

Hi,

when a remove duplicate stage is used the it is mandatory that data should be sorted on the key column before the data is passed to the remove duoplicate stage.

When sorting the data will go the spool space i.e temporary sapce and the location is defined in the configuration file.

As it was mentioned by you it has 2 million records you should have enough space on the server for the specified location.

you can use a link sort instead of a sperate sort stage.

The data at funnel stage might be waiting to get sorted befor RDS. It is confusing but DS will work in that way.

As Chulett mentioned do a usage analysis to know it in better way.

Might this solution will help.
Post Reply