Append into the LookUp Fileset?

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Thanh Dalton
Premium Member
Premium Member
Posts: 57
Joined: Wed Nov 15, 2006 10:25 am
Location: Ottawa ON

Append into the LookUp Fileset?

Post by Thanh Dalton »

Is it possible to append into the LookUp Fileset in a later job after it was created and loaded by a job?
Anyone knows?
Thanks in advance!
TPQ
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

I don't think it's possible. There is no overwrite/append property in the Lookup File Set stage. But you can construct something like the following.

Code: Select all

LookupFileSetprevioussource
     |
     V
  Funnel  ---->  LookupFileSet
     ^
     |
(new data)
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Thanh Dalton
Premium Member
Premium Member
Posts: 57
Joined: Wed Nov 15, 2006 10:25 am
Location: Ottawa ON

Post by Thanh Dalton »

Thanks for your response, Ray.

I tried but the LookupFileSetprevioussource doesn't support an output link into Funnel or anything. I did the test to put the data into a dataset at the sametime as into the LookupFileSetprevioussource. Later, I could append into the dataset in a different job and reload the LookupFileSet.

I'm just wondering about this kind of performance. Is there any better way?

Thanks a bunch!
TPQ
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

What sort of a problem are you having going from a Lookup Fileset to a join or funnel stage?
Thanh Dalton
Premium Member
Premium Member
Posts: 57
Joined: Wed Nov 15, 2006 10:25 am
Location: Ottawa ON

Post by Thanh Dalton »

When I use LookUpFileset as a source and Funnel stage as a destination and I got a message saying :The source stage does not support stream output links!
TPQ
PhilHibbs
Premium Member
Premium Member
Posts: 1044
Joined: Wed Sep 29, 2004 3:30 am
Location: Nottingham, UK
Contact:

Post by PhilHibbs »

Thanh Dalton wrote:When I use LookUpFileset as a source and Funnel stage as a destination and I got a message saying :The source stage does not support stream output links!
That's right, it doesn't. Once your data goes into a LookUpFileset, the only thing you can do is to look up against it. You have to jump through hoops in order to accumulate data from several sources over time, re-creating the file set from another copy of the source data that can be streamed.

I don't think that Lookup File Sets are a good idea anyway. Your stream data has to be partitioned the same way that the Lookup File Set was partitioned when it was created, which is not efficient (you might have a small set of look up data, but have to re-partition millions of rows of stream data just to do the look up on that key). I put all my look up data in regular File Sets.
Phil Hibbs | Capgemini
Technical Consultant
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Phil has it - I wasn't thinking before I responded! I never use lookup filesets.
Thanh Dalton
Premium Member
Premium Member
Posts: 57
Joined: Wed Nov 15, 2006 10:25 am
Location: Ottawa ON

Post by Thanh Dalton »

Thanks to Phil and ArndW!

My experience with the table to do the LookUp for 300M rows take so long. That's why I prefer to use LookUpFileset instead. It has proved to perform very well. In this case, I can append into a dataset and reload to the LookUp Fileset. I tested for 100M rows to load the LookUp Fileset from a dataset and it took me only 9 minutes. So I guess, I go for the dataset then!

Thanks to all for the help!
TPQ
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

By LookupFileSetprevioussource I meant the job that populated the Lookup File Set. The Lookup File Set stage itself does not support stream output.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply