Ideal condition for using sparse Look up

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
opdas
Participant
Posts: 115
Joined: Wed Feb 01, 2006 7:25 am

Ideal condition for using sparse Look up

Post by opdas »

Hi All,

What would be an ideal condition for using sparse look up and when should we avoid the temtation of using it.
Om Prakash


"There are things that are known, and there are things that are unknown, and in between there are doors"
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

A pretty good time to think about using a sparse lookup is when your lookup contains 1 million rows and your incoming data stream contains a few hundred.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

More precisely, your main input stream can contain very many rows too, but only a small number distinct lookup key values.

The idea of a sparse lookup is to be able to avoid loading a huge reference table into either a virtual Data Set or Lookup File Set.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply