Page 1 of 1

join vs lookup

Posted: Tue Aug 19, 2014 12:02 am
by vamsi.4a6
When i had discussion with my team mate he told look up stage should be used when the reference data is small.Not sure what could be the reason and even he do not know.Any thoughts on this

Posted: Tue Aug 19, 2014 11:25 am
by chulett
There is a discussion of Lookup versus Join in the documentation. Suggest you start there. You can also find discussions here if you search, for example this one.

Posted: Tue Aug 19, 2014 2:20 pm
by ray.wurlod
Lookup loads the reference data set into memory, so that your lookups are performed in memory (at memory speed) and there is no need to sort the data (a hash table index is also created in memory). Join does require that both its inputs be sorted.

Posted: Tue Aug 19, 2014 2:22 pm
by qt_ky
Then there is also sparse lookup, also documented.