I have been using datasets with the lookup stage. But then I read about the fileset lookup stage.
What is the difference between using
one master dataset, and one reference dataset and a lookup stage to join on a key
versus
one master dataset, one fileset lookup stage, and one lookup stage to join on a key.
Is it mainly the size of the data, over 2GB or not?
thanks again.
using fileset lookup stage or not?
Moderators: chulett, rschirm, roy
According to the documentation the fileset will have better performance, since the data is stored in correct form. I've not noticed significant performance differences, but lookup filesets cannot be 'viewed', which is a distinct disadvantage.
<a href=http://www.worldcommunitygrid.org/team/ ... TZ9H4CGVP1 target="WCGWin">
</a>
</a>