remove duplicate stage compared to sort stage
Posted: Wed Nov 02, 2005 3:23 pm
I want to remove duplicate records from a data set. I was going to use the remove duplicates stage. The documentation says the data must be sorted before removing duplicates. However the sorter stage also has the capability of removing duplicates.
Why would I want to use the removed duplicate records stage when the sort stage can do it?
What does the remove duplicate records stage do that the sort doesn't?
Why would I want to use the removed duplicate records stage when the sort stage can do it?
What does the remove duplicate records stage do that the sort doesn't?