Query about Remove duplicates, join stage

zulfi123786 · Post by **zulfi123786** » Wed Mar 24, 2010 7:38 am

Hi

Is is mandatory that the remove duplicates stage should be provided with sorted data? What if the data is not sorted explicity and forcing DataStage not to insert any sorts.... would it cause any data issues ?

Same question goes for Join stage and Change Data Capture stage

Consider that we are hashing the data on the keys.

Please advice

ArndW · Post by **ArndW** » Wed Mar 24, 2010 8:59 am

This really is a case where a 1-minute job (2 x row generator, 1 join, 1 peek) will answer your question for you. If you disable sort generation and feed the stages unsorted data they will fail.