Query on SCD Stage

balu536 · Post by **balu536** » Fri Sep 13, 2013 7:10 am

Hi,
I've a query on Slow Changing Dimensions (SCD) Stage.

Do we need to sort the data (on the input links) on the Business Keys being used in the stage?
Is it mandatory for proper functioning of SCD stage, like we do for Join and Merge stages?

Thanks.

ray.wurlod · Post by **ray.wurlod** » Fri Sep 13, 2013 5:30 pm

Data do not need to be sorted on business key. When the dimension table is loaded into memory the business key column is identified. That is sufficient.

balu536 · Post by **balu536** » Thu Sep 19, 2013 3:10 pm

Thanks Ray.

Also is it fine if we do partitioning (Hash) on the source link alone or is it required on both source and reference link?

Please clarify.

IBM Analytics Champion 2009 - 2020 · Post by **asorrell** » Fri Sep 20, 2013 8:00 am

Yes, the partitioning should match on both links. However, if the other link is set to "auto" then DataStage will determine that it needs to be set to match the other input stream and set it to "Hash" on the right keys without tell you. I believe if you look at the osh you can verify that.

However, I always recommend setting stages explicitly so it is more evident where the partitioning is occurring.

IBM Analytics Champion 2009 - 2020 · Post by **asorrell** » Fri Sep 20, 2013 8:02 am

Yes and No. Yes, the partitioning should match on both links. However, if the other link is set to "auto" then DataStage will determine that it needs to be set to match the other input stream and set it to "Hash" on the right keys without tell you. I believe if you look at the osh you can verify that.

However, I always recommend setting stages explicitly so it is more evident where the partitioning is occurring.

balu536 · Post by **balu536** » Fri Sep 20, 2013 11:35 am

Thanks Andy.