Page 1 of 1

Check sum stage

Posted: Thu Jun 16, 2011 1:39 pm
by deva
Hi I need one quick information regarding check sum stage.

I want encript one of the key field. To make distinct and uniq out of given values, I am using two columns and passing those two columns through check sum stage. The result I am getting 32 numbers.

But I need 15 to 16 numbers only. I dont want take substring. the output of check sum need 16 digits.

Currently I am using 7.5 version, we are going to upgrage into 8.5, So I want to know how this will works.

Please help me with right information.

Posted: Thu Jun 16, 2011 1:50 pm
by chulett
A "checksum" is not an encyption mechanism since there is no decryption side to it and it will not generate anything "distinct and uniq"... and you don't have any choice on the length of the output. Sounds like you need to look into other methodologies like MD5 and others.

Posted: Thu Jun 16, 2011 2:20 pm
by deva
Thanks for your reply. I don't want to decript again, my requirement is I have 5 systems, out of wihich , the key column should encript, (meaning end user should not find out this id is from particular system. ) We are loading 5 systems information into one datawarehouse.

to deidentify the key column I am using this process.

Thats reason to encript I want to use system name + key column and pass through checksum stage that will give a uniqe number.

In this case I am geting 32 bit number, If I want only 16 bin , how can I do that.

Posted: Thu Jun 16, 2011 2:41 pm
by ray.wurlod
Checksum will not necessarily give you a unique number. There is a small possibility that two values will generate the same checksum value - small enough that most people accept the risk when using the checksum as a comparison technique, its intended purpose.

Posted: Thu Jun 16, 2011 10:15 pm
by chulett
Perhaps you need to think about leveraging a surrogate key for this? :?