Anonymise data through DataStage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
laxman.ds
Premium Member
Premium Member
Posts: 66
Joined: Thu Mar 02, 2006 9:00 am

Anonymise data through DataStage

Post by laxman.ds »

Hello

We have a requirment where we need to anonymise data coming from source files.
can anyone please let me know the strategy to achive this through datastage

Thanks & Regards
2 B 1 4 ALL
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Does it need to be reliably repeatable? Does it have to preserve the relationships between and the statistics of the data?

If any of these is true investigate the Optim product from IBM (or another data masking product). Otherwise you're going to be re-inventing a whole lot of wheels.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

We've had various conversations here on the topic and it really depends on what you mean by 'anonymise', probably 'masking' as Ray notes. For example, one such conversation is here.
-craig

"You can never have too many knives" -- Logan Nine Fingers
Post Reply