Dirty Data

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Planning, planning and more planning!
Use Quality Manager to audit your source data, to get a feel for exactly how dirty they are, and what steps might be needed to cleanse them.
Use INTEGRITY, perhaps, to do the same kind of thing, and possibly even to do the cleansing as well.
Cleansing in DataStage is largely performed via output link contraint expressions.
Question: how do you know you have dirty data in your DW? From this you should be able to track backwards to determine how they sneaked through your processing, and make the latter more robust.
There are consultants around who can help you with these processes.


Ray Wurlod
Education and Consulting Services
ABN 57 092 448 518
Post Reply