Page 1 of 1

Hash files cleanup

Posted: Mon Feb 04, 2013 10:04 am
by mdbatra
Hi All

Is there any way we can retrieve the information that what are all the hash files still being used by a Data Stage project?

Unix production box contains an external hash directory where huge no. of hash files are being accumulated since last 5 yrs. Even for the decommissioned applications, files, which are big in sizes, are still occupying the space . In order to free up some space, we first need to know what all is still being used in the project.

Any alternative approach to reach the desired goal is also welcomed.
Thanks.

Posted: Mon Feb 04, 2013 10:12 am
by chulett
I've posted about this before and still feel like it's worth posting. One approach would be to use a 3rd party tool to farm all of the current hashed files being used (as in read from or written to) from a dsx export of the project. Chuck Smith has such a beastie here and it is easily modifiable to (as I did many moons ago) store the results in a database table. That way the results are queriable for whatever combination of things you are looking for.

Posted: Mon Feb 04, 2013 10:44 am
by mdbatra
Looks like very doable one( just had the overview, though, for now-will check in detail tonight )

Many thanks for sharing this, Craig. Cheers :D