I have mistakenly deleted some datasets using the rm command.
In order to clean up the orphan data files should I just go onto the nodes where the orphan files are and delete those too or is there a cleaner way ?
Thanks
Deletion of datasets using rm
Moderators: chulett, rschirm, roy
Deletion of datasets using rm
Colin Larcombe
-------------------
Certified IBM Infosphere Datastage Developer
-------------------
Certified IBM Infosphere Datastage Developer
-
- Participant
- Posts: 222
- Joined: Tue Aug 30, 2005 2:07 am
- Location: pune
- Contact:
Hi,
You cannot use the UNIX rm command to delete a data set because DataStage represents a single data set with multiple files. Using rm simply removes the descriptor file, leaving the much larger data files behind.
So, you can use Data set management in Designer or Manager to delete the entire data or cleanup the data.
You cannot use the UNIX rm command to delete a data set because DataStage represents a single data set with multiple files. Using rm simply removes the descriptor file, leaving the much larger data files behind.
So, you can use Data set management in Designer or Manager to delete the entire data or cleanup the data.
NageshSunkoji
If you know anything SHARE it.............
If you Don't know anything LEARN it...............
If you know anything SHARE it.............
If you Don't know anything LEARN it...............
The orphaned dataset data files have no link back to their "descriptor" so you will need to manually delete them.
I wrote a program to search the machine for all descriptor files and link them to the corresponding data files, then any files that aren't linked in the data directory are junk files and can be deleted. It wasn't too difficult to do (I'd post it but the code is proprietary to the customer I was at).
Basically from UNIX create your own magic file and do a find to get the descriptors. You can then use orchadmin ll {dataset} to get information on the part files and parse that out. Then compare your total list with the contents of any data directories listed in your system's configuration directory for APT files.
I wrote a program to search the machine for all descriptor files and link them to the corresponding data files, then any files that aren't linked in the data directory are junk files and can be deleted. It wasn't too difficult to do (I'd post it but the code is proprietary to the customer I was at).
Basically from UNIX create your own magic file and do a find to get the descriptors. You can then use orchadmin ll {dataset} to get information on the part files and parse that out. Then compare your total list with the contents of any data directories listed in your system's configuration directory for APT files.
<a href=http://www.worldcommunitygrid.org/team/ ... TZ9H4CGVP1 target="WCGWin">
</a>
</a>