Cleaning up datastage internal files

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
mouthou
Participant
Posts: 208
Joined: Sun Jul 04, 2004 11:57 pm

Cleaning up datastage internal files

Post by mouthou »

hello everyone,

there are some files created by Datastage at some specific locations. is there a way to automatically clean up the files or should have been there any option during server installation?

the location /opt/IBM/InformationServer/Server/Scratch has some files like
c51node1.000000000000001e.00000000
c51node1.000000000000001d.00000000
c51node1.000000000000001c.00000000
c51node1.000000000000001b.00000000

and under project directory, there are so many files (even older than few months)

DS_TEMP1707
DS_TEMP1708
DS_TEMP1709
DS_TEMP1710

RT_BP80.O
RT_BP800
RT_BP800.O
RT_BP801
RT_BP801.O
RT_BP802

RT_LOG1666
RT_LOG1667
RT_LOG1668
RT_LOG1669

RT_STATUS990
RT_STATUS991
RT_STATUS993

Thought of getting experts help about those files and as to know why they are not getting cleaned and how to clean them.

thanks in advance,
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Never, ever delete files and directories inside a project directory from outside of DataStage. The only acceptable exception is the contents of the &PH& directory. Doing such a deletion will cause problems in the project, ranging from mildly annoying pop-boxes to complete project corruption.

The Scratch directory is a different story, if those files are old then they can be deleted.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

All files/directories in the project that have names of the form DS_namennn or RT_namennn are the Repository tables in which information about job number nnn are stored. The only safe way to delete these is to delete the job (using one of the clients). If you need the job, then you need these files/directories.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
smishra.ds
Premium Member
Premium Member
Posts: 9
Joined: Wed Apr 23, 2008 12:11 pm
Location: Global

Post by smishra.ds »

All files/directories in the project that have names of the form DS_namennn or RT_namennn are the Repository tables in which information about job number nnn are stored.


Hi Ray,

Thanks for the info.
From where i can get more details about these?

How we can view the repository contents in V 7.5, as it is still in Universe?

How we will know the Job number in the repository and the log files name & place associated with that job?

Is there any specific method of viewing the repository?
Can you please suggest/provide any documentation for this?

Thanks in advance, as i was searching for this kind of info.
Thanks again.
infranik
Participant
Posts: 20
Joined: Tue Oct 11, 2005 8:11 am

Post by infranik »

Try the following command in the DS Administrator:

SELECT * FROM DS_JOBS WHERE NAME='jobname'
Or
SELECT JOBNO FROM DS_JOBS WHERE NAME='jobname'

you should get the job number from the above query ex: 123.
The RT_LOG123 will contain log information for the job. similarly rest of the files with 123 number are associated with the same job.
Never delete/alter the files manually as it might result in corrupting your job.

alternatively if you use the number from the RT_** files that are very old, and put them in below query:
SELECT NAME FROM DS_JOBS WHERE JOBNO=<RT_number>
you would get the job name.
go to the DS director and clear the status file/log for the job if required.
If you delete the job from the designer, the files automatically get deleted.

rgds,
Nik
Post Reply