Auto Purging of Dataset

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
ds_avatar
Premium Member
Premium Member
Posts: 6
Joined: Sun Dec 11, 2011 8:51 pm

Auto Purging of Dataset

Post by ds_avatar »

Hi,

Our DS server is getting crashed very frequently, after analysis we found that disk space is eaten by datasets (huge size), since we have used them heavily in our parallel jobs.

My Question: Is there any way to delete datasets once the correp. job is processed successfully? or what is the ideal way to deal with this situation?

As work around we have written a shell script and scheduled it in corn, which delete all dataset files older than 2 days.
Thanks,
ds_avatar
pandeesh
Premium Member
Premium Member
Posts: 1399
Joined: Sun Oct 24, 2010 5:15 am
Location: CHENNAI, TAMIL NADU

Post by pandeesh »

Once the job is finished , delete them using orchadmin in a after job subroutine or using execute command activity in a sequence.
pandeeswaran
roy
Participant
Posts: 2598
Joined: Wed Jul 30, 2003 2:05 am
Location: Israel

Post by roy »

I think that a periodical cleansing or a preload cleansing will also leave your data sets available for some debuging in case you need them.
Roy R.
Time is money but when you don't have money time is all you can afford.

Search before posting:)

Join the DataStagers team effort at:
http://www.worldcommunitygrid.org
Image
ds_avatar
Premium Member
Premium Member
Posts: 6
Joined: Sun Dec 11, 2011 8:51 pm

Post by ds_avatar »

pandeesh wrote:Once the job is finished , delete them using orchadmin in a after job subroutine or using execute command activity in a sequence.
I also thought to use execute command in job sequence, but what will be the selection criteria to pick-up the datasets belong to corresponding job.
roy wrote:I think that a periodical cleansing or a preload cleansing will also leave your data sets available for some debuging in case you need them.
How come dataset can be used for debug when it is available in non-readable format. :(
Thanks,
ds_avatar
pandeesh
Premium Member
Premium Member
Posts: 1399
Joined: Sun Oct 24, 2010 5:15 am
Location: CHENNAI, TAMIL NADU

Post by pandeesh »

ds_avatar wrote:I also thought to use execute command in job sequence, but what will be the selection criteria to pick-up the datasets belong to corresponding job.
Just pass the parameter for dataset name as what you do in job level.(else hard code if you are hard coding in job level)
ds_avatar wrote:How come dataset can be used for debug when it is available in non-readable format. :(
It can be as Data set management option is there in the client tool.
IHTH(I Hope This Helps) :)
pandeeswaran
Post Reply