Delete data from hash files

chulett · Post by **chulett** » Tue Nov 01, 2005 6:13 pm

Never tried that way.

We've taken two different approaches: One is to rebuild the hashed file such that the deleted rows are automagically removed. Another approach is to add a 'deleted' column and mark them as logically deleted by updating that indicator during the delete process. Your jobs then need to know that a hashed file hit is only a hit if that column isn't set.

Your mileage may vary.

jozef · Post by **jozef** » Tue Nov 01, 2005 6:58 pm

chulett wrote:Never tried that way.

We've taken two different approaches: One is to rebuild the hashed file such that the deleted rows are automagically removed. Another approach is to add a 'deleted' column and mark them as logically deleted by updating that indicator during the delete process. Your jobs then need to know that a hashed file hit is only a hit if that column isn't set.

Your mileage may vary.

Hi Craig,

Thanks, I'm currently doing the rebuild function, but the amount of data grew from 3 mil records to 8mil plus. Causing my job to run an extra 2 hours.

The hash file normally stays the same with a few updates, inserts and some records that needs to be deleted. Rebuilding the hash file for 20 000 records seems to be a bit over the top, hence the search for a better solution.

Regards

kcbland · Post by **kcbland** » Tue Nov 01, 2005 10:15 pm

You could always stream the data to an output link using a function in the constraint that isn't really a constraint but a does a delete on the hash file from within the function.

It's quite easy actually, just open the hash file on the first row passed and put into a COMMON file handle variable, then using the passed key issue a DELETE ON yourfilehandle, key statement ELSE ... statement. It's not pretty but it saves having to use the UV/ODBC stage and it's a whole lot faster.

ArndW · Post by **ArndW** » Wed Nov 02, 2005 1:55 am

Jozef,

Using SQL in your case to delete a couple of records is going to be very inefficient. You could write a routine used in a derivation to delete records. Here is an example (most of the error handling skipped) that is tuned for efficiency but only handles one HashedFile.

Code: Select all

Routine DeleteHashedRecord(Filename,Key)
EQUATE ProgramName TO 'DeleteHashedRecord'
COMMON/DeleteCommon/Initialized,FilePtr

   IF NOT(Initialized)
   THEN
      OPEN Filename TO FilePtr ON ERROR CALL DSLogFatal('Error, Unable to open "':Filename:'" hashed file with error ':STATUS(),ProgramName)
                                           ELSE  CALL DSLogFatal('Error, Unable to open "':Filename:'" hashed file with error ':STATUS(),ProgramName)
      Initialized = 1
   END ;** of if this is the first call to the routine in this session

   Ans = 0
   DELETE Key FROM FilePtr ELSE Ans =1