Hash File-Is there any way to insert duplicates

suma · Post by **suma** » Tue Jun 22, 2004 1:12 am

I am using a hash File to load data into the Target,Is there any way to load duplicate records into the target,For example I have EmpNAme and DOJ as my source column and i am making the EMPNAME as the Key and in the Hash File stage,automatically all the Duplicate records are removed!

One thing is i can generate Keys and make that as a KEY!!!

Is there any other way to add the duplicate record without generating a key column...

What i mean is there any thing called multi valued hash Files that supports this concept?

ray.wurlod · Post by **ray.wurlod** » Tue Jun 22, 2004 2:25 am

No.

Only non-key columns can be multi-valued in hashed files.

You need a generated key. Make it an Integer, and generate it as @OUTROWNUM in the Transformer stage that loads the hashed file. Specify, in the hashed file creation options, that the hashing algorithm is SEQ.NUM.

If you're using a UV stage to load the hashed file, you can create a hashed file with auto-incrementing key by customising the CREATE TABLE statement to include DEFAULT NEXT AVAILABLE in the primary key column.

suma · Post by **suma** » Tue Jun 22, 2004 5:51 am

I am not very clear with system variables.What does @OUTROWNUM do?
and why should make the hashing Algorithm as SEQ.NUM?

"If you're using a UV stage to load the hashed file, you can create a hashed file with auto-incrementing key by customising the CREATE TABLE statement to include DEFAULT NEXT AVAILABLE in the primary key column"

And why should we go for universe Stage?I have imported a Hash FIle using universe File Defnitions from the Manager.Can u tell me how to customize the Create Table Statement.

ray.wurlod · Post by **ray.wurlod** » Tue Jun 22, 2004 6:13 am

@OUTROWNUM numbers the rows leaving a Transformer on an output link.

When the keys form an unbroken integer sequence, the dynamic hashed file SEQ.NUM hashing algorithm is far more efficient than the default, GENERAL, hashing algorithm.

Every UniVerse table is physically a hashed file. So, when you create a table with the UV stage, you're creating a hashed file. You can access this through a UV stage or through a Hashed File stage once it has been created.

Of course, if the hashed file exists already, then you can't use CREATE TABLE.

The UV stage has a "create table" check box which, when checked, enables an "Edit DDL" tab. It is on this tab that you can modify the CREATE TABLE statement.

Similarly, the Hashed File stage has the ability to create the hashed file. You edit the creation properties by clicking the Options button that is enabled when the "create file" check box is checked.