use of hash files

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
shivan
Participant
Posts: 70
Joined: Mon Jul 25, 2005 9:29 am

use of hash files

Post by shivan »

hi,
i m new to datastage. i was going through the documentation but couldnt find what is the real use of hash files. The only thing i understand is that it uses hashing algorithm to feed the rows.

shivan
diamondabhi
Premium Member
Premium Member
Posts: 108
Joined: Sat Feb 05, 2005 6:52 pm
Location: US

Post by diamondabhi »

Hi Shivan,
Hash Files can be used for manythingd, search for Hash Files and u will find a lot of information about it.

Thanks,
Abhi.
kduke
Charter Member
Charter Member
Posts: 5227
Joined: Thu May 29, 2003 9:47 am
Location: Dallas, TX
Contact:

Post by kduke »

The main function of hash files is for fast lookups.
Mamu Kim
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Very fast.

And it's hashed file, not hash file.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
pnchowdary
Participant
Posts: 232
Joined: Sat May 07, 2005 2:49 pm
Location: USA

Post by pnchowdary »

They can also be used for eliminating duplicates. :)
Thanks,
Naveen
ppalka
Participant
Posts: 118
Joined: Thu Feb 10, 2005 7:25 am
Contact:

Post by ppalka »

And to handle multivalue fields :)
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

pnchowdary wrote:They can also be used for eliminating duplicates. :)
May i know how to eliminate duplicate using this stage, is it thru lookup with the same set of file or thru some other method.......

regards
kumar
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Kumar,

the key in a hash file is always unique, so doing a subsequent WRITE to the same key will overwrite the previous value; thereby removing duplicates.

-Arnd.
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

ArndW wrote:Kumar,

the key in a hash file is always unique, so doing a subsequent WRITE to the same key will overwrite the previous value; thereby removing duplicates.

-Arnd.
Thanx Arnd i never thought about this...

regards
kumar
kollurianu
Premium Member
Premium Member
Posts: 614
Joined: Fri Feb 06, 2004 3:59 pm

Post by kollurianu »

Code: Select all

And to handle multivalue fields 

what does the above statement mean ppalka? can explain me...


Thanks a bunch,
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

A multi-valued field (or column) may contain a list, rather than an atomic value. Some databases support the concept, though sometimes not using this terminology. This is particularly so with a database that promises OLAP capability (such as Oracle 9i/10g and MS SQL Server Plato) and databases with a Pick heritage (such as UniVerse, UniData, D3 and so on).
The main "support" for multi-valued data in DataStage is the ability automatically to "explode", or "un-nest", the multiple values to expose the "nested table" in at least first normal form.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply