What is the Difference between Hashfile and Sequential File

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Sourav
Participant
Posts: 17
Joined: Tue Sep 04, 2007 5:34 pm
Location: Delhi

What is the Difference between Hashfile and Sequential File

Post by Sourav »

Difference between Hashfile and Sequential File
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Parallel jobs do not have hashed files, only server jobs do. Hashed files are similar to database tables and one can do indexed or keyed reads on them, as well as update the contents; this cannot be done with sequential files.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

When's the interview?

There is no such thing as a hash file, other than a tool for shaping a block of hash.

The correct terminology in DataStage is "hashed" file. It is so called because it uses a hashing algorithm, rather than a primary key index or other mechanism, to determine the unique location of each record within its structure.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply