DataSets Vs Hash Files

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
srekant
Premium Member
Premium Member
Posts: 85
Joined: Wed Jan 19, 2005 6:52 am
Location: Detroit

DataSets Vs Hash Files

Post by srekant »

Hi,

Is DataSets in DS7.5 EE equivalent to server HashFiles .If not any other stage in DS 7.5 EE that have the functionality of Hashfiles.
Sree
kcbland
Participant
Posts: 5208
Joined: Wed Jan 15, 2003 8:56 am
Location: Lutz, FL
Contact:

Post by kcbland »

No. They are like apples and oranges. They're in the fruit family (both hold data) but are not even close in usage capabilities.
Kenneth Bland

Rank: Sempai
Belt: First degree black
Fight name: Captain Hook
Signature knockout: right upper cut followed by left hook
Signature submission: Crucifix combined with leg triangle
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

You are probably noting a similarity between the fact that a hashed file and a data set can be in memory and can service lookups.

While that is true, the architecture of each is radically different. For example there is no mechanism for automatically partitioning a hashed file over the processing nodes defined in the configuration file. There is no published information as to whether data sets use hashing or some other mechanism for finding key values, and apparently this may be different depending on the source from which the virtual data set was loaded.

Conceptually, then, there are some similarities, but they are definitely not the same animal.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply