Hi,
I am new to data stage Parallel xtender, i would like to know the
diff between Data set, sequential file stage & file stage, use of it in various scenarios...
can any one help me...
Thanks
sundar
diff between Data set, sequential file stage & file stag
Moderators: chulett, rschirm, roy
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Welcome aboard! :D
Reading the appropriate chapters describing each stage type in the Parallel Job Developer's Guide (parjdev.pdf) will aid your understanding.
A quick, and necessarily incomplete, summary is:
Reading the appropriate chapters describing each stage type in the Parallel Job Developer's Guide (parjdev.pdf) will aid your understanding.
A quick, and necessarily incomplete, summary is:
- A Sequential File stage accesses regular operating system files, such as CSV files, text files, and so on. In general that access must be sequential rather than parallel.
A Data Set stage accesses a persistent Data Set, which is an on-disk copy of a virtual Data Set; the set of partitioned data with which all operators in parallel jobs deal. Data in Data Sets are in internal format, particularly numeric data are stored in binary form.
A File Set stage accesses a File Set, which is partitioned across all nodes specified in the configuration file (as is a Data Set) but which contains human-readable data in each of its files. A Lookup File Set includes a key definition in its schema.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.