Dataset Limitations

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
coehcl
Charter Member
Charter Member
Posts: 16
Joined: Tue Oct 10, 2006 8:42 am

Dataset Limitations

Post by coehcl »

Hi,

In our process,First job will load approx. 4-5 GB of data in a dataset.And in the second job, i have to use the above data set as source.
Will Dataset handle that much data or any limitations to that ?

Thanks in Advance,
Kishore
Nageshsunkoji
Participant
Posts: 222
Joined: Tue Aug 30, 2005 2:07 am
Location: pune
Contact:

Post by Nageshsunkoji »

Hi,

The amount of data that can be stored in each destination data file is limited by the characteristics of the file system and the amount of free disk space available.

In your case, there is no problem until and unless, if there is an issue with space. You can access that Dataset as a source in other job also.
NageshSunkoji

If you know anything SHARE it.............
If you Don't know anything LEARN it...............
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

I'm pretty sure that PX datasets and filesets avoid any 2Gb limit your filesystem might have by adding partfiles; but I'm not absolutely certain. If you can create files larger than 2Gb in your OS then there is no effective limit other than disk space.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Even if your operating system limits file sizes to 2GB there is no effective limit. Data Sets' data will be stored in multiple files per partition. So, if you had 10000 files per partition and 1000 partitions, that's 2.0E9 GB of data. And you're allowed more than that; I only chose those numbers for illustrative purposes.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply