XM Input Stage

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
saikir
Participant
Posts: 92
Joined: Wed Nov 08, 2006 12:25 am
Location: Minneapolis
Contact:

XM Input Stage

Post by saikir »

Hi,

I am new to using XML capabilities of DataStage. I would like to know whether there is any limit in the number of rows that can be read in terms of count or Size? Say for e.g. there is a 2GB limit on DataSet?

Regards,
Sai
mahadev.v
Participant
Posts: 111
Joined: Tue May 06, 2008 5:29 am
Location: Bangalore

Post by mahadev.v »

The 2GB limit on the DataSet is imposed by the operating system constraints and not a DataStage constraint (mostly for the data files in the resource disk). Number of rows is never a problem, and the size is also not a constraint as long as it is not constrained by the OS.
"given enough eyeballs, all bugs are shallow" - Eric S. Raymond
eostic
Premium Member
Premium Member
Posts: 3838
Joined: Mon Oct 17, 2005 9:34 am

Post by eostic »

You are likely to hit in-memory limits in the current version of XMLPack, long before you have issues with ultimate filesize. A "single document" cannot be more than about 300 to 500M in size, depending on the number of nodes and your env....

...however, that is "single document"...you could certainly run a job that is retrieving, parsing, and reading/processing hundreds of such documents in a single run, where such documents are a collection of XML instance documents in a single subdirectory.

Ernie
Ernie Ostic

blogit!
<a href="https://dsrealtime.wordpress.com/2015/0 ... ere/">Open IGC is Here!</a>
Post Reply