Limitation of DataStage to Process XML data?

A forum for discussing DataStage<sup>®</sup> basics. If you're not sure where your question goes, start here.

Moderators: chulett, rschirm, roy

Post Reply
snt_ds
Premium Member
Premium Member
Posts: 280
Joined: Wed Oct 18, 2006 11:53 am
Location: Pune, India

Limitation of DataStage to Process XML data?

Post by snt_ds »

Hi All,
Does any body having particular experience about limitation of DataStage to process XMS?
This might be generalized question, but we are exploring different options to process XML data.
I searched thru forums but couldnt find proper information.

Thanks for your time.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

The forums are not the place to search for documentation. You may have the documentation that was delivered with the software (that is, all the manuals). If not, request your support provider to get them for you.

I can't help with the actual question, having no experience with XMS.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

XMS or XML? You've mentioned both, which is it? :?
-craig

"You can never have too many knives" -- Logan Nine Fingers
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

I'm not really aware of any 'limitation' per se, what kind of concerns are you having? Have you gone through the XML PACK documentation in your Docs directory? Have you gone through the XML Best Practices document that Kim Duke is hosting for us on his website?

That would be where you should start.
-craig

"You can never have too many knives" -- Logan Nine Fingers
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Image

Hopefully Ernie will see the signal.
-craig

"You can never have too many knives" -- Logan Nine Fingers
eostic
Premium Member
Premium Member
Posts: 3838
Joined: Mon Oct 17, 2005 9:34 am

Post by eostic »

Cool image. The kids are playing "lego batman" on the Wii..... nice. XML limitations. Too broad a subject. Let's get specific. What are you trying to accomplish?

Ernie
Ernie Ostic

blogit!
<a href="https://dsrealtime.wordpress.com/2015/0 ... ere/">Open IGC is Here!</a>
lstsaur
Participant
Posts: 1139
Joined: Thu Oct 21, 2004 9:59 pm

Post by lstsaur »

If you know how to use Java to proceess XML documents, especially how to use StAX, don't even bother to use the Datastage. Datastage is using DOM based technique, if the XML document is larger than 2 GB, then DS aborts.
eostic
Premium Member
Premium Member
Posts: 3838
Joined: Mon Oct 17, 2005 9:34 am

Post by eostic »

A good point lstsaur makes. Size. Something to certainly consider, as DS uses XSLT in the internals, and this requires that the document be loaded entirely into memory. This has advantages, but at the same time limits the overall size. Then again, many XML applications have far less volume than this in a "single" document.

Skills is another consideration. If you already have the Java skills, and not the DS skills, it may not even matter. If the team doing the development is mostly DS and not Java, then you have something else to consider.

What are you "doing" with the XML after you get it? This is another significant consideration. Are you reading it to write to an rdbms? Most relationally based ETL tools do what is often referred to as "shredding" the document --- into rows and columns. DataStage does this as well as anything, and if you are already working with said rdbms targets and other sources, you can hopefully re-use parts of your logic.

If you are "writing" XML, it's a bit more tricky. Of course, if DataStage is already being used to read from a whole host of sources, it still may be the best candidate --- but if the XML has a very complex hierarchy, it can get very complicated in DS... do-able, but you'll need some strong DataStage skills.

If you are reading a complex hierachical XML document, with the sole purpose of then writing a different, alternative complex hierarchical XML document, TX may be a better option, depending on the complexity of the document and/or the existence of other DataStage skills and assets, and the number of such Transformations you need to build.

Ernie
Ernie Ostic

blogit!
<a href="https://dsrealtime.wordpress.com/2015/0 ... ere/">Open IGC is Here!</a>
snt_ds
Premium Member
Premium Member
Posts: 280
Joined: Wed Oct 18, 2006 11:53 am
Location: Pune, India

Post by snt_ds »

Thanks Ernie/lstsaur.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

eostic wrote:A good point lstsaur makes.
Ah... Ernie Ostic, the Yoda of DSXchange. :wink:
-craig

"You can never have too many knives" -- Logan Nine Fingers
snt_ds
Premium Member
Premium Member
Posts: 280
Joined: Wed Oct 18, 2006 11:53 am
Location: Pune, India

Post by snt_ds »

eostic wrote:What are you "doing" with the XML after you get it?
We are writing to RDBMS.
Post Reply