Limitation of DataStage to Process XML data?
Moderators: chulett, rschirm, roy
Limitation of DataStage to Process XML data?
Hi All,
Does any body having particular experience about limitation of DataStage to process XMS?
This might be generalized question, but we are exploring different options to process XML data.
I searched thru forums but couldnt find proper information.
Thanks for your time.
Does any body having particular experience about limitation of DataStage to process XMS?
This might be generalized question, but we are exploring different options to process XML data.
I searched thru forums but couldnt find proper information.
Thanks for your time.
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
The forums are not the place to search for documentation. You may have the documentation that was delivered with the software (that is, all the manuals). If not, request your support provider to get them for you.
I can't help with the actual question, having no experience with XMS.
I can't help with the actual question, having no experience with XMS.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
I'm not really aware of any 'limitation' per se, what kind of concerns are you having? Have you gone through the XML PACK documentation in your Docs directory? Have you gone through the XML Best Practices document that Kim Duke is hosting for us on his website?
That would be where you should start.
That would be where you should start.
-craig
"You can never have too many knives" -- Logan Nine Fingers
"You can never have too many knives" -- Logan Nine Fingers
Cool image. The kids are playing "lego batman" on the Wii..... nice. XML limitations. Too broad a subject. Let's get specific. What are you trying to accomplish?
Ernie
Ernie
Ernie Ostic
blogit!
<a href="https://dsrealtime.wordpress.com/2015/0 ... ere/">Open IGC is Here!</a>
blogit!
<a href="https://dsrealtime.wordpress.com/2015/0 ... ere/">Open IGC is Here!</a>
A good point lstsaur makes. Size. Something to certainly consider, as DS uses XSLT in the internals, and this requires that the document be loaded entirely into memory. This has advantages, but at the same time limits the overall size. Then again, many XML applications have far less volume than this in a "single" document.
Skills is another consideration. If you already have the Java skills, and not the DS skills, it may not even matter. If the team doing the development is mostly DS and not Java, then you have something else to consider.
What are you "doing" with the XML after you get it? This is another significant consideration. Are you reading it to write to an rdbms? Most relationally based ETL tools do what is often referred to as "shredding" the document --- into rows and columns. DataStage does this as well as anything, and if you are already working with said rdbms targets and other sources, you can hopefully re-use parts of your logic.
If you are "writing" XML, it's a bit more tricky. Of course, if DataStage is already being used to read from a whole host of sources, it still may be the best candidate --- but if the XML has a very complex hierarchy, it can get very complicated in DS... do-able, but you'll need some strong DataStage skills.
If you are reading a complex hierachical XML document, with the sole purpose of then writing a different, alternative complex hierarchical XML document, TX may be a better option, depending on the complexity of the document and/or the existence of other DataStage skills and assets, and the number of such Transformations you need to build.
Ernie
Skills is another consideration. If you already have the Java skills, and not the DS skills, it may not even matter. If the team doing the development is mostly DS and not Java, then you have something else to consider.
What are you "doing" with the XML after you get it? This is another significant consideration. Are you reading it to write to an rdbms? Most relationally based ETL tools do what is often referred to as "shredding" the document --- into rows and columns. DataStage does this as well as anything, and if you are already working with said rdbms targets and other sources, you can hopefully re-use parts of your logic.
If you are "writing" XML, it's a bit more tricky. Of course, if DataStage is already being used to read from a whole host of sources, it still may be the best candidate --- but if the XML has a very complex hierarchy, it can get very complicated in DS... do-able, but you'll need some strong DataStage skills.
If you are reading a complex hierachical XML document, with the sole purpose of then writing a different, alternative complex hierarchical XML document, TX may be a better option, depending on the complexity of the document and/or the existence of other DataStage skills and assets, and the number of such Transformations you need to build.
Ernie
Ernie Ostic
blogit!
<a href="https://dsrealtime.wordpress.com/2015/0 ... ere/">Open IGC is Here!</a>
blogit!
<a href="https://dsrealtime.wordpress.com/2015/0 ... ere/">Open IGC is Here!</a>