Which is best tool for handling 20TB to 30TB Data ?

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
Subu
Participant
Posts: 9
Joined: Fri Mar 18, 2005 4:08 am

Which is best tool for handling 20TB to 30TB Data ?

Post by Subu »

Hi ,
Which is best tool for handling 20TB to 30TB Data ? Compare between DataState or Ab Initio?
Please advice me.

Thanks
Subu
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Subu,

it really depends upon a lot of factors and there is no single answer. DataStage Px is great for large amounts of data, as is Ab Initio. Both tools will do the job. The deciding factors are then price, availability of resources, the Salesperson, etc.

Apart from that, this question has been asked in different forms several times in this forum and a search of the headers should give you a number of interesting threads with many different opinions and views. But since this is a DataStage site the view will be somewhat slanted.
roy
Participant
Posts: 2598
Joined: Wed Jul 30, 2003 2:05 am
Location: Israel

Post by roy »

Hi,
Do they have a forum like this one? (must have a weight in my opinion ;))
Roy R.
Time is money but when you don't have money time is all you can afford.

Search before posting:)

Join the DataStagers team effort at:
http://www.worldcommunitygrid.org
Image
Subu
Participant
Posts: 9
Joined: Fri Mar 18, 2005 4:08 am

Post by Subu »

Thanks for your help. Still i am in the confused stage :) . Because I need to give good reason to our client why i am chose for DataStage .. why not ApInitio ?

Thanks
Subu
***
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Subu,

in order to recommend one over the other you would need to know more about the data, platforms, databases, metadata, infrastructure, etc. How is the 20-30Tb of Data organized - is that 300 rows at 100Gb apiece? 10 Tables? Source from complex sources, from PL/1 or Cobol? Does the client require metadata management at a low or high level?

All ETL tools will move data from A to B with no real transformations or logic in about the same amount of time; so using size alone as a differentiator is no use.
vmcburney
Participant
Posts: 3593
Joined: Thu Jan 23, 2003 5:25 pm
Location: Australia, Melbourne
Contact:

Post by vmcburney »

You are asking the question on a DataStage forum so people will tend to favour DataStage over Ab Initio. Both ETL tools have a robust parallel processing engine that is better then any other data integration tool on the market. They both have data quality plugins and metadata management. We cannot tell you which one is better. It comes down to price, availability of resources and best fit in your enterprise. Why don't you let each company present to your client and let the client decide.
T42
Participant
Posts: 499
Joined: Thu Nov 11, 2004 6:45 pm

Re: Which is best tool for handling 20TB to 30TB Data ?

Post by T42 »

Subu wrote:Hi ,
Which is best tool for handling 20TB to 30TB Data ? Compare between DataState or Ab Initio?
DataStage EE. Seriously. There is a challenge out by Ascential to anyone who can defeat the benchmarked performance (I don't have the specific URL handy at the moment), and NOONE, not even Ab Initio, could come close, performance-wise.

Ascential have plenty of very large clients who throw terabytes of data left n' right daily, so they are very experienced with your data size. I have worked on databases that are 2-3 terabytes total, and DataStage does a decent job handling them.

One interesting trivia: DataStage EE was initially developed by Torrent. Torrent and Ab Initio used to be a single company, before they spun off into separate paths due to disagreement on how to handle data. The combined company was founded by people who developed the Thinking Machines back in the late 80s/early 90s. My memory is so fuzzy right now on this. I do think that Torrent chose the right solution, and I bet Ascential would agree. :-)
diamondabhi
Premium Member
Premium Member
Posts: 108
Joined: Sat Feb 05, 2005 6:52 pm
Location: US

Post by diamondabhi »

DataStage EE ofcourse, no questions about it even performance wise and economic factor and technical support.
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Go to Ascential's web site and do a search for "benchmark". There are some success stories (with big data volumes) that you can use to support your pro-DataStage case.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Subu
Participant
Posts: 9
Joined: Fri Mar 18, 2005 4:08 am

Post by Subu »

Thanks for reply. I found some material from 'coogle' also.

Subu
***
Subu
Participant
Posts: 9
Joined: Fri Mar 18, 2005 4:08 am

Post by Subu »

Thanks for reply. I found some material from 'coogle' also.

Subu
Post Reply