Page 1 of 1

Greenplum

Posted: Wed Apr 30, 2008 2:24 am
by ratna
Hi all,

i have a question, can we use Greenplum Database on Datastage 7.5.2?
And can you tell me how? is it using the ODBC?

Thanks,
Ratna

Posted: Wed Apr 30, 2008 2:42 am
by ray.wurlod
Probably. You would need to obtain an ODBC driver (presumably from GreenPlum themselves) that is capable of managing their parallel access functionality. You could also write text files, and have their parallel bulk loader do the heavy lifting. And you could, of course, write your own custom stage (if you can get access to documentation about any client GreenPlum or PostGres API).

A recent press release indicates that GreenPlum has "certified interoperability with ... IBM DataStage", so it may be well worth it to contact them directly to see precisely what this means. And then you can post the answer here!

Posted: Wed Apr 30, 2008 3:45 am
by tkbharani
Yes , you can use it in DS 7.5.2
We have implemented in 7.1 server job itself. But Insert/update is slow when using ODBC.
Best Way for using GreenPlum with DataStage is use "gpfdist" fast greenplum loader using unix. Very soon they are coming with native connectivity between GreenPlum and DataStage . GP is working on it.

When u have tones of data to be loaded and quried, use GP and DS for best results. :wink:

Posted: Wed Apr 30, 2008 4:07 am
by ray.wurlod
We don't actually know the volumes of data that U is processing. We haven't heard from U for some while and even then no details were vouchsafed as to data volumes.

Posted: Wed Apr 30, 2008 4:37 am
by ethanr
If it is regarding size of datalaoding in GreenPlum,
then approximately you can load 1 TeraByte data in less than 3 hours(12 cpu,dual core)
For query'ing you can scan 1 Tera Byte of data in 16 minutes.
For more accurate bench marks you can always contact GreenPlum.

Posted: Sun Mar 25, 2012 8:04 pm
by mike369
tkbharani, can you tell config steps details? thank you

Posted: Sun Mar 25, 2012 11:57 pm
by ray.wurlod
From almost four years ago?
:shock: