Difference between Teradata Enterprise & Teradata API st

Shamitha_Shetty · Post by **Shamitha_Shetty** » Tue Aug 14, 2007 1:59 am

Which Teradata Stage is more efficient when there is huge input data for extraction?Input volume:approximately 450 million recs.
which Teradata stage (Enterprise or API) helps extraction of data more efficiently?
Also after extraction from any of the above is done,i need to check if the extraction returns no rows then interface should abort.how can this be done?

JoshGeorge · Post by **JoshGeorge** » Tue Aug 14, 2007 2:15 am

Definitely Teradata (TD) API is not the choice if you have millions of records. TD EE (Enterprise) when used as source calls FASTEXPORT (TD Unload Utility). For explicit exports or big data pulls TD EE can work fine, or you can use the MLOAD stage (which does Fastexport if it is a source stage).

ray.wurlod · Post by **ray.wurlod** » Tue Aug 14, 2007 6:30 pm

To check whether no rows were extracted before proceeding you need a modular design - one module (job) to perform the extraction into a staging area (Data Set), and a later module to process the data from that staging area. Perform your check before starting the second module.

Shamitha_Shetty · Post by **Shamitha_Shetty** » Wed Aug 15, 2007 11:01 pm

Yeah,for the volume of input data to extract,modular design is suitable.but how do we go about checking if the extraction is empty.Taking a count of the table is not appropriate looking at the huge volume of data.Is there any other alternative?

ray.wurlod · Post by **ray.wurlod** » Thu Aug 16, 2007 12:46 am

Try selecting just the first row from the table.

hamzaqk · Post by **hamzaqk** » Mon Aug 20, 2007 3:39 am

TD API reads the records sequentially and is preferred when you have to read small number of records although it does not have any limits pertaining to TD utilities.

TD Ent stage is the only stage which can read and write in parallel and is preferred for large volume of data. it uses TeraRead and Terawrite or in other words fast export and load protocols and thus has the same limitations which the TD utilities have.

Teradata Certfied Master V2R5

JoshGeorge · Post by **JoshGeorge** » Mon Aug 20, 2007 4:16 am

Interesting... I think TD API have the option to run in parallel mode also. One can write/insert records using TD API in parallel mode and it gives much better performance than using the same in sequential mode.

hamzaqk wrote:TD API reads the records sequentially and is preferred when you have to read small number of records although it does not have any limits pertaining to TD utilities.

TD EE is not the 'only' parallel stage I think. TD Mload is also widely used.

hamzaqk wrote: TD Ent stage is the only stage which can read and write in parallel and is preferred for large volume of data. it uses TeraRead and Terawrite or in other words fast export and load protocols and thus has the same limitations which the TD utilities have.