Rows number is different for different execution mode

KadetG · Post by **KadetG** » Thu Dec 07, 2006 2:07 am

Hi All

So... If I set Execution mode in "Default (Sequental)" DB2_UDB_API correct extract the datas. But if I set Execution mode in "Parrallel" Stage etract more data then have in table. But extracting is faster..

Why that is possible? What I do incorrect?

kumar_s · Post by **kumar_s** » Thu Dec 07, 2006 2:15 am

Can you explain with some more details. How many number of rows were in table and what was the output number of rows. How many stages were there? what partition is used in all stages? Select query if any?

KadetG · Post by **KadetG** » Thu Dec 07, 2006 2:23 am

Hi

I have 294886 rows for PRCSS_WK_ENDING_DT 10.11.2006. But If I set parallel then datastage extract 1474430 rows. Job consist of only 3 stages (DB2_UDB_API for extract, Copy stage with parallel by default and DB2_UDB_API with parralel for load).

This is SQL request to DB2: SELECT UTILDM_EMP_GEO_SPRCSS.CMPNY_CD,UTILDM_EMP_GEO_SPRCSS.CTRY_CD,UTILDM_EMP_GEO_SPRCSS.PRCSS_WK_ENDING_DT FROM UTILDM_EMP_GEO_SPRCSS WHERE PRCSS_WK_ENDING_DT = '2006-11-10';

kumar_s · Post by **kumar_s** » Thu Dec 07, 2006 2:31 am

Is your statistics shows as 1474430 or the count on your target table? If its the later case, you might be counting the previously inserted data as well. In not, do you see any duplicates in the extracted data for any specific key?
Ensure partition method is 'same' and not Entire in copy stage.

ray.wurlod · Post by **ray.wurlod** » Thu Dec 07, 2006 2:59 am

If you're executing the same SELECT on each partition you're effectively getting Entire partitioning. If that's not what you want, you're going to have to constrain the query appropriately and/or select in sequential mode and choose an appropriate partitioning algorithm.

KadetG · Post by **KadetG** » Fri Dec 08, 2006 5:12 am

Hi

2kumar_s
it`s statistics shows and rows count after transfer data.
Hmmm I think that is duplicates...
But why it`s extacted and copied to target table?
I`m sure... That happen if Source DB2_UDB_API and copy stage have "parallel" in execution mode. And all be ok if I set sequentioal for DB2_UDB_API and still parallel for copy stage.