Page 1 of 1

Row generator and Column generator

Posted: Fri Jul 12, 2013 7:17 am
by asmita
Hi All,
i am new to this site. Can any one of you suggest me ,what i can do to improve performance tuning in Row generaor, Column generator and Peek stage. Please do guide me.

Thanks,
Asmita

Posted: Fri Jul 12, 2013 7:31 am
by ArndW
There is very little that can be done to improve the performance of the row and column generators.

The Peek stage is used for debugging purposes and the only way to make it work faster is to have it log less information, in which case it probably isn't needed and should be removed from the job in question.

Row and Column generators are incredibly fast active stages. I can't think of any stage which can process as much data as these stages can produce.

Posted: Fri Jul 12, 2013 7:33 am
by chulett
Start by posting in the correct forum. :wink: I moved your post.

Secondly, provide more details as I don't really see any of those stages requiring any 'performance tuning'. Did you have a specific job or job design in mind when posted this question? And since you started in a forum that doesn't require them, can you let us know what version you are running and your server platform - Windows or UNIX.

Posted: Fri Jul 12, 2013 12:15 pm
by rameshrr3
Try toggling the eexecution mode from Sequential to Parallel and see if you get an improvement in performance. Its wise to post the context in which you are trying this . Other wise our answers are a shot in the dark . If you are on a Grid/MPP, check if any Node Map or node pool constraint is applied to it and try changing the constraint so that it runs on a node local to the conductor.

Posted: Fri Jul 12, 2013 7:57 pm
by jwiles
Column Generator is probably already running in parallel.
Row Generator runs in sequential mode by default. You should understand that by setting it to parallel, you will have more records generated (number of nodes executing * number of records requested). You may not want to do that, or you may want to adjust the number of rows requested to a lower value when switching it to parallel.

Do you have a job in which you--or someone else--believe that these stages ARE performance bottlenecks (and why do you think so?), or perhaps has someone asked this as part of an interview?

Regards,