Hi all,
My question is just on the best practices for two consecutives transforms.
Transforms are used to do look up.
What is the best 1 or 2 ?
1. Put a Hash File or Sequential between the two Look up
2. Nothing between the transforms
Thanks !
Best practices for 2 consecutives transforms
Moderators: chulett, rschirm, roy
Best practices for 2 consecutives transforms
Hi
I will suggest that you should try avoiding the two transformers. Try implementing all the transformations in one transformer.
If at all you can't avoid then I thing directly linking this transformer to next transformer will be better that using a stage between them and then linking it to second transformer.
I hope that answers your question.
Regards
Ashwin
I will suggest that you should try avoiding the two transformers. Try implementing all the transformations in one transformer.
If at all you can't avoid then I thing directly linking this transformer to next transformer will be better that using a stage between them and then linking it to second transformer.
I hope that answers your question.
Regards
Ashwin
Kumar, I have one doubt here?kumar_s wrote:Two transformed next to next will be clubbed as a single process during runtime unless interprocess is introduced. And placeing a intermediate passive stage will increase a unecessary IO.
I am aware of the thing that if you put consecutive transformers it would be executed only by a single process, until you have the row buffering enabled.
The doubt is on how is it actually executed?
1. Do all the transformation clubbed together, then it is executed.
2. Or, each single transformer related transformations are executed first, then the row follows the transformations of the succeding transformers in that order.
Thanks
Success consists of getting up just one more time than you fall.
Not only transformer, all active stages are merged to a single process during compilation. Passive stages frame the boundary. All the transformation logic will be built in to a single process. There wont be pipeline parallelism among the stages.
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Re: Best practices for 2 consecutives transforms
Nothing between the Transformer stages. If you have multiple CPUs, or the job in its current form consumes less than 40% of the single CPU, enable inter-process row buffering.kjaouhari wrote:Hi all,
My question is just on the best practices for two consecutives transforms.
Transforms are used to do look up.
What is the best 1 or 2 ?
1. Put a Hash File or Sequential between the two Look up
2. Nothing between the transforms
Thanks !
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.