Improve performance with multiple transformers

myukassign · Post by **myukassign** » Fri Jun 04, 2010 10:45 am

I would like to improve the performance of my job. If anyone can help me with some ideas should be great. Please suggest me if a change in design or change in stage can give me any better.

My job is like this.

Job1 . Dataset (xmls) --> XML Input stage -> move data of 40 different complex elements to 50 datasets.

Before writing to dataset I do an in line sorting and partition the data with hash and store in the datasets.
(I am happy with this job--finish it in less than 1 minute)

Job2. I read this 50 datasets ----> Transformer(simple transformation rules)---> Push to dataset. If any records meet my criteria of rejection in transformer, it will go to a shared container. That means all 50 transforers reject will collect by a funnel and push to the shared container.Since I partion data and store it in previous job, I am using same partition.

This Job2 is taking almost 15 minutes to complete.

Is there anyway to improve the performance. Even if I run the job without a single xml , still it is taking the same time which tells me...the problem is not becuase of the huge data load. Its the design that creating problem..

Any help.....

ETLJOB · Post by **ETLJOB** » Fri Jun 04, 2010 11:23 am

myukassign wrote: Job2. I read this 50 datasets ----> Transformer(simple transformation rules)---> Push to dataset.

I guess the problem is with the read. How you read these datasets?
Is there anything available in datasets, something similar to "file pattern" option in sequential file? If yes, are you making use of it?

Also, can you look in the job monitor to find out which stage is taking more time?

myukassign · Post by **myukassign** » Fri Jun 04, 2010 12:02 pm

NO File patters is thr in dataset so I did not use that.

Its a normal read.

eostic · Post by **eostic** » Fri Jun 04, 2010 12:05 pm

Anything else you can tell us about the job and the source file? How many rows ends up taking the 15 minutes?

Also, what happens if you just have "Source File ---- Transformer --- (some target), and then a Constraint in the Transformer that is impossible (such as 1=0)....or you could also just have the source file go into a dummy Copy Stage......how fast does that read the source data?

Ernie

myukassign · Post by **myukassign** » Fri Jun 04, 2010 12:28 pm

eostic wrote:Anything else you can tell us about the job and the source file? How many rows ends up taking the 15 minutes?

Also, what happens if you just have "Source File ---- Transformer --- (some target), and then a Constraint in the Transformer that is impossible (such as 1=0)....or you could also just have the source file go into a dummy Copy Stage......how fast does that read the source data?

Ernie

At my first post I mentioned that, even if I run the job wihtout even a single record it will take almost the same time... so it has nothing to do with the data....I think this 50 transformers is killing my peace...

chulett · Post by **chulett** » Fri Jun 04, 2010 4:01 pm

Now it's 50 transformers rather than "datasets"?

ray.wurlod · Post by **ray.wurlod** » Fri Jun 04, 2010 4:43 pm

Is operator combination enabled or disabled?

Use Monitor or Performance Analyzerto report on the CPU consumption. This will provide guidance about which sets of stages could benefit from disabling operator combination.

myukassign · Post by **myukassign** » Tue Jun 08, 2010 1:33 am

ray.wurlod wrote:Is operator combination enabled or disabled?

Use Monitor or Performance Analyzerto report on the CPU consumption. This will provide guidance about which sets of stages could benefit from disabling operator combination.

When I start the perfomance analyzer it giveme warning window "No perfomance data avaialbe"

How to use this...

ray.wurlod · Post by **ray.wurlod** » Tue Jun 08, 2010 2:42 am

It's all in the manual. You have to capture the perfomance data when you run the job, then the Performance Analyzer reports on the captured statistics.

myukassign · Post by **myukassign** » Tue Jun 08, 2010 5:13 am

ray.wurlod wrote:It's all in the manual. You have to capture the perfomance data when you run the job, then the Performance Analyzer reports on the captured statistics.

I tried with different combination of enabling and disabling operator but not much improvement in perfomance.

1. As I said before....The layout of my job is like this

DS -----------> Transformer --------------> DS
DS -----------> Transformer --------------> DS
DS -----------> Transformer --------------> DS

Sometimes in a job I have 50 such DS-->TRNS--->DS.

When I reduce the number of transformer and breaking in to multiple jobs the job is giving me some what better perfomance.

What should be the best approch? Is there anything else I should try other than operation combination enabling/diabling ?

chulett · Post by **chulett** » Tue Jun 08, 2010 6:12 am

myukassign wrote:1. As I said before....The layout of my job is like this

DS -----------> Transformer --------------> DS
DS -----------> Transformer --------------> DS
DS -----------> Transformer --------------> DS

Sometimes in a job I have 50 such DS-->TRNS--->DS.

No, this is the first time you've fully described your job layout in such a manner as to make it clear to us.

myukassign also wrote:When I reduce the number of transformer and breaking in to multiple jobs the job is giving me some what better perfomance.

I would certainly hope so. What an... interesting... approach.

eostic · Post by **eostic** » Tue Jun 08, 2010 6:49 am

My bet is that this is simply an initialization issue. You are saying that it takes that long even if the source file is completely empty? Sorry I missed that point earlier.

There are a ton of processes being started here. Hopefully you are using a single node config file, but even if not, you seem to be doing your own level of parallelism on top of the pipeline parallelism that is inherent in the platform. Look at the OS, when this thing is finally running, you probably have a LOT of osh processes running.

I'd like to know more about why you need the multiple levels of "designer based" parallelism...what are you trying to accomplish.....

...and then further, consider using Server for your solution. Even if you still end up having the "designer based" parallelism, you will end up with vastly fewer processes and it will probably start up much sooner.

Ernie

chulett · Post by **chulett** » Tue Jun 08, 2010 7:04 am

Thanks Ernie. Wanted to convey that when I posted, but wasn't quite sure how to articulate it properly. Too dang early in the morning.

myukassign · Post by **myukassign** » Tue Jun 08, 2010 7:37 am

Oki Let me explain you what I am trying to do....

1. I have an input XML which has almost 100 complex elements. The idea is , source system send me all the related table information in each complex elements of that XML.

2. I need to move each complex element to 100 datasets with some small transformation rules like null chk, add timestamps, load dates etc for each of these and finally load to tables.

So my design is like that.

If I don't do designer based parallisam then imagine, I need to create 100 differnt jobs one for each complex element.

Hope you understood why such design is implemented.

Please suggest me any design flow you see here ...or the server job approch is better in this case?

I hope with this answer I will be able to close this thread...

Thanks a lot for your valuable suggestion

agpt · Post by **agpt** » Tue Jun 08, 2010 8:55 am

are you doing same kind of transformation processing on all 100 elements?

if so, you might want to use one transformer job to do this processing on all the elements first at one go and then may be you can use a switch stage to send the output to different DS based on value of complex elements.

DSXchange

Improve performance with multiple transformers

Improve performance with multiple transformers

Re: Improve performance with multiple transformers

Re: Improve performance with multiple transformers