Transfering data from one job to another in a sequencer

splayer · Post by **splayer** » Fri Jun 23, 2006 3:31 pm

I am trying to transfer some rows between 2 jobs in a job sequencer. Let's say, there are some rows which have been acted upon by JobA. I would like JobB to see these changes. I want to do this in memory so using a data set file is not an option. I can use parameters of the job sequencer but it'll be cumbersome. Also, the number of rows will change.
Can anyone suggest a solution?

ray.wurlod · Post by **ray.wurlod** » Fri Jun 23, 2006 3:45 pm

No, unless you're prepared to write a pair of Custom stages to manage the memory. DataStage has nothing out of the box.

Using a Sequential File stage to write to a named pipe in one job and one to read from that named pipe in the other job is a possibility, but co-ordination of the jobs is crucial. Unlike server Sequential File stage (which can handle named pipes automatically) you will need to create and manage the named pipe yourself in a parallel job. Use mknod or mkfifo to create it, for example.

thompsonp · Post by **thompsonp** » Mon Jun 26, 2006 1:51 am

What is the purpose of the job sequence in this case?

Both jobs will have to be running simultaneously for the data to be held 'between them' in memory.

Why not just put it all in one job? If that makes for a complicated looking job try using containers.

ray.wurlod · Post by **ray.wurlod** » Mon Jun 26, 2006 6:05 am

Job sequences manage flow of control, not flow of data.

thompsonp · Post by **thompsonp** » Mon Jun 26, 2006 6:30 am

Why do rows acted on by JobA need to be accessed by JobB without first landing them to disk?

I think the fact that a job sequence is used to control these two jobs is diverting attention from the real issue of how to get data from one job to another. Does it make any difference if the jobs are started from a job seqeunce, from Director or from the command line?

There are a variety of ways that data can pass from one job to another most of which land to disk. As Ray metioned named pipes are the obvious exception and you could perhaps argue MQ be included as well.

I think you need to reconsider what problem you are trying to solve and why keeping data in memory is so important in this case.

ArndW · Post by **ArndW** » Mon Jun 26, 2006 6:34 am

Named pipes are usually used for simple interprocess communication. DataStage supports named pipes.

Klaus Schaefer · Post by **Klaus Schaefer** » Mon Jun 26, 2006 6:45 am

If you store the records it in a EE DataSet - quick and VERY performant - they will be in memory if you act on them with a lookup in job b.

Klaus

DSXchange

Transfering data from one job to another in a sequencer

Transfering data from one job to another in a sequencer

Re: Transfering data from one job to another in a sequencer