Joiner, union etc
Moderators: chulett, rschirm, roy
Joiner, union etc
Hi all,
I'm very new to Datastage, have just completed installation on my local machine (7.5x2) and am now messing around with it.
I come from an Informatica background, and well, DS's GUI confuses me. I successfully managed to do simple loads (DB table>flat file; DB table>simple transformation>flat file) using Server jobs. But I didn't notice any objects that are frequently found and used in Informatica, ie Joiner, Union, Sorter, Filter.
However I did notice that more objects are available if a Parallel job is created.
How does one do the similar activities in Datastage, like in Informatica?
I'm very new to Datastage, have just completed installation on my local machine (7.5x2) and am now messing around with it.
I come from an Informatica background, and well, DS's GUI confuses me. I successfully managed to do simple loads (DB table>flat file; DB table>simple transformation>flat file) using Server jobs. But I didn't notice any objects that are frequently found and used in Informatica, ie Joiner, Union, Sorter, Filter.
However I did notice that more objects are available if a Parallel job is created.
How does one do the similar activities in Datastage, like in Informatica?
You cant say that Parallel jobs are restricted only to UNIX systems, but considering the stability and security what UNIX provides it certainly has an edge. As far as functions you want to build and use in your server jobs you can also do it with the help of subroutines(written in BASIC) to be used in your jobs.
Sufiyan
I think Datastage 7.5X2 supports running parallel jobs on windows. And in the transformer(parallel) you can use various inbuilt system variables, functions, type conversion and various others. And if you feel that these are less then also you can build your routines in C/C++ for parallel jobs. Understand the difference BASIC for server jobs routines and C/C++ for parallel jobs routines.
Sufiyan
I think Datastage 7.5X2 supports running parallel jobs on windows. And in the transformer(parallel) you can use various inbuilt system variables, functions, type conversion and various others. And if you feel that these are less then also you can build your routines in C/C++ for parallel jobs. Understand the difference BASIC for server jobs routines and C/C++ for parallel jobs routines.
Sufiyan
Well, as long as you have Enterprise Edition Server and you can create parallel jobs you don't have to worry about whether you are on Windows or UNIX. However, based on your server's hardware configuration you will gain by utilizing the Parallel Engine's capabilities.Stupe wrote:Ah... so, if I do continue using Server jobs, I'll have to code my subroutines.
And these transformations will be available in Parallel jobs. Can I run Parallel jobs on my local machine (containing Source, Datastage, Target)?
Also, you design the jobs using the Designer and compile them. Execution happens on the server. So you are not running the jobs on your "local machine" per se.... unless you have the client and server on the same box and you are using the client on the server box to develop.
I am sure other Gurus can throw more light on the subject.... This is my understanding...
Good luck
-V
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
You have version 7.5x2, which is parallel-capable.
The main thing is that the design process is a very different mindset. In Informatica you're used to a bottom-up design strategy - start with the details and build the structure (maps and the like) on those. In DataStage it's top-down - start with the big picture (essentially the picture you draw in Designer) and fill in the details subsequently.
The main thing is that the design process is a very different mindset. In Informatica you're used to a bottom-up design strategy - start with the details and build the structure (maps and the like) on those. In DataStage it's top-down - start with the big picture (essentially the picture you draw in Designer) and fill in the details subsequently.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Thanks for the replies guys, but unfortunately can't see the whole of Ray's comments.
But just thinking out loud, I assume the licensing structure for Parallel-capable and a non parallel-capable is different. One of the added benefits of parallel-capable will definitely be being able to split the load to different processors. I get that.
But it looks like the non parallel-capable version is very much inferior, since 'simple' transformation objects like joiners, filters, sorts etc needs some manual coding (can't say how difficult to code though, never/haven't tried it).
But just thinking out loud, I assume the licensing structure for Parallel-capable and a non parallel-capable is different. One of the added benefits of parallel-capable will definitely be being able to split the load to different processors. I get that.
But it looks like the non parallel-capable version is very much inferior, since 'simple' transformation objects like joiners, filters, sorts etc needs some manual coding (can't say how difficult to code though, never/haven't tried it).
-
- Participant
- Posts: 11
- Joined: Thu Sep 13, 2007 12:56 am
Re: Joiner, union etc
Hi...
In Server 7.5x2, if you are using OCI, RDBMS or any other Database stages, you can join two tables using SQL query. If you are using sequential file or any flat file, you can do all the join types except right outer join by using a transformer stage.
Coming to Sorting, there is a sort stage. Again you can use this stage when your source is sequential or flat files. If the data is coming from database, you can sort the data in OCI stage or any other datbase stages.
In Server 7.5x2, if you are using OCI, RDBMS or any other Database stages, you can join two tables using SQL query. If you are using sequential file or any flat file, you can do all the join types except right outer join by using a transformer stage.
Coming to Sorting, there is a sort stage. Again you can use this stage when your source is sequential or flat files. If the data is coming from database, you can sort the data in OCI stage or any other datbase stages.