DSXchange

Posted: **Mon Mar 01, 2010 6:26 am**

I tried to have a good understanding of Run Time Column Propagation (RCP) in term of when and how to use it properly.

Under what circumstances would it be a good idea to enable RCP on the project level?

What are the downside for enabling RCP at the project level.

When not to turn on RCP at the project level.

thanks.

Posted: **Mon Mar 01, 2010 10:01 am**

We use RCP to only show columns that have transforms. That way it makes it easier to view. The other columns have not changes and are just passing thru. Much cleaner.

Posted: **Tue Mar 02, 2010 5:08 am**

Are there any disadvantages, if we choose RCP in Datastage?

Posted: **Tue Mar 02, 2010 8:09 am**

RCP can be confusing at first, since you can do manipulations on columns that don't seem to exist. Along the same lines, you can get confusing warnings in joins when a column is marked as "duplicate" because it is coming from RCP.
In terms of execution and compile times there is no difference.

Posted: **Tue Mar 02, 2010 9:56 am**

I agree, it can be confusing.

Most people do not need it because they carry all the columns across manually. Why do they use RCP?

Posted: **Tue Mar 02, 2010 10:22 am**

To answer the question as originally asked, IMHO I would never simply turn it on at the Project level unless you had an expert developer team all of whom were very comfortable with it.

Posted: **Tue Mar 02, 2010 11:36 am**

The best reason I've seen for RCP is when you have a set of functions (stage types) that you want to use to process a vast set of tables, each that has a different structure ....especially in cases where you don't need "really specific" transformations on certain column names...... Entire generic jobs are written that have no columns at all...... These scenarios may be rare, but it becomes a very powerful option to have a job that works against nearly "any" table.

Ernie

Posted: **Tue Mar 02, 2010 11:42 am**

Exactly... turn it on at the job level when you really need or can leverage it properly, case by case. All IMHO, of course.

Posted: **Tue Mar 02, 2010 1:14 pm**

Hi All,
When i use schema file for reading a data file then i use RCP since i do not want to specify the column names in the source and target.

Regards
Sreeni

Posted: **Tue Mar 02, 2010 1:55 pm**

I agree with Craig. Turn it on at the job level. I think it helps with shared containers. Only specifying the columns needed.

Posted: **Tue Mar 02, 2010 3:28 pm**

I've also used it shops with very large structures (for example a record with hundreds and hundreds of columns) where only a few columns needed to be worked on, but all columns needed to be transmitted along to the end result.

I only enable it on a per-job basis - it can cause a lot of problems otherwise.

DSXchange

when to use RCP

when to use RCP