tsort and psort

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

qt_ky
Premium Member
Premium Member
Posts: 2895
Joined: Wed Aug 03, 2011 6:16 am
Location: USA

Post by qt_ky »

When tsort or psort executes in sequential mode, it's called a total sort.

When tsort or psort executes in parallel mode, it's called a partition sort or a parallel sort.

When you combine a parallel sort with a sort merge collector, it's called a total sort.

Records processed by psort must be less than 32 KB in length (psort... prohibitive sort).

In the Sort stage:

The default "Sort Utility = DataStage" option leads to the tsort operator.

The "Sort Utility = Unix" option leads to the psort operator. You can find "psort -sorter unix" in the OSH script.
Choose a job you love, and you will never have to work a day in your life. - Confucius
Post Reply