can target dataset preserve pre sorted data?

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
tverma27
Participant
Posts: 8
Joined: Tue Mar 10, 2009 3:56 am

can target dataset preserve pre sorted data?

Post by tverma27 »

Hi

In my project we are doing Sorting on some business columns and want to load the sorted data in a dataset.

But my concern is since input data will move to different nodes of the dataset, Will the sorting remain preserved in the dataset?


Thanks in advance!
mahadev.v
Participant
Posts: 111
Joined: Tue May 06, 2008 5:29 am
Location: Bangalore

Post by mahadev.v »

Sorting and partition of the data can be maintained in a DataSet. Use Same partition on the DataSet input link. As long as you are not repartitioning the sorted data this would work.
"given enough eyeballs, all bugs are shallow" - Eric S. Raymond
tverma27
Participant
Posts: 8
Joined: Tue Mar 10, 2009 3:56 am

Post by tverma27 »

Thanks for your quick reply
i am doing a database sorting so will SAME partition help?
mahadev.v
Participant
Posts: 111
Joined: Tue May 06, 2008 5:29 am
Location: Bangalore

Post by mahadev.v »

Might work. Data would get repartitioned but the sort order in each partition should remain.
"given enough eyeballs, all bugs are shallow" - Eric S. Raymond
stefanfrost1
Premium Member
Premium Member
Posts: 99
Joined: Mon Sep 03, 2007 7:49 am
Location: Stockholm, Sweden

Post by stefanfrost1 »

Depends on which database (stage) and read method you are using. If you're not reading data from each database partition(node) your data will be returned on a single node and thereby your dataset will remain as a single node when using SAME partitioning.

Using DB2EE-stage for example and read method table would return data on as many nodes as your db is configured on and using db2 hash algoritm, so using SAME partitioning in this case would preserve your sort and partitioning as from the db.
-------------------------------------
http://it.toolbox.com/blogs/bi-aj
my blog on delivering business intelligence using agile principles
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

Repartitioning will, as a general rule, destroy any sorting (which is per-partition).
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
Post Reply