dataset problem

Post questions here relative to DataStage Enterprise/PX Edition for such areas as Parallel job design, Parallel datasets, BuildOps, Wrappers, etc.

Moderators: chulett, rschirm, roy

Post Reply
chvenkat.v
Participant
Posts: 94
Joined: Fri Dec 14, 2007 3:22 am

dataset problem

Post by chvenkat.v »

Hi,

any one please help .....

how to view the data of dataset without using datastage. here any tool is their to view the data of dataset in unix/aix envoriment.



thnks
venkat
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

You can use the DataStage command "orchadm" to do this. The options are:
$ orchadmin dump
##I IIS-DSEE-TFCN-00001 10:52:14(000) <main_program>
IBM WebSphere DataStage Enterprise Edition 8.0.1.4665
Copyright (c) 2001, 2005-2007 IBM Corporation. All rights reserved



##I IIS-DSEE-TUTL-00031 10:52:14(001) <main_program> The open files limit is 1024; raising to 65536.

COMMAND: dump [ -options... ] descriptor-files...

Dump the specified ORCHESTRATE parallel files as text to the
standard output. If no options are specified, all records are
dumped in order from the first record of the first partition to
the last record of the last partition. Each field value is
followed by a space, and each record is followed by a newline.
Specific top-level fields may be dumped with the -field option.

OPTIONS:
-field name Dump the specified top-level field. The default is
to dump all fields. This option can occur multiple
times. Each occurrence adds to the list of fields.

-name Precede each value by its field name and a colon.

-n numrec Limit the number of records dumped per partition.
The default is not to limit.

-part N Dump only the specified partition. The default is
to dump all partitions.

-p period Dump every N'th record in a partition, starting
with the first record not skipped (see -skip).
The period must be greater than 0. The default
is 1.

-skip N Skip the first N records in each partition. The
default is 0.

-x Use the system config file rather than the one
stored in the dataset.

If an option occurs multiple times, the last one takes effect.
The -field option is an exception: each occurrence adds to the
list of fields to be dumped.

EXAMPLES:
Dump all records of all partitions of a parallel file named
small.ds. Precede each value by its field name and a colon.

orchadmin dump -name small.ds

Dump the value of the customer field of the first 99 records
of partition 0 of big.ds.

orchadmin dump -part 0 -n 99 -field customer big.ds
BugFree
Participant
Posts: 82
Joined: Wed Dec 13, 2006 6:02 am

Post by BugFree »

I don't know of any such tool. But "orchadmin dump" command can be used from the unix command line to view the data. There are options like -n to specify the number of records to be fetched from each partition, -part to specify which partition you want the records to be fetched. Do a "orchadmin -help" to get the documentation for the command.
Ping me if I am wrong...
Post Reply