DataStage Internal Tables

Post questions here relative to DataStage Server Edition for such areas as Server job design, DS Basic, Routines, Job Sequences, etc.

Moderators: chulett, rschirm, roy

Post Reply
RCMURTHY
Participant
Posts: 1
Joined: Tue Aug 30, 2005 10:50 pm

DataStage Internal Tables

Post by RCMURTHY »

How to Get all table details like DS_AUDIT ,DS_JOBS and how to describe internal tables.
chulett
Charter Member
Charter Member
Posts: 43085
Joined: Tue Nov 12, 2002 4:34 pm
Location: Denver, CO

Post by chulett »

Welcome!

Keeping in mind the fact that these 'table details' are purposefully not published, you can find alot of information regarding them posted to these forums. Searching on the individual table names would be one way to start. You'll soon see who the Rock Stars* are. :wink:

Oh, and you would use LIST DICT tablename to describe them.

* hint: not me
-craig

"You can never have too many knives" -- Logan Nine Fingers
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

The tables that comprise the repository are deliberately incompletely described. "They" reserve the right to change them, and do so at most major releases. Further, some tables contain multiple record structures.
On this forum, and by inspecting Kim Duke's ETL toolkit, you can find out pretty much all that it is possible to know. Beware that this structure ceases to exist at the next release ("Hawk"); the Repository will henceforth be "open" with your choice of database, provided (initially) that you choose one of DB2, Oracle or SQL Server.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

So is the Hawk's Repository is built on a 3rd party database, and not on UV like Informatica?
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

The Hawk repository is no longer the DataStage flavor of UniVerse. Informatica has never used UniVerse files and cannot access them (without the 3rd party ODBC drivers).
kumar_s
Charter Member
Charter Member
Posts: 5245
Joined: Thu Jun 16, 2005 11:00 pm

Post by kumar_s »

Thanks for the repsonse.
BTW I actually ment Informatica uses 3rd party database like oracle to build and maintain its repository.
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
ray.wurlod
Participant
Posts: 54607
Joined: Wed Oct 23, 2002 10:52 pm
Location: Sydney, Australia
Contact:

Post by ray.wurlod »

In Hawk the same repository is used by not only DataStage but also InformationAnalyzer (formerly AuditStage, ProfileStage and some of the QualityStage investigation pieces), QualityStage and MetaStage. And you won't have direct access to the repository at all - there is a layer of services between you and it.
IBM Software Services Group
Any contribution to this forum is my own opinion and does not necessarily reflect any position that IBM may hold.
bradh
Participant
Posts: 2
Joined: Wed Nov 09, 2005 7:41 am

Post by bradh »

I am creating a DataStage job that will extract job name and last modification date from the DS_AUDIT file, sort the data and create a .csv file that I will mail to my manager. He needs the modification date for auditing. I understand that DS_AUDIT may change in the future. Does anyone see anything wrong with getting a list of jobs and change dates this way?


Thanks,

Brad
gateleys
Premium Member
Premium Member
Posts: 992
Joined: Mon Aug 08, 2005 5:08 pm
Location: USA

Post by gateleys »

Use the command LIST DICT DS_AUDIT to describe the DS_AUDIT file. You will see what are the fields in that file. Now, you can use either a hashed file or a UV stage to get the required fields, which in your case would be KEY (for job) and DTM (for Modification Timestamp).

gateleys
ArndW
Participant
Posts: 16318
Joined: Tue Nov 16, 2004 9:08 am
Location: Germany
Contact:

Post by ArndW »

Brad,

as long as you and/or the customer is aware that this method will cease to function (without modification) at some point in the next release then go ahead.
gateleys
Premium Member
Premium Member
Posts: 992
Joined: Mon Aug 08, 2005 5:08 pm
Location: USA

Post by gateleys »

Hey Arnd,
Is there any other way that we can collect the statistics other than from DS_AUDIT? If NO and also since the current repository will be massively revised in the next version, then we will be left with 2 options:
1. Wait for the client to upgrade to Hawk, and then figure out a way by going through it docs. Till then, tell the client that we cannot mess with DS_AUDIT since it will cease to exist in the next release.
2. Tell our client that the next release is no good/too expensive and use DS_AUDIT.


gateleys
kduke
Charter Member
Charter Member
Posts: 5227
Joined: Thu May 29, 2003 9:47 am
Location: Dallas, TX
Contact:

Post by kduke »

DS_AUDIT has been explained in several posts by Ray. EtlStats includes about 60 scripts most report on these tables like DS_JOBS and DS_JOBOBJECTS but a few also look at RT_LOG and RT_STATUS. There are lots of jobs included which extract this information to create your own metadata repository. These are all examples of how to use this information to make your ETL more powerful because you can intergrate row counts or job metadata.

I promise the next version will be much easier to install and use. I plan on cleaning up the jobs as well. A lot of these jobs you may only use once in a while. I have some jobs which I will share which extract all the table names which are cleared or truncated. I also extract all the table names from the system tables so I can find which tables are not truncated by comparing the results of the 2 jobs.

I did similar things with job sequences. I wanted to know which parameters were used in a sequence. To extract this and write your own SQL against the results to see if all developers setup each sequence the same. How consistent are your developers? Why not find out for sure instead of guessing.
Mamu Kim
Post Reply