Hi All,
I have many parallel jobs, which create and populate datasets. After all the jobs finish, I want to get the count of records present in each of the datasets. All the datasets are stored in a single directory. I am aware of just one approach which doesn't look efficient to me - creating new jobs to pull the records from datasets and getting their counts.
Is there any easy and better approach?
Thanks in advance!
Record count from datasets
Moderators: chulett, rschirm, roy
Record count from datasets
Nitin Jain | India
If everything seems to be going well, you have obviously overlooked something.
If everything seems to be going well, you have obviously overlooked something.
You can use the UNIX command "orchadmin describe {dataset}" to get the information you are looking for.
<a href=http://www.worldcommunitygrid.org/team/ ... TZ9H4CGVP1 target="WCGWin">
</a>
</a>
Kumar - please tell me what the command line is (apart from the "orchadmin" one I mentioned)? I'm not aware of any other method of doing this so I am curious what option you are referring to.
<a href=http://www.worldcommunitygrid.org/team/ ... TZ9H4CGVP1 target="WCGWin">
</a>
</a>
ArndW wrote:Kumar - please tell me what the command line is (apart from the "orchadmin" one I mentioned)? I'm not aware of any other method of doing this so I am curious what option you are referring to.
$dsrecords ds_name
This can be found in UserGuide.pdf, and I dont find it any where else.
Impossible doesn't mean 'it is not possible' actually means... 'NOBODY HAS DONE IT SO FAR'
-
- Participant
- Posts: 54607
- Joined: Wed Oct 23, 2002 10:52 pm
- Location: Sydney, Australia
- Contact:
Thanks, that is probably more efficient than firing up the whole orchadmin.kumar_s wrote:$dsrecords ds_name
<a href=http://www.worldcommunitygrid.org/team/ ... TZ9H4CGVP1 target="WCGWin">
</a>
</a>