Dataset write Failure
Posted: Fri Feb 29, 2008 9:48 pm
Getting following fatal errors (in order) while loading data into datasets.
1. SRC_oe_order_header_all,0: Write to dataset on [fd 24] failed (Success) on node node1, hostname lxdscon.beckman.com
2. SRC_oe_order_header_all,0: Orchestrate was unable to write to any of the following files:
3. SRC_oe_order_header_all,0: /dstage1/Server/Datasets/Data_frm_OAGCRD.txt.dsadm.lxdscon.beckman.com.0000.0000.0000.5ba6.c9920787.0000.246beea0
4. SRC_oe_order_header_all,0: Block write failure. Partition: 0
5. SRC_oe_order_header_all,0: Failure during execution of operator logic.
8. SRC_oe_order_header_all,0: Fatal Error: File data set, file "/dstage1/store/Data_frm_OAGCRD.txt".; output of "SRC_oe_order_header_all": DM getOutputRecord error.
9. node_node1: Player 1 terminated unexpectedly.
10.main_program: APT_PMsectionLeader(1, node1), player 1 - Unexpected exit status 1.
11. main_program: Step execution finished with status = FAILED.
I read related previous posts in the forum and Did following research.
1. I was running the job as isadmin.
2. Checked permissions for folder where datasets are saved.
We have set following directories for datasets as well as scratch disk.
resource disk "/dstage1/Server/Datasets"
resource scratchdisk "/dstage1/Server/Scratch"
We have all the permissions on Server where all the datasets are stored. 'store' is another folder where we save output files. It also has full permissions.
drwxrwxrwx 4 root root 4096 Feb 12 14:56 Server
drwxrwxrwx 6 root root 4096 Feb 13 10:18 Projects
drwxrwxrwx 4 root root 4096 Feb 29 19:11 store
drwxrwxrwx 2 root root 4096 Feb 22 15:16 Scratch
drwxrwxrwx 2 root root 4096 Feb 29 19:11 Datasets
3. Checked available space using df comand. We have plenty of space left in '/dstage1' where data is stored.
[isadmin@lxdscon dstage1]$ df -h
Filesystem Size Used Avail Use% Mounted on
/dev/mapper/VG00-LogVol00 3.1G 175M 2.8G 6% /
/dev/cciss/c0d0p1 190M 13M 169M 7% /boot
none 3.8G 0 3.8G 0% /dev/shm
/dev/mapper/VG01-LogVol00 29G 13G 15G 47% /dstage1
/dev/mapper/VG00-LogVol01 6.0G 4.1G 1.7G 72% /home
/dev/mapper/VG00-LogVol05 10G 7.6G 1.9G 81% /opt
/dev/mapper/VG00-LogVol02 3.1G 54M 2.9G 2% /tmp
/dev/mapper/VG00-LogVol03 10G 2.5G 7.0G 26% /usr
/dev/mapper/VG00-LogVol04 3.1G 99M 2.9G 4% /var
4. Checked file size limits using ulimit -a command
[isadmin@lxdscon dstage1]$ ulimit -a
core file size (blocks, -c) 0
data seg size (kbytes, -d) unlimited
file size (blocks, -f) unlimited
pending signals (-i) 1024
max locked memory (kbytes, -l) 32
max memory size (kbytes, -m) unlimited
open files (-n) 1024
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
stack size (kbytes, -s) 10240
cpu time (seconds, -t) unlimited
max user processes (-u) 131071
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited
Any thoughts?
Thanks in Advance!
1. SRC_oe_order_header_all,0: Write to dataset on [fd 24] failed (Success) on node node1, hostname lxdscon.beckman.com
2. SRC_oe_order_header_all,0: Orchestrate was unable to write to any of the following files:
3. SRC_oe_order_header_all,0: /dstage1/Server/Datasets/Data_frm_OAGCRD.txt.dsadm.lxdscon.beckman.com.0000.0000.0000.5ba6.c9920787.0000.246beea0
4. SRC_oe_order_header_all,0: Block write failure. Partition: 0
5. SRC_oe_order_header_all,0: Failure during execution of operator logic.
8. SRC_oe_order_header_all,0: Fatal Error: File data set, file "/dstage1/store/Data_frm_OAGCRD.txt".; output of "SRC_oe_order_header_all": DM getOutputRecord error.
9. node_node1: Player 1 terminated unexpectedly.
10.main_program: APT_PMsectionLeader(1, node1), player 1 - Unexpected exit status 1.
11. main_program: Step execution finished with status = FAILED.
I read related previous posts in the forum and Did following research.
1. I was running the job as isadmin.
2. Checked permissions for folder where datasets are saved.
We have set following directories for datasets as well as scratch disk.
resource disk "/dstage1/Server/Datasets"
resource scratchdisk "/dstage1/Server/Scratch"
We have all the permissions on Server where all the datasets are stored. 'store' is another folder where we save output files. It also has full permissions.
drwxrwxrwx 4 root root 4096 Feb 12 14:56 Server
drwxrwxrwx 6 root root 4096 Feb 13 10:18 Projects
drwxrwxrwx 4 root root 4096 Feb 29 19:11 store
drwxrwxrwx 2 root root 4096 Feb 22 15:16 Scratch
drwxrwxrwx 2 root root 4096 Feb 29 19:11 Datasets
3. Checked available space using df comand. We have plenty of space left in '/dstage1' where data is stored.
[isadmin@lxdscon dstage1]$ df -h
Filesystem Size Used Avail Use% Mounted on
/dev/mapper/VG00-LogVol00 3.1G 175M 2.8G 6% /
/dev/cciss/c0d0p1 190M 13M 169M 7% /boot
none 3.8G 0 3.8G 0% /dev/shm
/dev/mapper/VG01-LogVol00 29G 13G 15G 47% /dstage1
/dev/mapper/VG00-LogVol01 6.0G 4.1G 1.7G 72% /home
/dev/mapper/VG00-LogVol05 10G 7.6G 1.9G 81% /opt
/dev/mapper/VG00-LogVol02 3.1G 54M 2.9G 2% /tmp
/dev/mapper/VG00-LogVol03 10G 2.5G 7.0G 26% /usr
/dev/mapper/VG00-LogVol04 3.1G 99M 2.9G 4% /var
4. Checked file size limits using ulimit -a command
[isadmin@lxdscon dstage1]$ ulimit -a
core file size (blocks, -c) 0
data seg size (kbytes, -d) unlimited
file size (blocks, -f) unlimited
pending signals (-i) 1024
max locked memory (kbytes, -l) 32
max memory size (kbytes, -m) unlimited
open files (-n) 1024
pipe size (512 bytes, -p) 8
POSIX message queues (bytes, -q) 819200
stack size (kbytes, -s) 10240
cpu time (seconds, -t) unlimited
max user processes (-u) 131071
virtual memory (kbytes, -v) unlimited
file locks (-x) unlimited
Any thoughts?
Thanks in Advance!