Documentation/Cluster job logs

From systems
Jump to: navigation, search

To find out why a job died look at the log file with the queue id number.

grep <queue id> <path to log file which is /opt/gridengine/default/spool/qmaster/messages>

example:

qstat -f

all.q@gaianode-0-0.local BIP 2/4 2.01 lx26-amd64 1239010 0.48611 j1u78 oa2134 r 04/13/2007 19:27:32 1


grep 1239010 /opt/gridengine/default/spool/qmaster/messages

errors for this job only will be listed.