SGE and Fedora 8 (or any bleeding edge glibc)

Posted by chris Wed, 28 Nov 2007 17:49:44 GMT

Running Grid Engine on an OS with a bleeding-edge version of glibc? The following (taken directly from here) may help:

Responding to a question about which versions of Linux would support a particular version of Grid Engine, Ben had this to say in his reply:

"...The only thing you need to change for this to work properly is ${SGE_ROOT}/util/arch, since only glibc up to 2.5 is defined as "supported" in that script, and Fedora 8 ships with glibc 2.7. Change the glibc check (around line 248) from 3|4|5) to 3|4|5|6|7)".

This is good advice in general for people running operating systems that may ship with versions of glibc that are "newer" than what the SGE developers are using in their lab and build facilities.

No IO usage measurements on Linux

Posted by chris Wed, 28 Nov 2007 17:41:36 GMT

Ever wonder why IO usage for Grid Engine jobs running on Linux systems are not captured in either the SGE accounting or reporting logs?

This message posted to the Users mailing list kicked off an interesting thread and even generated a new Enhancement Issue and submitted patch.

It turns out that IO usage is always reported as "0.00000" under Linux because the built in PDC code within Grid Engine does not have an easy way (under Linux) to learn about IO consumption on a per-task or per-process basis.

Some additional digging by the original poster revealed some interesting Linux kernel options:

The Linux kernel can be compiled with CONFIG_TASKSTATS and CONFIG_TASK_IO_ACCOUNTING options which enable simple per-process I/O usage tobe counted through /proc/(PID)/io as well as the taskstats interface. The execd's PDC module is not aware of these interfaces, and therefore makes no attempt to count this usage under Linux.

In Issue 2429 a patch is submitted that lets the SGE PDC code be aware of io reporting values that can be found in /proc/(PID)/io.

How you can help:

  • For your particular flavor of Linux, determine if the kernel options "CONFIG_TASKSTATS" and "CONFIG_TASK_IO_ACCOUNTING" are enabled in the default vendor supplied kernel. Add this data as a comment on Issue 2429.
  • Test out the patch yourself