4000 nodes and 62,976 cores in one Grid Engine cluster

Posted by Rayson Wed, 23 Jan 2008 05:31:00 GMT

The Texas Advanced Computing Center (TACC) will be bringing the Ranger Supercomputer online soon. And, SGE will be the batch system for the cluster.

As Ranger has close to 4000 nodes and 62,976 cores (each Sun Blade X6420 node has 4 quad-core Opteron processors), SGE 6.2 adds a number of scalability features to support this huge cluster:

  • running the scheduler as a thread of the qmaster
  • reducing the load report overhead
  • At the SC07 conference, DanT talked about TACC, the scalability improvements and other features in SGE 6.2.