Status
From Wiki
There is a ganglia server running at http://www.cs.unc.edu/ganglia that shows the current load on the nodes.
Contents |
Issues
Outstanding
InfiniBand bandwidth for larger packets on connections originating on comp nodes is operating about about %40 of what it could be. We have a ticket open with QLogic attempting to resolve this.
When MPI jobs are deleted via qdel, they sometimes keep running on the grid nodes.
--Eddale 12:09, 29 June 2009 (EDT)
Resolved
MPI is now running fine through the grid engine, even when running jobs larger than 100 nodes in size.
Downtime
The first Wednesday of every month is tentatively planned as a maintenance day. A posting will be made to the mailing list detailing the anticipated downtime in advance of the maintenance.
