CERN Accelerating science

SLC6 Batch Service degraded

 
Date & time of incident: 
Saturday, June 22, 2013 - 14:00
Incident Description: 

A significant fraction of the SLC6 batch capacity has crashed recently with kernel panics. We are investigating the cause and are restarting the machines.

The SLC5 batch capacity is unaffected.

(Jobs on nodes which have crashed will appear in the UNKWN state until the node recovers, at which point the jobs will be marked as failed.)

Service Element Affected: 
Batch Service
Impact: 
Service is degraded
Status: 
Resolved
Resolution date: 
Mon, Aug 12, 10:00
Posted by: 
IT-PES
Unit responsible for resolution: 
IT Department