CERN Accelerating science

BATCH Service affected over the weekend

 
Date & time of incident: 
Friday, November 4, 2011 - 23:00
Incident Description: 

Due to an hardware problem the response time of the LSF Master nodes for the BATCH service was severely degraded over the weekend.

On Monday morning a failover was performed to the secondary LSF master while the primary machine is repaired.

We will update this incident when we revert to normal operations, there will be a short interruption again when the primary node is brought back into production.

Service Element Affected: 
Batch Service
Impact: 
Service is degraded
Status: 
Resolved
Resolution date: 
Mon, Nov 7, 17:30
Expected resolution or Next Update Time: 
Monday, November 7, 2011 - 14:00
Posted by: 
IT-PES
Unit responsible for resolution: 
IT Department