Date & time of incident:
Thursday, July 4, 2013 - 20:12
Post date:
Thursday, July 4, 2013 - 20:59
Incident Description:
There is a triple disk failure on user home dir server afs135. We're looking into it.
Service Element Affected:
AFS Service
Impact:
Service is degraded
Status:
Resolved
Resolution date:
Fri, Jul 5, 14:30
Posted by:
IT-DSS
Unit responsible for resolution:
IT Department
Updates
All volumes should now be
All volumes should now be restored.
All user and "p." project
All user and "p." project volumes are back online. The "q." project volumes are still being restored.
The root cause of this
The root cause of this incident is a multiple disk failure in one of the storage enclosures that took out 7 disks at the same time.
About 93% of the affected volumes could be brought back online via a FibreChannel reconfiguration.
The remaining volumes (that were on the completely broken array) are currently being restored from backup and should become online by midday on Friday.
The following user home directories are affected:
user.amccrea
user.aponni
user.battagl
user.cddmgr
user.fcaponio
user.jawillia
user.jbernhub
user.lish
user.oreardon
user.orestano
user.sblusk
In addition, the parts of the project lemondump and acc need to be restored.