Date & time of incident:
Tuesday, May 14, 2013 - 10:00
Post date:
Tuesday, May 14, 2013 - 11:14
Incident Description:
Some users are experiencing intermittent login failures to lxplus.cern.ch (SLC6). We're investigating the problem.
For users blocked by this, the SLC5-based lxplus service is still available via the lxplus5.cern.ch alias.
Please note that this also affects user acron jobs.
Updates on this incident will be provided regularly (see below).
Service Element Affected:
LXPLUS Service
Impact:
Service is degraded
Status:
Resolved
Resolution date:
Thu, Jul 25, 10:00
Posted by:
IT-PES
Unit responsible for resolution:
IT Department

Updates
Update on 25 July:
Update on 25 July:
A patched version of sssd has now been deployed across lxplus for last two days. Since there have been no instances of sssd crashing and logins to lxplus are reliable.
Update on 19 July:
Update on 19 July:
A temporary fix has been provided by Red Hat for this problem. It was deployed in LXPLUS yesterday (18/07/2013).
All LXPLUS users are encouraged to report any problem they may encounter when logging to LXPLUS.
Update on 21 June:
Update on 21 June:
Although a final solution has not yet been found, further countermeasures have been applied to LXPLUS service which have drastically reduce the number of interactive login failures. Investigation is still ongoing.
Update on 10 June:
Update on 10 June:
While the underlying cause of the lxplus intermittant login problems is not resolved there is now enough monitoring and automation in place to fix the problem when it arises in a timely fashion.
We will provide more news asap.