Date & time of incident:
Wednesday, February 1, 2012 - 10:00
Post date:
Wednesday, February 1, 2012 - 10:29
Incident Description:
The LCGR database is experiencing problems currently. We suspect the network connectivity issues of one of the database nodes (LCGR2), which influences whole database. In order to confirm it and isolate the problem we need to shut down this node temporarily. It means that some of your applications can suffer performance deteriorations and some can notice broken connections.
Database services which can feel the strongest performance deteriorations:
- CMS_DASHBOARD,
- ATLAS_DASHBOARD
Other affected services which can fell performance deteriorations (with lower probability):
- ALICE_DASHBOARD
- ATLAS_DASHBOARD_DM
- ATLAS_DASHBOARD_PROD
- LCG_DASHBOARD
- LCG_FCR
- LCG_FTS
- LCG_FTS_MONITOR
- LCG_FTS_T2
- LCG_FTS_T2_W
- LCG_FTS_W
- LCG_GRIDMAP
- LCG_GRIDOPS
- LCG_GRIDVIEW2
- LCG_LFC
- LCG_OPS
- LCG_SAM_PI
- LCG_SAM_PORTAL
- LCG_SAM_PPS
- LCG_SAME
- LCG_SITEMON
- LCG_SWAT
- LCG_SYSTEM_JOBS
- LCG_VOMS
- LCGR_BACKUP
- LHCB_DASHBOARD
Service Element Affected:
DB & Application Platform Service for Projects & Experiments
Impact:
Service is degraded
Status:
Resolved
Resolution date:
Wed, Feb 1, 11:50
Posted by:
IT-DB
Unit responsible for resolution:
IT Department