Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.
Comment: Added root cause of monitoring


Date2nd October 2017
AuthorsChris Mills
StatusIn progress
SummaryMultiple containers went down on production applications and monitoring
ImpactUsers were unable to log in

Table of Contents

Root Cause

InvestigatingThe root cause is split into sections due to the parts this incident affects.

Monitoring:

The monitoring containers not running were as result of redeploying the stack due to the keycloak issue on the 28th 

Prod Applications:

We are currently looking into the root cause of this

Trigger

Investigating

Resolution

...

Reuben Noot [8:56 AM]
production doesn't look to healthy - getting internal server error just trying to get to https://apps.tis.nhs.uk/ui/

Action Items

Action ItemTypeOwnerIssue

mitigate/prevent









Timeline


Supporting Information

...