Date

Authors

Marcello Fabbri (Unlicensed) Liban Hirey (Unlicensed)

Status

Done

Summary

When a new RabbitMQ broker was created in AWS, we omitted migrating the ESR DataExport service to it (all other services were migrated fine). When the old broker was deleted it triggered errors on the DataExport service.

Impact

No Users reported any problems during the incident.

Non-technical Description

The ESR DataExport service was not migrated over in time therefore there was a possibility of having some application and notification confirmations missing following the deletion of the old broker.


Trigger

Deletion of the old RabbitMQ broker in AWS.


Detection

Slack notification from Sentry @ 10:11 on 30/07/2021.

Resolution

This list of GeneratedAppRecords appears to have been successfully exported by the EsrDataExportService, as seen on the PendingExport table. However some Placements have not updated their esr status on TIS and will require to be manually corrected.

Timeline

: 15:00 - Other services migrated to new RabbitMQ broker (STAGE)

: 11:25 - Other services migrated to new RabbitMQ broker (PROD)

: 10:05 - Old RabbitMQ broker deleted from AWS by Liban

: 10:11 - An alert in the #sentry-esr channel

: 10:32 - Pepe alerted the dev team

: 10:56 - Liban switches DataExport service over to the new RabbitMQ broker after realising that it doesn’t exist in it

: 11:11 - Liban and Marcello start investigating to verify whether any data is missing

e.g. check DB for Applicants exported 27th-29th:

Root Cause(s)

Deletion of the old RabbitMQ broker in AWS.

Dispersed/scattered configuration

Lessons Learned