Date

Authors

Cai Willis

Status

Resolved

Summary

Impact

Revalidation application was functionally unavailable

Non-technical Description

A process which refreshes all of the data in the revalidation system was triggered accidentally on the production (“live”) environment - the intention was that it was only to be run on the staging (“testing”) environment. Once this process has started, it must be allowed to complete for data to be restored, and unfortunately it takes a very long time to complete.

Trigger

Accidental triggering of full production data sync

Detection

Noticed by developer


Resolution

Allowed process to complete naturally


Timeline

All times in GMT unless indicated

Root Cause(s)

Why did the connections list disappear on production

Why was an unscheduled full production data-resynchronisation triggered?

Why was the sync not terminated?

Why did the recommendations list disappear?

Why did the DTO naming conversion in the integration service stop working?


Action Items

Action Items

Owner

Fix the JSON mapping issue in recommendations, and revert any patchwork done in the FE to compensate

Cai Willis

Done

Automated backups or similar “built-in” to the sync process so that it can be aborted and restored as required

Cai Willis

Introduce batch messaging to speed up biggest bottleneck - judging by the work on the overnight doctor sync this could reduce the whole process down to a couple of hours

Already have work lined up in addressing the raised action e.g GMC sync work


Lessons Learned