Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 4 Current »

Date

Authors

Joseph (Pepe) Kelly Reuben Roberts

Status

Resolved

Summary

Person Search Sync Failed

Impact

Person search page was not showing some data between 07:35 and 08:00

Non-technical Description

The overnight sync procedure for TIS was unable to run. This meant only some trainees were being shown on the person search page.


Trigger


Detection

  • The issue was detected when errors were noted in the ‘Sync [Person sync job] started’ Slack notification, and the ‘Sync [Person sync job] finished’ notification failed to appear.


Resolution

  • Created a new production elasticsearch cluster to use based on the terraform description (instance_type = t3.medium.elasticsearch instead of the incorrectly set instance_type = t3.small.elasticsearch)

  • Manually triggered the Person sync job to rebuild the Person elasticsearch index.


Timeline

  • - 06:20 - Noted Person sync job errors on STAGE and PROD

  • - 06:29 - Quickest fix (simply re-running the job) observed not to resolve the issue on STAGE

  • - 07:35 - Question raised by user on Teams

  • - 07:40-07:50 - Rebuilt the elasticsearch cluster infrastructure as noted in ‘Resolution’ above.

  • - 07:51-07:58 - Manually re-ran the Sync job; trainees becoming visible during this time.

  • - 08:00 - Confirmed issue resolved with users


Root Cause(s)

  • The Person elasticsearch cluster had been incorrectly reconfigured on 11 Jan 2021 to use t3.small.elasticsearch instances. This caused the sync job to fail.


Action Items

Action Items

Owner

n/a


Lessons Learned

  • Manually running the terraform script without going through the normal pull-request and approval process for the TIS-OPS project exposes the infrastructure to a greater risk of being accidentally misconfigured.

  • No labels

0 Comments

You are not logged in. Any changes you make will be marked as anonymous. You may want to Log In if you already have an account.