Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

Date

Authors

Joseph (Pepe) Kelly John Simmons (Deactivated) Yafang Deng Reuben Roberts Jayanta Saha Edward Barclay

Status

Resolved

Summary

Person search sync job failed

Impact

Non-technical Description

We run a number of sync jobs overnight. This one failed - another process was taking place that prevented it from successfully running.

We re-ran the job shortly afterwards and it completed successfully.

We investigated what tripped it up and will work to mitigate a recurrence.


Trigger

  • garbage collection activity taking longer than expected and eating into the sync job schedule.

Detection

  • monitoring-prod Slack alert.


Resolution

  • re-running the job as soon as it was noticed.


Timeline

  • 2022-02-18|01:41: Sync [Person sync job] failed with exception…” message in Slack monitoring-prod channel.

  • 2022-02-18|03:13: Team member restarted the job when they notice the issue.

  • 2022-02-18|03:24: Rerun job completed successfully.


Root Cause(s)


Action Items

Action Items

Owner


Lessons Learned

  • .

  • No labels

0 Comments

You are not logged in. Any changes you make will be marked as anonymous. You may want to Log In if you already have an account.