Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Date

Authors

Joseph (Pepe) Kelly Yafang Deng

Status

Documenting

Summary

Jira Legacy
serverSystem JIRA
serverId4c843cd5-e5a9-329d-ae88-66091fcfe3c7
keyTIS21-4513

There was an issue processing a Placement Create bulk upload file and a number of other files were not processed until the service was restarted

Impact

Users uploaded files were not processed.

We restarted the service to “skip over” the file that was problematic.

...

  • 13:55 - File uploaded and starts being processed.

  • 13:55-15:24 - Other users upload files and the file was being processed

  • 15:24 - Error logged

  • 15:25 - User report on Teams

  • 15:24-15:40 - Service monitored for signs that data was still being processed and once it became clear it wasn’t, the service was restarted

  • 15:53 - Service processes queued files

  • 17:29 - Admin user tried uploading the same Placement Create file again, and then it was processed successfully

...

Root Cause(s)

  • When admin users raised the query, the job has already spent 1.5 hours. (12:55:51 UTC - 14:25 UTC)

  • We thought the job was stalled, but it was not. Until generic upload service was restarted, the job had been processing.

  • Below image shows the record of 1702nd row in the spreadsheet. And there’re 1788 rows in total.

...

Action Items

Action Items

Comments

Owner

Lessons Learned

  • Look for logs to check if the job is really stalled.