Incident Summary: There was a recent incident related to delays in the evaluation of Asset Governance Rules, stemming from a queue build-up that caused temporary slowness in rule execution.
Timeline:
Root Cause Analysis: The delay was traced back to a build-up in the job queue utilized by the CCM Asset Governance feature. This model employs an asynchronous execution approach using a job queue, where rule executions are enqueued for processing. Workers asynchronously dequeue jobs from this queue to perform actual rule evaluations.
Analysis: The queue build-up was notable for specific types of evaluations with customers noticing slowness in Asset governance execution.
Immediate Resolution: To promptly address the issue, the team increased the replica count for the services involved, facilitating quicker job consumption from the queue.
Total Downtime: There was no downtime during the incident
Follow-up Actions: