Some customers who are using the Protection module in the APAC cluster experienced significant delays in anomaly detection. Processing of traces was delayed by 4–5 hours, resulting in late anomaly events. The issue was isolated to the APAC cluster; all other product areas and clusters continued to function normally.
The service that the Anomaly Detector depends on kept failing, which prevented the detector from getting the information it needed to run. Because of this, the system stopped processing new data and a large delay built up. Once the underlying issue was fixed, processing resumed, but extra capacity was needed to clear the backlog.
A subset of customers using the Protection module in the APAC cluster experienced delayed anomaly detections, with results appearing 4–5 hours later than expected. No data loss occurred, and no other clusters or product areas were affected.
To prevent this from happening again, we are
To be proactive and react faster, we are