Between 19:15 PST and 23:15 PST on 23 November 2025, customers on the EU1 cluster experienced an STO service outage. The outage was caused by a spike in memory usage, which pushed the pods into an unhealthy state.
The refid-cache sidecar was configured with a 256 MiB memory limit. During an unusually large CVE/EPSS data sync, memory usage exceeded this limit, resulting in OOMKills. This caused the pod to be marked unhealthy and enter a CrashLoopBackOff state, rendering sto-core unavailable in the EU1 cluster.
Customer Impact: All customers on the EU1 cluster were unable to use STO.
Other Environments: No impact on non-EU clusters
Duration: Approximately four hours (19:15 PST – 23:15 PST, 23 November 2025)
Immediate Fix
Action Items