On August 28 customers using deployment dashboards in Production encountered “data staleness”. No data loss occurred. All other product areas remained fully functional.
The ETL service that powers these dashboards had a memory spike which paused further processing of raw pipeline data.
Deployment dashboards served stale data via custom dashboards.
Enhance monitoring & alerting – Add monitoring on ETL service to alert when a certain threshold of memory is breached and update capacity if needed.