On March 2, the prod8 environment became temporarily inaccessible due to a config issue during a platform deployment. The issue affected ingress routing for the platform UI, resulting in HTTP 404 responses when users attempted to access the environment.
The issue was quickly identified as an ingress configuration problem. A temporary mitigation was applied by updating the ingress configuration, which immediately restored access. A permanent fix is being implemented to prevent recurrence.
The issue was caused by a service config that incorrectly generated ingress configuration during deployment. This caused the ingress controller to misroute incoming requests that did not match the expected path. As a result, these requests were directed to the default backend and returned 404 responses.The problem was isolated to the ingress routing layer. Network connectivity and the Google Cloud Network Load Balancer were functioning normally
Engineering teams applied a temporary mitigation by patching the platform-ui ingress configuration in production to remove the incorrect host entries. This restored correct routing behavior and resolved the accessibility issue.
Access to the prod8 environment was fully restored after the ingress configuration update.
To prevent recurrence of this issue, the following steps are underway:
These improvements will ensure that similar misconfigurations are caught before reaching production environments.