Customers were unable to access https://app.harness.io/ for 2 minutes.
A recent deployment for the gateway component in the prod-1 environment had an incorrect configuration that downscaled all the gateway pods.
Resolution
Configuration was reverted to restore the service availability.
Time(UTC) | Event |
---|---|
5 Nov 12:52:50 PM | Service deployment downscaled the gateway pods. |
5 Nov 12:54:50 PM | Scaled-up gateway pods. New pods were up and running to serve traffic. |
On Nov 5, 2024, for 2 minutes, users experienced an HTTP 503 (service unavailable) error when attempting to access https://app.harness.io. This occurred due to the downscaling of the gateway service. The issue originated from a recent deployment that applied an incorrect configuration. The configuration was immediately reverted to restore service availability.
Improve Pre-Deployment Checks: Enhance pre-deployment checks to validate critical service configurations, to prevent unintended downscaling.