All Systems Operational

About This Site

For information on identifying the Cluster hosting your Harness account, visit our support page at https://developer.harness.io/docs/platform/Get-started/platform-concepts/view-account-info-and-subscribe-to-alerts

Prod 1 Operational
90 days ago
99.98 % uptime
Today
Continuous Delivery (CD)
Operational
90 days ago
100.0 % uptime
Today
Continuous Delivery - Next Generation (CDNG)
Operational
90 days ago
100.0 % uptime
Today
Cloud Cost Management (CCM)
Operational
90 days ago
99.94 % uptime
Today
Continuous Error Tracking (CET)
Operational
90 days ago
100.0 % uptime
Today
Chaos Engineering
Operational
90 days ago
100.0 % uptime
Today
Continuous Integration Enterprise(CIE) - Cloud Builds
Operational
90 days ago
100.0 % uptime
Today
Continuous Integration Enterprise(CIE) - Self Hosted Runners
Operational
90 days ago
100.0 % uptime
Today
Custom Dashboards
Operational
90 days ago
99.94 % uptime
Today
Feature Flags (FF)
Operational
90 days ago
99.98 % uptime
Today
Security Testing Orchestration (STO)
Operational
90 days ago
100.0 % uptime
Today
Service Reliability Management (SRM)
Operational
90 days ago
100.0 % uptime
Today
Prod 2 Operational
90 days ago
99.97 % uptime
Today
Continuous Delivery (CD)
Operational
90 days ago
99.98 % uptime
Today
Continuous Delivery - Next Generation (CDNG)
Operational
90 days ago
99.98 % uptime
Today
Cloud Cost Management (CCM)
Operational
90 days ago
99.92 % uptime
Today
Continuous Error Tracking (CET)
Operational
90 days ago
99.98 % uptime
Today
Chaos Engineering
Operational
90 days ago
100.0 % uptime
Today
Continuous Integration Enterprise(CIE) - Cloud Builds
Operational
90 days ago
99.98 % uptime
Today
Continuous Integration Enterprise(CIE) - Self Hosted Runners
Operational
90 days ago
100.0 % uptime
Today
Custom Dashboards
Operational
90 days ago
99.9 % uptime
Today
Feature Flags (FF)
Operational
90 days ago
99.95 % uptime
Today
Security Testing Orchestration (STO)
Operational
90 days ago
99.96 % uptime
Today
Service Reliability Management (SRM)
Operational
90 days ago
99.98 % uptime
Today
Prod 3 Operational
90 days ago
99.98 % uptime
Today
Continuous Delivery (CD)
Operational
90 days ago
100.0 % uptime
Today
Continuous Delivery - Next Generation (CDNG)
Operational
90 days ago
100.0 % uptime
Today
Cloud Cost Management (CCM)
Operational
90 days ago
99.94 % uptime
Today
Continuous Error Tracking (CET)
Operational
90 days ago
100.0 % uptime
Today
Continuous Integration Enterprise(CIE) - Cloud Builds
Operational
90 days ago
100.0 % uptime
Today
Continuous Integration Enterprise(CIE) - Self Hosted Runners
Operational
90 days ago
100.0 % uptime
Today
Custom Dashboards
Operational
90 days ago
99.93 % uptime
Today
Feature Flags (FF)
Operational
90 days ago
100.0 % uptime
Today
Security Testing Orchestration (STO)
Operational
90 days ago
100.0 % uptime
Today
Service Reliability Management (SRM)
Operational
90 days ago
100.0 % uptime
Today
Security Testing Orchestration FirstGen (fka ZeroNorth)
Operational
90 days ago
100.0 % uptime
Today
Service Reliability Management - Error Tracking FirstGen (fka OverOps)
Operational
90 days ago
100.0 % uptime
Today
Software Engineering Insights FirstGen (fka Propelo)
Operational
90 days ago
99.94 % uptime
Today
Operational
Degraded Performance
Partial Outage
Major Outage
Maintenance
Major outage
Partial outage
No downtime recorded on this day.
No data exists for this day.
had a major outage.
had a partial outage.
Past Incidents
Oct 1, 2023

No incidents reported today.

Sep 30, 2023

No incidents reported.

Sep 29, 2023

No incidents reported.

Sep 28, 2023
Resolved - This incident has been resolved.
Sep 28, 17:36 PDT
Update - We are continuing to monitor the workaround in place for any issues. Additionally, we are seeing responses come back from our primary upstream provider but are awaiting additional status update/confirmation from the provider(GCP) before affirmation.
Sep 28, 16:33 PDT
Monitoring - Harness has recently implemented a workaround for the majority of customers which should restore CCM Perspectives as well as Custom Dashboards. Our primary upstream provider(GCP) however, while showing some signs of recovery is still encountering issues. Harness will continue to provide status updates as we closely monitor the workaround we have put in place to mitigate issues related to CCM Perspectives and Dashboards until the incident is fully resolved.

(Note: Existing GCP/CCM connector validation/test functionality will error out due to the workaround in place, but does not affect product functionality)

Sep 28, 14:51 PDT
Update - Harness has recently implemented a workaround for the majority of customers which should restore CCM Perspectives as well as Dashboards. Our primary upstream provider is still experiencing issues and is showing signs of recovery, and we expect an update within the next 30 minutes to the upstream status.

Note: Existing GCP/CCM connector validation/tests will error out due to the workaround in place.

Sep 28, 14:39 PDT
Identified - We identify that the issue we are experiencing is related to the Google Big Query incident:

https://status.cloud.google.com/incidents/V8br4RDzg1RsCw6zWQEv

Sep 28, 13:53 PDT
Investigating - CCM Perspectives and Dashboards are down because of a GCP Bigquery incident. We are currently investigating this issue.
Sep 28, 13:23 PDT
Resolved - We are resolving the incident on the DockerHub side for now. Please follow DockerHub status page for any further updates
Sep 28, 17:24 PDT
Monitoring - Docker has identified and mitigated the issue and they are actively monitoring for any recurrences, and we will continue to monitor their status in parallel. Latest update from the provider:

We have mitigated the issue as of about 21:50 UTC, but will continue to monitor until an underlying incident at our provider is cleared.

Sep 28, 16:31 PDT
Investigating - Currently, DockerHub is having an active incident of timeouts while authenticating or pulling from Dockerhub https://www.dockerstatus.com/pages/incident/533c6539221ae15e3f000031/6515dbc89fabff05350ae18d. Harness customers using Docker in the pipelines may run into issues because of the ongoing DockerHub incident. Please follow the DockerHub status page for further updates.
Sep 28, 14:16 PDT
Sep 27, 2023

No incidents reported.

Sep 26, 2023

No incidents reported.

Sep 25, 2023

No incidents reported.

Sep 24, 2023

No incidents reported.

Sep 23, 2023

No incidents reported.

Sep 22, 2023
Resolved - While the Google Cloud team continues to work on the mitigation on their end, we will close this incident as resolved since app.propelo.ai is functional in the U.S. region.
Sep 22, 16:12 PDT
Update - We are continuing to monitor for any further issues.
Sep 22, 14:40 PDT
Monitoring - app.propelo.ai is back up and running, and we continue to monitor the situation. We have applied the workaround that Google suggested and made some changes to the configuration of the External Application Load Balancer, which resolved the issue. We will continue to monitor the situation.
Sep 22, 14:38 PDT
Investigating - app.propelo.ai is currently down in the U.S. region because of the GCLB issues on the GCP side that we notified in the morning https://status.cloud.google.com/incidents/skbvoobU4btcxyAthEmu#2c2sBHWU84yPDJ8y1ar4. This module uses an Application Load Balancer, and we are seeing the impact now. app.harness.io has no issues and continues to be functional.
Sep 22, 14:31 PDT
Resolved - While the Google Cloud team continues to work on the mitigation on their end, we will close this incident as resolved since it is isolated to the External Application Load Balancer and won't impact Harness. Thank you.
Sep 22, 13:47 PDT
Monitoring - Based on the GCP status page, the issue is isolated to the GCP External Application Load Balancer and the elevated HTTP 4xx Error rates. Harness should not be impacted; we will continue to monitor the situation.
Sep 22, 11:29 PDT
Investigating - GCP has posted on their status page an incident affecting Google Cloud Networking, Cloud Load Balancing https://status.cloud.google.com/incidents/skbvoobU4btcxyAthEmu#2c2sBHWU84yPDJ8y1ar4. We validated that app.harness.io has no impact. We will continue to monitor the situation and post updates as they come by.
Sep 22, 11:02 PDT
Sep 21, 2023
Postmortem - Read details
Sep 21, 14:26 PDT
Resolved - We noticed spikes in Redis that caused the Feature Flag service to become inoperative, we have already resolved the problem and are monitoring our services!

Get ship Done!

Sep 21, 07:59 PDT
Investigating - We noticed that Redis CPU spike on FF causing FF services to not respond and we are already working to identify the issue.
Sep 21, 07:53 PDT
Sep 20, 2023

No incidents reported.

Sep 19, 2023

No incidents reported.

Sep 18, 2023
Resolved - We can confirm normal operation. Get Ship Done!
We will continue to monitor and ensure stability.

Sep 18, 04:19 PDT
Monitoring - Harness Feature Flag with creating environment or target groups issues have been addressed and normal operations have been resumed. We are monitoring the service to ensure normal performance continues.
Sep 18, 03:43 PDT
Investigating - Users are not able to create new environments or target groups in the Feature Flag Module. We are working to identify the cause and restore normal operations as soon as possible.
Sep 18, 02:18 PDT
Sep 17, 2023

No incidents reported.