CI pipelines using git connectors with a Delegate
as the mode of connection did not update the status back to SCM/Git providers. This did not impact those pipelines whose git connectors are connected via the Harness Platform
.
Steps to resolve the issue immediately
Time | Event |
---|---|
10/05/2023 | |
3:30 AM | Delegate deployment |
10:53 AM | First customer ticket for status check not reporting back |
11-12 AM | Two more customer ticket for the same issue |
12:20 PM | Incident channel created |
2:20 PM | Reproduced the issue PR checks reporting not working via delegate |
2:40 PM | Rolled back delegate in rings |
3:30 PM | Informed customer about rollback |
3:40PM | Customer confirmed restoration |
Harness CI engineers tried to reproduce in-house with various combinations of infrastructure such as Kubernetes, Harness Cloud, Virtual Machines, etc but it took some time to realize this happens only when the git connector is set up to be connected via Harness Delegate instead of Harness Platform. As soon as we realized this, we engaged the delegate engineering team and they helped with reverting the delegate version to a previous one that did not have this code.
Add automation to catch this case and also set up internal alerts when the issue happens so things can be handled proactively.