Between May 27 and June 1, 2026, some Harness CI customers experienced pipeline execution failures with the error:
failed to call LE.RetryStartStep: context deadline exceeded
Affected customers saw intermittent CI pipeline failures during step execution.
Existing pipeline definitions, customer data, source code, and artifacts were not impacted.
The root cause was a deadlock in the Light Engine logging path. When the log service returned an error, the Light Engine log writer attempted to reacquire a mutex it already held. This caused the Light Engine process to freeze, which led to step execution timeouts and pipeline failures.
Harness Engineering took multiple mitigation steps during the incident, including:
The final fix addressed the mutex deadlock in the Light Engine log writer and prevented the same lock from being reacquired while already held.
Harness is taking the following actions to reduce recurrence risk: