Skip to content

Track and report non-responsive followers when quorum rounds age-out#17

Open
GeoffreyPS wants to merge 2 commits intochassisframework:mainfrom
GeoffreyPS:gps/report-node-responses-on-aged-out-heartbeats
Open

Track and report non-responsive followers when quorum rounds age-out#17
GeoffreyPS wants to merge 2 commits intochassisframework:mainfrom
GeoffreyPS:gps/report-node-responses-on-aged-out-heartbeats

Conversation

@GeoffreyPS
Copy link

@GeoffreyPS GeoffreyPS commented Mar 16, 2026

What it does

This pull request includes minor changes to the QuorumStatus module to allow Craft to report on nodes that did not send heartbeat responses by the time a quorum round stops being tracked.

With these changes, craft can now publish telemetry events for each round that ages out for all followers that did not respond in time. The new telemetry event is [:craft, :quorum, :miss]

Effectively, this gives us information on non-responsive followers without having to wait for such a follower to eventually send us the heartbeat (when we ordinarily publish the :missed_deadline telemetry event).

Extra Info

Reviewing notes

  • Note: There are a few changes here that are just formatting. I noticed this early enough to be able to run the formatter without any new work and create a formatting change as a separate commit. That way, it can be easier to view the diff of what really changed. I may have a formatter mismatch and can check on getting local environment matching if this is not desirable.

  • Additional work will be needed on either this branch or Add observability docs #16, depending on the order of their merging (assuming changes are accepted).

  • I do not know the best place to insert any new tests to exercise this behavior. I would welcome such suggestion.

@GeoffreyPS GeoffreyPS changed the title Track and report non-responsive nodes when quorum rounds age-out Track and report non-responsive followers when quorum rounds age-out Mar 16, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant