Skip to content

Conversation

@dergeberl
Copy link

What this PR does / why we need it:

If a machine is deleted the it is possible that the load balancer do not know yet that the node is not available anymore. This results eventually in unsuccessful new connections as the node where the load balancer is forwarding the traffic is not existing anymore.

There are 2 ways a load balancer gets informed: 1. Via a reconcile of the load balancer in the cloud-provider package or the kube-proxy health check. Unfortunately the "drain" or conditions set by the MCM dose not trigger a reconcile or the health check to fail.

The ToBeDeletedByClusterAutoscaler Taint is used in both cloud-provider and kube-proxy, therefore we want to add this Taint to improve the load balancing during machine terminations.

This is new step is added between Initiate node drain/Delete Volume Attachments and Initiate VM deletion with one additional ShortRetry (5s) to give the cloud-provider and kube-proxy time to react before removing the server.

CC: @kon-angelo

Which issue(s) this PR fixes:
Fixes #

Special notes for your reviewer:

Release note:

Use `ToBeDeletedByClusterAutoscaler` Taint to improve load balancing during machine terminations.

@dergeberl dergeberl requested a review from a team as a code owner November 25, 2025 15:36
@gardener-robot gardener-robot added needs/review Needs review size/l Size of pull request is large (see gardener-robot robot/bots/size.py) needs/second-opinion Needs second review by someone else labels Nov 25, 2025
@aaronfern aaronfern self-assigned this Dec 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

needs/review Needs review needs/second-opinion Needs second review by someone else size/l Size of pull request is large (see gardener-robot robot/bots/size.py)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants