A way to enforce leadership loss/switch. #4477

shreemaan-abhishek · 2022-10-05T15:42:26Z

Is your enhancement related to a problem? Please describe

In the scope of leader election in a distributed environment. One of the best practices is that a given instance should not be the leader forever. After a certain interval of time, the leader instance should give up leadership so that the election happens again and a new leader is chosen.

Using fabric8 kubernetes client to implement leader election among replicas of a pod, we are able to observe a leader is elected. And leadership is transferred to a different pod once the leader pod is dead. The only missing part of the puzzle is, the fabric8 kubernetes client not having a way to enforce the leader pod to lose its leadership status.

Such an expected implementation can be used by the client to enforce a leadership switch periodically using appropriate methods provided by the JDK.

Describe the solution you'd like

The solution can be as simple as this method being marked as public.

Describe alternatives you've considered

To enforce leadership loss we can:

call ExecutorService.shutdownNow() on the ExecutorService running leader election. (Or basically, try to interrupt/kill the leader election thread)
update the lease lock with a LeaderElectionRecord having an empty holder identity.

Additional context

After taking a look at the example code for leader election shared in this repo, I think the fabric8 leader election interface was never intended to be used for a use case like this. (Leader Election among replicas)

The text was updated successfully, but these errors were encountered:

shawkins · 2022-10-05T16:32:16Z

With #4318 you can do:

LeaderElector elector = kubernetesClient.leaderElector().withConfig(new LeaderElectionConfigBuilder().withReleaseOnCancel()...

Future<?> theFuture = elector.start();
...
// then when you call cancel on the future, it will release the leadership if currently held
theFuture.cancel();

shreemaan-abhishek · 2022-10-06T04:23:51Z

@shawkins - Thank you so much for the response.

We tried this out, the idea was to switch the leadership periodically from one instance to another by making the leader instance lose the leadership. We observed the leadership status being passed on to other instances a couple of times after this, theFuture.cancel(); did not yield leadership loss.

This is why I am looking for an appropriate way to do this using an API method.

shawkins · 2022-10-06T11:37:36Z

We observed the leadership status being passed on to other instances a couple of times after this, theFuture.cancel(); did not yield leadership loss.

Yes it will only release if the current instance is the leader. That means you can get the behavior you want by using the onStartLeading callback to create a timer task or similar to cancel the current leader after whatever interval you see fit. Something like:

  private void startLeaderElector() {
      AtomicReference<Future<?>> startFuture = new AtomicReference<>();
      LeaderElector elector = kubernetesClient.leaderElector()
          .withConfig(
              new LeaderElectionConfigBuilder()
                  .withReleaseOnCancel()
                  ...
                  .withLeaderCallbacks(new LeaderCallbacks(
                      () -> {
                        // fine to run over the default pool as the start work is non-blocking
                        CompletableFuture.delayedExecutor(30, TimeUnit.MINUTES).execute(() -> {
                          startFuture.get().cancel(true);
                        });
                        // do other stuff
                      },
                      () -> {
                        startLeaderElector();
                        // do other stuff
                      },
                      s -> { // do something }))
                  .build())
          .build();
      startFuture.set(elector.start());
    }

Obviously that's not very elegant, but I don't see that the go client implementation is designed for this either.

Enhancements that would make this easier:

another config value for the total number of allowed renews or total renewal duration
add a cancel method to the LeaderElector, and pass the elector to at least the onStartLeading callback - but the signature change is breaking.

It may also be good to clarify / enforce that a LeaderElector instance should only have 1 election running at a time - simultaneous calls to run / start should be disallowed.

@manusa any thoughts?

stale · 2023-01-04T22:38:54Z

This issue has been automatically marked as stale because it has not had any activity since 90 days. It will be closed if no further activity occurs within 7 days. Thank you for your contributions!

shreemaan-abhishek · 2023-01-05T09:38:27Z

ping!

…d mock compatibility

stale bot added the status/stale label Jan 4, 2023

stale bot removed the status/stale label Jan 5, 2023

shawkins added a commit to shawkins/kubernetes-client that referenced this issue Mar 18, 2023

fix fabric8io#4477 fabric8io#4965: LeaderElector release public / cru…

c2a8fac

…d mock compatibility

shawkins mentioned this issue Mar 18, 2023

fix #4477 #4965: LeaderElector release public / crud mock compatibility #4977

Merged

11 tasks

shawkins added a commit to shawkins/kubernetes-client that referenced this issue Mar 18, 2023

fix fabric8io#4477 fabric8io#4965: LeaderElector release public / cru…

e79654b

…d mock compatibility

shawkins self-assigned this Mar 19, 2023

manusa added this to the 6.6.0 milestone Mar 22, 2023

manusa added the enhancement label Mar 22, 2023

manusa closed this as completed in #4977 Mar 22, 2023

manusa pushed a commit that referenced this issue Mar 22, 2023

fix #4477 #4965: LeaderElector release public / crud mock compatibility

f91e0bd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A way to enforce leadership loss/switch. #4477

A way to enforce leadership loss/switch. #4477

shreemaan-abhishek commented Oct 5, 2022 •

edited

Loading

shawkins commented Oct 5, 2022

shreemaan-abhishek commented Oct 6, 2022 •

edited

Loading

shawkins commented Oct 6, 2022 •

edited

Loading

stale bot commented Jan 4, 2023

shreemaan-abhishek commented Jan 5, 2023

A way to enforce leadership loss/switch. #4477

A way to enforce leadership loss/switch. #4477

Comments

shreemaan-abhishek commented Oct 5, 2022 • edited Loading

Is your enhancement related to a problem? Please describe

Describe the solution you'd like

Describe alternatives you've considered

Additional context

shawkins commented Oct 5, 2022

shreemaan-abhishek commented Oct 6, 2022 • edited Loading

shawkins commented Oct 6, 2022 • edited Loading

stale bot commented Jan 4, 2023

shreemaan-abhishek commented Jan 5, 2023

shreemaan-abhishek commented Oct 5, 2022 •

edited

Loading

shreemaan-abhishek commented Oct 6, 2022 •

edited

Loading

shawkins commented Oct 6, 2022 •

edited

Loading