-
Notifications
You must be signed in to change notification settings - Fork 451
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Permanent cluster member #222
Comments
Any workaround for this ? Can the client simply attempt to rejoin periodically as a workaround ? |
That is what we now do in MetalLB, periodic reJoin |
I noticed this in a fairly simple implementation. If there's a network outage to a single node, the other nodes correctly kick it out of the list, but the single node kicks everyone else out of their list as well, becoming isolated and never rejoining, even after the network comes back. Current workaround is to periodically check the list for a member count of 1 and rejoin if so. I'd love for |
Hi All,
I'm using MemberList to provide fast dead node detection in MetalLB, and I feel some feature that I'm writing around Memberlist should be included:
If I have 4 members on 4 nodes, and I have a network outage for 1 or 2 minutes, Memberlist communication will timeout and Memberlist will not recover, considering that the only member that is alive is the local member.
Would it make sense to you to:
The idea is to have the external code just call PermanentJoin(hostlist) when they see a change in K8S api
The text was updated successfully, but these errors were encountered: