-
Notifications
You must be signed in to change notification settings - Fork 5.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BUG] salt-minion gets disconnected from salt-master #66715
Comments
Hi there! Welcome to the Salt Community! Thank you for making your first contribution. We have a lengthy process for issues and PRs. Someone from the Core Team will follow up as soon as possible. In the meantime, here’s some information that may help as you continue your Salt journey.
There are lots of ways to get involved in our community. Every month, there are around a dozen opportunities to meet with other contributors and the Salt Core team and collaborate in real time. The best way to keep track is by subscribing to the Salt Community Events Calendar. |
@tjyang I presume that the issue is not related to salt-master as it only depends on salt-minion version by my observation so far. salt-master:3006, salt-minion:3004 - the problem does not appear. There is no MultiMinionProcessManager in salt-minion:3004, so my assumption is that this very entity is somehow related to the problem. |
@gvfnix thanks for your usage sharing.
|
If anyone finds this while googling for their variant of
This is what happened to me: The new "Broadcom-Flavoured" Debian packages apparently mark /etc/salt/minion as a proper config file. Now debconf asks whether to overwrite the pre-existing salt config files, which won't work if running e.g. in cloud-init. This left the After starting the salt-minion service, the minion did start up, but did run (accorting to it's debug log, which I enabled for discovering this), some The effect for me was that the salt-minion was first running, but then after 90 seconds terminated itself during this phenomenon. I suspect that this SIGTERM from within the salt-minion process tree itself makes salt-minion struggle? My solution will be to suppress the interactive "overwrite? (Y/I/N/O/D/Z)" with apt/dpkg means. |
Description
During the day I found out that recently updated salt-minion does not respond:
Minion itself seems to be pretty happy though:
When I try to restart the minion, it does not stop in a neat way:
Seems like MultiMinionProcessManager hangs and does not get terminated by systemd:
After killing MultiMinionProcessManager and restarting salt-minion.service, minion normally connects to the master for the next couple of hours. During that time of normal work minion can be restarted with no problem.
The issue does not reproduce in salt-minion 3004, but reproduces in salt-minion 3006.
Setup
salt-master 3007.1 runs in a container (based on Ubuntu 22.04) orchestrated by Kubernetes, has 4 CPUs and 8Gi of memory, resources are guaranteed.
Master config:
Minion 3007.1 run on Ubuntu Server 22.04.
Minion config:
Steps to Reproduce the behavior
Wait a couple of hours, then try to run any module at salt-master targeting salt-minion.
Expected behavior
The module should run normally.
Versions Report
The text was updated successfully, but these errors were encountered: