-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add alert for rhel6 running vms #486
Conversation
4bb299e
to
d9748c2
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good. I have only small comments.
@akrejcir I saw that a |
Can you add code to the failing test, that will skip this specific alert? Then later we can remove it, when the runbook exists. |
d9748c2
to
253fb74
Compare
Thanks @akrejcir for your review! I pushed the changes to address your comments. PTAL! Thanks |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we actually need a new controller for this or could this be implemented as an operand like we already do for other parts of ssp?
We discussed this before, and I suggested creating a controller. The reason is that the operands in ssp-controller are only active if the SSP object is present in the cluster, and they can only watch objects they own. In the case of this PR, the VMs are not owned by ssp, so operand would not work. Also, it would have worse performance, because on every update of a VM, the whole ssp-controller logic would run. |
Expr: intstr.FromString("sum by (namespace, name) (kubevirt_vm_rhel6) > 0"), | ||
Annotations: map[string]string{ | ||
"summary": "VM {{ $labels.namespace }}/{{ $labels.name }} is based on RHEL6 template, and this will not be supported in the next release", | ||
//"runbook_url": runbookURLBasePath + Rhel6AlertName, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How can the runbook / runbook_url
be created?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! The runbook will be created under kubevirt/monitoring project
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Can it be created in advance? Or only after the metric is available? (Sorry, I don't know much about runbooks)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know much neither :) In general, I think the two things can go in parallel. Personally, I think that documentation that says there is something, but in fact there is not, is worse than providing something without saying it. But I am open to other points of view. If you prefer to firstly update the documentation, no problem at all. WDYT?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@fossedihelm every alerts must have a runbook in advance. Changing this test just to get the alert in the code is not good practice. The test was set in place for a reason.
Please revert the change to the test and add a runbook and runbook_url annotation to the alert asap. Thank you.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks @sradco kubevirt/monitoring#177 for the runbook.
ssp changes ready as soon the monitoring pr is merged. Thanks and sorry
I'm not against this approach tbh, however I'd like to see more justification in the PR/ReleaseNote/Commits about why we need this (RHEL 6 is ELS downstream but what does that mean here?) and some docs somewhere in the tree about the controller before we merge this if possible. |
RHEL6 guests will be not supported on RHEL9 hosts anymore [1]. And CNV 4.13 is planned to be on RHCOS 9, so RHEL6 cannot be supported in CNV 4.13 anymore. Adding a vm controller that watch for rhel 6 vms. Rhel6 template put a label in the vm with the format: rhel6-<workload>-<flavor>. This label is checked to determine whether a vm is rhel6 or not. [1] https://access.redhat.com/articles/973163#rhelkvm Signed-off-by: fossedihelm <[email protected]>
253fb74
to
f8be23c
Compare
Kudos, SonarCloud Quality Gate passed! 0 Bugs No Coverage information |
/hold |
/unhold |
/retest |
/lgtm |
/retest |
/test e2e-single-node-functests |
@danilo-gemoli: No presubmit jobs available for kubevirt/ssp-operator@master In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/retest |
@akrejcir Can you have a look again, thank! |
/lgtm |
/approve |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: akrejcir The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
What this PR does / why we need it:
RHEL6 guests will be not supported on RHEL9 hosts anymore [1].
And CNV 4.13 is planned to be on RHCOS 9, so RHEL6 cannot be supported in CNV 4.13 anymore.
Adding a vm controller that watch for rhel 6 vms.
Rhel6 template adds a label in the vm in the format:
rhel6-<workload>-<flavor>
.This label is checked to determine whether a vm is rhel6 or not.
[1] https://access.redhat.com/articles/973163#rhelkvm
Which issue(s) this PR fixes:
Fixes #
Special notes for your reviewer:
Release note:
/cc @akrejcir