-
Notifications
You must be signed in to change notification settings - Fork 14
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When rpm package installation fails during Import Cluster operation, information about this problem is not provided #627
Comments
While installation of gluster-integration, modified the logic to check for any errors reported by ansible and populate it back to the job status. tendrl-bug-id: Tendrl/node-agent#627 Signed-off-by: Shubhendu <[email protected]>
While installation of gluster-integration, modified the logic to check for any errors reported by ansible and populate it back to the job status. tendrl-bug-id: Tendrl/node-agent#627 Signed-off-by: Shubhendu <[email protected]>
@mbukatov can you please verify this and mark as closed. |
Using snapshot builds:
I reproduced the issue and see that the problem is detected and reported correctly: For comparison, the same grep as used in the original report:
And full message as returned by api:
The only problem with the error message I notice is the fact that node on which the installation failed is referenced via it's id |
@shtripat How could I translate the node id used in the message to hostname? Is it described somewhere? |
Also, after the error, I see that the cluster looks fine in cluster list of tendrl ui: But some values related to the affected node are not there, such as "Brick Status" widget on Host Dasboard: Is this ok? I have additional question: what should I do when an error like this happens during import cluster, assuming I'm able to find out the root cause and fix it as in this case? @Tendrl/tendrl-qe @Tendrl/tendrl-core |
It would need changes in code to show node's FQDN instead of node-id here in message. For the time being to correlate you can refer I will send a separate PR for the same anyway. Regarding @brainfunked @r0h4n @nthomas-redhat I remember we talked about tagging the cluster with error if the import failed and depict the same in UI. Comments? |
@mbukatov sent Tendrl/commons#768 for showing FQDN of node in log messages instead of node-ids |
Fixed: Tendrl/commons@33ac94f |
Using snapshot builds:
I reproduced the issue and see that the problem is detected and reported correctly, with a full hostname of the affected node:
So I consider this verified. |
Fixed in current master branch, as noted in #627 (comment) |
Description
When rpm package installation fails during Import Cluster operation, information about this problem nor more details behind it is not provided. One can only guess what has happened based on errors which consequently follow.
Version
Latest snapshot build from master branch (it's part of upcoming 1.5.2 version):
Steps to reproduce
baseurl
to something invalid (eg.baseurl=file://root/tendrl-repo-which-does-not-exist/
).tendrl-gluster-integration
is not possible on the selected machine.jobs/${JOB_ID}/messages
api call as well:Actual Results
Even though that the installation of
tendrl-gluster-integration
package failed on one storage server, there is no direct indication of this happening on the page. The first error is about missing configuration file, which is a consequence of the missing package:The other way to check that no information about installation fail is to grep for details in the pretty version of task messages.
No error related to the package name:
All errors reported by Tendrl (just 2 in this case) are just a consequence of the installation failure:
Expected Results
Tendrl should report an error about the fact that installation of the package failed, with details indicating why is that (based on yum error) if possible.
Additional Details
This problem was discussed on A daily meeting of Tendrl developers and reported here with full reproducer based on agreement with @r0h4n .
The text was updated successfully, but these errors were encountered: