-
Notifications
You must be signed in to change notification settings - Fork 175
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
critical target error message on VCH console #8112
Comments
If you still have that VCH around, please could you supply the Additionally, given this recreates for you, please could you:
This gives us a method by which we can gather additional logging if/when the problem recreates. Given the message about tools not running I am wondering if this is a possible recreate of #7680 and would love to get actionable data on that. Regarding the message on the console. I don't know why you're seeing this but...
|
vmware.log |
Hi @hickeng, I've reproduced this on a VCH with debug=2. The VCH admin portal is unavailable, but I've attached the logs from /var/log/vic on the VCH. |
@hickeng Any chance you could look at this? I'm seeing panic: runtime error: integer divide by zero appearing less than 24 hours after I create a VCH, necessitating a redeploy. This is currently a showstopper for my VIC testing as it interrupts the test run. |
@RebeccaYo Apologies for the delay. I've taken a look at the logs you attached. It's possible that this is a variant of #7680. One very effective way of determining if this is the same issue is deploying with a static IP on the management network and seeing if the problem persists. If it does not then you can try with DHCP again and specifying If this is confirmed to be the same issue I'm very interested in knowing whether the panic is also present in the tether.debug when deployed with debug=2. We've struggled to get any traction on #7680 and any insight would be invaluable. |
Hi George, thank you, I'll try that. In the meantime, here's the tether.debug for the VCH deployed with debug=2. tether.debug.txt |
VIC version:
1.4.0
Deployment details:
./vic-machine-linux create --name WV-VCH --public-network-ip 192.168.10.100/24 --management-network PublicNetwork --insecure-registry 10.115.68.63:443 --compute-resource ordos12.eng.vmware.com --no-tlsverify --no-tls --thumbprint ... --target 10.115.68.198 --user [email protected] --image-store Tegile-Lun3a --volume-store Tegile-Lun3a:default --volume-store Tegile-Lun1a:Tegile-Lun1a --volume-store Tegile-Lun1b:Tegile-Lun1b --bridge-network VCH1BridgeNetwork --client-network Testbed --public-network Testbed --container-network Testbed:cn1 --dns-server 192.168.10.101 --public-network-gateway 192.168.10.101 --endpoint-cpu 1 --endpoint-memory 2048
Steps to reproduce:
After deploying the VCH, the error appears within a number of hours to days.
This same error has appeared on many different deployments of VCH, which were located on different datastores (I was making sure this wasn't because of a bad disk). Every time, the error is the same (including sector number) except for the number at the beginning of the error.
Actual behavior:
There is an error message on the console of the VCH.
[ 10.487831] blk_update_request: critical target error, dev sda, sector 15958016. See attachment.
Also, I cannot connect to the Docker daemon of this VCH at tcp://192.168.10.100:2375.
Expected behavior:
After deploying the VCH, there was no error on the console and I did have connectivity to the Docker daemon.
Logs:
VCH Admin portal is inaccessible.
When I tried to enable ssh on the VCH, I received the message
INFO[0005] ### Configuring VCH for debug ####
INFO[0005] Validating target
INFO[0005]
INFO[0005] VCH ID: VirtualMachine:vm-2075
INFO[0005] Creating directory [Tegile-Lun3a] WV-VCH
INFO[0005] Datastore path is [Tegile-Lun3a] WV-VCH
INFO[0006]
INFO[0006] Installer version: v1.4.0-18893-6c385b0
INFO[0006] VCH version: v1.4.0-18893-6c385b0
ERRO[0006] Tools is not running in the appliance, unable to continue
ERRO[0006] Unable to enable ssh on the VCH appliance VM: Tools is not running in the appliance, unable to continue
INFO[0006] Collecting 67e398ac-44ad-4777-b58a-848f8b56df0f vpxd.log
ERRO[0006] Tools is not running in the appliance, unable to continue
ERRO[0006] --------------------
ERRO[0006] vic-machine-linux debug failed: debug failed
Additional details as necessary:
The text was updated successfully, but these errors were encountered: