-
Notifications
You must be signed in to change notification settings - Fork 28
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
1.5.0 Stability issues when boiler is not supporting enabled sensors #115
Comments
I believe I'm having the same issue. The symptoms are the same, and I have also recently upgraded to 1.5.0 but have not been able to get it running in any stable or reliable way. I think the issue might have something to do with MQTT as I get better results with it turned OFF but have not absolutely verified this yet. |
Hi both, |
Thanks @Daveblanche for the tip, but yes I definitely did do that as recommended. I think the issue is on the OT Gateway end not the HA end though. I'm going to continue to try to figure this out and will post more here as I make progress. |
@Daveblanche Regarding my stability, I had few reconnections yesterday, but at 23:00 I disabled logging (serial and telnet) and it failed only once a whole day for 40 seconds. |
Hi guys, I use S3 myself and tested the project on C3, but I can't reproduce the problem. Indeed, the web works slower when there is no connection to the boiler via openterm, and I will try to fix this. As for the fact that polling some IDs breaks the bus - I don't know why this could be. Perhaps there is some kind of bug in the boiler firmware. In the logs I did not see a poll of these IDs and loss of connection via OT. If you have more information it will help. |
Anyone who has problems with losing connection, test this build. And happy holidays! |
Thank you @Laxilef I will give it a try! |
I have tested 1.5.1, no reconnection since 2 hour, it works now. Page is responsive. No complaints. This state was not achievable on 1.5.0, Good job :) I'm interested about @tincanpete feedback. attaching logs.txt, maybe something minor or expected, I have few warnings. edit, I hit ctrl c in telnet terminal (my bad lol), then closed putty and wanted to turn it off in settings and page is laggy, problem detected in HA. RIP lol. After turning off telnet and restarting the ESP everythings is laggy again, so issue is still open :/ after 2 mins since boot it might went back to normal (but not sure, like 70% of the responsivity)?, I will see if I got any reconnections I think we need more detailed logs I have swapped to c3 and its about not usable. so it looks like s3 barely handles the problem, where c3 not. |
Hello, I have also just tested 1.5.1 and while I thought there was some initial success, its seems no. Having MQTT enabled definitely makes things significantly worse. I have attached a file showing repeated 'ping' to the board. When it's working well, I always see about 10-50ms. However as you can see it quickly becomes unstable and unresponsive; and will eventually sort-of come back to life but it seems quite random. During the time when ping is very slow or dropped, the UI is also unresponsive, and if connected, the MQTT server will report the device is off-line. Sometimes my boiler will also report an OpenTherm communication error on its display. When the problem goes away, everything goes back to normal and works OK, but often not long enough to be useful. I'm using S2-mini board, and this ping trace was done with the gateway serial port, telnet, and logging all turned OFF to try and make sure that high level logging wasn't causing high load to be part of the problem. Thanks for your help! |
@tincanpete In my case esp32-s3 seems fine, but when I changed to weaker esp-c3 it was nightmare, just not working. I guess I would have similar results as you. If you still have problems maybe try disabling sensors that are not available on your boiler (to minimize warnings in logs) I belive my pings.txt are fine, its wifi with signal 68-76% reported on OTG page. |
@SanFable I will try that, I did not use the "factory" image, just the normal one and upgraded via the UI. |
Still no joy unfortunately even after fully erasing and re-flashing the S2 and using the factory bin. I have attached a larger log but just look at this extract from the end:
Note the time stamps from 2:59 onwards, there's a big delay between each one. At the same time, the pings to the board are looking like this screenshot: Full log file below: All of this was with MQTT turned OFF by the way, which I thought would be better, but did not help actually. thanks! |
To add to my previous comment, I did try disabling the "power" sensor which my boiler does not support, but it didn't seem to make any difference. |
From my side, I had 2 reconnections every 24h using esp32s3. I assume esp32 s3 is powerful enough to handle something bad thats going on in the background. C3 was not usable. |
Guys, I can assume that the problem may be in the router. Let's check it: check how the web interface works when connected to the ESP access point, i.e. when the network is not yet configured on the ESP. P.S. What routers do you use? Can you disable telnet and check the web? To view the logs at this moment, you can use the serial port. |
I'm using ubiquiti U7 pro. Everything was fine on previous versions (1.4.5 in my case with mini D1). When first time configuring connected to ESP access point web interface is blazing fast. I will try to differ if it slows down after connecting to boiler or not. UniFi controller doesn't show me problems, it says wifi experience excellent (95%+), some spikes to good (88%) |
Test 1.5.0 on your D1 mini. Now you are testing on ESP32, these are different boards and there is a different SDK. upd: If you are using mesh and multiple access points, this may not work correctly with ESP. I don't know why, but sometimes it happens. And I don't recommend using 2G and 5G APs with the same SSID. |
Interesting idea, I had considered it might be a wifi/router issue, however the network is stable and has been for a long time, running Home Assistant and many other ESP-based devices (Shelly Relays and similar) without a problem. Do you think there's a chance my Wemos S2-mini board just "doesn't like" the wifi network? The problem was present, although not as severe, with software 1.4.5. |
I’m on UniFi, too. Check your retry rates on the front page of UniFi network app. I had an issue a few weeks ago, with high retries, and it was down to channel choice/availability. My network is incredibly stable, too. |
Something similar, and there was UniFi there too: https://community.home-assistant.io/t/opentherm-gateway-thermostat-with-full-integration-for-home-assistant/617228/128 |
ESP32 C3 connected to mikrotik, OT not connected esp.c3.webm |
I've not had a chance to try with Wireless AP only as the hardware is running in a shop we own and I've not been there for a couple of days. However, remotely monitoring it I have just noticed the "Uptime" on the UI homepage has been reset and the "Last Reset Reason" is showing "Reset due to other watchdogs". Does this offer any clues to you? |
No, "save debug data" just gives me this:
Under what circumstances will Watchdog cause a reboot? |
Hmm, strange, there is no backtrace in the debug data. Without a backtrace it is impossible to find out the reason. |
Hi, I have a similar issue with system instability (frequent disconnections). By default, I have these parameters:
My system is configured with a minimum flow temperature of 50 degrees. I’ve tried this three times (always with firmware 1.5.0), and the issue has replicated every time. If I leave the Emergency mode values unchanged (40°C and 120 seconds), everything works correctly. I hope this can help. |
Never mind… the problem has now reappeared even without modifying the parameters. The disconnection issue occurs even if the device is not connected to the boiler. |
If your ESP is powered via USB, try replacing the power supply with a different one. |
Initially, the D1 mini was connected with an external stabilized power supply (via 5V pin). |
Guys, I'm not saying there is no problem. But I don't know what causes the problem and how to fix it because I can't reproduce it. Perhaps for some reason the router is disconnecting the client. I think that you need to compile the firmware with core logs and see what is happening in more detail via COM port. Example of additional |
Ok, I’ve run all the tests I could, including using version 1.5.1 and disabling the 5G WiFi network, but I haven’t seen any improvement. Sorry for the question, but on the release page, what’s the difference between these two firmware files: firmware_s2_mini_1.5.1.bin |
Hi thereSorry, I’m travelling for work and don’t have great internet access. I’m making a massive assumption that this plugin is based on esphome?If so, have a look on the esphome site. It covers connection issues whilst using the single wire thermostat recommended in this project. But I may be wrong, obviously. D
Sent from Sky Yahoo Mail for iPhone
On Wednesday, January 8, 2025, 8:56 am, Simone ***@***.***> wrote:
Ok, I’ve run all the tests I could, including using version 1.5.1 and disabling the 5G WiFi network, but I haven’t seen any improvement.
I’ll try setting up an S2 mini to see if things get better.
Sorry for the question, but on the release page, what’s the difference between these two firmware files:
firmware_s2_mini_1.5.1.bin
firmware_s2_mini_1.5.1.factory.bin
Thank you very much!
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
Maybe you have some other wifi router to compare with it?
Factory for flashing via esptool, not factory for OTA.
No, we are talking about the firmware from this repository :) |
Yes, but replacing the router is not a simple operation (I have many devices connected, including 4 D2 mini that have been working without problems for about a year..)
Ok! I've tried with an wemos s2 mini, but I have the same problem.. As soon as I have some free time I will do more tests.. |
You can just turn on another router, connect the ESP and your computer to it. You do not need to configure Internet access for this router. To test, you don't need to change the router for all devices :) |
In this case I will try as soon as possible and I will check through a ping, even if it will not be connected to home Assistant via MQTT. Thanks! |
Well, if you placed the ESP inside the boiler, then you made a shielded box for it 😄 Now I wonder if other users from this issue have the same reason or not. |
@SanFable can you try moving the ESP further away from the boiler? |
Its glued and powered by meanwell power supply, I will try to disconnect it from shield and move about meter away (i hope that meter won't affect OT communication) |
😅 No, the ESP is outside the boiler, about 20 cm away inside a plastic box, but it is probably still too close to the boiler that somehow interferes with wireless communication. |
I need to revise my previous statement. The connection actually remained stable for almost two days, but then the same issues resurfaced. I tried moving the device closer to the router, but the problem persists. I downgraded to version 1.4.6, but this didn’t bring any improvements. I’m investigating other possible causes. It seems that the Wemos gradually slows down over time, eventually causing intermittent Wi-Fi disconnections. I’ll share an update if I find a solution to the issue. |
In case of memory leaks, ESP restarts would occur. |
Hello,
I’ve been using OTGateway with my Beretta Ciao Green 25 C.S.I. boiler for about a year. It works fine with smart TRVs and Home Assistant, although the functionality is somewhat limited.
Recently, I updated to version 1.5.0 and switched to an ESP32-S3 (I also tried the ESP32-C3 with the same results). Previously, I was using version 1.4.5 with an ESP8266.
The Issue
After the update, the system became unstable:
After about a minute, the OpenTherm Gateway status shows "problematic," and everything becomes unavailable.
The ESP32-S3 seems to struggle:
The web interface is very slow and unresponsive.
Logs over Telnet are delayed and limited.
With the ESP32-C3, the system was completely unreachable.
What I Found
I think the issue is caused by new sensors/IDs, such as:
In the logs, I saw warnings like:
[WARN] Failed to receive ...
It looks like the boiler doesn’t support these IDs, and they might be overloading the OpenTherm communication, causing it to crash.
What Worked
I turned off these new sensors and reconnected the OTGateway to the boiler. Since then, it’s been running stable for over 30 minutes, and the OpenTherm Gateway status is "OK."
Attaching logs_1.txt from a setup where I didn’t disable all the mentioned sensors (minimum modulation and maximum power were still enabled).
In the end, the ESP32 crashed, and the Telnet connection was lost.
I just realized that even after turning off the mentioned sensors in my successful run, I’m still seeing invalid request IDs 15 and 14 (minimum modulation, maximum power, and maximum modulation).
Attaching logs_2.txt from over 30 minutes of stable operation. However, there are still warnings in the logs from sensors that should be disabled. Could it be that something is overlapping when these sensors are enabled?
EDIT:
After about 6h I had few reconnections:
PS is there anything that I could do to improve support for my boiler?
The text was updated successfully, but these errors were encountered: