Some TOOL1LC boards give incorrect temperature readings using 3.6.0-beta.2 firmware #1060

dc42 · 2024-11-18T09:24:44Z

When running 3.6.0-beta.2 firmware some TOOL1LC boards give incorrect temperature readings for thermistors connected to TEMP0, for example about 2C instead of about 21C. Reported by Andy S and also seen by me. Probably related is this forum post https://forum.duet3d.com/topic/36964/3-6-0-beta-2-issue-1lc-inconsistent-temp-reading.

In my toolchanger the version 1.2a board at CAN address 22 exhibits this issue but the version 0.6 board at address 23 does not. If I swap the CAN addresses over and restart then the issue moves to address 23, i.e. the issue stays with the board hardware; so the difference in behaviour is not caused by differences in board configuration.

dc42 · 2024-11-18T09:38:58Z

Further investigation: using commit 4cf3 of the Duet3Expansion project and commit 40c9 of CANlib, the issue occurs when commit 4980 of CoreN2G is used but not when the earlier commit fa3a is used. The difference is that in the later commit, the DMA descriptors and write-back sections are put in a separate RAM section just after the CAN message buffers, which is cleared to zero by the startup code, instead of in bss. Here are some possible causes of the issue:

Moving the DMA writeback address has upset the DMA controller in some way so that DMA transfers from the ADC on the SAMC21 sometimes yield incorrect results.
The area of memory that the DMA descriptors have been moved to gets corrupted by something, causing incorrect DMA from the ADC.
Moving the DMA descriptors has caused the address of the TEMP0 filter buffer to move, and the address it moved to is getting corrupted.

Changing the DMA channel used by the ADC from 2 to 5 does not solve the problem, which makes it less likely that possible cause 2 above is the reason.

dc42 · 2024-11-18T10:30:23Z

Further investigation revealed that even if the DMA descriptors are left in bss the problem still exists. Removing the startup code to initialise the [empty] DmaBuffers section then fixes the problem.

dc42 · 2024-11-18T11:16:00Z

Solved the problem on my toolchanger by reducing the optimisation level of the startup code. The old code turned the loops into calls to memcpy and memset. Adding attribute((optimize ("-fno-tree-loop-distribute-patterns"))) to prevent that didn't fix it, but changing the optimisation level of ResetHandler to -Os did (the project setting is -O3). Also changed optimisation level of InitClocks to -Os. New binary for others to test is at https://www.dropbox.com/scl/fo/kaxbpbsyq2lxbae4tmzpk/AAPVUAh6CO_FMO-cG9ttX6U?rlkey=f0aesppc8d2p7bzwwi2s1jayo&dl=0.

dc42 · 2024-12-04T16:10:48Z

Confirmed fixed, see https://forum.duet3d.com/topic/36964/3-6-0-beta-2-issue-1lc-inconsistent-temp-reading/5.

dc42 added the bug Bug that has been reproduced label Nov 18, 2024

dc42 added this to the 3.6.0 milestone Nov 18, 2024

dc42 self-assigned this Nov 18, 2024

dc42 added the Done - Needs Testing label Nov 18, 2024

dc42 closed this as completed Dec 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some TOOL1LC boards give incorrect temperature readings using 3.6.0-beta.2 firmware #1060

Some TOOL1LC boards give incorrect temperature readings using 3.6.0-beta.2 firmware #1060

dc42 commented Nov 18, 2024 •

edited

Loading

dc42 commented Nov 18, 2024 •

edited

Loading

dc42 commented Nov 18, 2024

dc42 commented Nov 18, 2024 •

edited

Loading

dc42 commented Dec 4, 2024

Some TOOL1LC boards give incorrect temperature readings using 3.6.0-beta.2 firmware #1060

Some TOOL1LC boards give incorrect temperature readings using 3.6.0-beta.2 firmware #1060

Comments

dc42 commented Nov 18, 2024 • edited Loading

dc42 commented Nov 18, 2024 • edited Loading

dc42 commented Nov 18, 2024

dc42 commented Nov 18, 2024 • edited Loading

dc42 commented Dec 4, 2024

dc42 commented Nov 18, 2024 •

edited

Loading

dc42 commented Nov 18, 2024 •

edited

Loading

dc42 commented Nov 18, 2024 •

edited

Loading