Fix issue #83: thingbuf hangs when buffer is full #85

tukan · 2024-04-15T18:00:47Z

Fixes: #83

Previously, to determine if the buffer was full, we checked whether the head and tail were pointing to the same slot with the head one generation behind. However, this check fails if we skip slots, leading to scenarios where the head and tail point to different slots even though the buffer is full.

For example, consider a buffer with 3 slots. Initially, we write to the buffer three times (gen + 0). Then, we read from slot 0 and slot 1, holding the reference from slot 1, and read from slot 2 (gen + 0). Next, we write to slot 0 (gen + 1) and read from slot 0 (gen + 1), which moves our head to slot 1 (gen + 1). Then we try to write to slot 1 (gen + 1) and skip it, so we write to slot 2 (gen + 1). Then again we write to slot 0 (gen + 2). And then we attempt to write to slot 1 but we skip and attempt to write to slot 2 (gen + 2). However, we can’t write into it because it still contains data from the previous generation (gen + 1), and our head points to slot 1 instead of slot 2.

This fix ensures the buffer full condition accurately reflects the actual status of the slots, particularly when writes are skipped.

Previously, to determine if the buffer was full, we checked whether the head and tail were pointing to the same slot with the head one generation behind. However, this check fails if we skip slots, leading to scenarios where the head and tail point to different slots even though the buffer is full. For example, consider a buffer with 3 slots. Initially, we write to the buffer three times (gen + 0). Then, we read from slot 0 and slot 1, holding the reference from slot 1, and read from slot 2 (gen + 0). Next, we write to slot 0 (gen + 1) and read from slot 0 (gen + 1), which moves our head to slot 1 (gen + 1). Then we try to write to slot 1 (gen + 1) and skip it, so we write to slot 2 (gen + 1). Then again we write to slot 0 (gen + 2). And then we attempt to write to slot 1 but we skip and attempt to write to slot 2 (gen + 2). However, we can’t write into it because it still contains data from the previous generation (gen + 1), and our head points to slot 1 instead of slot 2. This fix ensures the buffer full condition accurately reflects the actual status of the slots, particularly when writes are skipped.

hawkw

thanks for fixing this! i had a few small suggestions, and i'd love to see a loom test exercising this, but overall, this looks right to me. thank you!

hawkw · 2024-04-15T18:28:37Z

src/lib.rs

@@ -696,4 +708,72 @@ mod tests {
        // don't panic in drop impl.
        core.has_dropped_slots = true;
    }
+


it would be nice to also add a loom model exercising what happens when a buffer fills up due to concurrent pushes from multiple threads? we could do something where we spawn multiple threads and have each one try to push in a loop until the buffer is full, which would check that all of those threads eventually complete.

we could also exercise slot skipping by adding a thread that calls pop_ref and either mem::forgets the guards or stuffs them someplace to hang onto them.

@hawkw I need your advise here, I added a simple loom test but I find it difficult to construct a good test with read/writes under loom. Imagine we have a thread that reads from the buffer (for example, exactly three times) and we have two threads that write to the buffer until three elements are read and the buffer is full, under loom we will fail because we will reach max iterations in cases in which we attempt to write a lot and doesn't read, making us spin in "buffer is full" and not progress

src/lib.rs

tukan · 2024-04-15T19:48:08Z

Thank you for your comments. I will address them this week, hopefully tomorrow.

hawkw

I had some last questions about whether we need to add more loom tests. But, since this PR fixes the bug, I'd like to go ahead and merge it, and we can add more tests in subsequent branches.

src/mpsc/tests/mpsc_blocking.rs

## v0.1.6 (2024-04-18) #### Bug Fixes * fix senders hanging when the buffer is full (#85) ([723c44a](723c44a), closes [#83](#83))

tukan added 3 commits April 15, 2024 20:28

fix(core): update current tail/head when possible to avoid extra cycles

92f78d5

fix(mpsc): fix mpsc_test_skip_slot tests

fb73a3b

tukan mentioned this pull request Apr 15, 2024

thingbuf::mpsc::Sender hanging up for parallel try_send_ref and send / send_ref from sync thread and async tokio::task #83

Closed

hawkw self-requested a review April 15, 2024 18:13

hawkw approved these changes Apr 15, 2024

View reviewed changes

tukan added 3 commits April 16, 2024 21:40

fix: improve code style and debug output (address review comments)

95d980d

fix: improve grammar (address review comments)

d639975

add simple loom test to test that we stop when buffer is full

c11720e

hawkw approved these changes Apr 18, 2024

View reviewed changes

src/mpsc/tests/mpsc_blocking.rs Show resolved Hide resolved

Update src/mpsc/tests/mpsc_blocking.rs

9512e03

hawkw enabled auto-merge (squash) April 18, 2024 17:03

hawkw merged commit 723c44a into hawkw:main Apr 18, 2024
25 checks passed

hawkw added a commit that referenced this pull request Apr 18, 2024

chore: prepare to release v0.1.6

b7f4772

## v0.1.6 (2024-04-18) #### Bug Fixes * fix senders hanging when the buffer is full (#85) ([723c44a](723c44a), closes [#83](#83))

hawkw mentioned this pull request Apr 18, 2024

chore: prepare to release v0.1.6 #86

Merged

hawkw added a commit that referenced this pull request Apr 18, 2024

chore: prepare to release v0.1.6 (#86)

3cccebf

## v0.1.6 (2024-04-18) #### Bug Fixes * fix senders hanging when the buffer is full (#85) ([723c44a](723c44a), closes [#83](#83))

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix issue #83: thingbuf hangs when buffer is full #85

Fix issue #83: thingbuf hangs when buffer is full #85

tukan commented Apr 15, 2024

hawkw left a comment

hawkw Apr 15, 2024

tukan Apr 18, 2024

tukan commented Apr 15, 2024

hawkw left a comment

Fix issue #83: thingbuf hangs when buffer is full #85

Fix issue #83: thingbuf hangs when buffer is full #85

Conversation

tukan commented Apr 15, 2024

hawkw left a comment

Choose a reason for hiding this comment

hawkw Apr 15, 2024

Choose a reason for hiding this comment

tukan Apr 18, 2024

Choose a reason for hiding this comment

tukan commented Apr 15, 2024

hawkw left a comment

Choose a reason for hiding this comment