WT-13428 Implement block manager file write unit tests #11016

jiechenbo · 2024-09-11T02:03:01Z

Summarize the reason behind this change (this might be the problem you're solving, or the context around the request) and the solution you have chosen.

github-actions · 2024-09-11T02:03:16Z

Thanks for creating a pull request! The below questions and checklist are intended to help with verifying your change is well tested. Response is optional, but if you choose to respond please edit this comment.

What makes this change safe?

A good answer to this question helps the reviewers understand where they should focus their attention, so please consider these questions:

Is the change risky or not? Why?
What tests are you adding or changing? Why?
What existing tests are you relying on?
What, if anything, are you concerned about that you'd like the reviewer to focus on?
References:
Risk level guide
Testing frameworks

Checklist before requesting a review

I have performed a self-review of my code.
I have made corresponding changes to the documentation (if applicable).
I have added/updated tests that demonstrate my fix is effective or that my feature works correctly.

tod-johnson-mongodb · 2024-09-11T16:31:58Z

test/unittest/tests/block/test_block_file_write.cpp

+
+    // Test that the block was correctly written.
+    std::string buf(expected_str.length(), ' ');
+    block->fh->handle->fh_read(block->fh->handle, (WT_SESSION *)session, offset,


Use static_cast<WT_SESSION *>(session) since Server Code Style says Do not use C-style casts, ever.

luke-pearson · 2024-09-11T16:42:26Z

test/unittest/tests/block/test_block_file_write.cpp

+    return block;
+}
+
+TEST_CASE("Block: __wti_block_write_off", "[block_write]")


I'm leaving some notes for our meeting later today. I see that this test suggests that it tests __wti_block_write_off but then only calls __ut_block_write_off is that intentional?

luke-pearson · 2024-09-11T16:42:54Z

src/block/block_ext.c

@@ -545,7 +545,7 @@ __wti_block_alloc(WT_SESSION_IMPL *session, WT_BLOCK *block, wt_off_t *offp, wt_
    WT_ASSERT_SPINLOCK_OWNED(session, &block->live_lock);

    /* If a sync is running, no other sessions can allocate blocks. */
-    WT_ASSERT(session, WT_SESSION_BTREE_SYNC_SAFE(session, S2BT(session)));
+    // WT_ASSERT(session, WT_SESSION_BTREE_SYNC_SAFE(session, S2BT(session)));


We could do a fairly minor mock on the session here to provide a zero flag btree if we want.

luke-pearson · 2024-09-11T16:43:33Z

src/block/block_write.c

+
+#ifdef HAVE_UNITTEST
+int
+__ut_block_write_off(WT_SESSION_IMPL *session, WT_BLOCK *block, WT_ITEM *buf, wt_off_t *offsetp,


If we opt to test this via __wti_block_write_off we won't need to wrap this.

tod-johnson-mongodb · 2024-09-11T17:10:15Z

test/unittest/tests/block/test_block_file_write.cpp

+         * std::cout << str2 << std::endl;
+         * REQUIRE((__ut_block_write_off(session->get_wt_session_impl(), block, buf, &offset, &size,
+         *   &checksum, false, false, false)) == 0);
+         * validate_block_write(session->get_wt_session_impl(), block, str2, offset, size, checksum, 2,


validate_block_write() parameters are in the wrong order.

tod-johnson-mongodb · 2024-09-11T17:11:11Z

test/unittest/tests/block/test_block_file_write.cpp

+        validate_block_write(session->get_wt_session_impl(), block, offset, size, checksum,
+          expected_str, buf->size, expected_offset);
+
+        /*


It is more readable to use #if 0 to "comment out" code than an actual comment.

tod-johnson-mongodb · 2024-09-11T18:25:29Z

test/unittest/tests/block/test_block_file_write.cpp

+    }
+
+    /*
+     * Does this need a test? It is hard to test whether the function below respects this or not.


The test with caller_locked=true is needed. Actually it is easy to check whether caller_lock=true works. It is hard to test whether caller_lock=false works.

__wt_spin_lock(session, &block->live_lock);

__ut_block_write_off() with caller_locked = true. If this does not hang __block_write_off() correctly did not lock the lock.

validate_block_write()

__wt_spin_trylock(session, &block->live_lock) should return EBUSY to show the lock is still locked.

__wt_spin_unlock(session, &block->live_lock) to unlock the lock.

To test caller_locked=false you need two tests.

__wt_spin_trylock(session, &block->live_lock) returns 0 to show the lock was unlocked.

__wt_spin_unlock(session, &block->live_lock); to unlock it again.

__ut_block_off_write() with caller_locked = false.

validate_block_write()

__wt_spin_trylock(session, &block->live_lock) returns 0 to show the lock was unlocked.

__wt_spin_unlock(session, &block->live_lock) to unlock it again.
This shows the write worked but does not show the block was ever locked.

Something like the below but more complicated.
Thread 1.

__wt_spin_lock(session, &block->live_lock);

int expected_write_time = How long the write is expected to take.

pthread_create() thread2

start_time = read time.

__ut_block_write_off() with caller_locked=false
Thread 1 hangs here.

end_time = read time.

WT_ASSERT(session, (end_time - start_time) > 2 * expected_write_time); to verify it hung.

validate_block_write()

__wt_spin_trylock(session, &block->live_lock) should return 0 to show the lock is unlocked.

__wt_spin_unlock(session, &block->live_lock);

Thread 2.

sleep(10 * expected_write_time);

__wt_spin_unlock(session, &block->live_lock);

pthread_exit(0);

I said something like the above since the above test would fail if right after pthread_create() there is a context switch from thread 1 to thread 2 and thread 2 completes both steps 1 and 2 before there is a context switch back to thread 1. In that case thread 1 would not hang and the difference end_time - start_time would be too small. This is unlikely in current multicore CPUs plus some OSs context switch during sleep. However it could happen. A more complicated test with critical sections could completely eliminate the false failure.

luke-pearson · 2024-09-11T18:35:53Z

test/unittest/tests/block/test_block_file_write.cpp

+    // Create WT_ITEM buffer and copy a string into it.
+    WT_ITEM *buf;
+    REQUIRE(__wt_scr_alloc(session->get_wt_session_impl(), 0, &buf) == 0);
+    REQUIRE(__wt_buf_initsize(session->get_wt_session_impl(), buf, DEFAULT_BLOCK_SIZE) == 0);


I thought things seemed a bit odd here, especially the call to expected_str.copy. I didn't anticipate the rabbit hole I would get myself into. But I'm starting to understand the issues that this test has. We need to think about what we're testing here and I'm still wrapping myself around that question. Our first test is trying to write a string to file.

The more WT way of setting the buffer would be:

WT_BLOCK *block = create_block(session, cp); REQUIRE(__wt_block_write_size(session->get_wt_session_impl(), block, &buf_memsize) == 0); REQUIRE(__wt_buf_init(session->get_wt_session_impl(), &buf, buf_memsize) == 0); REQUIRE(__wt_buf_set(session->get_wt_session_impl(), &buf, expected_str.c_str(), expected_str.length()) == 0);

The current implementation of the test doesn't respect the allocation size of the block manager and somehow gets it to return an offset of 5? That piqued my interest. The data itself is size 5 so why would the offset be 5? It should either be 0 as we wrote at the start of the block of 512 as we wrote at the start of the next "chunk" in the block. So I updated the code to my implementation but then I wrote at an offset of 512 which while expected was surprising, so I probed a bit further and found that this line in this change is why I wrote at 512:

block->size = DEFAULT_BLOCK_SIZE;

Because we're using the in memory file system we're not getting the correct file header information counted when we call __wt_block_open. This originates from here:

wiredtiger/src/block/block_open.c

Line 255 in 6fd5c0c

WT_ERR(__wt_filesize(session, block->fh, &block->size));

. I tested a normal WT file being opened on disk and this call sets block->size to 4096. But if we use the in memory FS we start at size 0. Which is weird and I don't fully understand yet but explains why we need the size line you added. Still this seem wrong, but then it begs the question of what are we trying to test? Should the file have header info added?

I also noted that for the first test case we are only appending to the block, and wonder if for all test cases that is true. We also set the fit to best but to actually test that we'd need to do some pretty complicated writes and deletions so is that within the scope of this test?

luke-pearson · 2024-09-11T20:03:43Z

test/unittest/tests/block/test_block_file_write.cpp

+{ 
+    // Test offset, size and checksum.
+    expected_offset += expected_size;
+    REQUIRE(offset == expected_offset);


Offset should also always be modulo of alloc size.

jiechenbo added 13 commits August 29, 2024 04:29

Initial framework

4a76108

run s_all

440f64c

Create initial framework for block open

257503b

Added destructor for filesystem

30f355a

ran s_all

a13807c

Added free blocks

322528e

Apply lock fix

9f3871a

Address PR changes

f3b39cb

run s_all

30f1126

Adding const reference

4918e24

Write work

b995217

Iterated on write

1982471

Merge branch 'develop' into wt-13403-implement-flie-write-read-ut

1bccf78

jiechenbo added 2 commits September 11, 2024 02:08

Fixed up some comments

221c624

Update comments

644e05e

tod-johnson-mongodb reviewed Sep 11, 2024

View reviewed changes

luke-pearson reviewed Sep 11, 2024

View reviewed changes

tod-johnson-mongodb reviewed Sep 11, 2024

View reviewed changes

luke-pearson reviewed Sep 11, 2024

View reviewed changes

jiechenbo closed this Sep 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WT-13428 Implement block manager file write unit tests #11016

WT-13428 Implement block manager file write unit tests #11016

jiechenbo commented Sep 11, 2024

github-actions bot commented Sep 11, 2024

tod-johnson-mongodb Sep 11, 2024

luke-pearson Sep 11, 2024

luke-pearson Sep 11, 2024

luke-pearson Sep 11, 2024

tod-johnson-mongodb Sep 11, 2024

tod-johnson-mongodb Sep 11, 2024

tod-johnson-mongodb Sep 11, 2024 •

edited

Loading

luke-pearson Sep 11, 2024 •

edited

Loading

luke-pearson Sep 11, 2024

WT-13428 Implement block manager file write unit tests #11016

WT-13428 Implement block manager file write unit tests #11016

Conversation

jiechenbo commented Sep 11, 2024

github-actions bot commented Sep 11, 2024

What makes this change safe?

Checklist before requesting a review

tod-johnson-mongodb Sep 11, 2024

Choose a reason for hiding this comment

luke-pearson Sep 11, 2024

Choose a reason for hiding this comment

luke-pearson Sep 11, 2024

Choose a reason for hiding this comment

luke-pearson Sep 11, 2024

Choose a reason for hiding this comment

tod-johnson-mongodb Sep 11, 2024

Choose a reason for hiding this comment

tod-johnson-mongodb Sep 11, 2024

Choose a reason for hiding this comment

tod-johnson-mongodb Sep 11, 2024 • edited Loading

Choose a reason for hiding this comment

luke-pearson Sep 11, 2024 • edited Loading

Choose a reason for hiding this comment

luke-pearson Sep 11, 2024

Choose a reason for hiding this comment

tod-johnson-mongodb Sep 11, 2024 •

edited

Loading

luke-pearson Sep 11, 2024 •

edited

Loading