Don't clear metadata before transitioning chunk to owned state #1015

senderista · 2021-10-14T17:38:17Z

While addressing last-minute review feedback, I got confused and forgot why I had used chunk_manager_t::load() instead of chunk_manager_t::initialize() in memory_manager_t::allocate_used_chunk(). The reason was that chunk_manager_t::initialize() clears chunk metadata, which we can't safely do until we know we own the chunk. This resulted in a race where a thread might initially try to acquire a used chunk and then get preempted, losing the race to acquire the chunk to another concurrent thread. Then the first thread would wake up and clear the now-in-use chunk's metadata while the second thread was already using it for allocations! This would clear any allocation bits that the second thread had already set, and the missing allocation bits would eventually trigger an assert when those allocations were freed during GC.

The fix was simple: just use chunk_manager_t::load() to access the metadata before acquiring the chunk, as I originally did. The metadata will be cleared anyway when gaia::db::allocate_object() (the original caller of memory_manager_t::allocate_chunk()) calls chunk_manager_t::initialize(), which calls chunk_manager_metadata_t::clear().

Note that the above initialization hazard doesn't apply to memory_manager_t::allocate_unused_chunk(), but there's no reason to do things any differently, so I made the same change there.

The fact that I got confused about initialization responsibility is evidence that this code needs further refactoring; the existing interfaces have to be contorted to fit the new chunk lifecycle protocol and should be replaced by a new interface design that starts from the protocol itself.

…o owned state

LaurentiuCristofor · 2021-10-14T17:42:10Z

production/db/inc/memory_manager/memory_structures.inc

@@ -148,6 +148,7 @@ void chunk_manager_metadata_t::synchronize_allocation_metadata()

 void chunk_manager_metadata_t::clear()
 {
+    // NB: We cannot clear the chunk state and version!


I don't think the "NB" part is needed.

LaurentiuCristofor · 2021-10-14T17:49:14Z

production/db/memory_manager/src/memory_manager.cpp

@@ -153,7 +154,8 @@ chunk_offset_t memory_manager_t::allocate_used_chunk()

        auto available_chunk_offset = static_cast<chunk_offset_t>(found_index);
        chunk_manager_t chunk_manager;
-        chunk_manager.initialize(available_chunk_offset);
+        // NB: We cannot call initialize() here because we don't own the chunk yet!


BTW, I'm not sure if you have any valid use for initialize() anymore, so if that's the case, you should just remove it. The reason I asked you why you were not using it was because you still kept it, so I thought you still needed its semantics.

Yeah, I'm still calling it from gaia::db::allocate_object(), but as I said, this code is a mess and the control flow is quite convoluted. Really I just need to call chunk_manager_metadata_t::clear() somewhere, and only the chunk manager has access to chunk metadata, so that's why I still need it.

Don't call chunk_manager_t::initialize() before transitioning chunk t…

8873a44

…o owned state

senderista requested review from LaurentiuCristofor, mihirj1993 and simone-gaia October 14, 2021 17:38

LaurentiuCristofor reviewed Oct 14, 2021

View reviewed changes

LaurentiuCristofor approved these changes Oct 14, 2021

View reviewed changes

senderista merged commit 67b393c into master Oct 14, 2021

senderista deleted the tobin/debug_double_free branch October 14, 2021 17:47

LaurentiuCristofor reviewed Oct 14, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't clear metadata before transitioning chunk to owned state #1015

Don't clear metadata before transitioning chunk to owned state #1015

senderista commented Oct 14, 2021

LaurentiuCristofor Oct 14, 2021

LaurentiuCristofor Oct 14, 2021

senderista Oct 14, 2021

Don't clear metadata before transitioning chunk to owned state #1015

Don't clear metadata before transitioning chunk to owned state #1015

Conversation

senderista commented Oct 14, 2021

LaurentiuCristofor Oct 14, 2021

Choose a reason for hiding this comment

LaurentiuCristofor Oct 14, 2021

Choose a reason for hiding this comment

senderista Oct 14, 2021

Choose a reason for hiding this comment