bug(snapshot) : Do not preempt inside OnDbChange issue #829 #868

adiholden · 2023-02-22T13:17:24Z

Serializer encode db index of entries
Do not encode db index inside consume channel
Allow push serialized data to channel from main snapshot loop and from journal callback (push may preempt)

Signed-off-by: adi_holden <[email protected]>

dranikpg

Lets use the new helpers from helio EnterFiberAtomicSection and FiberAtomicGuard in the snapshotting loop that iterates over other snapshots to assert no suspensions are made

dranikpg · 2023-02-22T15:08:01Z

src/server/rdb_save.cc

@@ -681,6 +686,7 @@ error_code RdbSerializer::FlushToSink(io::Sink* s) {
  // interrupt point.
  RETURN_ON_ERR(s->Write(bytes));
  mem_buf_.ConsumeInput(bytes.size());
+  last_entry_db_index_ = kInvalidDbId;  // After every flash we should write the DB index again.


Lets add that we need to do this because the blobs in the channel are interleaved and multiple savers can correspond to a single writer (in case of single file rdb snapshot)

flash -> flush. (flash is 📸 ).

dranikpg · 2023-02-22T15:14:00Z

src/server/rdb_save.cc

+  // SELECTDB is serialized inside journal writer. In rdb loader when parsing journal blob
+  // the journal reader parses the entry db index, and we dont use the main flow of
+  // updating the current db index. Unless we change the flow in the loader, we must
+  // update last_entry_db_index_ to invalid so that the next snapshot entry to be serialized
+  // will update the db index.
+  last_entry_db_index_ = kInvalidDbId;


So the journal db numbering and rdb db numbering are fully independent, right? Then it seems like we don't have to reset last_entry_db_index_?

dranikpg · 2023-02-22T15:21:17Z

src/server/snapshot.cc

  // TODO: investigate why a single byte gets stuck and does not arrive to replica
  for (unsigned i = 10; i > 1; i--)
-    CHECK(!default_serializer_->SendFullSyncCut());
-  FlushDefaultBuffer(true);
+    CHECK(!serializer_->SendFullSyncCut());
+  PushSerializedToChannel(true);


Oh that reminded me we still have this issue 😅

OMG, maybe we do not? I suggest removing this loop and see if the issue remains.

We still have bug here

romange · 2023-02-24T20:07:14Z

src/server/rdb_save.cc

-                                             uint64_t expire_ms) {
+                                             uint64_t expire_ms, DbIndex dbid) {
+  SelectDb(dbid);
+  last_entry_db_index_ = dbid;


is SelectDb used anywhere else?
anyway, I think this line should be moved to SelectDb -line 261

true this is left overs from my implementation before some fix, now its not relevant. will move inside

romange · 2023-02-24T20:09:39Z

src/server/rdb_save.cc

@@ -701,17 +707,20 @@ io::Bytes RdbSerializer::PrepareFlush() {
  return mem_buf_.InputBuffer();
 }

-error_code RdbSerializer::WriteJournalEntries(absl::Span<const journal::Entry> entries) {
+error_code RdbSerializer::WriteJournalEntries(const journal::Entry& entry) {


now it becomes singular WriteJournalEntry

romange · 2023-02-24T20:10:33Z

src/server/snapshot.h

-  // Return if flushed.
-  bool FlushDefaultBuffer(bool force);
+  // Push serializer's internal buffer to channel.
+  // Push regradless of buffer size if force is true.


regradless -> regardless

Signed-off-by: adi_holden <[email protected]>

dranikpg · 2023-02-26T09:22:49Z

You didn't add the new FiberAtomicGuard, but we can do it separately when pulling new helio

bug(snapshot) : Do not preempt inside OnDbChange

2e964e2

Signed-off-by: adi_holden <[email protected]>

adiholden requested review from romange and dranikpg February 22, 2023 13:17

adiholden changed the title ~~bug(snapshot) : Do not preempt inside OnDbChange isuee #829~~ bug(snapshot) : Do not preempt inside OnDbChange issue #829 Feb 22, 2023

dranikpg reviewed Feb 22, 2023

View reviewed changes

romange reviewed Feb 24, 2023

View reviewed changes

PR fix

397f1c0

Signed-off-by: adi_holden <[email protected]>

adiholden requested review from romange and dranikpg February 26, 2023 08:45

PR fix

42b5bdf

Signed-off-by: adi_holden <[email protected]>

dranikpg approved these changes Feb 26, 2023

View reviewed changes

adiholden merged commit 4b44fb6 into main Feb 26, 2023

romange deleted the fix_829 branch March 1, 2023 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bug(snapshot) : Do not preempt inside OnDbChange issue #829 #868

bug(snapshot) : Do not preempt inside OnDbChange issue #829 #868

adiholden commented Feb 22, 2023

dranikpg left a comment •

edited

Loading

dranikpg Feb 22, 2023 •

edited

Loading

romange Feb 24, 2023

dranikpg Feb 22, 2023

dranikpg Feb 22, 2023 •

edited

Loading

romange Feb 24, 2023

adiholden Feb 26, 2023

romange Feb 24, 2023

adiholden Feb 26, 2023

romange Feb 24, 2023

romange Feb 24, 2023

dranikpg commented Feb 26, 2023

bug(snapshot) : Do not preempt inside OnDbChange issue #829 #868

bug(snapshot) : Do not preempt inside OnDbChange issue #829 #868

Conversation

adiholden commented Feb 22, 2023

dranikpg left a comment • edited Loading

Choose a reason for hiding this comment

dranikpg Feb 22, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dranikpg Feb 22, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

dranikpg commented Feb 26, 2023

dranikpg left a comment •

edited

Loading

dranikpg Feb 22, 2023 •

edited

Loading

dranikpg Feb 22, 2023 •

edited

Loading