rfc: the state of pageserver tenant relocation #3868

LizardWizzard · 2023-03-22T17:21:35Z

SomeoneToIgnore

RFC clearly promotes the simplistic approach, but generally curious is it possible to make things work using the broker instead of some console commands, and have the simplistic approach implemented through that?

One idea: in the current model, we don't have multiple PS nodes with the tenant attach for a long time — the process is not normal and we use various calls to suppress the S3 writes, for example.

Since we still target the broker-based coordinated approach, can we use it now for the simplistic approach?
If PS nodes are able to detect through broker when a certain tenant gets attached on multiple nodes, they can stop background operations and S3 writes that may lead to discrepancies, and only start them when all but one PS nodes are left with this tenant attached.
That would save us multiple extra console calls for stopping the tasks and allow us to transition towards the coordination-based approach more gradually.

docs/rfcs/023-the-state-of-pageserver-tenant-relocation.md

problame

Very much in favor of the simplest approach, as the current plan is to make relocation a human-initiated "back-office" type operation.

Let's revisit east-west traffic between pageservers when we get to sharding.

docs/rfcs/023-the-state-of-pageserver-tenant-relocation.md

LizardWizzard · 2023-03-23T13:04:14Z

RFC clearly promotes the simplistic approach, but generally curious is it possible to make things work using the broker instead of some console commands, and have the simplistic approach implemented through that?

IMO this is the the distinction between simplistic approach and more complicated ones. How do you see it?

Theoretically we can move relocation logic to pageserver and let pageserver be the oracle who manages things, but it makes certain things more complicated, because 1) we can loose pageserver 3) changes need to be made in control plane database, so pageserver will need to go to it to change pageserver id, and to schedule availability check.

Actually I like the idea from the point of testing. We can mock availability check and call to change pageserver id in our local control plane. So step two can be to move in this direction. I think this is a direction that worth exploring. Thoughts?

LizardWizzard · 2023-03-23T14:32:43Z

Thinking about that a bit more, I think the main problem will be handling possible pageserver outages, so another pageserver can pick it up and finish the process. For console this problem is already solved with operations management.

How this can look like?

SomeoneToIgnore · 2023-03-23T15:49:03Z

I think, the relocation could be kept as simplistic as it gets, but the background process handling could be done better than handling HTTP requests — hoped that syncing over the broker is not much harder than doing calls from the console.

LizardWizzard · 2023-03-23T15:52:24Z

I think, the relocation could be kept as simplistic as it gets, but the background process handling could be done better than handling HTTP requests — hoped that syncing over the broker is not much harder than doing calls from the console.

I'm not sure how this can look like, can you elaborate? Maybe write down the possible sequence? (can be high level and imprecise)

docs/rfcs/023-the-state-of-pageserver-tenant-relocation.md

github-actions · 2023-05-10T14:41:07Z

992 tests run: 944 passed, 0 failed, 48 skipped (full report for `45ff311`)

Flaky tests

PostgreSQL 15 (release build)

✅ test_runner/regress/test_remote_storage.py::test_remote_storage_upload_queue_retries[local_fs]

_{The comment gets automatically updated with the latest test results ♻️}

LizardWizzard requested review from problame, koivunej, shanyp and SomeoneToIgnore March 22, 2023 17:21

SomeoneToIgnore reviewed Mar 23, 2023

View reviewed changes

docs/rfcs/023-the-state-of-pageserver-tenant-relocation.md Show resolved Hide resolved

docs/rfcs/023-the-state-of-pageserver-tenant-relocation.md Show resolved Hide resolved

problame reviewed Mar 23, 2023

View reviewed changes