Runner changes: will use existing tokio::runtime or create one for itself and race condition fix #162

conorbros · 2021-09-01T02:45:44Z

Shotover's Runner will use an existing runtime if it is running inside one (e.g. in async tests) or create one for itself (same behaviour as before).

The Runtime builder was moved to another method (Runner::runtime) and we check if we can get a handle on the current runtime, otherwise create one and return the Handle and Runtime objects. run_spawn, run_block and with_observability_interface use this method to get a handle to start their tasks from. ShotoverManager was modified to hold a handle and runtime object. We have to store the runtime inside the ShotoverManager so that it does not go out of scope during tests and shut down.

I was also able to update tokio from 1.6.1 to 1.11.0.

shotover-proxy/tests/helpers/mod.rs

shotover-proxy/tests/runner/runtime_int_tests.rs

shotover-proxy/src/runner.rs

shotover-proxy/tests/helpers/mod.rs

shotover-proxy/src/runner.rs

shotover-proxy/tests/runner/runtime_int_tests.rs

shotover-proxy/src/runner.rs

shotover-proxy/tests/helpers/mod.rs

If a shutdown message is sent before the receiver's waiting to receive that shutdown message are created shotover will panic on the send

rukai

Now that I can see the size of the race condition fix I can see that should have gone in a separate PR.
But the changes arent too complicated so I should be able to manage reviewing as is.

shotover-proxy/src/config/topology.rs

shotover-proxy/src/runner.rs

shotover-proxy/tests/helpers/mod.rs

shotover-proxy/src/runner.rs

rukai · 2021-09-03T02:35:21Z

shotover-proxy/src/server.rs

-        // Cannot receive a "lag error" as only one value is ever sent.
-        self.notify.recv().await.unwrap();
+        // check we didn't receive a shutdown message before the receiver was created
+        if !*self.notify.borrow() {


this looks like a race condition, but also this whole shutdown abstraction looks kind of weird.

This code is adapted from the tokio mini-redis project.

I was wondering why there was so many comments... hehe.
Ill take a look

This snippet demonstrates why we need the check:

#[tokio::main] async fn main() { let (tx, mut rx) = tokio::sync::watch::channel(false); tx.send(true).unwrap(); rx.changed().await.unwrap(); println!("borrow rx: {}", *rx.borrow()); let mut rx2 = rx.clone(); rx2.changed().await.unwrap(); //hangs here! println!("borrow rx2: {}", *rx2.borrow()); }

Alright, fair enough.

shotover-proxy/tests/runner/runner_int_tests.rs

rukai · 2021-09-03T05:13:09Z

shotover-proxy/src/server.rs

-        // Cannot receive a "lag error" as only one value is ever sent.
-        self.notify.recv().await.unwrap();
+        // check we didn't receive a shutdown message before the receiver was created
+        if !*self.notify.borrow() {


This snippet demonstrates why we need the check:

#[tokio::main] async fn main() { let (tx, mut rx) = tokio::sync::watch::channel(false); tx.send(true).unwrap(); rx.changed().await.unwrap(); println!("borrow rx: {}", *rx.borrow()); let mut rx2 = rx.clone(); rx2.changed().await.unwrap(); //hangs here! println!("borrow rx2: {}", *rx2.borrow()); }

Alright, fair enough.

shotover-proxy/tests/helpers/mod.rs

rukai · 2021-09-03T05:31:46Z

shotover-proxy/tests/helpers/mod.rs

-                .unwrap()
-                .unwrap();
+            self.trigger_shutdown_tx.send(true).unwrap();
+            tokio::task::block_in_place(move || {


Is this doing what we think it is doing?
I dont see anything that actually uses the join handle in anyway.
Consider this screenshot with the ; removed.
We can see that its actually returning the JoinHandle (so the join handle must not be being used in any way)

If the block_in_place happens to give us a nice error maybe we can just stick it somewhere to give us that error.
Or maybe we can do whatever detection it is doing and give an even better error! Dont forget to include #[tokio::test(flavor = "multi_thread")] on your test! or something

Yeah, I'm pretty sure this isn't doing anything. 😂

Strange. With it removed shotover doesn't shutdown cleanly so I was under the impression it was blocking it until the join handle task finishes

Yeah I think we need this instead. block_in_place lets us enter an async context and then we can use the handle to run the join handle future to completion.

tokio::task::block_in_place(move || { self.runtime_handle .block_on(self.join_handle.take().unwrap()) .unwrap() .unwrap(); });

Ah that works.

However I'm thinking we should use block_in_place in Runner to assert we got a multithread runtime.
This way any user of Runner gets this assertion.
Using block_in_place is discouraged so we should revert back to entering the runtime rather than using block_in_place twice.

XA21X · 2021-09-03T06:09:52Z

shotover-proxy/src/sources/mod.rs

@@ -62,20 +62,20 @@ impl SourcesConfig {
        &self,
        chain: &TransformChain,
        topics: &mut TopicHolder,
-        trigger_shutdown_tx: broadcast::Sender<()>,
+        trigger_shutdown_rx: watch::Receiver<bool>,


Would it make sense to use Shutdown instead of watch everywhere else? The "whole shutdown abstraction looks kind of weird" is encapsulated by the abstraction so it seems a bit of a waste to reimplement it? Or is it important to keep the separation between what can initiate shutdowns and what can listen for it?

Yes, I think using Shutdown instead of watch is a good idea, but lets do that in a follow up PR.
Otherwise we risk hitting more issues and further blowing up the size of this PR.

rukai

LGTM

conorbros added 2 commits August 31, 2021 19:05

Runner use existing runtime or creates one for itself

92783da

Update tests

eea7f55

conorbros linked an issue Sep 1, 2021 that may be closed by this pull request

Add optionally configurable runtime to enable async integration tests #153

Closed

rukai requested changes Sep 1, 2021

View reviewed changes

conorbros added 3 commits September 1, 2021 14:06

change ShotoverManager method back to function

7177e0b

Refactor Runner init functions

c71c5f3

document the use of futures::executor::block_on in ShotoverManager

969814d

rukai reviewed Sep 1, 2021

View reviewed changes

shotover-proxy/src/runner.rs Outdated Show resolved Hide resolved

rukai reviewed Sep 1, 2021

View reviewed changes

shotover-proxy/tests/helpers/mod.rs Outdated Show resolved Hide resolved

conorbros added 7 commits September 1, 2021 17:12

use null transform for integration test

17d8dee

rename variables in RunnerSpawned and ShotoverManager

3ea7a84

rename _g to _enter_guard

98a9c36

Rename test ("rust") workflow to test

7d758f5

Fix race condition in startup

2d363b0

If a shutdown message is sent before the receiver's waiting to receive that shutdown message are created shotover will panic on the send

Test the cassandra source early shutdown

7dce35a

Merge branch 'switching-to-watch' into int-test-async

92bf921

conorbros changed the title ~~Runner use existing tokio::runtime or create one for itself~~ Runner changes: will existing tokio::runtime or create one for itself and race condition fix Sep 2, 2021

rukai mentioned this pull request Sep 2, 2021

Implement TLS support for incoming redis connections #141

Merged

3 tasks

rukai requested changes Sep 3, 2021

View reviewed changes

shotover-proxy/src/config/topology.rs Outdated Show resolved Hide resolved

shotover-proxy/src/runner.rs Outdated Show resolved Hide resolved

shotover-proxy/tests/helpers/mod.rs Outdated Show resolved Hide resolved

conorbros added 5 commits September 3, 2021 10:17

ShotoverManager drop will panic if run in a single threaded runtime

12406dc

Keep receiver alive in run_block

4c25c1b

Make wait_for_socket_to_open private again

7506109

Switch shutdown_complete channel back to mpsc

4d8c70d

Clean up some noise

70a9e47

rukai reviewed Sep 3, 2021

View reviewed changes

shotover-proxy/src/runner.rs Outdated Show resolved Hide resolved

Remove the arc and clone receiver instead for trigger shutdown channel

b582ee6

rukai reviewed Sep 3, 2021

View reviewed changes

shotover-proxy/src/runner.rs Outdated Show resolved Hide resolved

rukai reviewed Sep 3, 2021

View reviewed changes

shotover-proxy/tests/helpers/mod.rs Outdated Show resolved Hide resolved

Remove the receiver from ShotoverManager

b6258de

rukai reviewed Sep 3, 2021

View reviewed changes

XA21X reviewed Sep 3, 2021

View reviewed changes

Review feedback

b30bc25

conorbros requested a review from rukai September 6, 2021 01:36

rukai approved these changes Sep 6, 2021

View reviewed changes

XA21X approved these changes Sep 6, 2021

View reviewed changes

conorbros mentioned this pull request Sep 6, 2021

Remove lua support #172

Merged

rukai merged commit 3d713b4 into shotover:main Sep 6, 2021

conorbros changed the title ~~Runner changes: will existing tokio::runtime or create one for itself and race condition fix~~ Runner changes: will use existing tokio::runtime or create one for itself and race condition fix Sep 6, 2021

conorbros deleted the int-test-async branch September 6, 2021 04:08

rukai mentioned this pull request Sep 7, 2021

Remove unsafe usages #173

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Runner changes: will use existing tokio::runtime or create one for itself and race condition fix #162

Runner changes: will use existing tokio::runtime or create one for itself and race condition fix #162

conorbros commented Sep 1, 2021

rukai left a comment

rukai Sep 3, 2021 •

edited

Loading

conorbros Sep 3, 2021 •

edited

Loading

rukai Sep 3, 2021

rukai Sep 3, 2021

rukai Sep 3, 2021

rukai Sep 3, 2021

XA21X Sep 3, 2021

conorbros Sep 3, 2021

conorbros Sep 3, 2021 •

edited

Loading

rukai Sep 3, 2021 •

edited

Loading

XA21X Sep 3, 2021

rukai Sep 3, 2021 •

edited

Loading

rukai left a comment

Runner changes: will use existing tokio::runtime or create one for itself and race condition fix #162

Runner changes: will use existing tokio::runtime or create one for itself and race condition fix #162

Conversation

conorbros commented Sep 1, 2021

rukai left a comment

Choose a reason for hiding this comment

rukai Sep 3, 2021 • edited Loading

Choose a reason for hiding this comment

conorbros Sep 3, 2021 • edited Loading

Choose a reason for hiding this comment

rukai Sep 3, 2021

Choose a reason for hiding this comment

rukai Sep 3, 2021

Choose a reason for hiding this comment

rukai Sep 3, 2021

Choose a reason for hiding this comment

rukai Sep 3, 2021

Choose a reason for hiding this comment

XA21X Sep 3, 2021

Choose a reason for hiding this comment

conorbros Sep 3, 2021

Choose a reason for hiding this comment

conorbros Sep 3, 2021 • edited Loading

Choose a reason for hiding this comment

rukai Sep 3, 2021 • edited Loading

Choose a reason for hiding this comment

XA21X Sep 3, 2021

Choose a reason for hiding this comment

rukai Sep 3, 2021 • edited Loading

Choose a reason for hiding this comment

rukai left a comment

Choose a reason for hiding this comment

rukai Sep 3, 2021 •

edited

Loading

conorbros Sep 3, 2021 •

edited

Loading

conorbros Sep 3, 2021 •

edited

Loading

rukai Sep 3, 2021 •

edited

Loading

rukai Sep 3, 2021 •

edited

Loading