Network deadlock because of mutex locking order #29347

ohitz · 2020-10-20T09:18:51Z

Describe the bug

It is possible to deadlock the entire network stack with TCP. If this happens, the network is completely blocked, not even ICMP traffic works anymore.

The reason is that the send and receive paths each lock two mutexes in different order.

Receive path: z_work_q_main() -> process_rx_packet() -> net_rx() -> process_data() -> net_ipv4_input() -> net_conn_input() -> tcp_recv() -> tcp_in() (locks conn->lock) -> tcp_data_get() -> net_context_packet_received() (locks context->lock)
Send path: send() -> zsock_send() -> zsock_sendto() -> z_impl_zsock_sendto() -> sock_sendto_vmeth() -> zsock_sendto_ctx() -> net_context_send() (locks context->lock) -> context_sendto() -> net_tcp_queue_data() (locks conn->lock)

The deadlock happens if a mutex is locked in one path and the other path also locks a mutex before the second mutex can be locked. In that case both paths wait for each other to release the locks.

To Reproduce

This problem is very sensible to timing. It can be reliably reproduced by inserting a short sleep right before acquiring the mutex in net_tcp_queue_data(). In that case, a small modification to the echo server in the samples can be used to demonstrate the problem.

Steps to reproduce the behavior:

Patch the TCP stack and the echo server:

diff --git a/samples/net/sockets/echo_server/src/tcp.c b/samples/net/sockets/echo_server/src/tcp.c
index 0ff35a1852..034e2e7c7b 100644
--- a/samples/net/sockets/echo_server/src/tcp.c
+++ b/samples/net/sockets/echo_server/src/tcp.c
@@ -129,6 +129,8 @@ static void handle_data(void *ptr1, void *ptr2, void *ptr3)
 
        client = data->tcp.accepted[slot].sock;
 
+       sendall(client, "HELLO", sizeof("HELLO"));
+
        do {
                received = recv(client,
                        data->tcp.accepted[slot].recv_buffer + offset,
diff --git a/subsys/net/ip/tcp2.c b/subsys/net/ip/tcp2.c
index 36d6b191d1..ff7d64d5bd 100644
--- a/subsys/net/ip/tcp2.c
+++ b/subsys/net/ip/tcp2.c
@@ -1550,6 +1550,8 @@ int net_tcp_queue_data(struct net_context *context, struct net_pkt *pkt)
                return -ENOTCONN;
        }
 
+       k_sleep(K_MSEC(10));
+
        k_mutex_lock(&conn->lock, K_FOREVER);
 
        if (tcp_window_full(conn)) {

Compile the echo server and flash (tested on MIMXRT1060-EVK)
Connect to the echo server, send data and close the connection right away: echo -n "test" | nc -v -N 192.0.2.1 4242

The stack is blocked, not even ping works anymore.

Expected behavior

The stack should not block.

Impact

That's a pretty bad deadlock as by the textbooks.

Environment

Zephyr 8f2d69c

The text was updated successfully, but these errors were encountered:

jukkar · 2020-10-20T11:00:58Z

The reason is that the send and receive paths each lock two mutexes in different order.

* Receive path: `z_work_q_main()` -> `process_rx_packet()` -> `net_rx()` -> `process_data()` -> `net_ipv4_input()` -> `net_conn_input()` -> `tcp_recv()` -> `tcp_in()` **(locks conn->lock)** -> `tcp_data_get()` -> `net_context_packet_received()` **(locks context->lock)**

* Send path: `send()` -> `zsock_send()` -> `zsock_sendto()` -> `z_impl_zsock_sendto()` -> `sock_sendto_vmeth()` -> `zsock_sendto_ctx()` -> `net_context_send()` **(locks context->lock)** -> `context_sendto()` -> `net_tcp_queue_data()` **(locks conn->lock)**

One thing which is a bit mystery here is that the RX thread in receive path and the application thread are separate threads. So even if net_context_packet_received() calls the socket recv_cb() with context locked, the application thread will get blocked when trying to get context lock, in which case the RX should continue running. Of course if the RX thread is never able to run because of priorities, then the unlock will not happen.
Anyway, one easy "fix" here is to just release the context lock in net_context_packet_received() before calling recv_cb().

ohitz · 2020-10-20T12:55:19Z

@jukkar there is still something odd with your PR #29351. When the sleep is left in net_tcp_queue_data(), the echo server still locks the network stack as above, trying to lock the mutex in net_tcp_queue_data().

jukkar · 2020-10-20T16:57:13Z

@jukkar there is still something odd with your PR #29351. When the sleep is left in net_tcp_queue_data(), the echo server still locks the network stack as above, trying to lock the mutex in net_tcp_queue_data().

Ok, thanks for the info. I will try to replicate the issue.

Release the context lock before passing data to the application socket as that might cause deadlock if the application is run before the RX thread and it starts to send data and if the RX thread is never able to run (because of priorities etc). Fixes zephyrproject-rtos#29347 Signed-off-by: Jukka Rissanen <[email protected]>

jukkar · 2020-10-21T11:35:14Z

@ohitz thanks for a good analysis. I think the new version of the fix will solve the issue, at least I was no longer able to replicate the hang issue. In the fix, we do not pass data to application with tcpconn->lock held.

ohitz · 2020-10-21T12:01:31Z

Thanks @jukkar I have tried the new fix. Unfortunately, there is a NULL pointer access now at subsys/net/ip/tcp2.c:1494 if the client closes the connection. The echo server doesn't crash, but you can see it if you insert an assertion which checks conn->context != NULL right there.

jukkar · 2020-10-21T12:43:38Z

Unfortunately, there is a NULL pointer access now at subsys/net/ip/tcp2.c:1494 if the client closes the connection.

Indeed, the connection handler disappears. This is an easy fix, I will send a new version in few minutes.

jukkar · 2020-10-21T12:54:13Z

I remember that there was some commit(s) in another TCP issue where one could catch the null pointer access violations with this hw, you seem to have that applied. Have you considered to upstream that properly, could be quite useful to have?

ohitz · 2020-10-21T13:03:37Z

I can confirm the fix now works properly, thanks @jukkar!

As for the MPU configuration to catch null pointer access violations, I'll check with my colleague.

armandciejak · 2020-10-21T13:15:35Z

@jukkar I have no time to clean-up our patch now but I can publish it on github if someone wants to take it over. It is not ready for a pull request yet since it is very specific to our target.

Release the context lock before passing data to the application socket as that might cause deadlock if the application is run before the RX thread and it starts to send data and if the RX thread is never able to run (because of priorities etc). Fixes #29347 Signed-off-by: Jukka Rissanen <[email protected]>

ohitz added the bug The issue is a bug, or the PR is fixing a bug label Oct 20, 2020

jukkar mentioned this issue Oct 20, 2020

net: context: Release lock before passing RX data to socket #29351

Merged

jukkar added the area: Networking label Oct 20, 2020

jukkar self-assigned this Oct 20, 2020

jukkar added the priority: medium Medium impact/importance bug label Oct 20, 2020

MaureenHelm closed this as completed in #29351 Oct 21, 2020

ohitz mentioned this issue Oct 22, 2020

Network deadlock #29444

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Network deadlock because of mutex locking order #29347

Network deadlock because of mutex locking order #29347

ohitz commented Oct 20, 2020

jukkar commented Oct 20, 2020

ohitz commented Oct 20, 2020

jukkar commented Oct 20, 2020

jukkar commented Oct 21, 2020

ohitz commented Oct 21, 2020

jukkar commented Oct 21, 2020

jukkar commented Oct 21, 2020

ohitz commented Oct 21, 2020

armandciejak commented Oct 21, 2020

Network deadlock because of mutex locking order #29347

Network deadlock because of mutex locking order #29347

Comments

ohitz commented Oct 20, 2020

jukkar commented Oct 20, 2020

ohitz commented Oct 20, 2020

jukkar commented Oct 20, 2020

jukkar commented Oct 21, 2020

ohitz commented Oct 21, 2020

jukkar commented Oct 21, 2020

jukkar commented Oct 21, 2020

ohitz commented Oct 21, 2020

armandciejak commented Oct 21, 2020