Discussion: envoy internal connection #11725

lambdai · 2020-06-24T00:19:09Z

I was exploring dispatching h2 CONNECT to listeners of the same envoy process.
The mental topology is something like below.

curl -> H2 CONNECT -> ListenerA_HCM -> TCP -> ListenerB_TcpProxy.

Without any change listenerA_HCM -> TCP -> ListenerB_TcpProxy will be carried by a TCP connection.
It is expensive in terms of

Resource usage: tcp 4 tuple socket id and consider the TIME_WAIT, ephemeral ports
Latency: it take extra epoll cycle to relay the data
CPU usage: annoying memory copy from and to kernel.

From my point of view, where envoy's role is the sidecar, the tunnel HCM is pluggable. The ListenerB_TcpPoxy carries the business logic. This TcpProxy should kept the minimum changes when ListenerA_HCM switching on and off. I believe the deployment is significantly different from envoy as central proxy.

To reach the minimum change at TcpProxy listener, the reasonable approach is to create a non-socket based connection while providing the connection interface: read/write so that

The cluster used by tunnel HCM could create such a virtual connection,
Dispatcher create both VirtualClientConnection and VirtualServerConnection
TcpListener find the filter chain and create filter chain for VirtualServerConnection, optionally bypass listener filter.

I think the major challenges includes simulating the delay close, half close, watermarks, supporting other transport socket, etc. But not all are necessary at the very beginning.

This approach could be an optimization for any envoy cluster connecting to envoy itself.

Pros:

Build on top of the existing abstraction: listener, cluster, host, connection.
The new functionaries are extensions. The lifetime of above core components are not impacted.
Almost zero overhead for anyone who doesn't use it
Least control plane change.

Cons:

The pipe-ish connection doesn't have the highest performance among the alternative solutions
The new simulated connection is not tcp feature-complete (I doubt if we need the whole).

Alternative solutions
Cluster network filter, or http upstream filter: The dispatch of connections is achieved by X clusters cluster/route instead of X listener. Migration from traditional listener filter chain match to cluster match is non-trivial. It's a huge burden when we switching the tunnel.

CC @alyssawilk @PiotrSikora

The text was updated successfully, but these errors were encountered:

lambdai · 2020-06-24T00:27:41Z

Looks like this is a specialization of #11618

lambdai · 2020-06-24T01:39:12Z

I looked at my experimental code. My major change is to strip the existing socket from Connection. Looks like I could use the socket interface instead.

antoniovicente · 2020-06-24T14:39:15Z

Would you get equivalent behavior if ListenerA_HCM were to select ListenerB_TcpProxy's upstream directly instead of going through some virtual connections? I guess that you need more than just listenerB's upstream since there may be some L4 filters that need to operate on the request, and the socket type for listenerB could be something like SSL, in which case you'ld need to hook into https://wiki.openssl.org/index.php/BIO to perform SSL operations without an underlying real fd.

From past experience, mimicing an fd API without a real fd behind it is error prone and challenging.

antoniovicente · 2020-06-24T14:40:51Z

I looked at my experimental code. My major change is to strip the existing socket from Connection. Looks like I could use the socket interface instead.

Which connection? You still need to go through the H2 codec on listenerA and there could even be multiple H2 CONNECTs active on the same client connection. Passing A's client connection to B's listener won't work.

lambdai · 2020-06-24T23:39:10Z

Would you get equivalent behavior if ListenerA_HCM were to select ListenerB_TcpProxy's upstream directly instead of going through some virtual connections? I guess that you need more than just listenerB's upstream since there may be some L4 filters that need to operate on the request, and the socket type for listenerB could be something like SSL, in which case you'ld need to hook into https://wiki.openssl.org/index.php/BIO to perform SSL operations without an underlying real fd.

I agree with you that an 100% FD api is extremely hard, especially when listenerB has corner options.

But there is a question ahead.

Does listenerB need to handle both real tcp connection and virtual connection?
Does listenerB need to implement all the tcp listener features?

At my scenario, both answer is NO.
I incline to introduce another UserspacePipeListener. It starts with self defined socket interface, buffer dest and buffer souce transport socket(which excludes SSL), but could carries L4 filters including non-terminated ones (e.g. authz) and terminated ones (e.g. tcp proxy).
I am fine with that because in istio, control plane owns both listenerA_HCM and listenerB_TcpProxy. If ListenerB has a crazy filter or use some not implement yet feature, listenerA_HCM could choose to go through real tcp connections.

lambdai · 2020-06-24T23:43:41Z

wrt the concern SSL: it's fine to me that control plane create listenerB1 terminating SSL connection and listenerB2 terminating virtual connection.

lambdai · 2020-06-24T23:57:17Z

I looked at my experimental code. My major change is to strip the existing socket from Connection. Looks like I could use the socket interface instead.

Which connection? You still need to go through the H2 codec on listenerA and there could even be multiple H2 CONNECTs active on the same client connection. Passing A's client connection to B's listener won't work.

Oh, my. It looks like you are much more ambitious than my plan.
Original: client <-C1, connect stream-> HCM <-C2,rawtcp-> TcpProxy <-C3, rawtcp-> server
My plan: client <-C1, connect stream-> HCM <-C2,virtual-> TcpProxy <-C3, rawtcp -> server

My proposal affects only C2. The HCM still need to relay the bytes to C2 from the connect stream in C1. I am not gonna relay the fd to C2 or C3.

In the words of SocketInterface, C2 is a non-fd socket from the view of HCM.

Does it make sense?

antoniovicente · 2020-06-25T00:41:24Z

I looked at my experimental code. My major change is to strip the existing socket from Connection. Looks like I could use the socket interface instead.

Which connection? You still need to go through the H2 codec on listenerA and there could even be multiple H2 CONNECTs active on the same client connection. Passing A's client connection to B's listener won't work.

Oh, my. It looks like you are much more ambitious than my plan.
Original: client <-C1, connect stream-> HCM <-C2,rawtcp-> TcpProxy <-C3, rawtcp-> server
My plan: client <-C1, connect stream-> HCM <-C2,virtual-> TcpProxy <-C3, rawtcp -> server

My proposal affects only C2. The HCM still need to relay the bytes to C2 from the connect stream in C1. I am not gonna relay the fd to C2 or C3.

In the words of SocketInterface, C2 is a non-fd socket from the view of HCM.

Does it make sense?

I think the original picture is actually:
client <-C1, connect stream-> HCM <-upstream connection C2-> -> <-C3 listener connection-> TcpProxy <-C4, rawtcp-> server

Use of a socket pair or pipe for C2 and C3 seems like a good place to start in order to work around the local ephemeral port limits issue. Most things would continue to work in that case. Yielding to the event loop between writes to C2 and reads from C3 is actually a good thing, you want to allow other players to take a turn. I also think its important to keep call stacks relatively shallow to reduce complexity.

Going beyond that, it depends on what layer you'ld like to hook virtual connections into. Network::ConnectionImpl has several important responsibilities related to handling of high/low watermarks. IOHandle doesn't have a way to schedule events directly and needs to rely on Event::FileEvent or TransportSocketCallbacks::setReadBufferReady() to schedule resumption. Operations that get the local and remote socket address from the socket may also present some difficulties.

Our current non-Envoy based system uses virtual connections for a lot of stuff. I think the move to SSL BIO for virtual connections may be unavoidable. It is mostly a matter of getting it implemented.

lambdai · 2020-06-25T19:08:41Z

client <-C1, connect stream-> HCM <-upstream connection C2-> -> <-C3 listener connection-> TcpProxy <-C4, rawtcp-> server

Good point!

Use of a socket pair or pipe for C2 and C3 seems like a good place to start in order to work around the local ephemeral port limits issue.

Agree. An envoy connection wrapping socket pair with "connection attributes" and "connect()" solve the ephemeral port problem.
It solve everything except the overhead of bytes copy and the transport at next poll cycle.

lambdai · 2020-06-25T19:15:11Z

Going beyond that, it depends on what layer you'ld like to hook virtual connections into. Network::ConnectionImpl has several important responsibilities related to handling of high/low watermarks. IOHandle doesn't have a way to schedule events directly and needs to rely on Event::FileEvent or TransportSocketCallbacks::setReadBufferReady() to schedule resumption. Operations that get the local and remote socket address from the socket may also present some difficulties.

I see the challenges there. I cannot even close my virtual connection at this moment :(

lambdai · 2020-06-30T18:08:18Z

I have an end2end demo showing that we can chain HCM and TcpProxy not through a OS fd. Actually it saves 2 fd. See C2 and C3 above.

https://github.com/lambdai/envoy-dai/tree/hostconn

mattklein123 · 2020-07-14T19:58:38Z

One other comment is this issue feels very similar to what we have already done with the API listener which is being used by Envoy Mobile. We have already discussed having a TCP variant of the API listener which would work with TCP proxy type setups. I wonder if that would help here? Basically maybe this is a special upstream/socket which actually loops back through an API listener interface? cc @junr03 @goaway

lambdai · 2020-07-14T21:00:56Z

We have already discussed having a TCP variant of the API listener which would work with TCP proxy type setups.

Great to see that looping back to listener has a wider use case! To me the "tcp proxy type" is a little vague.

What I am proposed here is to hide the loop between the upstream cluster and the dest listener

mattklein123 · 2020-07-14T21:22:44Z

Sorry my point is that you can have an API listener variant that injects raw TCP bytes vs. HTTP messages.

lambdai · 2020-07-15T00:49:45Z

I brainstorms the usage of ApiListener and it probably works if

Support listener update.
Could name and reference the listener by socket interface ("api://<api_listener_name>")
Expand the api listener to per worker thread.
allow to attach more than terminate L4 filter (HCM, maybe TcpProxy).

It require extra complexity added to SyntheticConnection.

I think we are head to the similar goal as grabbing a user space connection not though OS fd.

chradcliffe · 2020-07-24T14:48:42Z

The issue here refers to h2 CONNECT, but am I correct in thinking that this would also apply to HTTP/1.1 CONNECT?

A few months back I posted to the envoy-dev mailing list asking about the best approach to inspect traffic over HTTP/1.1 CONNECT -- in particular, how we would go about running the CONNECT-tunneled traffic through the HTTP stack. The solution that was suggested -- creating an "inner" HCM -- more or less works, but being able to feed the CONNECT tunnel back into a listener sounds like a much better solution, and it sounds like it would also give us the ability to process tunneled TLS.

stale · 2020-08-24T00:12:26Z

This issue has been automatically marked as stale because it has not had activity in the last 30 days. It will be closed in the next 7 days unless it is tagged "help wanted" or other activity occurs. Thank you for your contributions.

lambdai · 2020-08-27T00:13:11Z

design doc updated to reflect the dev branch.
https://docs.google.com/document/d/1ok9oyMw-39lUXcO8ihhY2bL6b5gCoyK3-UyFoPEWCgo/edit?usp=sharing

The strawman branch mainly described the bytestream after connection is established.
The connection establishment is somewhat orthogonal

ggreenway · 2020-08-27T16:37:54Z

A couple ideas on the topic:

This could combine nicely with multiple addresses per listener (Support multiple addresses per listener #11184)
If it's combined with multiple addresses, we could skip TLS for the local case (if the listener was configured for TLS).

github-actions · 2020-12-09T17:27:48Z

This issue has been automatically marked as stale because it has not had activity in the last 30 days. It will be closed in the next 7 days unless it is tagged "help wanted" or "no stalebot" or other activity occurs. Thank you for your contributions.

github-actions · 2020-12-16T20:03:12Z

This issue has been automatically closed because it has not had activity in the last 37 days. If this issue is still valid, please ping a maintainer and ask them to label it as "help wanted" or "no stalebot". Thank you for your contributions.

YaoZengzeng · 2021-10-20T02:37:27Z

@lambdai based on my research, this "internal connection" seems not finished yet? Is there rough data for performance improvement, compared with original way (conneciton through kernel)? Thanks! :)

liverbirdkte · 2021-11-07T14:05:17Z

@lambdai I have a use case with two listeners listenerA_HCM and listenerB_HCM:
curl -> H2 CONNECT -> ListenerA_HCM -> TLS -> ListenerB_HCM -> Upstream server. listenerA_HCM is used for tunneling, ListenerB_HCM terminates TLS connection and inspects payload to do some security checks.

Do you have any plan to support this SSL use case? Thanks

asraa added the area/http label Jun 24, 2020

lambdai mentioned this issue Aug 7, 2020

[WIP] Envoy Internal connection strawman #12537

Closed

stale bot added the stale stalebot believes this issue/PR has not been touched recently label Aug 24, 2020

stale bot removed the stale stalebot believes this issue/PR has not been touched recently label Aug 27, 2020

This was referenced Aug 27, 2020

api: add envoy internal address #12837

Merged

listener: add envoy internal listener #12838

Closed

lambdai mentioned this issue Aug 27, 2020

listener: rename network::tcplistener #12862

Merged

chradcliffe mentioned this issue Sep 2, 2020

CONNECT support for HTTP/1.1 upstreams #11308

Closed

htuch mentioned this issue Sep 8, 2020

mTLS over TCP tunnel via HTTP/2 CONNECT #13001

Closed

lambdai mentioned this issue Oct 1, 2020

Full picture of in memory listener and connection #13361

Closed

lizan mentioned this issue Nov 12, 2020

Protocol detection after terminating CONNECT #13981

Closed

github-actions bot added the stale stalebot believes this issue/PR has not been touched recently label Dec 9, 2020

github-actions bot closed this as completed Dec 16, 2020

antoniovicente mentioned this issue Feb 2, 2021

extension: User space event #14712

Merged

This was referenced Mar 8, 2021

filter_chain_match does not apply to dynamic_forward_proxy HTTP targets #15348

Closed

api: add internal listener #15376

Merged

liverbirdkte mentioned this issue Nov 25, 2021

Listener filter to terminate HTTP CONNECT #19077

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Discussion: envoy internal connection #11725

Discussion: envoy internal connection #11725

lambdai commented Jun 24, 2020

lambdai commented Jun 24, 2020

lambdai commented Jun 24, 2020

antoniovicente commented Jun 24, 2020

antoniovicente commented Jun 24, 2020

lambdai commented Jun 24, 2020

lambdai commented Jun 24, 2020

lambdai commented Jun 24, 2020

antoniovicente commented Jun 25, 2020 •

edited

Loading

lambdai commented Jun 25, 2020

lambdai commented Jun 25, 2020

lambdai commented Jun 30, 2020

mattklein123 commented Jul 14, 2020

lambdai commented Jul 14, 2020

mattklein123 commented Jul 14, 2020

lambdai commented Jul 15, 2020

chradcliffe commented Jul 24, 2020

stale bot commented Aug 24, 2020

lambdai commented Aug 27, 2020

ggreenway commented Aug 27, 2020

github-actions bot commented Dec 9, 2020

github-actions bot commented Dec 16, 2020

YaoZengzeng commented Oct 20, 2021

liverbirdkte commented Nov 7, 2021

Discussion: envoy internal connection #11725

Discussion: envoy internal connection #11725

Comments

lambdai commented Jun 24, 2020

lambdai commented Jun 24, 2020

lambdai commented Jun 24, 2020

antoniovicente commented Jun 24, 2020

antoniovicente commented Jun 24, 2020

lambdai commented Jun 24, 2020

lambdai commented Jun 24, 2020

lambdai commented Jun 24, 2020

antoniovicente commented Jun 25, 2020 • edited Loading

lambdai commented Jun 25, 2020

lambdai commented Jun 25, 2020

lambdai commented Jun 30, 2020

mattklein123 commented Jul 14, 2020

lambdai commented Jul 14, 2020

mattklein123 commented Jul 14, 2020

lambdai commented Jul 15, 2020

chradcliffe commented Jul 24, 2020

stale bot commented Aug 24, 2020

lambdai commented Aug 27, 2020

ggreenway commented Aug 27, 2020

github-actions bot commented Dec 9, 2020

github-actions bot commented Dec 16, 2020

YaoZengzeng commented Oct 20, 2021

liverbirdkte commented Nov 7, 2021

antoniovicente commented Jun 25, 2020 •

edited

Loading