You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
gRPC-go allocates two buffers per transport: one for reads, and one for writes. They are allocated on transport creation in http_util.go. With their respective default sizes of 32KB and 64KB, those buffers can account for a significant part of grpc-go memory footprint. This is especially noticeable when there are a high number of mostly idle connections.
As an example, with 1000 connections open, those buffers consume 100MB of memory. We first noticed this problem server-side in our deployments when using the round robin balancer in services that have a lot of clients (in this use case, setting up some form of subsetting would help). Another scenario where this issue is prominent is for control planes with streaming, such as xDS management servers.
It is possible to disable or to tune the size of those buffers by setting ReadBufferSize/WriteBufferSize on the server side and WithReadBufferSize/WithWriteBufferSize on the client side, but there are performance implications. From reading the comments and code, those buffers are meant to batch calls to the transport, thereby reducing the number of system calls: the code dealing with frames reads and writes with size 9 bytes (HTTP2 frame header to get the frame size) immediately followed by a read or write of the size of the frame. This means that in most cases, when the socket becomes readable or there is data to be written, at least two syscalls can be batched.
From what I could read, the benchmarks present in this repository focus on latency and throughput, and it is not surprising that those buffers improve the results. Note also that the benchmarks override the buffer size to 128KB, both for servers and clients. This is a problem because some benchmark results may not reflect the reality of what most users are experiencing. The benchmarks do not account for fixed memory usage per connection.
Things I'm trying to get out of this issue are:
Gains to be had by changing the implementation without significant performance loss. In particular, it is unclear to me why the write buffer does not use bufio.Writer and instead has a custom implementation that allocates a buffer of twice the passed in size (2*32KB by default). This was introduced in this change: transport: refactor to reduce lock contention and improve performance #1962.
Make sure that benchmarks use the default values for buffer sizes -- I'm happy to propose the fix for this.
Opportunities for documenting how to tune those values based on use cases.
Explore whether there is value in adding read and write buffer sizes as a benchmark dimension, as well as measuring the cost of idle connections.
More creative ideas to improve memory usage when there are many mostly idle connections (e.g. dynamically tune buffer size based on activity). We are somewhat limited by the way IOs are done in go: we can't just allocate memory when a call to read is ready, since socket readiness and read are not separate (see net: add mechanism to wait for readability on a TCPConn golang/go#15735).
The text was updated successfully, but these errors were encountered:
@atollena : We discussed this issue today and we've decided to create issues for the list of things you've mentioned in here. That way, we can keep the discussions scoped in the respective issues/PRs.
gRPC-go allocates two buffers per transport: one for reads, and one for writes. They are allocated on transport creation in http_util.go. With their respective default sizes of 32KB and 64KB, those buffers can account for a significant part of grpc-go memory footprint. This is especially noticeable when there are a high number of mostly idle connections.
As an example, with 1000 connections open, those buffers consume 100MB of memory. We first noticed this problem server-side in our deployments when using the round robin balancer in services that have a lot of clients (in this use case, setting up some form of subsetting would help). Another scenario where this issue is prominent is for control planes with streaming, such as xDS management servers.
It is possible to disable or to tune the size of those buffers by setting ReadBufferSize/WriteBufferSize on the server side and WithReadBufferSize/WithWriteBufferSize on the client side, but there are performance implications. From reading the comments and code, those buffers are meant to batch calls to the transport, thereby reducing the number of system calls: the code dealing with frames reads and writes with size 9 bytes (HTTP2 frame header to get the frame size) immediately followed by a read or write of the size of the frame. This means that in most cases, when the socket becomes readable or there is data to be written, at least two syscalls can be batched.
From what I could read, the benchmarks present in this repository focus on latency and throughput, and it is not surprising that those buffers improve the results. Note also that the benchmarks override the buffer size to 128KB, both for servers and clients. This is a problem because some benchmark results may not reflect the reality of what most users are experiencing. The benchmarks do not account for fixed memory usage per connection.
Things I'm trying to get out of this issue are:
bufio.Writer
and instead has a custom implementation that allocates a buffer of twice the passed in size (2*32KB by default). This was introduced in this change: transport: refactor to reduce lock contention and improve performance #1962.The text was updated successfully, but these errors were encountered: