Support grpc keep alive server parameters #4402

anjmao · 2019-08-06T12:43:26Z

Is this a BUG REPORT or FEATURE REQUEST? (choose one): FEATURE REQUEST

NGINX Ingress controller version:

rancher/nginx-ingress-controller:0.21.0-rancher3

Kubernetes version (use kubectl version):

Client Version: version.Info{Major:"1", Minor:"11", GitVersion:"v1.11.2", GitCommit:"bb9ffb1654d4a729bb4cec18ff088eacc153c239", GitTreeState:"clean", BuildDate:"2018-08-08T16:31:10Z", GoVersion:"go1.10.3", Compiler:"gc", Platform:"darwin/amd64"}
Server Version: version.Info{Major:"1", Minor:"11", GitVersion:"v1.11.6", GitCommit:"b1d75deca493a24a2f87eb1efde1a569e52fc8d9", GitTreeState:"clean", BuildDate:"2018-12-16T04:30:10Z", GoVersion:"go1.10.3", Compiler:"gc", Platform:"linux/amd64"}

Environment:
AWS, Rancher

What happened:

I have grpc backend written in go and mobile client written in swift which uses swift-grpc.
On go backend I have keep alive policy

keepalivePolicy = keepalive.EnforcementPolicy{
	MinTime:             5 * time.Second, // If a client pings more than once every x duration, terminate the connection.
	PermitWithoutStream: false,           // Allow pings even when there are no active streams
}

keepaliveParams = keepalive.ServerParameters{
	MaxConnectionIdle:     1 * time.Hour,    // If a client is idle for given duration, send a GOAWAY.
	MaxConnectionAge:      1 * time.Hour,    // If any connection is alive for more than given duration, send a GOAWAY.
	MaxConnectionAgeGrace: 10 * time.Second, // Allow given duration for pending RPCs to complete before forcibly closing connections
	Time:                  10 * time.Second, // Ping the client if it is idle for given duration to ensure the connection is still active.
	Timeout:               5 * time.Second,  // Wait given duration for the ping ack before assuming the connection is dead.
}

Ngnix ingress is used to load balance and terminate TLS traffic.

apiVersion: extensions/v1beta1
kind: Ingress
metadata:
  annotations:
    nginx.ingress.kubernetes.io/backend-protocol: "GRPC"
    nginx.ingress.kubernetes.io/server-snippet: |
      grpc_read_timeout 600s;
      grpc_send_timeout 600s;
      client_body_timeout 600s;

My goal is to have long lived bidi streaming rpc so client can accept incoming updates from the backend. Also if client is disconnected (let's say internet connection is disabled) I want server to determine this as fast as possible (ideally 10 seconds). Currently my grpc server is doing keep alive ping each 10 seconds and ngnix proxy is doing ack of the ping but ngnix itself is not pinging client.

What you expected to happen:
I expect to have settings on ngnix ingress to allow setup grpc keep alive policy, something like

grpc_keepalive_time 10s;
grpc_keepalive_timeout 5s;

Or even better to just forward grpc ping frames to the client.

I found similar issue on envoy proxy but it seems to be fixed now envoyproxy/envoy#2086

The text was updated successfully, but these errors were encountered:

thetruechar · 2019-10-25T08:35:18Z

nginx.ingress.kubernetes.io/server-snippet: |
grpc_read_timeout 600s;
grpc_send_timeout 600s;
client_body_timeout 600s;

this save my day, thank you guy!

fejta-bot · 2020-01-23T09:00:00Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle stale

fejta-bot · 2020-02-22T09:41:34Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/lifecycle rotten

fejta-bot · 2020-03-23T10:26:10Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

k8s-ci-robot · 2020-03-23T10:26:27Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-testing, kubernetes/test-infra and/or fejta.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

PI-Victor · 2021-02-11T23:40:53Z

i think i want to have a better look at this, since i ran into the same issue.

/remove-lifecycle rotten
/assign

PI-Victor · 2021-02-11T23:43:13Z

/reopen

k8s-ci-robot · 2021-02-11T23:43:21Z

@PI-Victor: Reopened this issue.

In response to this:

/reopen

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

fejta-bot · 2021-05-13T00:31:07Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

fejta-bot · 2021-06-12T01:00:57Z

Stale issues rot after 30d of inactivity.
Mark the issue as fresh with /remove-lifecycle rotten.
Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle rotten

fejta-bot · 2021-07-12T01:09:57Z

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

k8s-ci-robot · 2021-07-12T01:10:03Z

@fejta-bot: Closing this issue.

In response to this:

Rotten issues close after 30d of inactivity.
Reopen the issue with /reopen.
Mark the issue as fresh with /remove-lifecycle rotten.

Send feedback to sig-contributor-experience at kubernetes/community.
/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

nnewc · 2021-11-22T21:06:41Z

nginx.ingress.kubernetes.io/server-snippet:
grpc_read_timeout 600s;
grpc_send_timeout 600s;
client_body_timeout 600s;

Is there an alternative to this? Because of kubernetes/kubernetes#126811 server-snippets are disabled in my cluster (RKE2).

danielleiszen · 2022-12-19T16:33:13Z

I am not sure, if this is related. But I TLS terminate a gRPC upstream with NGINX Ingress. After 1 minute of inactivity the stream stops regardless of the idle timeout, connection timeout and whatever timeout I specify on the client side when opening the stream. The gRPC server logs clearly show a 1 min timeout which is NOT specified anywhere in the configurations or call options for the service.

Furthermore I cannot reproduce the behaviour without the NGINX Ingress. So I suspect that NGINX messes up the call options for the gRPC stream during TLS termination. My ingress annotations are as simple as:

  annotations:
    kubernetes.io/ingress.class: "nginx"
    nginx.ingress.kubernetes.io/ssl-redirect: "true"
    nginx.ingress.kubernetes.io/backend-protocol: "GRPC"

Any idea on avoiding this?
Thank you.

chenchengfa93 · 2022-12-22T07:25:00Z

Hi @danielleiszen, have any idea avoiding this? I meet the same problem

chenchengfa93 · 2022-12-23T02:53:29Z

Hi @danielleiszen, have any idea avoiding this? I meet the same problem
In my case, this is because of client_body_timeout，grpc keepalive only use tcp ping, but client_body_timeout need a body.We send empty message with interval time less to client_body_timeout to solve it

danielleiszen · 2022-12-23T06:25:27Z

Hi @danielleiszen, have any idea avoiding this? I meet the same problem
In my case, this is because of client_body_timeout，grpc keepalive only use tcp ping, but client_body_timeout need a body.We send empty message with interval time less to client_body_timeout to solve it

Hi @chenchengfa93,

I ended up doing something similar. I created a keep alive endpoint on my service that I call periodically from the client. The keep alive triggers a downstream communication and keeps the channel open. The client schedules the next keep alive call only when that downstream event arrives.

This seems to work.

anjmao mentioned this issue Aug 8, 2019

Server keep alive not closing client connection grpc/grpc-go#2955

Closed

aledbf added the kind/feature Categorizes issue or PR as related to a new feature. label Sep 2, 2019

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jan 23, 2020

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Feb 22, 2020

k8s-ci-robot closed this as completed Mar 23, 2020

k8s-ci-robot assigned PI-Victor Feb 11, 2021

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Feb 11, 2021

k8s-ci-robot reopened this Feb 11, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label May 13, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Jun 12, 2021

k8s-ci-robot closed this as completed Jul 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support grpc keep alive server parameters #4402

Support grpc keep alive server parameters #4402

anjmao commented Aug 6, 2019 •

edited

Loading

thetruechar commented Oct 25, 2019

fejta-bot commented Jan 23, 2020

fejta-bot commented Feb 22, 2020

fejta-bot commented Mar 23, 2020

k8s-ci-robot commented Mar 23, 2020

PI-Victor commented Feb 11, 2021 •

edited

Loading

PI-Victor commented Feb 11, 2021

k8s-ci-robot commented Feb 11, 2021

fejta-bot commented May 13, 2021

fejta-bot commented Jun 12, 2021

fejta-bot commented Jul 12, 2021

k8s-ci-robot commented Jul 12, 2021

nnewc commented Nov 22, 2021

danielleiszen commented Dec 19, 2022

chenchengfa93 commented Dec 22, 2022

chenchengfa93 commented Dec 23, 2022

danielleiszen commented Dec 23, 2022

Support grpc keep alive server parameters #4402

Support grpc keep alive server parameters #4402

Comments

anjmao commented Aug 6, 2019 • edited Loading

thetruechar commented Oct 25, 2019

fejta-bot commented Jan 23, 2020

fejta-bot commented Feb 22, 2020

fejta-bot commented Mar 23, 2020

k8s-ci-robot commented Mar 23, 2020

PI-Victor commented Feb 11, 2021 • edited Loading

PI-Victor commented Feb 11, 2021

k8s-ci-robot commented Feb 11, 2021

fejta-bot commented May 13, 2021

fejta-bot commented Jun 12, 2021

fejta-bot commented Jul 12, 2021

k8s-ci-robot commented Jul 12, 2021

nnewc commented Nov 22, 2021

danielleiszen commented Dec 19, 2022

chenchengfa93 commented Dec 22, 2022

chenchengfa93 commented Dec 23, 2022

danielleiszen commented Dec 23, 2022

anjmao commented Aug 6, 2019 •

edited

Loading

PI-Victor commented Feb 11, 2021 •

edited

Loading