client fails annoyingly with lots of log messages when server does not speak grpc #120

jellevandenhooff · 2015-03-17T01:37:56Z

I am running "go version go1.4.1 darwin/amd64". I accidentally pointed a grpc client at an address that didn't speak grpc. Afterwards, grpc printed a lot of messages in the log that were not helpful at best and distracting at worst. I would prefer grpc to a) not generate as many errors, perhaps with some back-off mechanism, and b) not print as many errors.

Specifically, my terminal filled with hundreds of lines of the form

2015/03/16 21:08:01 transport: http2Client.notifyError got notified that the client transport was broken unexpected EOF.
2015/03/16 21:08:01 transport: http2Client.notifyError got notified that the client transport was broken unexpected EOF.
2015/03/16 21:08:01 transport: http2Client.notifyError got notified that the client transport was broken unexpected EOF.
2015/03/16 21:08:01 transport: http2Client.notifyError got notified that the client transport was broken unexpected EOF.
2015/03/16 21:08:01 transport: http2Client.notifyError got notified that the client transport was broken unexpected EOF.

I tried sticking in a "c.failFast = true" in grpc.Invoke, but that did not help.

iamqizhao · 2015-03-17T02:15:51Z

On Mon, Mar 16, 2015 at 6:38 PM, jellevandenhooff [email protected]
wrote:

I am running "go version go1.4.1 darwin/amd64". I accidentally pointed a
grpc client at an address that didn't speak grpc. Afterwards, grpc printed
a lot of messages in the log that were not helpful at best and distracting
at worst. I would prefer grpc to a) not generate as many errors, perhaps
with some back-off mechanism, and b) not print as many errors.

Specifically, my terminal filled with hundreds of lines of the form

2015/03/16 21:08:01 transport: http2Client.notifyError got notified that
the client transport was broken unexpected EOF.
2015/03/16 21:08:01 transport: http2Client.notifyError got notified that
the client transport was broken unexpected EOF.
2015/03/16 21:08:01 transport: http2Client.notifyError got notified that
the client transport was broken unexpected EOF.
2015/03/16 21:08:01 transport: http2Client.notifyError got notified that
the client transport was broken unexpected EOF.
2015/03/16 21:08:01 transport: http2Client.notifyError got notified that
the client transport was broken unexpected EOF.

I tried sticking in a "c.failFast = true" in grpc.Invoke, but that did not
help.

I had got another user request which complained that there was no log
messages when the transport sees some error triggering reconnect. So I
added this log which seems spam you now. :) Let me see if there is a way I
can win in both worlds.

—
Reply to this email directly or view it on GitHub
#120.

jellevandenhooff · 2015-03-17T04:20:28Z

Ah -- I guess the messages themselves might by only a symptom of the underlying problem. I don't know if instantaneous reconnecting is desired behavior, but if it is not, then adding some back-off would probably also make me happy :)

Thanks!

iamqizhao · 2015-03-17T05:35:33Z

Exponential back-off is already there (https://github.com/grpc/grpc-go/blob/master/clientconn.go#L164). You actually hit a different case: your connect actually succeeded because that port is listening but when you sent the first rpc it got rejected by the peer because the peer does not speak grpc. Therefore, you already hit the initial reconnect interval.

jellevandenhooff · 2015-03-17T20:32:46Z

How do you feel about moving the exponential back-off into clientConn so it also backs-off if a server is crashing or misbehaving? I could try and see what it looks like if you like that idea.

yinhm · 2015-04-13T01:29:26Z

I was thinking can we reconnect a little bit of more aggressive, I mean, I need to restart client mostly of the time when I restart server doesn't feels right. It killing my productivity.

Maybe reconnect if there is new rpc calls in client? Maybe just a little of aggressive, my server has no complain of that. What I want is thought of like zeromq or nanomsg, you don't need to care if server was going down, you just send messages.

mauaht · 2016-01-07T08:43:56Z

Got the same high volume of error messages under the following scenario:

Set up generic TCP listener which accepts and closes connections
Send a request to server using the grpc hello world tutorial

It looks like there is an infinite loop in the Invoke function in /grpc/call.go as follows:

Invoke creates callInfo with callInfo.failFast=false at line 109
If sendRequest() in line 169 fails with transport.ConnectionError a subsequent continue jumps to loop start at line 143
If recvResponse() in line 181 fails with transport.ConnectionError a subsequent continue jumps to loop start at line 143
Loop does not exit because callInfo.failFast=false blocks error handling at line 150

Setting callInfo.failFast=true aborts the loop at line 143 and prevents the infinite loop.

In this case where the server explicitely closes the connection, because it does not recognise the grpc protocol, the client should not attempt retry.
There may be other transport.ConnectionError cases for which limited retries are appropriate, but unbounded retries are not a good idea.

The following test code reproduces the above scenario

import (
    "fmt"
    "testing"
    "net"
    "golang.org/x/net/context"
    "google.golang.org/grpc"  
    demo "google.golang.org/grpc/examples/helloworld/helloworld"
)

const hw_addr = "127.0.0.0:50000" 

func TestGrpcReject(t *testing.T) {
    // Create a simple server that just closes connections  
    lis, err := net.Listen("tcp", hw_port)
    if err != nil { fmt.Printf("Listener not started: %v\n", err); return }

    go func() {
        for {
            conn, _ := lis.Accept() 
            fmt.Printf("Server closing connection\n")
            conn.Close()
        }
    }()     

    // Set up client and send request to server
    conn, err := grpc.Dial(hw_address, grpc.WithInsecure())
    client := demo.NewGreeterClient(conn)
    client.SayHello(context.Background(), &demo.HelloRequest{Name: "World"})
}

artushin · 2016-02-19T23:57:51Z

I also felt this was extremely annoying in a dev environment (I have multiple services connecting to a non-critical grpc service and they would cumulatively spit out over 10k log lines in 5 seconds if I turned that service off), so I just replace the grpclog Logger with grpclog.SetLogger(log.New(ioutil.Discard, "", 0)) when in dev.

iamqizhao · 2016-02-20T00:15:51Z

If you replace it with glogger (https://github.com/grpc/grpc-go/blob/master/grpclog/glogger/glogger.go), all the logs will go to some files instead of stderr unless you configure it explicitly.

gm42 · 2016-08-02T11:23:01Z

I am experiencing same issue and using grpclog.SetLogger. However, as I see it the problem is that there is no context about the error: if I am using different gRPC services I will never know by reading this log message which one is specifically the failing service.

Would it be possible to add to this log message more context e.g. the hostname or service name?

c4milo · 2016-08-11T18:07:13Z

It would be very nice if the underlined backoff mechanism covers this case as well.

schmohlio · 2016-10-16T00:54:21Z

is there any update on this? This problem really floods logs when running on GCE. setting the logger seems to provide an initial fix, but feels wrong

menghanl · 2016-10-17T22:39:11Z

We are working on improving the logging system, #922 is the first step.

jellevandenhooff · 2016-10-17T23:17:21Z

While I appreciate #922, the many log messages are a symptom of many failed connection attempts. I think it'd be worthwhile also reducing connection frequency, because even if we're not spewing logs, we're still hammering some poor unsuspecting service.

c4milo · 2016-10-17T23:23:08Z

I was going to mention @jellevandenhooff's observation too, I thought there was some sort of exponential backoff involved when reconnecting.

menghanl · 2016-10-18T00:11:48Z

Can you share more information about your program? Like what the error you got was and what server were you connecting to? What we want to know is the root cause of the connection error.

We have backoff mechanism if the connection can't be established at the first place.
One possible situation where the flood can happen is that the connection is established successfully, and disconnects immediately after that. We should do something (backoff or even stop retrying) depending on the reason.

jellevandenhooff · 2016-10-18T01:47:55Z

The root cause of the error is exactly as you described, and a backoff as you propose is the solution. My PR from last year sketched out an approach for implementing that, but with all the changes to grpc, doesn't quite merge anymore.

menghanl · 2016-10-18T18:06:58Z

The root cause of the error is exactly as you described

I assume by this you mean "connection is established successfully, and disconnects immediately after that."
Then what is the reason for this to happen in your case? (The server is a misbehavior server? or a non-gRPC server?)

c4milo · 2016-10-18T18:08:44Z

@menghanl in my case the server did not have properly setup the certificate.

jellevandenhooff · 2016-12-13T19:38:13Z

closing in favor of #954

jellevandenhooff mentioned this issue Mar 20, 2015

Add back-off logic to transportMonitor's reconnect. #125

Closed

bradfitz mentioned this issue Jul 21, 2016

http2Client.notifyError every 4 minutes when using gcd googleapis/google-cloud-go#293

Closed

iamqizhao mentioned this issue Jul 29, 2016

http2Client.notifyError every 4 minutes when using gcd #798

Closed

menghanl mentioned this issue Oct 17, 2016

Add Severity and VerboseLevel to grpclog. #922

Merged

ccressent mentioned this issue Oct 21, 2016

Cannot run route_guide example #933

Closed

jellevandenhooff mentioned this issue Oct 31, 2016

backoff/timeout does not take into account actual GRPC connection established, causes denial of service. #954

Closed

jellevandenhooff closed this as completed Dec 13, 2016

lgarron mentioned this issue May 3, 2017

Cloud Console logs are flooded with transport: http2Client.notifyError got notified that the client transport was broken EOF. chromium/hstspreload.org#89

Closed

menghanl mentioned this issue Jun 28, 2017

Use log severity and verbosity level #1340

Merged

lock bot locked as resolved and limited conversation to collaborators Sep 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

client fails annoyingly with lots of log messages when server does not speak grpc #120

client fails annoyingly with lots of log messages when server does not speak grpc #120

jellevandenhooff commented Mar 17, 2015

iamqizhao commented Mar 17, 2015

jellevandenhooff commented Mar 17, 2015

iamqizhao commented Mar 17, 2015

jellevandenhooff commented Mar 17, 2015

yinhm commented Apr 13, 2015

mauaht commented Jan 7, 2016

artushin commented Feb 19, 2016

iamqizhao commented Feb 20, 2016

gm42 commented Aug 2, 2016 •

edited

Loading

c4milo commented Aug 11, 2016 •

edited

Loading

schmohlio commented Oct 16, 2016 •

edited

Loading

menghanl commented Oct 17, 2016

jellevandenhooff commented Oct 17, 2016

c4milo commented Oct 17, 2016 •

edited

Loading

menghanl commented Oct 18, 2016

jellevandenhooff commented Oct 18, 2016

menghanl commented Oct 18, 2016

c4milo commented Oct 18, 2016

jellevandenhooff commented Dec 13, 2016

client fails annoyingly with lots of log messages when server does not speak grpc #120

client fails annoyingly with lots of log messages when server does not speak grpc #120

Comments

jellevandenhooff commented Mar 17, 2015

iamqizhao commented Mar 17, 2015

jellevandenhooff commented Mar 17, 2015

iamqizhao commented Mar 17, 2015

jellevandenhooff commented Mar 17, 2015

yinhm commented Apr 13, 2015

mauaht commented Jan 7, 2016

artushin commented Feb 19, 2016

iamqizhao commented Feb 20, 2016

gm42 commented Aug 2, 2016 • edited Loading

c4milo commented Aug 11, 2016 • edited Loading

schmohlio commented Oct 16, 2016 • edited Loading

menghanl commented Oct 17, 2016

jellevandenhooff commented Oct 17, 2016

c4milo commented Oct 17, 2016 • edited Loading

menghanl commented Oct 18, 2016

jellevandenhooff commented Oct 18, 2016

menghanl commented Oct 18, 2016

c4milo commented Oct 18, 2016

jellevandenhooff commented Dec 13, 2016

gm42 commented Aug 2, 2016 •

edited

Loading

c4milo commented Aug 11, 2016 •

edited

Loading

schmohlio commented Oct 16, 2016 •

edited

Loading

c4milo commented Oct 17, 2016 •

edited

Loading