-
Notifications
You must be signed in to change notification settings - Fork 4.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TCP CLOSE_WAIT counts up to 500 and rpc error #8443
Comments
@jsosulska I don't think it is a similar issue with #8435. |
@jsosulska I've got the goroutine profile func (r *Raft) GetConfiguration() ConfigurationFuture { |
It's fixed by raft 1.1.2. |
Overview of the Issue
TCP CLOSE_WAIT counts increase from 0 to 500 in 2 hours, and java service report agent service register or get KV read-timeout error in logs.
Netstat shows CLOSE_WAIT are on server-port(not lan-port,wan-port,http-port).
We have 5 servers ,and the error occured with the leader. The leader will be reselected at the begin of error, but the CLOSE_WAIT counts will continue to increase. after restart the consul server process, the error disappeared.
After restarted, the go routinue counts was only 30% of the go routinue counts before the restart.
Reproduction Steps
Steps to reproduce this issue, eg:
Consul info for both Client and Server
Consul version:1.5.3
OS: Redhat linux7.4
Possible Causes:
Consul RPC use tcp without keepalive,and each connection was processed with one go routinue.
With some network errors, the client need to reconnect to server, but the old connection(go toutinue) was not released by the server. So the server have to process more and more go routinues.
The text was updated successfully, but these errors were encountered: