-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
query: error DeadlineExceeded missing error details #4923
Comments
The error message says the cert is not valid, please check cert. the diff is |
Yes, I know what the error message is complaining about but only because it showed the Cert error, which is only displayed in Thanos Versions v0.22.0 and bellow. Thats is my issue. In Thanos v0.23.0 does NOT display the cert error. |
Ran into a similar issue and had to temporarily install v0.22.0 to see the full error message |
I've faced the same issue today, I've missed adding the store port and I could only see the errors after rollback to version 0.22 |
Hello 👋 Looks like there was no activity on this issue for the last two months. |
Closing for now as promised, let us know if you need this to be reopened! 🤗 |
Still reproducible and needed. |
Is this also reproducible with the latest version? |
I have yet to ask our internal Thanos maintainers about whether the issue persists on the latest version or not. It looks like we've held our deploy back at I did some digging and found the following:
ahlex@galette:~/repos/thanos% git diff 2bf7e1ca06e60cd1dc722c97640b639b9d8ce704{~,} -- go.mod | rg -C 2 -w grpc
- google.golang.org/api v0.50.0
- google.golang.org/genproto v0.0.0-20210713002101-d411969a0d9a
- google.golang.org/grpc v1.39.0
+ google.golang.org/api v0.56.0
+ google.golang.org/genproto v0.0.0-20210903162649-d08c68adba83
+ google.golang.org/grpc v1.40.0
gopkg.in/alecthomas/kingpin.v2 v2.2.6
gopkg.in/fsnotify.v1 v1.4.7
--
+ github.com/prometheus/prometheus => github.com/prometheus/prometheus v1.8.2-0.20210914090109-37468d88dce8
github.com/sercand/kuberesolver => github.com/sercand/kuberesolver v2.4.0+incompatible
google.golang.org/grpc => google.golang.org/grpc v1.29.1
ahlex@galette:~/repos/grpc-go% git diff v1.39.0..v1.40.0 -- internal/status/status.go | rg -C 5 -F 'Error()'
type Error struct {
- e *spb.Status
+ s *Status
}
func (e *Error) Error() string {
- return fmt.Sprintf("rpc error: code = %s desc = %s", codes.Code(e.e.GetCode()), e.e.GetMessage())
+ return e.s.String()
}
// GRPCStatus returns the Status represented by se.
ahlex@galette:~/repos/grpc-go% git diff ba41bbac225e6e1a9b822fe636c40c3b7d977894{~,} -- internal/status/status.go | rg -C 5 -F 'Error()'
type Error struct {
- e *spb.Status
+ s *Status
}
func (e *Error) Error() string {
- return fmt.Sprintf("rpc error: code = %s desc = %s", codes.Code(e.e.GetCode()), e.e.GetMessage())
+ return e.s.String()
}
// GRPCStatus returns the Status represented by se. To help answer:
Yes, I believe the latest Thanos can repro the issue, but I would like someone to also validate my digging. As for a solution forward, if y'all think my research is on the right track, I'm more than happy to open a PR into Thanks! |
I think that makes sense. The only thing I'm wondering if we shouldn't call something else to get the full info, but at a first simple look, I could not find such method. Perhaps we should make an issue / PR there indeed. I.e. best time/effort would perhaps be to create an issue there. Just to double check we are on the right path. Anyhow, good spot/debugging ;) thank you |
Hello! So I spent some time trying to PR a fix, to find out that I was a bit off the mark. While I still think the problem is between grpc-go versions I did some more digging and found out I'm not proficient enough in all of the changes to make a definitive call, so I filed grpc/grpc-go#5342 to have the experts look at it. I did find some solid anchors to what I think needs to be bubbled up, and I've dropped those hints onto the upstream bug report. Sorry I wasn't more helpful, and hopefully we get a fix soon! |
All good, you helped a lot :) |
Just wanted to add this here, I ran into this same issue during an upgrade:
I'm not sure if others are experiencing this using the bitnami helm chart, but that is how I installed Thanos. The issue was that the querier parameter |
Hello 👋 Looks like there was no activity on this issue for the last two months. |
Still relevant! Problem persists in version 0.28.0. |
Hello 👋 Looks like there was no activity on this issue for the last two months. |
Still relevant, and rather annoying when trying to decipher why two Thanos components can't talk. |
Is this fixed? I have the same error message using Thanos 0.26.0 but only for one sidecar, the other work. How to get more information why communication cannot be established by different component? |
We've got bitten by this too. In our case, there was an issue with the TLS version not being supported end-to-end and the fact that the endpointset logs didn't include the TLS error made it hard to troubleshoot. To reproduce, you can take a working Thanos setup with Query and sidecar and set the |
In the mean time, setting the following environment variables might help troubleshooting: |
Thanos, Prometheus and Golang version used:
The Docker images:
Thanos: 0.23.0 and newer
go: go1.16.8
What happened:
When the query fails to connect to a store/sidecar it does not display certificates issues like before. This makes debugging difficulty.
What you expected to happen:
The error should display more information if the certificate is invalid like in Thanos v0.22.0
How to reproduce it (as minimally and precisely as possible):
The text was updated successfully, but these errors were encountered: