-
Notifications
You must be signed in to change notification settings - Fork 1.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
//knative/serving/test/e2e:TestAutoscaleUpDownUp is super flaky #2351
Comments
/assign @dgerd @josephburnett Dan volunteered to help a bit here. |
The autoscaling issue seems to be caused by requests hitting Terminating pods, in which I think we should add back the prestop sleep to unblock PRs, while working on a better fix. |
/assign @tcnghia |
One other action item might be to create another E2E test that just serves variable traffic over time. That would help narrow in on the cause more quickly I think. Up Down and Up is a lot to debug all at once. |
@dgerd has an excellent way to repro this consistently by killing the revision's Pod. I think we should add that test. |
Hi, is this problem still there? if yes, then I'd like try to dig it also. |
Actually it's way better now: in the last 24 runs, it only failed once. I'm closing this issue, thanks everyone. |
Expected Behavior
//knative/serving/test/e2e:TestAutoscaleUpDownUp
flakiness is close to 0.Actual Behavior
For the last 16 CI runs,
TestAutoscaleUpDownUp
failed 10 times, or ~60% of the time. From these 10 failures, 9 were because ("got/wanted" numbers change for each failure)https://gubernator.knative.dev/build/knative-prow/logs/ci-knative-serving-continuous/1057301584175697921
Steps to Reproduce the Problem
Additional Info
The text was updated successfully, but these errors were encountered: