-
Notifications
You must be signed in to change notification settings - Fork 300
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Testgrid for integration tests is broken #2097
Comments
The release-0.6 branch looks healthy https://testgrid.k8s.io/sig-scheduling#pull-kueue-test-integration-release-0-6&width=20 |
The only difference in the presubmit configuration is that |
/assign @gabesaba |
@alculquicondor: GitHub didn't allow me to assign the following users: gabesaba. Note that only kubernetes-sigs members with read permissions, repo collaborators and people who have commented on this issue/PR can be assigned. Additionally, issues/PRs can only have 10 assignees at the same time. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Probably because junit output is too large - there's an error message that it is malformed since over 100MB We could either bump this limit, or reduce verbosity |
/assign @gabesaba |
removing -v ginkgo flag from test-integration target, junit.xml went from 204MB to 88MB. This would fix issue, but we're still close to the limit. Looking next into any particularly spammy logs |
Much of the size can be attributed to a few tests. Below are the tests with output > 1MB (after HTML unescaping, so the actual size is larger)
|
within those tests, attributing output to certain lines
We're repeatedly reconciling unschedulable workloads without any backoff. Should there be a backoff here? I imagine that a backoff of even a fraction of a second would drastically reduce the logging output here |
212MB to 15MB after changing scheduler.go:128 to 10ms
|
Uhm.... maybe we can use a |
What happened:
The testgrid shows error for
Overall
and doesn't show the individual tests.Other testgrids (E2E, unit) look fine.
What you expected to happen:
A line for every test.
How to reproduce it (as minimally and precisely as possible):
https://testgrid.k8s.io/sig-scheduling#pull-kueue-test-integration-main&width=20
Anything else we need to know?:
We have lost history for the last time it worked.
Environment:
kubectl version
):git describe --tags --dirty --always
):cat /etc/os-release
):uname -a
):The text was updated successfully, but these errors were encountered: