-
Notifications
You must be signed in to change notification settings - Fork 3.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(test): wait enough time to Trigger Running Hook. Fixes: #12844 #12855
Conversation
Signed-off-by: shuangkun <[email protected]>
Signed-off-by: shuangkun <[email protected]>
Signed-off-by: shuangkun <[email protected]>
I saw there are many comment: // TODO: Temporarily comment out this assertion since it's flaky:
// The running hook is occasionally not triggered. Possibly because the step finishes too quickly
// while the controller did not get a chance to trigger this hook. This one we can sleep longer. Because it only verifies running hooks, unlike others that also verify other types of hooks. |
Signed-off-by: shuangkun <[email protected]>
Could you let us how you tested it to make sure it doesn't flake |
It is indeed not guaranteed, but now sleep 1s does often lead to a higher probability of failure of this test. In many other tests, the running hook check is skipped. If it is better for you to skip the task,I can skip it. |
I wonder if we can run https://argo-workflows.readthedocs.io/en/stable/running-locally/#running-e2e-tests-locally a few times to reproduce the original issue |
It is difficult to reproduce, but it may happen in the test after submitting the PR. Like TestStoppedWorkflow and TestTemplateExecutor, It is almost difficult to reproduce locally when running a test alone. It is easy to occur only after the PR is submitted and several people trigger the pipeline at the same time. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we can keep monitor and see if this change reduces flakiness
Yes, I think so. Thanks |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks! We should observe this change resolves the issue.
I wonder if we can run argo-workflows.readthedocs.io/en/stable/running-locally/#running-e2e-tests-locally a few times to reproduce the original issue
It might be hard to reproduce this issue by local testing, because this fakiness may be caused by poor spec of CI runner (2core, 7mem).
Ref: #11534
@agilgur5 Thanks for mentioning me :)
…12855) Signed-off-by: shuangkun <[email protected]> (cherry picked from commit daa5f7e)
…12844 (argoproj#12855) Signed-off-by: shuangkun <[email protected]>
…12844 (argoproj#12855) Signed-off-by: shuangkun <[email protected]>
sleep 1s is sometimes too short to trigger the running hook
Fixes #12844
Motivation
Modifications
Verification