Fixing test flakiness #2242

sleepy-owl · 2021-02-24T03:57:49Z

The test test_update_envs_env_update is flaky. It sometimes fails more than twice. The assertion on line 78 assert np.var(mean_rewards) > 0 fails. This PR addresses this issue.

To find a solution, I collected samples of np.var(mean_rewards) from several test executions and computed the tail distribution just to check how often can the value be 0.

Based on the collected samples, it seems there is ~12% chance that the test will fail. I suggest to run this test 3 times, then the probability of failure will be <1%.

I think refactoring this test using the statistical evaluation might be a good way to reduce the flakiness of this test.

Do you guys think this makes sense? Please let me know if this looks good or if you have any other suggestions. Also, here I assume there are no bugs in the code under test.

Also, I am curious to know why do you check if var > 0? Statistically, zero variance can happen sometimes. Is it better to have something like assert np.mean(mean_rewards) > 0?

ryanjulian · 2021-02-24T18:06:30Z

/ok-to-test

rlworkgroupbot · 2021-02-24T18:07:03Z

Command run output for 35fd206

rlworkgroupbot · 2021-02-24T22:24:36Z

Command run output for 35fd206

sleepy-owl · 2021-02-24T23:11:47Z

hi, it seems the checks failed due to some config issue?

ryanjulian · 2021-02-24T23:31:25Z

@ziyiwu9494 can you PTAL?

krzentner · 2021-03-04T20:01:14Z

/ok-to-test

rlworkgroupbot · 2021-03-04T20:01:48Z

Command run output for 35fd206

sleepy-owl · 2021-03-15T02:53:14Z

Hi all, thanks for approving the PR. Is there something blocking for merge?

avnishn · 2021-03-18T19:24:51Z

/ok-to-test

rlworkgroupbot · 2021-03-18T19:25:25Z

Command run output for e0f1501

Fixing test flakiness

35fd206

sleepy-owl requested a review from a team as a code owner February 24, 2021 03:57

sleepy-owl requested review from ryanjulian and removed request for a team February 24, 2021 03:57

mergify bot requested review from a team, yeukfu and ziyiwu9494 and removed request for a team February 24, 2021 03:58

krzentner approved these changes Mar 4, 2021

View reviewed changes

krzentner added the ready-to-merge label Mar 4, 2021

ziyiwu9494 approved these changes Mar 15, 2021

View reviewed changes

mergify bot requested a review from a team March 15, 2021 07:23

ziyiwu9494 approved these changes Mar 15, 2021

View reviewed changes

mergify bot requested a review from a team March 15, 2021 07:24

Merge branch 'master' into testfix

e0f1501

haydenshively approved these changes Mar 19, 2021

View reviewed changes

mergify bot requested a review from a team March 19, 2021 19:04

krzentner merged commit 90b6090 into rlworkgroup:master Apr 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing test flakiness #2242

Fixing test flakiness #2242

sleepy-owl commented Feb 24, 2021

ryanjulian commented Feb 24, 2021

rlworkgroupbot commented Feb 24, 2021

rlworkgroupbot commented Feb 24, 2021

sleepy-owl commented Feb 24, 2021

ryanjulian commented Feb 24, 2021

krzentner commented Mar 4, 2021

rlworkgroupbot commented Mar 4, 2021

sleepy-owl commented Mar 15, 2021

avnishn commented Mar 18, 2021

rlworkgroupbot commented Mar 18, 2021

Fixing test flakiness #2242

Fixing test flakiness #2242

Conversation

sleepy-owl commented Feb 24, 2021

ryanjulian commented Feb 24, 2021

rlworkgroupbot commented Feb 24, 2021

rlworkgroupbot commented Feb 24, 2021

sleepy-owl commented Feb 24, 2021

ryanjulian commented Feb 24, 2021

krzentner commented Mar 4, 2021

rlworkgroupbot commented Mar 4, 2021

sleepy-owl commented Mar 15, 2021

avnishn commented Mar 18, 2021

rlworkgroupbot commented Mar 18, 2021