run tests on Beaker with NFS cache #120

epwalsh · 2023-02-17T21:19:15Z

Changes proposed:

Sets up unit tests to run on Beaker via beaker-run-action (this is what we use to run GPU tests on Beaker for Tango).
Splits up tests to run in different suites, a CI job for each suite. This reduces CI runtime from 10+ minutes to 4 minutes and could be reduced even more by splitting off more suites. Note that suite A is the catch-all. Anything not explicitly marked for another suite will be put into A.

Once https://github.com/allenai/reconfig/issues/1166 is resolved we should update the cache location.

epwalsh · 2023-02-18T00:30:31Z

tests/test_all_tasks.py

+# These are tasks are known to fail for now due to an unreachable server.
+known_failures = {
+    "lambada_mt_en",
+    "lambada_mt_fr",
+    "lambada_mt_de",
+    "lambada_mt_it",
+    "lambada_mt_es",
+    "triviaqa",
+}


I had to xfail these tests because the server they rely on is apparently down. @dirkgr you might want to look into this more, maybe there's a new URL we should be using.

Hmm. Once they are cached, we are much more robust to such failures.

dirkgr

Approved with question

dirkgr · 2023-02-18T00:56:58Z

.github/workflows/main.yml

+        test_suite:
+          - name: A
+            mark: "not suite_B and not suite_C and not suite_D"
+
+          - name: B
+            mark: "suite_B"
+
+          - name: C
+            mark: "suite_C"
+
+          - name: D
+            mark: "suite_D"


What is this for? More parallelism?

Yea, as I mentioned in the description I broke the tests up into arbitrary groups aka "suites".

dirkgr · 2023-02-18T00:58:36Z

tests/test_all_tasks.py

+# These are tasks are known to fail for now due to an unreachable server.
+known_failures = {
+    "lambada_mt_en",
+    "lambada_mt_fr",
+    "lambada_mt_de",
+    "lambada_mt_it",
+    "lambada_mt_es",
+    "triviaqa",
+}


Hmm. Once they are cached, we are much more robust to such failures.

epwalsh added 10 commits February 17, 2023 13:18

run tests on Beaker

5dc4712

fix

379d478

break out test suites

b65d6cc

mypy fix

a4e0f3d

more fixes

02f8f6e

fix

fca70a8

no color, forked again

2e74f06

add color back

b66c2d2

xfail

06bdfb2

split up

ed0e427

epwalsh changed the title ~~run tests on Beaker~~ run tests on Beaker with NFS cache Feb 18, 2023

fix

5f46f5f

epwalsh marked this pull request as ready for review February 18, 2023 00:29

epwalsh commented Feb 18, 2023

View reviewed changes

epwalsh added 3 commits February 17, 2023 16:35

split up different

434d8f8

fix

1265688

fix description

7cce288

epwalsh requested a review from dirkgr February 18, 2023 00:45

dirkgr approved these changes Feb 18, 2023

View reviewed changes

Merge branch 'main' into fix-ci

6e0ac0f

dirkgr merged commit e8b671e into main Feb 20, 2023

dirkgr deleted the fix-ci branch February 20, 2023 19:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

run tests on Beaker with NFS cache #120

run tests on Beaker with NFS cache #120

epwalsh commented Feb 17, 2023 •

edited

Loading

epwalsh Feb 18, 2023

dirkgr Feb 18, 2023

dirkgr left a comment

dirkgr Feb 18, 2023

epwalsh Feb 18, 2023

dirkgr Feb 18, 2023

run tests on Beaker with NFS cache #120

run tests on Beaker with NFS cache #120

Conversation

epwalsh commented Feb 17, 2023 • edited Loading

epwalsh Feb 18, 2023

Choose a reason for hiding this comment

dirkgr Feb 18, 2023

Choose a reason for hiding this comment

dirkgr left a comment

Choose a reason for hiding this comment

dirkgr Feb 18, 2023

Choose a reason for hiding this comment

epwalsh Feb 18, 2023

Choose a reason for hiding this comment

dirkgr Feb 18, 2023

Choose a reason for hiding this comment

epwalsh commented Feb 17, 2023 •

edited

Loading