Speed up coverage tests #954

efaulhaber · 2021-10-28T11:15:37Z

Closes #948.

Comparison of the latest run on main and this PR:

	`main` test time	`main` total time	new test time without coverage	new test time with coverage	new total time
`tree_part1`	`36m 47s`	`42m 34s`	`19m 15s`	`19m 50s`	`46m 19s`
`tree_part2`	`55m 1s`	`1h 1m 48s`	`14m 42s`	`38m 57s`	`59m 34s`
`tree_part3`	`57m 15s`	`1h 2m 49s`	`12m 0s`	`38m 33s`	`56m 28s`
`tree_part4`	`53m 38s`	`1h 0m 29s`	`10m 33s`	`29m 41s`	`46m 34s`
`tree_part5`	`38m 11s`	`44m 5s`	`9m 40s`	`25m 40s`	`41m 30s`
`tree_part6`	`57m 7s`	`1h 2m 47s`	`10m 26s`	`20m 23s`	`37m 16s`
`structured`	`1h 43m 46s`	`1h 50m 1s`	`14m 38s`	`1h 1m 10s`	`1h 21m 31s`
`p4est_part1`	`42m 18s`	`49m 8s`	`14m 11s`	`19m 58s`	`41m 26s`
`p4est_part2`	`1h 58m 18s`	`2h 4m 48s`	`14m 56s`	`55m 36s`	`1h 17m 36s`
`unstructured_dgmulti`	`44m 30s`	`51m 6s`	`15m 4s`	`34m 15s`	`54m 47s`
`paper_self_gravitating _gas_dynamics`	`43m 10s`	`49m 48s`	`8m 38s`	`12m 20s`	`26m 48s`
`misc_part1`	`1h 30m 48s`	`1h 36m 25s`	`20m 9s`	`1h 39m 7s`	`2h 5m 2s`
`misc_part2`	`30m 38s`	`37m 30s`	`12m 47s`	`12m 38s`	`32m 18s`
`mpi` (Ubuntu)	`23m 38s`	`30m 37s`	`12m 19s`	`14m 19s`	`33m 17s`
`threaded` (Ubuntu)	`13m 58s`	`21m 16s`	`14m 0s`	`8m 46s`	`29m 14s`

Note that the misc_part2 timings are not from the latest CI run of this PR but from the one before that. The two runs before the latest one both took about 30 minutes, while the latest one took over one hour. I didn't change anything on this job since then. This job seems to vary greatly between runs for some reason. In this CI run on main, the misc_part2 job took over 2 hours, for example.

The greatest improvements can be seen in the jobs structured, p4est_part2, and paper_self_gravitating_gas_dynamics.
The job paper_self_gravitating_gas_dynamics contains some long-running tests that are now significantly faster without coverage and can be reduced to a few time steps when running with coverage. The jobs structured and p4est_part2 were the slowest, so I optimized a few tests by hand (for example, letting the polydeg=5 tests run with polydeg=3 instead in the coverage tests greatly reduced the test time, probably because the recompilation for polydeg=5 now disappears).

In general, this PR allows running more complicated non-coverage tests without greatly increasing the CI times. It also allows optimizing coverage times even more, as I did (roughly) for structured and p4est_part2.

codecov · 2021-10-28T13:17:57Z

Codecov Report

Merging #954 (d5d6cbb) into main (1d2fc8d) will decrease coverage by 0.00%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##             main     #954      +/-   ##
==========================================
- Coverage   93.79%   93.79%   -0.00%     
==========================================
  Files         284      284              
  Lines       20615    20616       +1     
==========================================
  Hits        19335    19335              
- Misses       1280     1281       +1

Flag	Coverage Δ
unittests	`93.79% <100.00%> (-<0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
src/callbacks_step/glm_speed.jl	`73.33% <100.00%> (+0.92%)`	⬆️
...discretization/semidiscretization_euler_gravity.jl	`90.86% <0.00%> (-0.51%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 1d2fc8d...d5d6cbb. Read the comment docs.

efaulhaber · 2021-10-28T16:42:54Z

Preliminary results from my first run: The job p4est_part2 has been sped up from 1h 48m to 1h 18m.

While this is a significant speedup, it's still of the same order. However, this approach has two big advantages:

This approach is fail-fast. The tests without coverage are done within the first ~20 minutes. If this step fails, the job will fail and not run the coverage step. Note that after a failing test, coverage results haven't been reported before anyway.
We can easily let the simulations run for more time steps without significantly slowing down the pipeline.

examples/dgmulti_2d/elixir_euler_brown_minion_vortex.jl

test/test_tree_2d_mhd.jl

.github/workflows/ci.yml

efaulhaber · 2021-11-04T11:15:44Z

Here's my suggestion to make tests even faster (writing that here, so it doesn't get lost in the review comments):
Instead of writing

@test_trixi_include(joinpath(EXAMPLES_DIR, "elixir_advection_basic.jl"),
      l2   = [8.311947673061856e-6],
      linf = [6.627000273229378e-5],
      maxiters_coverage = 10)

to run this test with 10 iterations in the coverage run, we could do something like this:

@test_trixi_include(joinpath(EXAMPLES_DIR, "elixir_advection_basic.jl"),
      l2   = [8.311947673061856e-6],
      linf = [6.627000273229378e-5],
      coverage_kwargs = Dict(maxiters => 10, cells_per_dimension => (2, 2, 2), polydeg => 2))

If no coverage_kwargs are specified, or a Dict that doesn't contain maxiters, a maxiters = 1 is added by default.
The coverage_kwargs would then be ignored for normal tests, but in coverage tests, the corresponding kwargs are applied (overwritten). This would allow us to easily tweak more complicated tests to run with fewer cells and/or a lower polydeg, or even a shorter AMR interval to be able to use less iterations.

We could also do the same for convergence_test to run convergence tests with fewer cells for the first iteration and with fewer iterations in general.

ranocha · 2021-11-04T11:33:49Z

Sounds good at first 👍 We need to realize that some tests will be really heavy, e.g., if they need to good resolution etc. to run at all because of positivity issues or something like that. Thus, we might want an option to disable some tests for coverage

sloede · 2021-11-04T12:38:31Z

Sounds good to me too, at first glance!

🚲🏠ing:

I'd not use coverage_kwargs since you can also override regular variables. Thus I suggest to use something like either coverage_override or just coverage.
Unless there are technical/performance reasons against it, I like named tuples better than Dicts since they are easier to type and read IMHO. Compare
```
Dict(maxiters => 10, cells_per_dimension => (2, 2, 2), polydeg => 2)
```
vs
```
(maxiters = 10, cells_per_dimension = (2, 2, 2), polydeg = 2)
```

efaulhaber · 2021-11-06T14:27:57Z

There is now only one line uncovered that was covered in main. This one's weird:

This is `src/semidiscretization/semidiscretization_euler_gravity.jl`.

ranocha · 2021-11-06T14:35:16Z

That's okay - inlined function definitions are not reported as covered, see #841

ranocha

Great work, @efaulhaber 👍

Do you see additional potential to reduce the CI times of misc_part1? It looks like some of the tests like 3D plotting and time series callback take quite long with coverage.

test/test_tree_1d_euler.jl

test/test_tree_2d_advection.jl

test/test_tree_2d_euler.jl

test/test_tree_3d_euler.jl

.github/workflows/ci.yml

efaulhaber · 2021-11-06T15:15:31Z

Do you see additional potential to reduce the CI times of misc_part1? It looks like some of the tests like 3D plotting and time series callback take quite long with coverage.

I haven't looked into this, which means that it probably has a lot of potential for optimization.

efaulhaber · 2021-11-07T22:25:43Z

test/test_trixi.jl

+  local run_without_coverage = get_kwarg(args, :run_without_coverage, true)
+  local run_with_coverage    = get_kwarg(args, :run_with_coverage, true)


Any better name ideas for these guys?

Why are there two arguments? This seems like a recipe for confusion in case they are both false. Also, having two keywords increases the chance that someone makes a silent typo that is never found, e.g., when using run_without_coverge=false.

Aren't these kind of tests that we want to drop easier to discard at a higher level? For example, we could check whether coverage is turned on and avoid including the threaded tests if so. Similarly, we will probably need special cases for AD tests etc. Thus, I think we should probably remove these keyword arguments again and extend this PR or merge this PR (if @sloede approves) and make another PR to improve the handling of expensive misc_partx tests.

I agree. Merging the first part that is already looking OK and then addressing other issues in a second PR seems like a good approach!

I tried to avoid having some magic values (:both, coverage, ...) and to use binary switches instead. Anyway, I'm removing it again.

ranocha

CI / p4est_part2 - ubuntu-latest - x64 - pull_request (pull_request) Successful in 139m became expensive again?

efaulhaber · 2021-11-08T10:12:57Z

CI / p4est_part2 - ubuntu-latest - x64 - pull_request (pull_request) Successful in 139m became expensive again?

I think that's just the runner being terribly slow this time. The first elixir_advection_basic.jl, which usually takes ~300s due to compilation time, took about 40 minutes this time.

This reverts commit 063efdf.

efaulhaber · 2021-11-08T12:20:08Z

Okay, I have no idea why p4est_part2 is so slow again. I haven't changed anything since it was fast.

First draft of sped up coverage test

f3cc82b

efaulhaber added 3 commits October 28, 2021 16:16

Prevent GlmSpeedCallback from overriding maxiters

9cc808d

Run tests twice: Once with coverage and once without

148a8e2

Merge branch 'main' into speed-up-coverage-tests

dd8dc75

ranocha mentioned this pull request Oct 28, 2021

Add an idealized baroclinic instability test and two accessory elixirs #942

Merged

efaulhaber added 3 commits November 3, 2021 11:56

Add maxiters kwarg to all elixirs

33bf623

Merge branch 'main' into speed-up-coverage-tests

a55f82b

Fix tests

1507fee

efaulhaber commented Nov 3, 2021

View reviewed changes

examples/dgmulti_2d/elixir_euler_brown_minion_vortex.jl Outdated Show resolved Hide resolved

efaulhaber commented Nov 3, 2021

View reviewed changes

test/test_tree_2d_mhd.jl Outdated Show resolved Hide resolved

efaulhaber closed this Nov 3, 2021

efaulhaber reopened this Nov 3, 2021

efaulhaber closed this Nov 3, 2021

efaulhaber reopened this Nov 3, 2021

ranocha mentioned this pull request Nov 4, 2021

insert keyword argument maxiters into solve and Trixi.solve via trixi_include #963

Merged

ranocha and others added 3 commits November 4, 2021 09:36

Merge branch 'main' into speed-up-coverage-tests

48d299d

Revert elixirs

9459979

Merge branch 'main' into speed-up-coverage-tests

1278f75

ranocha requested changes Nov 4, 2021

View reviewed changes

.github/workflows/ci.yml Show resolved Hide resolved

efaulhaber and others added 6 commits November 4, 2021 16:11

Allow passing different kwargs for coverage override

35bb151

Update coverage_override for maxiters for the AMR tests

1af1895

Skip most convergence tests in coverage tests

457ec7e

Merge branch 'main' into speed-up-coverage-tests

6d67ed3

Fix errors

f72dd07

Replace eval

d2b95cc

efaulhaber added 8 commits November 4, 2021 21:19

Fix 87d906c

ab9ab65

Increase maxiters for AMR tests by 1

bf242a9

Let some coverage tests run longer

ca59aa5

Fix coverage

dbee951

Fix tests

66e7820

Merge branch 'main' into speed-up-coverage-tests

bcea324

Increase coverage

b5f4ba3

Speed up StructuredMesh and P4estMesh coverage tests

46c4f95

efaulhaber marked this pull request as ready for review November 6, 2021 13:52

efaulhaber requested a review from ranocha November 6, 2021 14:24

ranocha requested changes Nov 6, 2021

View reviewed changes

efaulhaber added 3 commits November 7, 2021 23:14

Add comments

f89f0e6

Don't run threaded tests twice

063efdf

Merge branch 'main' into speed-up-coverage-tests

cf6c8c0

efaulhaber commented Nov 7, 2021

View reviewed changes

efaulhaber requested a review from ranocha November 7, 2021 22:25

ranocha requested changes Nov 8, 2021

View reviewed changes

Revert "Don't run threaded tests twice"

d5d6cbb

This reverts commit 063efdf.

efaulhaber requested a review from ranocha November 8, 2021 10:13

ranocha approved these changes Nov 8, 2021

View reviewed changes

ranocha enabled auto-merge (squash) November 8, 2021 11:25

ranocha disabled auto-merge November 8, 2021 13:06

ranocha merged commit 416b107 into trixi-framework:main Nov 8, 2021

efaulhaber deleted the speed-up-coverage-tests branch November 8, 2021 13:08

efaulhaber mentioned this pull request Nov 8, 2021

Speed up coverage tests #970

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up coverage tests #954

Speed up coverage tests #954

efaulhaber commented Oct 28, 2021 •

edited

Loading

codecov bot commented Oct 28, 2021 •

edited

Loading

efaulhaber commented Oct 28, 2021

efaulhaber commented Nov 4, 2021 •

edited

Loading

ranocha commented Nov 4, 2021

sloede commented Nov 4, 2021

efaulhaber commented Nov 6, 2021 •

edited

Loading

ranocha commented Nov 6, 2021

ranocha left a comment

efaulhaber commented Nov 6, 2021 •

edited

Loading

efaulhaber Nov 7, 2021

sloede Nov 8, 2021

ranocha Nov 8, 2021

sloede Nov 8, 2021

efaulhaber Nov 8, 2021

ranocha left a comment

efaulhaber commented Nov 8, 2021

efaulhaber commented Nov 8, 2021

		local run_without_coverage = get_kwarg(args, :run_without_coverage, true)
		local run_with_coverage = get_kwarg(args, :run_with_coverage, true)

Speed up coverage tests #954

Speed up coverage tests #954

Conversation

efaulhaber commented Oct 28, 2021 • edited Loading

codecov bot commented Oct 28, 2021 • edited Loading

Codecov Report

efaulhaber commented Oct 28, 2021

efaulhaber commented Nov 4, 2021 • edited Loading

ranocha commented Nov 4, 2021

sloede commented Nov 4, 2021

efaulhaber commented Nov 6, 2021 • edited Loading

ranocha commented Nov 6, 2021

ranocha left a comment

Choose a reason for hiding this comment

efaulhaber commented Nov 6, 2021 • edited Loading

efaulhaber Nov 7, 2021

Choose a reason for hiding this comment

sloede Nov 8, 2021

Choose a reason for hiding this comment

ranocha Nov 8, 2021

Choose a reason for hiding this comment

sloede Nov 8, 2021

Choose a reason for hiding this comment

efaulhaber Nov 8, 2021

Choose a reason for hiding this comment

ranocha left a comment

Choose a reason for hiding this comment

efaulhaber commented Nov 8, 2021

efaulhaber commented Nov 8, 2021

efaulhaber commented Oct 28, 2021 •

edited

Loading

codecov bot commented Oct 28, 2021 •

edited

Loading

efaulhaber commented Nov 4, 2021 •

edited

Loading

efaulhaber commented Nov 6, 2021 •

edited

Loading

efaulhaber commented Nov 6, 2021 •

edited

Loading