Improve performance of surface integral #997

ranocha · 2021-11-24T09:05:49Z

When SIMD optimizations land in Trixi.jl, the other parts become relatively more expensive. This is a simple way to increase the performance of the surface integral by reducing the number of memory accesses and total floating point operations.

ranocha · 2021-11-24T12:31:02Z

Some tests failed previously. These were either sensitive to the CFL number or previously known to introduce issues in CI. For the former, I reduced the CFL number, ran the setup on main, and used the reported errors for CI. For the latter, I increased the tolerance a bit 🙈

codecov · 2021-11-25T05:36:19Z

Codecov Report

Merging #997 (38e974c) into main (d9a8666) will decrease coverage by 0.01%.
The diff coverage is 86.49%.

@@            Coverage Diff             @@
##             main     #997      +/-   ##
==========================================
- Coverage   93.66%   93.65%   -0.01%     
==========================================
  Files         287      287              
  Lines       20972    20982      +10     
==========================================
+ Hits        19643    19650       +7     
- Misses       1329     1332       +3

Flag	Coverage Δ
unittests	`93.65% <86.49%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
examples/p4est_3d_dgsem/elixir_euler_ec.jl	`100.00% <ø> (ø)`
src/equations/compressible_euler_2d.jl	`93.13% <0.00%> (-0.21%)`	⬇️
src/equations/compressible_euler_3d.jl	`93.22% <0.00%> (-0.17%)`	⬇️
src/equations/shallow_water_2d.jl	`89.08% <0.00%> (-0.34%)`	⬇️
src/solvers/dgsem_p4est/dg_2d.jl	`97.15% <100.00%> (+0.02%)`	⬆️
src/solvers/dgsem_p4est/dg_3d.jl	`97.78% <100.00%> (+0.01%)`	⬆️
src/solvers/dgsem_tree/dg_1d.jl	`95.19% <100.00%> (+0.05%)`	⬆️
src/solvers/dgsem_tree/dg_2d.jl	`97.04% <100.00%> (+0.01%)`	⬆️
src/solvers/dgsem_tree/dg_3d.jl	`97.76% <100.00%> (+0.01%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update d9a8666...38e974c. Read the comment docs.

ranocha · 2021-11-25T13:30:39Z

@sloede All tests pass - coverage is slightly reduced due to additional inlining and #841

sloede

LGTM. As a general note: Should we adopt the policy to @inline all functions that might be used in a performance critical hot kernel and that are applied pointwise? I see few downsides, and it would make it easier to decide when to use @inline and when not.

sloede · 2021-11-25T14:51:42Z

src/solvers/dgsem_p4est/dg_2d.jl

+  # Access the factors only once before beginning the loop to increase performance.
+  # We also use explicit assignments instead of `+=` to let `@muladd` turn these
+  # into FMAs (see comment at the top of the file).


Thanks, these comments are really helpful to understand why something was done the way it is, and also ensure that this won't get revised by an eager developer in the future.

ranocha · 2021-11-25T15:22:56Z

LGTM. As a general note: Should we adopt the policy to @inline all functions that might be used in a performance critical hot kernel and that are applied pointwise? I see few downsides, and it would make it easier to decide when to use @inline and when not.

Maybe, that's an option.

ranocha added 2 commits November 24, 2021 09:59

improve performance of 3D surface integral

da99b10

improve performance of 1D and 2D surface integral

284e32a

ranocha added the performance We are greedy label Nov 24, 2021

ranocha requested a review from sloede November 24, 2021 09:05

ranocha added 6 commits November 24, 2021 10:25

adapt condition number test in CI

cc10063

fix type instability by inlining boundary_condition_slip_wall

db44ad0

no FluxRotated(flux_ranocha)

9752692

adapt CFL of 2D elixir_euler_ec.jl with boundary_condition_slip_wall

e82baeb

increase test tolerance of 1D hyp. diff. tests

cf4ae12

2D "elixir_advection_restart.jl with waving flag mesh

38e974c

sloede approved these changes Nov 25, 2021

View reviewed changes

ranocha merged commit 34ed1e8 into main Nov 25, 2021

ranocha deleted the hr/surface_integral branch November 25, 2021 15:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve performance of surface integral #997

Improve performance of surface integral #997

ranocha commented Nov 24, 2021

ranocha commented Nov 24, 2021

codecov bot commented Nov 25, 2021 •

edited

Loading

ranocha commented Nov 25, 2021

sloede left a comment

sloede Nov 25, 2021

ranocha commented Nov 25, 2021

Improve performance of surface integral #997

Improve performance of surface integral #997

Conversation

ranocha commented Nov 24, 2021

ranocha commented Nov 24, 2021

codecov bot commented Nov 25, 2021 • edited Loading

Codecov Report

ranocha commented Nov 25, 2021

sloede left a comment

Choose a reason for hiding this comment

sloede Nov 25, 2021

Choose a reason for hiding this comment

ranocha commented Nov 25, 2021

codecov bot commented Nov 25, 2021 •

edited

Loading