Update performance tips to be explicit about compilation time practices #37663

ChrisRackauckas · 2020-09-19T19:44:08Z

One of the things that has come up recently is that some benchmarks will include compilation time because "it's how I run code" or "it's a subjective thing", "it's a personal choice". The purpose of this change it to make it very explicit that this is not a personal choice and it is, as per the manual, using the programming language incorrectly. This gets rid of any debate around the topic and makes it easy for anyone, say a reviewer, to quote the manual in a way that says that any of the issues are user-induced because they are simply not programming or measuring in a style that mirrors common or recommended Julia usage.

doc/src/manual/performance-tips.md

Co-authored-by: Jonas Schulze <[email protected]>

timholy · 2020-09-19T21:05:08Z

I like the idea but this feels a bit heavy for what it's trying to convey, while also a bit disconnected from the rest of the performance tips page. What about incorporating this wisdom into the @time section? That section mentions discarding the first run, but perhaps we should just grab the bull by the horns and illustrate what we mean directly: (EDIT: ah, that second already includes a demo of the first time, I missed that when I first scanned it.)

blah blah...

julia> using LinearAlgebra

julia> @time eigen([2 1; 1 4])
  0.019775 seconds (35.04 k allocations: 2.221 MiB)
Eigen{Float64, Float64, Matrix{Float64}, Vector{Float64}}
values:
2-element Vector{Float64}:
 1.585786437626905
 4.414213562373095
vectors:
2×2 Matrix{Float64}:
 -0.92388   0.382683
  0.382683  0.92388

If you're experienced with scientific computing, you probably realize that 20 milliseconds seems like a very long time to compute the eigenvalues of a 2×2 matrix. The key point to realize, though, is that this includes the time required to compile eigen; if we run it a second time in the same session,

julia> @time eigen([2 1; 1 4])
  0.000042 seconds (13 allocations: 1.703 KiB)
Eigen{Float64, Float64, Matrix{Float64}, Vector{Float64}}
values:
2-element Vector{Float64}:
 1.585786437626905
 4.414213562373095
vectors:
2×2 Matrix{Float64}:
 -0.92388   0.382683
  0.382683  0.92388

you can see it's much faster. Indeed, @time is not an accurate way to measure such short times; a better option is to use the BenchmarkTools package, which provides @btime (benchmark time) and yields an accurate result:

julia> using BenchmarkTools

julia> @btime eigen([2 1; 1 4])
  826.000 ns (13 allocations: 1.70 KiB)
Eigen{Float64, Float64, Matrix{Float64}, Vector{Float64}}
values:
2-element Vector{Float64}:
 1.585786437626905
 4.414213562373095
vectors:
2×2 Matrix{Float64}:
 -0.92388   0.382683
  0.382683  0.92388

Thus, less than a microsecond is required. You might be skeptical--if you try this, you'll note it takes far longer than a microsecond to run @btime--but the explanation is that @btime gets its accurate measurements by running the code repeatedly and computing the average.

One consequence of the compile-time overhead is that it is not recommended to run performance-sensitive calculations with scripts from the command line: each run will be compiled freshly, and so you pay the cost of compilation each time you run it. Instead, start a Julia session and run the command repeatedly from within the same session, so that you only pay the compile-time cost once. You can use tools like Revise.jl or inline evaluation with VSCode or Juno to continue to evolve your code while keeping your session running.

Co-authored-by: Daniel Karrasch <[email protected]>

ChrisRackauckas · 2020-09-20T00:23:50Z

I kind of wanted it to be a bit heavy, making it explicit in the manual that it's not up for debate one's compilation-inducing workflow is an appropriate way for people to use (and thus benchmark) quick codes that are compile-time dominated: I think the documentation should specifically say you should only do that if you absolutely don't care about performance and it's not recommended. I think it guides users in the right direction and gives a very explicit statement that if someone is benchmarking code like that, they are directly contradicting the best practices as laid out in the manual, not just some best practices that most people on Discourse etc. seem to follow.

Incorporating it into the @time section was something I considered, but I think it's slightly separate from just timing. It's of course useful in timing to remove compilation time from the process, but workflows that reduce the amount of compilation are useful and recommended for cases beyond timing. Even when you aren't timing, I think it's good to recommend a style that is Revise-based, open REPL, or PackageCompiler, since otherwise I think the user will not get as great of an experience. So it overlaps with the timing section but I don't think it's quite the same thing.

Also, making it all about timing makes it look like a trick to make timings look better: I've seen people write it off and not time twice in benchmarks because they believe that's not representative of how running code actually works. If the manual is explicit that the recommended workflows are these styles that do not require recompilation, then it immediately follows that the appropriate way to time is to time twice to not time compilation.

IanButterworth · 2020-09-20T02:40:53Z

Would it be possible for @time to report any compilation time, perhaps as a % when greater than zero, like gc reporting behaves?

Edit: Implemented in #37678

timholy · 2020-09-20T07:02:05Z

That would be useful, but I think what Chris is worried about is

$ time julia -e 'my_code.jl'

i.e., Linux-time rather than Julia-@time.

timholy

In that case, here are a few more specific comments. Even with those comments, it seems to approach its underlying message a bit indirectly. What about coming out and saying something along the lines of "this is not how Julia is intended to be used"? And then finish with your "If you must use Julia this way, and if the compile performance is an obstacle, use PackageCompiler.jl..."

I would also recommend using links to these tools rather than assuming people will find them from the name. Just omitting the .jl from a google search can lead to failure.

timholy · 2020-09-20T07:05:08Z

doc/src/manual/performance-tips.md

@@ -134,6 +134,23 @@ its algorithmic aspects (see [Pre-allocating outputs](@ref)).
    For more serious benchmarking, consider the [BenchmarkTools.jl](https://github.com/JuliaCI/BenchmarkTools.jl)
    package which among other things evaluates the function multiple times in order to reduce noise.

+## [Avoid Compilation Time in Performance-Sensitive Calculations](@id compilation_time)
+
+Due to Julia's JIT compilation, the first run of a new function or a new Julia environment will include the compilation 


JIT has not yet been defined on this page

timholy · 2020-09-20T07:11:30Z

doc/src/manual/performance-tips.md

+
+Due to Julia's JIT compilation, the first run of a new function or a new Julia environment will include the compilation 
+time of a function in its first call. Compilation time is determined by the code of a function and the input types and
+is not dependent on the input values. Thus compile times for a given function are essentially constant with respect to 


This sentence seems awkward; I think it's because the emphasis position of the sentence, the end, seems to veer off in a new direction.

timholy · 2020-09-20T07:12:21Z

doc/src/manual/performance-tips.md

@@ -134,6 +134,23 @@ its algorithmic aspects (see [Pre-allocating outputs](@ref)).
    For more serious benchmarking, consider the [BenchmarkTools.jl](https://github.com/JuliaCI/BenchmarkTools.jl)
    package which among other things evaluates the function multiple times in order to reduce noise.

+## [Avoid Compilation Time in Performance-Sensitive Calculations](@id compilation_time)


In what context? If I have a computation that runs for 3 days that might be performance-sensitive, yet there's little reason not to launch it from the Linux prompt because compilation might take 20s.

timholy · 2020-09-20T07:13:07Z

doc/src/manual/performance-tips.md

+is not dependent on the input values. Thus compile times for a given function are essentially constant with respect to 
+the runtime costs which vary with respect to runtime values like array size and required calculation tolerances.
+This means that for large complex analyses compilation time is dwarfed by runtime. However, when inputs are simpler, like
+in microbenchmarks or short calculations in a new REPL session, compilation time matters for performance.


it's more that it affects your measurement; in a sense your argument is that it doesn't matter (or rather that it is an orthogonal concept)

timholy · 2020-09-20T07:13:37Z

doc/src/manual/performance-tips.md

+
+Recommended Julia practices for short calculations amortize this compilation cost over the lifetime of the program, using 
+tools like Revise.jl, within-module re-evalulation of IDEs like Juno and VS Code, or by keeping REPL sessions alive over 
+multiple analyses. It is inadvisable and not a recommended practice to repeatedly run short scripts directly from the 


the first bits (Revise and the IDEs) are essentially ways to achieve "keeping REPL sessions alive," so I'd invert the order here and start with the goal and then mention tools that can help you achieve it.

"inadvisable and not a recommended" seems redundant

timholy · 2020-09-20T07:14:26Z

doc/src/manual/performance-tips.md

+Recommended Julia practices for short calculations amortize this compilation cost over the lifetime of the program, using 
+tools like Revise.jl, within-module re-evalulation of IDEs like Juno and VS Code, or by keeping REPL sessions alive over 
+multiple analyses. It is inadvisable and not a recommended practice to repeatedly run short scripts directly from the 
+command line if compilation is a significant factor in the runtime and performance is necessary. Direct running of small 


In what context is "performance necessary" for things that complete quickly? Maybe just say "when benchmarking"? Or if you have other cases in mind too, perhaps spell those out?

timholy · 2020-09-20T07:15:23Z

doc/src/manual/performance-tips.md

+This means that for large complex analyses compilation time is dwarfed by runtime. However, when inputs are simpler, like
+in microbenchmarks or short calculations in a new REPL session, compilation time matters for performance.
+
+Recommended Julia practices for short calculations amortize this compilation cost over the lifetime of the program, using 


Maybe save this sentence for the end: it's forward-looking (it tells you how to solve the problem), but then you go back and tell people not to do the bad thing. Preserving the temporal order might help clarify your meaning.

timholy · 2020-09-20T07:16:06Z

doc/src/manual/performance-tips.md

+command line if compilation is a significant factor in the runtime and performance is necessary. Direct running of small 
+single scripts will not result in the highest performance and is thus not recommended in any context where performance is 
+required or measured. If such a usage is required, the recommended approach is to use  PackageCompiler.jl to precompile 
+the functionality into the sysimage, giving a usage that is similar to shared libraries of other compiled languages like C.


I'd put all your recommendations for fixing the issue in a block.

KristofferC · 2020-09-20T08:30:44Z

I like the idea but this feels a bit heavy for what it's trying to convey, while also a bit disconnected from the rest of the performance tips page

This was my initial reaction as well. Compilation time is already discussed in earlier sections so I don't really see what this provides. And, to me, this comes of a bit like a "you're holding it wrong" type of comment that people will start linking as soon as someone complains about compilation time. The tone feels more suitable for a blog or a medium like that.

DilumAluthge · 2020-09-21T02:38:08Z

The tone feels more suitable for a blog or a medium like that.

If I understand correctly, @ChrisRackauckas specifically wants this this to be in the manual so that when someone is running e.g. some kind of benchmark competition between languages, we can point them to the manual as an official/authoritative way of saying "your benchmarking claim is not correct."

matthieugomez · 2020-09-30T15:37:29Z

I don’t think it is inherently wrong to complain about compilation time in certain packages (as long as users clearly identify it). For instance, packages with over parametrized types can lead to large compilation times that do not disappear over time — I think this is why DataFrames decided not to be strongly typed in the end.

ViralBShah · 2021-02-07T21:14:33Z

One of the things we are really missing, is a description of Julia's compilation stages in the manual. Something we can really point users to - a model that they can hold in the mind around what happens during precompilation, what happens during compilation, what the JIT does, what is included in a system image, and then the differences between a first and second run.

I know this is not the point of this PR, but I think having this would greatly help a lot of users, and then the point about how to benchmark and so on can be included as well.

DilumAluthge · 2021-02-21T21:43:20Z

@ChrisRackauckas and @timholy What is the status of this PR, and what needs to be done?

It would be great to get something like this into the manual.

DilumAluthge · 2021-02-21T21:52:11Z

Also @ChrisRackauckas could you rebase this on the latest master?

timholy · 2021-02-21T22:47:43Z

(I removed my name from requested reviewers because I don't think it's changed since this review.)

stillyslalom · 2021-02-22T00:01:43Z

My take on the wording for this PR, focused more towards new user experience - I don't think plotting is 'performance-sensitive' in most cases, but plot latency is a bigger pain point than microbenchmarking.

Recoup the cost of compilation overhead by keeping a Julia session alive

When using simple functionality from large packages (including most plotting and data-processing libraries), it can take several seconds to load and compile functions from a given package, even though subsequent use might take fractions of a second. This is known as the "time to first plot", and it arises from the flexibility (and attendant complexity) of dynamic multiple dispatch, which prevents Julia from simply caching the results of compilation for re-use between sessions.

Several options are available to help avoid paying the cost of compilation more often than necessary. The easiest is to keep a Julia session open, typically using Revise.jl to automatically replace modified function definitions whenever you save your source files. If you need to repeatedly call a Julia script from the command line, try DaemonMode.jl, which uses a client/server model with a Julia process running in the background. For deployed applications, as well as 'static' workflows involving a fixed set of packages, you can use PackageCompiler.jl.

vtjnash · 2021-04-21T18:31:23Z

I don't think this is particularly necessary or productive to include in the performance tips. In an ideal world, people would benchmark what matters. But I don't think we can beneficially tell people they are wrong for measuring something else, so it remains up to readers to decide if a particular benchmark report is sensible.

jonas-schulze reviewed Sep 19, 2020

View reviewed changes

doc/src/manual/performance-tips.md Outdated Show resolved Hide resolved

dkarrasch reviewed Sep 19, 2020

View reviewed changes

doc/src/manual/performance-tips.md Outdated Show resolved Hide resolved

Update doc/src/manual/performance-tips.md

b386273

Co-authored-by: Jonas Schulze <[email protected]>

Update doc/src/manual/performance-tips.md

b686c13

Co-authored-by: Daniel Karrasch <[email protected]>

timholy reviewed Sep 20, 2020

View reviewed changes

ViralBShah added the docs This change adds or pertains to documentation label Feb 7, 2021

DilumAluthge requested a review from timholy February 21, 2021 21:43

timholy removed their request for review February 21, 2021 22:46

vtjnash closed this Apr 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update performance tips to be explicit about compilation time practices #37663

Update performance tips to be explicit about compilation time practices #37663

ChrisRackauckas commented Sep 19, 2020

timholy commented Sep 19, 2020 •

edited

Loading

ChrisRackauckas commented Sep 20, 2020 •

edited

Loading

IanButterworth commented Sep 20, 2020 •

edited

Loading

timholy commented Sep 20, 2020

timholy left a comment •

edited

Loading

timholy Sep 20, 2020

timholy Sep 20, 2020

timholy Sep 20, 2020

timholy Sep 20, 2020

timholy Sep 20, 2020

timholy Sep 20, 2020

timholy Sep 20, 2020

timholy Sep 20, 2020

KristofferC commented Sep 20, 2020

DilumAluthge commented Sep 21, 2020

matthieugomez commented Sep 30, 2020 •

edited

Loading

ViralBShah commented Feb 7, 2021

DilumAluthge commented Feb 21, 2021

DilumAluthge commented Feb 21, 2021

timholy commented Feb 21, 2021

stillyslalom commented Feb 22, 2021

vtjnash commented Apr 21, 2021

Update performance tips to be explicit about compilation time practices #37663

Update performance tips to be explicit about compilation time practices #37663

Conversation

ChrisRackauckas commented Sep 19, 2020

timholy commented Sep 19, 2020 • edited Loading

ChrisRackauckas commented Sep 20, 2020 • edited Loading

IanButterworth commented Sep 20, 2020 • edited Loading

timholy commented Sep 20, 2020

timholy left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KristofferC commented Sep 20, 2020

DilumAluthge commented Sep 21, 2020

matthieugomez commented Sep 30, 2020 • edited Loading

ViralBShah commented Feb 7, 2021

DilumAluthge commented Feb 21, 2021

DilumAluthge commented Feb 21, 2021

timholy commented Feb 21, 2021

stillyslalom commented Feb 22, 2021

Recoup the cost of compilation overhead by keeping a Julia session alive

vtjnash commented Apr 21, 2021

timholy commented Sep 19, 2020 •

edited

Loading

ChrisRackauckas commented Sep 20, 2020 •

edited

Loading

IanButterworth commented Sep 20, 2020 •

edited

Loading

timholy left a comment •

edited

Loading

matthieugomez commented Sep 30, 2020 •

edited

Loading