CPU usage regression for ripgrep #48257

sophiajt · 2018-02-16T07:48:54Z

As reported on Twitter: https://twitter.com/jleedev/status/964324747277455360

Looks like ripgrep with 1.24 consumes far more cpu power to compile than it did with 1.23.

matthiaskrgr · 2018-02-16T11:06:31Z

Is there a way to run RUSTFLAGS="-Z time-llvm-passes -Z time-passes" on stable/beta to get some background data on this issue?

pietroalbini · 2018-02-16T11:36:08Z

@matthiaskrgr the RUSTC_BOOTSTRAP=1 environment variable should do the trick.

matthiaskrgr · 2018-02-16T12:45:27Z

Looks like it's the codegen-units that make the difference here. (note: this system has a i5-7200U dual core / 4 threads cpu).
The tweet suggests that the machine could be an RPI, how many cores do these things usually have?

RUSTC_BOOTSTRAP=1 RUSTFLAGS="-Z time-passes -C codegen-units=1"  cargo +1.23.0  build --release
real	1m31,136s
user	3m5,831s
sys	0m3,018s

RUSTC_BOOTSTRAP=1 RUSTFLAGS="-Z time-passes "  cargo +1.23.0  build --release
real	1m29,970s
user	3m1,953s
sys	0m2,963s

RUSTC_BOOTSTRAP=1 RUSTFLAGS="-Z time-passes -C codegen-units=1" cargo +stable  build --release 

real	1m27,657s
user	2m57,814s
sys	0m3,034s

RUSTC_BOOTSTRAP=1 RUSTFLAGS="-Z time-passes" cargo +stable  build --release 

real	1m34,053s
user	5m35,334s
sys	0m4,888s

ishitatsuyuki · 2018-02-16T13:15:41Z

Parallel codegen-units is not as efficient as the old serial approach, and is likely the cause.

nikomatsakis · 2018-02-22T15:23:57Z

Added to #47745

nikomatsakis · 2018-02-22T15:25:20Z

My bad, this a compile-time regression?

nikomatsakis · 2018-02-22T15:26:28Z

triage: P-medium

There doesn't seem to be a lot to do here, other than perhaps reconsidering our defaults?

cc @alexcrichton -- what is the current strategy for --release, anyway? (How many CGU etc?)

sfackler · 2018-02-22T17:32:40Z

Release builds default to 16 codegen units.

alexcrichton · 2018-02-24T20:21:03Z

I believe this is sort of "expected fallout" of enabling multiple codegen units by default in release builds. Given that the overall time didn't regress here this is likely just "business as usual"

sophiajt · 2018-02-25T20:44:51Z

@alexcrichton - I'd caution us against treating this as "business as usual". The Rust compiler is consuming more energy and yielding nothing in return. Since one of the core tenants of Rust is its efficiency, to be so much less efficient during a compilation feels like a step backward.

Like Aaron likes to say, we should find a way to "bend the curve" and avoid improving in some way only to noticeably regress in others.

alexcrichton · 2018-02-25T20:51:36Z

@jonathandturner I agree we should be cautious, and I agree broadly that we shouldn't bush off issues like this. That being said we aren't getting nothing in return for ThinLTO, rather we're seeing broad improvements across the board. Some crates regress a little but most see huge improvements with parallel compilation.

michaelwoerister · 2018-03-01T16:10:34Z

I'm less and less sure about ThinLTO. The code it produces is less performant than full LTO, so it's not the best choice for release artifacts. At the same time it compiles slower than -Ccodegen-units=N which is usually optimized enough for the "debug-opt" use case.

alexcrichton · 2018-03-01T16:56:46Z

@michaelwoerister it's true that ThinLTO + multiple codegen units for a crate isn't always as fast as one codegen unit for that crate, but I think we're really undervaluing the compile time wins from multiple codegen units and overvaluing microbenchmarks and a few percent performance differences.

For example I feel like multiple codegen units + ThinLTO is a fantastic default for rustc. Builds feel much faster nowadays on multicore machines as it's not sitting with one CPU pegged for multiple minutes on librustc. Stats of perf.rust-lang.org show that ThinLTO does indeed regress performance by ~5% but in the grand scheme of things that seems like a tiny (and barely perceptible) tradeoff for the compile time wins we're seeing.

I also feel like we write off the compile times of release builds too frequently as "oh it's fine if it takes a long time to build". In reality though I feel like release builds happen way more often than we necessarily perceive. For example all Rust CI builds are in release mode (as we publish artifacts) and we can't really get around that. I'd imagine that there are other projects where you're testing an optimized binary pretty frequently but still need to make changes.

pnkfelix · 2018-03-08T14:50:30Z

Tagging with T-compiler (something I was tempted to do last week but I'm not sure why I balked at that time).

Enselic · 2023-09-27T17:23:37Z

Triage: By now (5 years later) it seems unlikely that this issue will ever be acted on, so let's close as won't fix rather than keeping this open forever.

matthiaskrgr mentioned this issue Feb 16, 2018

option to print total crate build time rust-lang/cargo#5046

Closed

nagisa mentioned this issue Feb 22, 2018

codegen-units + ThinLTO is not as good as codegen-units = 1 #47745

Open

rust-highfive added the P-medium Medium priority label Feb 22, 2018

pnkfelix added the T-compiler Relevant to the compiler team, which will review and decide on the PR/issue. label Mar 8, 2018

Enselic closed this as not planned Won't fix, can't repro, duplicate, stale Sep 27, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CPU usage regression for ripgrep #48257

CPU usage regression for ripgrep #48257

sophiajt commented Feb 16, 2018

matthiaskrgr commented Feb 16, 2018

pietroalbini commented Feb 16, 2018

matthiaskrgr commented Feb 16, 2018 •

edited

Loading

ishitatsuyuki commented Feb 16, 2018

nikomatsakis commented Feb 22, 2018

nikomatsakis commented Feb 22, 2018

nikomatsakis commented Feb 22, 2018

sfackler commented Feb 22, 2018

alexcrichton commented Feb 24, 2018

sophiajt commented Feb 25, 2018

alexcrichton commented Feb 25, 2018

michaelwoerister commented Mar 1, 2018

alexcrichton commented Mar 1, 2018

pnkfelix commented Mar 8, 2018

Enselic commented Sep 27, 2023

CPU usage regression for ripgrep #48257

CPU usage regression for ripgrep #48257

Comments

sophiajt commented Feb 16, 2018

matthiaskrgr commented Feb 16, 2018

pietroalbini commented Feb 16, 2018

matthiaskrgr commented Feb 16, 2018 • edited Loading

ishitatsuyuki commented Feb 16, 2018

nikomatsakis commented Feb 22, 2018

nikomatsakis commented Feb 22, 2018

nikomatsakis commented Feb 22, 2018

sfackler commented Feb 22, 2018

alexcrichton commented Feb 24, 2018

sophiajt commented Feb 25, 2018

alexcrichton commented Feb 25, 2018

michaelwoerister commented Mar 1, 2018

alexcrichton commented Mar 1, 2018

pnkfelix commented Mar 8, 2018

Enselic commented Sep 27, 2023

matthiaskrgr commented Feb 16, 2018 •

edited

Loading