-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The binary size - performance tradeoff #69
Comments
FWIW: I think funky stuff like unrolling (which is really worthless on embedded architectures) is to be expected to be done at higher optimisation levels and I've seen all kind of funky size regressions. I'm always using (and recommending) |
|
NB: I highly doubt that. As soon as one uses some the initialisation code from some of the typical SDKs, the code will be well in the kBs already. To even stay in Rusts range you'll have to manually bang the memory mapped registers and write your own linker scripts. Case in point, this is the smallest possible binary for a
|
Could we get a RFC for something like |
@Emilgardis No need for an RFC, marking the function with the loop as |
@japaric also wrote in rust-lang/rust#49260:
If this is still the case with LLVM 6, this definitely wants to be investigated and fixed on the LLVM side. He also wrote:
This might be the cause for some amount of bloat due to unnecessary branches, but shouldn't |
Why not get the default flags changed instead? It'd be very annoying to put annotations in every source file just in case someone might accidentally not change the compiler flags... |
The inlining thresholds in LLVM are tailored for C, which produces functions with relatively compact IR, and likely aren't well suited for Rust. In our in-house language we had to raise them significantly to get decent reductions in code size. |
@whitequark I've never heard of that attribute, seems like it should work however. |
@jonas-schievink I can not confirm that it produces larger files with opt-level=s, at least not in general. This all has quite a bit of premature optimisation smell to it, same as with the |
There has been a small amount of discussion about unrolling atttributes:
rust-lang/rfcs#2219
…On Fri, Mar 23, 2018 at 11:29 AM, Daniel Egger ***@***.***> wrote:
@jonas-schievink <https://github.com/jonas-schievink> I can not confirm
that it produces larger files with opt-level=s, at least not in general.
This all has quite a bit of premature optimisation smell to it, same as
with the #[inline(always)] we had sprinkled all over the map...
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#69 (comment)>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAC3ny1RMzakWcAlwf4TUD8Up6h0FM_1ks5thRS9gaJpZM4S2_bO>
.
|
The choice to unroll or not should be at the LLVM layer. With the Linux HAL I could very well want loop unrolling if my x86_64 machine was using a driver over the SMBus.
Is it possible for the LLVM backends to automatically opt-out when appropriate?
… On Mar 23, 2018, at 11:23 AM, Alex Burka ***@***.***> wrote:
There has been a small amount of discussion about unrolling atttributes:
rust-lang/rfcs#2219
On Fri, Mar 23, 2018 at 11:29 AM, Daniel Egger ***@***.***>
wrote:
> @jonas-schievink <https://github.com/jonas-schievink> I can not confirm
> that it produces larger files with opt-level=s, at least not in general.
> This all has quite a bit of premature optimisation smell to it, same as
> with the #[inline(always)] we had sprinkled all over the map...
>
> —
> You are receiving this because you are subscribed to this thread.
> Reply to this email directly, view it on GitHub
> <#69 (comment)>,
> or mute the thread
> <https://github.com/notifications/unsubscribe-auth/AAC3ny1RMzakWcAlwf4TUD8Up6h0FM_1ks5thRS9gaJpZM4S2_bO>
> .
>
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub, or mute the thread.
|
This issue was moved to rust-embedded/book#11 |
As of the latest LLVM upgrade (4.0 -> 6.0 on 2018-02-11) LLVM seems to now perform loop unrolling more agressively; this increased the binary size of a minimal program that only zeroes .bss and initializes .data from 130 bytes .text (nightly-2018-02-10) to 1114 bytes (nightly-2018-03-20) when using opt-level=3 + LTO -- FWIW, I highly doubt the loop unrolling actually improves performance at all. Original report: rust-lang/rust#49260
This put us in a bad spot because by default we'll end with large optimized (
--release
) binaries -- I can already foresee future comparisons between C and Rust pointing out that the smallest embedded C program is only a hundred bytes in size whereas the smallest embedded Rust program is 1 KB.So we should make sure we clearly document why Rust programs are so large by default and how to make Rust programs small. Using opt-level=s + LTO on the minimal program mentioned above brings the size back to 130 bytes .text.
cc @jamesmunns ^ that should be included in the book
There are other possibilites to explore here: like having something like C's / clang's
#pragma nounroll
to prevent LLVM from optimizing loops marked with that attribute, but I doubt we'll get any of that into the 2018 edition release -- it's too late, I think.The text was updated successfully, but these errors were encountered: