-
Notifications
You must be signed in to change notification settings - Fork 13k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ignore big-endian targets in UI tests with raw bytes #102379
Conversation
(rust-highfive has picked a reviewer for you, use r? to override) |
This comment was marked as resolved.
This comment was marked as resolved.
This comment has been minimized.
This comment has been minimized.
Some `const` UI tests have raw bytes in their output that are sensitive to the target endianness. We could consider something like the directive `stderr-per-bitwidth` for endian, but we only ever test little-endian in CI, so that would likely bit rot. For now, just ignore big-endian.
9c6c133
to
beb8d89
Compare
Hm, I wish we could at least test that we're seeing roughly expected behavior -- in particular, I worry that we might actually be wrong on big-endian given the sometimes low-level byte manipulation const does, and if we hide a bunch of tests we'd not notice.
These tests seem like they could probably be cross-compiled, I guess, which might make that easier. r? mir-opt -- do we have tests checking for big endian non-brokenness at all? (even if they only run in downstream distro CIs for the most part) |
Possibly, but that's only easy if they can be written for |
☔ The latest upstream changes (presumably #102091) made this pull request unmergeable. Please resolve the merge conflicts. |
What's the diff that we are currently seeing, which messages are we even talking about? I don't know into which CI logs I have to look...
Not having seen the diffs, I can only guess, but I assume most of the problems are caused by error messages that dump the raw memory. I think it would be a bad idea to lie in that dump and to show anything but the actual byte-by-byte content. I don't think there is a way we can fix these -- we already have separate stderr for 32bit/64bit here, for the exact same reason. We might also have some tests that transmute arrays of |
FWIW, Miri runs its test suite on the mips64-unknown-linux-gnuabi64 target, and I don't recall a single issue. Endianess is only relevant in very few places in the interpreter -- basically, when loading a multi-byte scalar value from memory, and when storing a multi-byte scalar value to memory. Nothing else works on multiple bytes of memory at a time. https://github.com/rust-lang/miri-test-libstd runs the libcore and liballoc tests on a big-endian target, too (specifically mips-unknown-linux-gnu). And a bunch of ecosystem crates run their tests on mips64-unknown-linux-gnuabi64 with Miri to test their own big-endianess-compatibility. So I am reasonably confident that the interpreter works correctly for those targets. Of course it would be nice to have some in-tree tests, but if we include out-of-tree tests we are well-covered currently. Maybe we should run the mips64-unknown-linux-gnuabi64 Miri test suite on rustc CI? |
Here's an example log from Fedora's 1.64.0 on s390x: The differences I'm trying to address here are mostly under
where it clearly got the right values, just with different endianness. |
Yeah that is supposed to print the actual memory contents, byte for byte. It would be bad if that didn't match the actual endianess of the interpreted code. This is also entirely untyped. The only thing we can do here is adjust the tests to make the contents endianess-independent. But that might not always be possible when enum discriminant values are involved. We can also add stderr normalization filters. |
Uhm, so not sure what the best way to proceed here is. Normalization sounds enticing, but from the output it does not look like there is a trivial way to make it apply generically without significant effort, am I right? The best that could have with the current infrastructure are replacement rules such as these
which is
Just ignoring these tests solves some of the issues (1, 2, 3), but not the others (4). |
Idea: we add a flag to ignore stderr output on certain filters instead of ignoring the entire test. This way we can still check that the annotations match independently of endianess (or even bitsize if we want to) |
I like the diff-filter idea, and it can still be somewhat strict -- if there's a mismatch, but both the expected and actual lines match a given regex, consider it fine. I'll see if I can add a compiletest extension along these lines. |
I've now come up with an alternative solution to this problem that does not simply ignore the tests:
I'd appreciate any comments or review! |
Maybe I'm just naive, but why do we want this? It seems to me like the root problem is that we're trying to test a portable language using a non-portable output. If we want to test that the behavior of a Rust program is correct, can we print/dump the consts using their Debug impl when possible? Of course this can't necessarily work for all consts, so maybe we should have something like a flag that opts into Debug-dumping consts and we always use in tests? Trying to hack around the non-portability of dumping the bytes just feels like repeating the old mistake of accidentally writing endian-sensitive code, which we often advertise Miri as a solution to. |
The behavior of rustc const eval is endianess-dependent -- that is a feature, not a bug. And we want const-eval to sometimes dump the raw low-level bytes of a constant; that's not a bug, either. Of course we also want to test that this output looks the way it should. So I don't quite understand what you mean. |
This is where I'm lost. As far as I can tell, the linked PRs are following your suggestions:
and thus are not testing that the output looks the way it should. We are exposing endian-ness in the compiler's output, then instead of using that information that got exposed, we erase that information in the test suite. |
Yeah I said that would be a possibility, but not one that I like.^^ There are like 3 parallel threads of discussion here, see #104135 (comment) for a more recent proposal. |
Thanks! |
Is there an issue tracking adding some big-endian checking for CI? Right now the risk of regressions here is pretty high. |
#106046 concluded that we'll leave that to external CI for now. |
Some
const
UI tests have raw bytes in their output that are sensitiveto the target endianness. We could consider something like the directive
stderr-per-bitwidth
for endian, but we only ever test little-endian inCI, so that would likely bit rot. For now, just ignore big-endian.