cranelift: Make inline stackprobe unroll sequence valgrind compliant #7470

afonso360 · 2023-11-03T12:59:10Z

👋 Hey,

This PR alters our stackprobe unroll sequences to move the stackpointer before writing to the stack. I don't think there was anything wrong with what we were doing before but it makes valgrind really unhappy. (See #7454)

I've tested this with cg_clif on x86, and it now cleanly passes valgrind for the original reported testcase.

Fixes: #7454
prtest:full

fitzgen

Thanks!

fitzgen · 2023-11-03T16:25:07Z

cranelift/codegen/src/isa/aarch64/abi.rs

+        // When manually unrolling adjust the stack pointer and then write a zero
+        // to the stack at that offset. This generates something like
+        // `sub sp, sp, #1, lsl #12` followed by `stur wzr, [sp]`.
+        //
+        // We do this because valgrind expects us to never write beyond the stack
+        // pointer and associated redzone.
+        // See: https://github.com/bytecodealliance/wasmtime/issues/7454
+        for _ in 0..probe_count {
+            insts.extend(Self::gen_sp_reg_adjust(-(guard_size as i32)));
+
+            insts.push(Self::gen_store_stack(
+                StackAMode::SPOffset(0, I8),
+                zero_reg(),
+                I32,
+            ));
+        }
+
+        // Restore the stack pointer to its original value
+        insts.extend(Self::gen_sp_reg_adjust((guard_size * probe_count) as i32));


Out of curiosity / unrelated to this PR: do you know why it is necessary to probe via storing to the stack rather than loading from the stack? It seems like either would accomplish the task AFAICT, and loads should be cheaper than stores.

From the point of view of global cost, stores might actually be better, I suspect: if it faults in a new page, a load would cause a read-only mapping to the global zero page to be created, and the first write would then cause another page fault (at least if the stack is an ordinary MAP_ANON region). Or at least, we've seen elsewhere that it's better for first touch to be write when TLB contention is high...

Ah okay that makes sense. Thanks!

…ytecodealliance#7470) * x64: Make inline probe sequence valgrind compliant * aarch64: Move stackprobe implementation into separate functions * aarch64: Make inline probe sequence valgrind compliant * riscv64: Make inline probe sequence valgrind compliant * riscv64: Avoid reloading probe amount when unrolling

afonso360 added 5 commits November 3, 2023 11:52

x64: Make inline probe sequence valgrind compliant

3aa837a

aarch64: Move stackprobe implementation into separate functions

e71907d

aarch64: Make inline probe sequence valgrind compliant

b45265c

riscv64: Make inline probe sequence valgrind compliant

3df41c7

riscv64: Avoid reloading probe amount when unrolling

f24e4fc

afonso360 requested a review from a team as a code owner November 3, 2023 12:59

afonso360 requested review from fitzgen and removed request for a team November 3, 2023 12:59

afonso360 mentioned this pull request Nov 3, 2023

riscv64: Find more oportunities to encode a compressed add #7471

Merged

github-actions bot added cranelift Issues related to the Cranelift code generator cranelift:area:aarch64 Issues related to AArch64 backend. cranelift:area:x64 Issues related to x64 codegen labels Nov 3, 2023

fitzgen approved these changes Nov 3, 2023

View reviewed changes

afonso360 added this pull request to the merge queue Nov 3, 2023

Merged via the queue into bytecodealliance:main with commit 2e8c488 Nov 3, 2023

afonso360 deleted the unroll-valgrind branch November 3, 2023 20:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cranelift: Make inline stackprobe unroll sequence valgrind compliant #7470

cranelift: Make inline stackprobe unroll sequence valgrind compliant #7470

afonso360 commented Nov 3, 2023

fitzgen left a comment

fitzgen Nov 3, 2023

cfallin Nov 3, 2023

fitzgen Nov 3, 2023

cranelift: Make inline stackprobe unroll sequence valgrind compliant #7470

cranelift: Make inline stackprobe unroll sequence valgrind compliant #7470

Conversation

afonso360 commented Nov 3, 2023

fitzgen left a comment

Choose a reason for hiding this comment

fitzgen Nov 3, 2023

Choose a reason for hiding this comment

cfallin Nov 3, 2023

Choose a reason for hiding this comment

fitzgen Nov 3, 2023

Choose a reason for hiding this comment