Optimise line number iteration #276

philipc · 2017-12-30T06:45:25Z

Useful for addr2line. Adds another inline(always) unfortunately.

Before: test bench_executing_line_number_programs ... bench: 379,536 ns/iter (+/- 12,044) test bench_parsing_line_number_program_opcodes ... bench: 271,593 ns/iter (+/- 9,285) After: test bench_executing_line_number_programs ... bench: 246,375 ns/iter (+/- 11,192) test bench_parsing_line_number_program_opcodes ... bench: 90,493 ns/iter (+/- 2,722)

Before: test bench_executing_line_number_programs ... bench: 246,375 ns/iter (+/- 11,192) After: test bench_executing_line_number_programs ... bench: 173,505 ns/iter (+/- 4,188)

Before: test bench_executing_line_number_programs ... bench: 173,505 ns/iter (+/- 4,188) After: test bench_executing_line_number_programs ... bench: 161,251 ns/iter (+/- 4,392)

Otherwise we get a lot of extra baggage even when it isn't used. Before: test bench_executing_line_number_programs ... bench: 161,251 ns/iter (+/- 4,392) After: test bench_executing_line_number_programs ... bench: 116,779 ns/iter (+/- 4,008)

coveralls · 2017-12-30T06:56:38Z

Coverage decreased (-0.05%) to 93.866% when pulling 14b9965 on philipc:line into 21041c7 on gimli-rs:master.

fitzgen

Great stuff :)

fitzgen · 2017-12-31T22:48:15Z

src/line.rs

-            (op_index_with_advance / maximum_operations_per_instruction);
-
-        self.row.registers.op_index = op_index_with_advance % maximum_operations_per_instruction;
+        if maximum_operations_per_instruction == 1 {


Wow, a little surprised this is such a large speed up.

Agreed. It definitely helped in addr2line too though, so it's not just a synthetic benchmark.

fitzgen · 2017-12-31T22:51:51Z

src/line.rs

@@ -503,7 +503,7 @@ pub enum Opcode<R: Reader> {
    UnknownStandard1(constants::DwLns, u64),

    /// An unknown standard opcode with multiple operands.
-    UnknownStandardN(constants::DwLns, Vec<u64>),
+    UnknownStandardN(constants::DwLns, R),


To be clear, the issue is the size of Vec bloating the decoded instruction size, not that we are hitting a lot of unknown standard opcodes that also have more than one operand, right?

If so, we could save another word (assuming this is still the widest variant) by boxing R, since R is usually two words with EndianBuf.

No, it's not the size of the Vec. The largest variant is currently DefineFile at 48 bytes, and UnknownStandardN was already only 32 before this change.

We're not hitting unknown standard opcodes.

I haven't investigated the assembly in great depth (there's too much), but there's a few inlined Vec functions that are no longer needed, and I guess this helps the optimiser. It also means there's no code to run to drop the Opcode.

philipc added 4 commits December 30, 2017 16:37

line: Avoid division when maximum_operations_per_instruction == 1

35c7750

Before: test bench_executing_line_number_programs ... bench: 246,375 ns/iter (+/- 11,192) After: test bench_executing_line_number_programs ... bench: 173,505 ns/iter (+/- 4,188)

line: Allow optimisation of div/rem

ef9c38f

Before: test bench_executing_line_number_programs ... bench: 173,505 ns/iter (+/- 4,188) After: test bench_executing_line_number_programs ... bench: 161,251 ns/iter (+/- 4,392)

line: avoid Vec in Opcode

14b9965

Otherwise we get a lot of extra baggage even when it isn't used. Before: test bench_executing_line_number_programs ... bench: 161,251 ns/iter (+/- 4,392) After: test bench_executing_line_number_programs ... bench: 116,779 ns/iter (+/- 4,008)

fitzgen approved these changes Dec 31, 2017

View reviewed changes

philipc merged commit 82a31fe into gimli-rs:master Jan 1, 2018

philipc deleted the line branch January 1, 2018 07:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimise line number iteration #276

Optimise line number iteration #276

philipc commented Dec 30, 2017

coveralls commented Dec 30, 2017

fitzgen left a comment

fitzgen Dec 31, 2017

philipc Jan 1, 2018

fitzgen Dec 31, 2017

fitzgen Dec 31, 2017

philipc Jan 1, 2018

Optimise line number iteration #276

Optimise line number iteration #276

Conversation

philipc commented Dec 30, 2017

coveralls commented Dec 30, 2017

fitzgen left a comment

Choose a reason for hiding this comment

fitzgen Dec 31, 2017

Choose a reason for hiding this comment

philipc Jan 1, 2018

Choose a reason for hiding this comment

fitzgen Dec 31, 2017

Choose a reason for hiding this comment

fitzgen Dec 31, 2017

Choose a reason for hiding this comment

philipc Jan 1, 2018

Choose a reason for hiding this comment