Optimize EndianReader to use ptr/len #302

philipc · 2018-05-22T06:58:04Z

This requires stable_deref_trait::CloneStableDeref for the bytes. If needed in future, we could relax that to stable_deref_trait::StableDeref by implementing a Clone for SubRange that recalculates the ptr.

This doesn't show any significant change in the gimli benchmarks, but I have modified addr2line to use Rc<Cow<[u8]>>, and it shows about 15% improvement for all operations in that (both context creation and lookup).

coveralls · 2018-05-22T07:47:20Z

Coverage increased (+0.002%) to 92.703% when pulling f4a43c2 on philipc:endian-reader-stablederef into a1c5bba on gimli-rs:master.

fitzgen

Looks great! A couple nitpicks inline below. Thanks @philipc !

fitzgen · 2018-05-22T15:53:59Z

src/endian_reader.rs

        }
    }
 }

 impl<Endian, T> EndianReader<Endian, T>
 where
    Endian: Endianity,
-    T: Deref<Target = [u8]> + Clone + Debug,
+    T: CloneStableDeref<Target = [u8]> + Clone + Debug,


We don't need the Clone trait bound anymore for any of these, because CloneStableDeref has Clone as a super trait.

fitzgen · 2018-05-22T15:55:00Z

src/endian_reader.rs

@@ -143,59 +149,77 @@ where
 // `self.endian`. Splitting the sub-range out from the endian lets us work
 // around this, making it so that only the `self.range` borrow is held active,
 // not all of `self`.
+//
+// This also serves to encapsulate the unsafe code concerning `CloneStableDeref`.


Maybe also comment about how we must keep a handle to bytes around since the ptr is only valid while we still hold the bytes (eg because holding the bytes holds a refcount for us).

fitzgen · 2018-05-22T15:55:25Z

src/endian_reader.rs

-        r.range.end = r.range.start + idx.end;
-        r.range.start += idx.start;
-        assert!(r.range.start <= r.range.end);
-        assert!(r.range.end <= self.range.end);


It is nice to have all this encapsulated.

fitzgen · 2018-05-22T15:57:27Z

This doesn't show any significant change in the gimli benchmarks, but I have modified addr2line to use Rc<Cow<[u8]>>, and it shows about 15% improvement for all operations in that (both context creation and lookup).

Interesting that our benches didn't show any speed up but the larger program did. I wonder if our benches are small enough that the caches are hiding some of the indirection costs for us here.

This requires `stable_deref_trait::CloneStableDeref` for the bytes. If needed in future, we could relax that to `stable_deref_trait::StableDeref` by implementing a `Clone` for `SubRange` that recalculates the `ptr`. This doesn't show any significant change in the gimli benchmarks, but I have modified `addr2line` to use `Rc<Cow<[u8]>>`, and it shows about 15% improvement for all operations in that (both context creation and lookup).

philipc · 2018-05-23T06:26:38Z

perf reports that a large chunk of the time in the benches is spent in a move and drop_in_place, and this didn't change due to this PR. This PR did make the code smaller and remove some slice bounds checks, but they weren't being reported as hot. That move and drop aren't present in the EndianSlice benchmark. I didn't try to see where the improvement comes from in addr2line, because its benchmarks aren't suitable for running under perf.

philipc requested a review from fitzgen May 22, 2018 06:58

fitzgen approved these changes May 22, 2018

View reviewed changes

philipc force-pushed the endian-reader-stablederef branch from 8fc936e to f4a43c2 Compare May 23, 2018 05:39

philipc merged commit 7da600f into gimli-rs:master May 23, 2018

philipc deleted the endian-reader-stablederef branch May 23, 2018 06:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize EndianReader to use ptr/len #302

Optimize EndianReader to use ptr/len #302

philipc commented May 22, 2018

coveralls commented May 22, 2018 •

edited

Loading

fitzgen left a comment

fitzgen May 22, 2018

fitzgen May 22, 2018

fitzgen May 22, 2018

fitzgen commented May 22, 2018

philipc commented May 23, 2018

Optimize EndianReader to use ptr/len #302

Optimize EndianReader to use ptr/len #302

Conversation

philipc commented May 22, 2018

coveralls commented May 22, 2018 • edited Loading

fitzgen left a comment

Choose a reason for hiding this comment

fitzgen May 22, 2018

Choose a reason for hiding this comment

fitzgen May 22, 2018

Choose a reason for hiding this comment

fitzgen May 22, 2018

Choose a reason for hiding this comment

fitzgen commented May 22, 2018

philipc commented May 23, 2018

coveralls commented May 22, 2018 •

edited

Loading