elemIndexEnd non-optimally implemented in terms of findIndexEnd #278

archaephyrryx · 2020-08-30T20:24:23Z

The current implementations of the elemIndexEnd function in modules Data.ByteString and Data.ByteString.Lazy are uniformly defined as findIndexEnd . (==) (the Char8 version merely invokes the strict bytestring definition after converting from Char to Word8).

This seems counterintuitive when compared with elemIndex, which is more optimized than findIndex through the use of the memchr C FFI call to avoid costly byte-by-byte predicate testing.

There exists a GNU extension for string.h that defines an operation memrchr that performs a similar operation to memchr but returns the final occurrence of a byte rather than the first, which could be used when available. Even without such platform-specific optimizations, it should still be possible to either add a memrchr-like function to the cbits code.

Even without FFI calls, elemIndexEnd could use the same logic as findIndexEnd and perform a byte-by-byte direct equality test at least as efficiently as findIndexEnd . (==), but without the indirection.

The text was updated successfully, but these errors were encountered:

Bodigrim · 2020-09-05T10:23:14Z

The implementation of elemIndexEnd was recently discussed here: #155 (comment) It seems that thanks to inlining there is no performance penalty.

AFAIU Windows systems do not provide memrchr, so the only option to remain cross-platform is to implement memrchr in cbits. I do not mind against such approach, if supported by benchmarks.

Bodigrim added the performance label Aug 30, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

elemIndexEnd non-optimally implemented in terms of findIndexEnd #278

elemIndexEnd non-optimally implemented in terms of findIndexEnd #278

archaephyrryx commented Aug 30, 2020

Bodigrim commented Sep 5, 2020

elemIndexEnd non-optimally implemented in terms of findIndexEnd #278

elemIndexEnd non-optimally implemented in terms of findIndexEnd #278

Comments

archaephyrryx commented Aug 30, 2020

Bodigrim commented Sep 5, 2020