Fix reinterpret performance #28707

Keno · 2018-08-16T19:31:25Z

This fixes #25014 by making it more obvious what's going on to LLVM.
Instead of a memcpy loop, we use a new intrinsic that puts an actual
llvm.memcpy into the IR, which is enough for LLVM to fold everything
away. In the benchmark from #25014, we still see some regressions from
0.6, but that is because it needs to dereference through the pointers
in the reinterpret and reshape wrappers. In any real code, that
dereferencing should be loop-invariantly moved out of the inner loop.

vtjnash · 2018-08-16T20:15:17Z

Glad to see this works. Can we just transparently make this he implementation of ccall(:memcpy) (it looks like we don’t need the extra alignment argument right now)

tknopp · 2018-08-16T20:28:10Z

@Keno Does this make the alignment workaround

A = Mmap.mmap(fdio(fd), Array{UInt8,1}, prod(dims)*sizeof(T), offset)
reshape(reinterpret(T,A),dims)

as fast as

A = Mmap.mmap(fdio(fd), Array{T,dims}, dims, offset)

under 0.6? I needed to change this here
https://github.com/JuliaIO/HDF5.jl/blob/master/src/HDF5.jl#L1651
in order to make code work that failed in 0.7 due to the alignment issue.

Keno · 2018-08-16T21:46:48Z

Can we just transparently make this he implementation of ccall(:memcpy)

Ok, let's do it that way. We can always add the aligned version if we need it later.

Keno · 2018-08-16T21:47:44Z

Does this make the alignment workaround [...] fast

Probably

This fixes #25014 by making it more obvious what's going on to LLVM. Instead of a memcpy loop, we use a ccall to :memcpy and turn this into llvm.memcpy at the IR level, which is enough for LLVM to fold everything away. In the benchmark from #25014, we still see some regressions from 0.6, but that is because it needs to dereference through the pointers in the reinterpret and reshape wrappers. In any real code, that dereferencing should be loop-invariantly moved out of the inner loop.

StefanKarpinski · 2018-08-17T17:12:12Z

This seems eligible for 1.0.1, right?

Keno · 2018-08-17T17:16:25Z

Yes, already has the backport label.

Keno added performance Must go faster backport pending 1.0 labels Aug 16, 2018

Keno changed the title ~~Fix reinterpret performnace~~ Fix reinterpret performance Aug 16, 2018

Keno force-pushed the kf/reinterpretperf branch from 0dbc329 to 1fe5cbf Compare August 16, 2018 21:46

tknopp mentioned this pull request Aug 16, 2018

mmap with arbitrary offsets no more allowed #28424

Closed

Keno force-pushed the kf/reinterpretperf branch from 1fe5cbf to 93164b7 Compare August 17, 2018 01:21

Keno merged commit 777810b into master Aug 17, 2018

StefanKarpinski deleted the kf/reinterpretperf branch August 17, 2018 17:11

ExpandingMan mentioned this pull request Aug 18, 2018

performance on 0.6, reinterpret and safety ExpandingMan/Arrow.jl#6

Open

KristofferC mentioned this pull request Aug 19, 2018

Backports to 1.0.1 #28764

Merged

RalphAS mentioned this pull request Aug 31, 2018

Performance of ReinterpretArray, continued #28980

Closed

RalphAS mentioned this pull request Sep 7, 2018

Serious regression of warp! JuliaImages/ImageTransformations.jl#60

Closed

KristofferC removed the backport pending 1.0 label Sep 27, 2018

vtjnash mentioned this pull request Feb 14, 2019

Illegal instruction with ccall to :memcpy #31073

Closed

timholy mentioned this pull request Mar 24, 2019

kwargs are annoying in the debugger JuliaDebug/Debugger.jl#141

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix reinterpret performance #28707

Fix reinterpret performance #28707

Keno commented Aug 16, 2018

vtjnash commented Aug 16, 2018

tknopp commented Aug 16, 2018

Keno commented Aug 16, 2018

Keno commented Aug 16, 2018

StefanKarpinski commented Aug 17, 2018

Keno commented Aug 17, 2018

Fix reinterpret performance #28707

Fix reinterpret performance #28707

Conversation

Keno commented Aug 16, 2018

vtjnash commented Aug 16, 2018

tknopp commented Aug 16, 2018

Keno commented Aug 16, 2018

Keno commented Aug 16, 2018

StefanKarpinski commented Aug 17, 2018

Keno commented Aug 17, 2018