-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Minimal CUDA version for Lux Dense Layer #1442
Comments
Seems like it could stem from function sum_loss(x)
return sum(abs2, x)
end
begin
x = cu(rand(Float32, 10, 10))
dx = zero(x)
Enzyme.autodiff(Reverse, sum_loss, Active, Duplicated(x, dx))
end which is documented as non-functional, but how can we bypass it? We could write in the pullback form which prevents the reduction and that might work. DetailsJIT session error: Symbols not found: [ cuMemsetD8 ] |
Apparently the differentiation of |
@avik-pal with all of these issues can you upload the full error log? |
Added both the stacktraces with CUDA, cuDNN and Enzyme master |
Oh cool so now with master your last one (#1442 (comment)) now successfully differentiates and hits a GPUCompiler / LLVM.jl / LLJIT related question that I may need some assist/explanation from @vchuravy / @maleadt So quick tldr. The original code we AD has say a cublasgemm inside it. We take that code via gpucompiler and also inject a cudamemset call to it. then send it to our custom jit infra to deal compile/run (in 1.10+ thats now the lljit stuff @gbaraldi did) Ignore the nullptr stuff for now. The JIT now complains it can't find Any thoughts? |
Basically as soon as we fix this we seem to successfully differentiate through cublasgemm (since at least generating the llvm module of the derivative has now succeeded!) |
In these cases Enzyme clearly is not the originator of all these symbols:
|
Now hitting a type tree issue not deducing types for:
|
using Lux, Random, LuxCUDA, Enzyme
gdev = gpu_device()
rng = Random.default_rng()
model = Chain(Dense(2 => 3, tanh), Dense(3 => 2))
ps, st = Lux.setup(Random.default_rng(), model) |> gdev
x = rand(rng, Float32, 2, 10) |> gdev
y = first(model(x, ps, st))
function loss_function(y, model, ps, st, x)
y .= first(model(x, ps, st))
return
end
begin
y = zeros(Float32, 2, 10) |> gdev
dy = ones(Float32, 2, 10) |> gdev
dx = zeros(Float32, 2, 10) |> gdev
dps = Enzyme.make_zero(ps)
Enzyme.autodiff(Reverse, loss_function, Active, Duplicated(y, dy),
Const(model), Duplicated(ps, dps), Const(st), Duplicated(x, dx))
@show dx
@show dps
end I am now getting: 1-element ExceptionStack:
Return type `Nothing` not marked Const, but type is guaranteed to be constant
Stacktrace:
[1] error(s::String)
@ Base ./error.jl:35
[2] thunkbase(ctx::LLVM.Context, mi::Core.MethodInstance, ::Val{0x0000000000007bf5}, ::Type{Const{typeof(loss_function)}}, ::Type{Active}, tt::Type{Tuple{Duplicated{CuArray{Float32, 2, CUDA.DeviceMemory}}, Const{Chain{@NamedTuple{layer_1::Dense{typeof(tanh), Int64, Int64, Nothing, Nothing, Static.True}, layer_2::Dense{typeof(identity), Int64, Int64, Nothing, Nothing, Static.True}}, Nothing}}, Duplicated{@NamedTuple{layer_1::@NamedTuple{weight::CuArray{Float32, 2, CUDA.DeviceMemory}, bias::CuArray{Float32, 1, CUDA.DeviceMemory}}, layer_2::@NamedTuple{weight::CuArray{Float32, 2, CUDA.DeviceMemory}, bias::CuArray{Float32, 1, CUDA.DeviceMemory}}}}, Const{@NamedTuple{layer_1::@NamedTuple{}, layer_2::@NamedTuple{}}}, Duplicated{CuArray{Float32, 2, CUDA.DeviceMemory}}}}, ::Val{Enzyme.API.DEM_ReverseModeCombined}, ::Val{1}, ::Val{(false, false, false, false, false, false)}, ::Val{false}, ::Val{false}, ::Type{FFIABI}, ::Val{true})
@ Enzyme.Compiler /mnt/.julia/packages/Enzyme/TiboG/src/compiler.jl:7328
[3] #s2055#19000
@ /mnt/.julia/packages/Enzyme/TiboG/src/compiler.jl:7407 [inlined]
[4] var"#s2055#19000"(FA::Any, A::Any, TT::Any, Mode::Any, ModifiedBetween::Any, width::Any, ReturnPrimal::Any, ShadowInit::Any, World::Any, ABI::Any, ErrIfFuncWritten::Any, ::Any, ::Type, ::Type, ::Type, tt::Any, ::Type, ::Type, ::Type, ::Type, ::Type, ::Type, ::Any)
@ Enzyme.Compiler ./none:0
[5] (::Core.GeneratedFunctionStub)(::UInt64, ::LineNumberNode, ::Any, ::Vararg{Any})
@ Core ./boot.jl:602
[6] autodiff
@ /mnt/.julia/packages/Enzyme/TiboG/src/Enzyme.jl:315 [inlined]
[7] autodiff(::ReverseMode{false, FFIABI, false, false}, ::typeof(loss_function), ::Type{Active}, ::Duplicated{CuArray{Float32, 2, CUDA.DeviceMemory}}, ::Const{Chain{@NamedTuple{layer_1::Dense{typeof(tanh), Int64, Int64, Nothing, Nothing, Static.True}, layer_2::Dense{typeof(identity), Int64, Int64, Nothing, Nothing, Static.True}}, Nothing}}, ::Duplicated{@NamedTuple{layer_1::@NamedTuple{weight::CuArray{Float32, 2, CUDA.DeviceMemory}, bias::CuArray{Float32, 1, CUDA.DeviceMemory}}, layer_2::@NamedTuple{weight::CuArray{Float32, 2, CUDA.DeviceMemory}, bias::CuArray{Float32, 1, CUDA.DeviceMemory}}}}, ::Const{@NamedTuple{layer_1::@NamedTuple{}, layer_2::@NamedTuple{}}}, ::Duplicated{CuArray{Float32, 2, CUDA.DeviceMemory}})
@ Enzyme /mnt/.julia/packages/Enzyme/TiboG/src/Enzyme.jl:332
[8] top-level scope
@ REPL[19]:7
[9] top-level scope
@ none:1 |
Does loss_function return anything? This error message indicates enzyme
thinks it returns nothing
…On Mon, Sep 9, 2024 at 10:17 AM Avik Pal ***@***.***> wrote:
using Lux, Random, LuxCUDA, Enzyme
gdev = gpu_device()
rng = Random.default_rng()
model = Chain(Dense(2 => 3, tanh), Dense(3 => 2))
ps, st = Lux.setup(Random.default_rng(), model) |> gdev
x = rand(rng, Float32, 2, 10) |> gdev
y = first(model(x, ps, st))
function loss_function(y, model, ps, st, x)
y .= first(model(x, ps, st))
returnend
begin
y = zeros(Float32, 2, 10) |> gdev
dy = ones(Float32, 2, 10) |> gdev
dx = zeros(Float32, 2, 10) |> gdev
dps = Enzyme.make_zero(ps)
Enzyme.autodiff(Reverse, loss_function, Active, Duplicated(y, dy),
Const(model), Duplicated(ps, dps), Const(st), Duplicated(x, dx))
@show dx
@show dpsend
I am now getting:
1-element ExceptionStack:
Return type `Nothing` not marked Const, but type is guaranteed to be constant
Stacktrace:
[1] error(s::String)
@ Base ./error.jl:35
[2] thunkbase(ctx::LLVM.Context, mi::Core.MethodInstance, ::Val{0x0000000000007bf5}, ::Type{Const{typeof(loss_function)}}, ::Type{Active}, tt::Type{Tuple{Duplicated{CuArray{Float32, 2, CUDA.DeviceMemory}}, ***@***.***{layer_1::Dense{typeof(tanh), Int64, Int64, Nothing, Nothing, Static.True}, layer_2::Dense{typeof(identity), Int64, Int64, Nothing, Nothing, Static.True}}, Nothing}}, ***@***.******@***.***{weight::CuArray{Float32, 2, CUDA.DeviceMemory}, bias::CuArray{Float32, 1, CUDA.DeviceMemory}}, ***@***.***{weight::CuArray{Float32, 2, CUDA.DeviceMemory}, bias::CuArray{Float32, 1, CUDA.DeviceMemory}}}}, ***@***.******@***.***{}, ***@***.***{}}}, Duplicated{CuArray{Float32, 2, CUDA.DeviceMemory}}}}, ::Val{Enzyme.API.DEM_ReverseModeCombined}, ::Val{1}, ::Val{(false, false, false, false, false, false)}, ::Val{false}, ::Val{false}, ::Type{FFIABI}, ::Val{true})
@ Enzyme.Compiler /mnt/.julia/packages/Enzyme/TiboG/src/compiler.jl:7328
[3] #s2055#19000
@ /mnt/.julia/packages/Enzyme/TiboG/src/compiler.jl:7407 [inlined]
[4] var"#s2055#19000"(FA::Any, A::Any, TT::Any, Mode::Any, ModifiedBetween::Any, width::Any, ReturnPrimal::Any, ShadowInit::Any, World::Any, ABI::Any, ErrIfFuncWritten::Any, ::Any, ::Type, ::Type, ::Type, tt::Any, ::Type, ::Type, ::Type, ::Type, ::Type, ::Type, ::Any)
@ Enzyme.Compiler ./none:0
[5] (::Core.GeneratedFunctionStub)(::UInt64, ::LineNumberNode, ::Any, ::Vararg{Any})
@ Core ./boot.jl:602
[6] autodiff
@ /mnt/.julia/packages/Enzyme/TiboG/src/Enzyme.jl:315 [inlined]
[7] autodiff(::ReverseMode{false, FFIABI, false, false}, ::typeof(loss_function), ::Type{Active}, ::Duplicated{CuArray{Float32, 2, CUDA.DeviceMemory}}, ***@***.***{layer_1::Dense{typeof(tanh), Int64, Int64, Nothing, Nothing, Static.True}, layer_2::Dense{typeof(identity), Int64, Int64, Nothing, Nothing, Static.True}}, Nothing}}, ***@***.******@***.***{weight::CuArray{Float32, 2, CUDA.DeviceMemory}, bias::CuArray{Float32, 1, CUDA.DeviceMemory}}, ***@***.***{weight::CuArray{Float32, 2, CUDA.DeviceMemory}, bias::CuArray{Float32, 1, CUDA.DeviceMemory}}}}, ***@***.******@***.***{}, ***@***.***{}}}, ::Duplicated{CuArray{Float32, 2, CUDA.DeviceMemory}})
@ Enzyme /mnt/.julia/packages/Enzyme/TiboG/src/Enzyme.jl:332
[8] top-level scope
@ REPL[19]:7
[9] top-level scope
@ none:1
—
Reply to this email directly, view it on GitHub
<#1442 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAJTUXBQAXY3AOJLFSQCMJTZVWUWHAVCNFSM6AAAAABHUOUU4SVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDGMZYGI2TCMBRGY>
.
You are receiving this because you commented.Message ID: <EnzymeAD/Enzyme.
***@***.***>
|
Yeah nvm it should have been ERROR:
No augmented forward pass found for cublasLtMatmulDescCreate
at context: %133 = call i32 @cublasLtMatmulDescCreate(i64 %bitcast_coercion, i32 %unbox32, i32 0) #916 [ "jl_roots"({} addrspace(10)* %126) ], !dbg !1609
Stacktrace:
[1] macro expansion
@ /mnt/.julia/packages/CUDA/Tl08O/lib/utils/call.jl:218
[2] macro expansion
@ /mnt/.julia/packages/CUDA/Tl08O/lib/cublas/libcublasLt.jl:400
[3] #1158
@ /mnt/.julia/packages/CUDA/Tl08O/lib/utils/call.jl:35
[4] retry_reclaim
@ /mnt/.julia/packages/CUDA/Tl08O/src/memory.jl:434
[5] check
@ /mnt/.julia/packages/CUDA/Tl08O/lib/cublas/libcublas.jl:24
[6] cublasLtMatmulDescCreate
@ /mnt/.julia/packages/CUDA/Tl08O/lib/utils/call.jl:34
[7] cublaslt_matmul_fused!
@ /mnt/.julia/packages/LuxLib/xXJ6n/ext/LuxLibCUDAExt/cublaslt.jl:62
Stacktrace:
[1] macro expansion
@ /mnt/.julia/packages/CUDA/Tl08O/lib/utils/call.jl:218 [inlined]
[2] macro expansion
@ /mnt/.julia/packages/CUDA/Tl08O/lib/cublas/libcublasLt.jl:400 [inlined]
[3] #1158
@ /mnt/.julia/packages/CUDA/Tl08O/lib/utils/call.jl:35 [inlined]
[4] retry_reclaim
@ /mnt/.julia/packages/CUDA/Tl08O/src/memory.jl:434 [inlined]
[5] check
@ /mnt/.julia/packages/CUDA/Tl08O/lib/cublas/libcublas.jl:24 [inlined]
[6] cublasLtMatmulDescCreate
@ /mnt/.julia/packages/CUDA/Tl08O/lib/utils/call.jl:34 [inlined]
[7] cublaslt_matmul_fused!
@ /mnt/.julia/packages/LuxLib/xXJ6n/ext/LuxLibCUDAExt/cublaslt.jl:62
[8] cublaslt_matmul_fused!
@ /mnt/.julia/packages/LuxLib/xXJ6n/ext/LuxLibCUDAExt/cublaslt.jl:13 [inlined]
[9] cublasLt_fused_dense!
@ /mnt/.julia/packages/LuxLib/xXJ6n/ext/LuxLibCUDAExt/cublaslt.jl:195
[10] cublasLt_fused_dense!
@ /mnt/.julia/packages/LuxLib/xXJ6n/ext/LuxLibCUDAExt/cublaslt.jl:193 [inlined]
[11] matmuladd!
@ /mnt/.julia/packages/LuxLib/xXJ6n/src/impl/matmul.jl:64 [inlined]
[12] matmuladd
@ /mnt/.julia/packages/LuxLib/xXJ6n/src/impl/matmul.jl:23 [inlined]
[13] matmuladd
@ /mnt/.julia/packages/LuxLib/xXJ6n/src/impl/matmul.jl:7 [inlined]
[14] fused_dense
@ /mnt/.julia/packages/LuxLib/xXJ6n/src/impl/dense.jl:6 [inlined]
[15] fused_dense_bias_activation
@ /mnt/.julia/packages/LuxLib/xXJ6n/src/api/dense.jl:30 [inlined]
[16] Dense
@ /mnt/research/lux/Lux.jl/src/layers/basic.jl:344 [inlined]
[17] apply
@ /mnt/.julia/packages/LuxCore/3RH53/src/LuxCore.jl:155 [inlined]
[18] macro expansion
@ /mnt/research/lux/Lux.jl/src/layers/containers.jl:0 [inlined]
[19] applychain
@ /mnt/research/lux/Lux.jl/src/layers/containers.jl:482 [inlined]
[20] Chain
@ /mnt/research/lux/Lux.jl/src/layers/containers.jl:480 [inlined]
[21] loss_function
@ ./REPL[8]:2 [inlined]
[22] loss_function
@ ./REPL[8]:0 [inlined]
[23] diffejulia_loss_function_5796_inner_1wrap
@ ./REPL[8]:0
[24] macro expansion
@ /mnt/.julia/packages/Enzyme/TiboG/src/compiler.jl:7187 [inlined]
[25] enzyme_call
@ /mnt/.julia/packages/Enzyme/TiboG/src/compiler.jl:6794 [inlined]
[26] CombinedAdjointThunk
@ /mnt/.julia/packages/Enzyme/TiboG/src/compiler.jl:6671 [inlined]
[27] autodiff
@ /mnt/.julia/packages/Enzyme/TiboG/src/Enzyme.jl:320 [inlined]
[28] autodiff(::ReverseMode{false, FFIABI, false, false}, ::typeof(loss_function), ::Type{Const}, ::Duplicated{CuArray{…}}, ::Const{Chain{…}}, ::Duplicated{@NamedTuple{…}}, ::Const{@NamedTuple{…}}, ::Duplicated{CuArray{…}})
@ Enzyme /mnt/.julia/packages/Enzyme/TiboG/src/Enzyme.jl:332
[29] top-level scope
@ REPL[9]:7
[30] top-level scope
@ none:1
Some type information was truncated. Use `show(err)` to see complete types. I will reopen the LuxLib issue, I missed this dispatch. You can close this issue if you want. |
Fails with
and a very long LLVM dump.
Details
JIT session error: Symbols not found: [ cuMemsetD8 ]
JIT session error: Symbols not found: [ cuMemsetD8 ]
JIT session error: Symbols not found: [ cuMemsetD8 ]
JIT session error: Symbols not found: [ cuMemsetD8 ]
JIT session error: Symbols not found: [ cuMemsetD8 ]
JIT session error: Symbols not found: [ cuMemsetD8 ]
┌ Warning: The Pkg REPL mode is intended for interactive use only, and should not be used from scripts. It is recommended to use the functional API instead.
└ @ Pkg.REPLMode ~/.julia/juliaup/julia-1.10.3+0.x64.linux.gnu/share/julia/stdlib/v1.10/Pkg/src/REPLMode/REPLMode.jl:382
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10), {} addrspace(10)** inttoptr (i64 138527901362528 to {} addrspace(10)**) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10), {} addrspace(10)** inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
┌ Error: Found null pointer at
│
│ Stacktrace:
│ [1] get_repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:63
│ [2] repl_frontend_task
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:56
│ [3] default_scalar_indexing
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:75
│ [4] assertscalar
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:105
│ [5] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:48
│ [6] scalar_getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:34
│ [7] _getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:17
│ [8] getindex
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/indexing.jl:15
│ [9] macro expansion
│ @ ~/.julia/packages/GPUArraysCore/GMsgk/src/GPUArraysCore.jl:210
│ [10] #_mapreduce#42
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:71
│ [11] _mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:33
│ [12] mapreduce
│ @ ~/.julia/packages/GPUArrays/OKkAu/src/host/mapreduce.jl:28
│ [13] _sum
│ @ ./reducedim.jl:1015
│ [14] sum
│ @ ./reducedim.jl:1011
│ [15] loss_function
│ @ /mnt/research/ongoing/lux/enzyme_cuda.jl:9
│ arg = %active_repl.checked = load atomic {} addrspace(10)*, {} addrspace(10) inttoptr (i64 138527901362528 to {} addrspace(10)**) unordered, align 8, !dbg !439, !tbaa !440, !alias.scope !431, !noalias !434
└ @ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/absint.jl:121
ERROR: LoadError: LLVM error: Failed to materialize symbols: { (JuliaExternal, { libname_cuCtxSetCurrent_40759, libname_cuMemHostAlloc_38650, libname_cuDeviceCanAccessPeer_41778, libname_cuCtxSetCurrent_40231, libname_cuStreamSynchronize_40575, libname_cuCtxSetCurrent_41988, libname_cuStreamGetCaptureInfo_41984, libname_cuCtxGetId_39658, libname_cublasGetProperty_39361, libname_cuMemPoolCreate_40279, libname_cuCtxPopCurrent_v2_40732, libname_cuCtxSetCurrent_41463, libname_cuMemAllocFromPoolAsync_38536, libname_cuMemPoolCreate_41596, libname_cuStreamQuery_41973, libname_cuCtxGetId_37318, libname_cuCtxPushCurrent_v2_40729, libname_cuMemPoolTrimTo_39311, libname_cuMemPoolSetAttribute_40353, libname_cublasSsymm_v2_64_39307, libname_cuDeviceGetAttribute_40201, libname_cuMemPoolCreate_40364, libname_cudaRuntimeGetVersion_39775, libname_cuCtxGetCurrent_40830, libname_cuMemAlloc_v2_38538, libname_cuMemsetD32Async_39068, libname_cuMemsetD32Async_39556, libname_cuCtxSetCurrent_40756, libname_cuMemPoolTrimTo_38488, libname_cuDeviceGet_41449, libname_cuStreamQuery_40740, libname_cuStreamGetCaptureInfo_40752, libname_cuStreamSynchronize_37225, libname_cuDeviceGetName_41015, libname_cuCtxSynchronize_38500, libname_cuCtxGetDevice_41967, libname_cuStreamCreate_41741, libname_cuMemGetInfo_v2_38102, libname_cuMemPoolGetAttribute_39590, libname_cuCtxGetCurrent_37272, libname_cuCtxGetApiVersion_41478, libname_cuCtxGetId_41476, libname_cuDeviceSetMemPool_38018, libname_cuLaunchKernel_38608, libname_cuOccupancyMaxPotentialBlockSize_38792, libname_cublasSgemm_v2_64_39435, libname_cuStreamSynchronize_37220, libname_cuMemGetInfo_v2_40442, libname_cuStreamSynchronize_41807, libname_cuDeviceGet_37877, libname_cuMemAlloc_v2_38530, libname_cuMemcpyDtoHAsync_v2_38437, libname_cuDevicePrimaryCtxRetain_37310, libname_cuDeviceCanAccessPeer_38375, libname_cuCtxSynchronize_38494, libname_cublasGetProperty_39352, libname_cuCtxGetDevice_38330, libname_cuPointerGetAttribute_38405, libname_cuStreamCreate_38169, libname_cuStreamCreate_40509, libname_cuStreamSynchronize_40580, libname_cuCtxSetCurrent_41991, libname_cuMemPoolTrimTo_39526, libname_cuMemPoolCreate_38024, libname_cuDeviceGet_40217, libname_cuDeviceGetAttribute_41433, libname_cuCtxSetCurrent_38409, libname_cuCtxGetCurrent_40285, libname_cuMemPoolTrimTo_39511, libname_cuMemPoolTrimTo_39395, libname_cuStreamQuery_41972, libname_cuCtxGetCurrent_37945, libname_cublasSscal_v2_39494, libname_cuMemPoolTrimTo_39462, libname_cuDeviceGetName_37443, libname_cuDeviceGetCount_39613, libname_cublasSgemm_v2_39442, libname_cuCtxSetCurrent_37891, libname_cuMemAllocAsync_38528, libname_cuMemPoolSetAttribute_38013, libname_cuCtxPushCurrent_v2_41961, libname_cuMemcpyDtoDAsync_v2_38454, libname_cuMemcpyHtoDAsync_v2_38348, libname_cuDeviceGetCount_37287, libname_cublasSetStream_v2_39406, libname_cuMemsetD32Async_40788, libname_cuStreamQuery_38336, libname_cuMemGetInfo_v2_41674, libname_cuMemPoolSetAccess_41767, libname_cuStreamSynchronize_41812, libname_cuMemPoolTrimTo_39365, libname_cuCtxGetDevice_40735, libname_cuCtxSetCurrent_38412, libname_cuStreamGetCaptureInfo_38400, libname_cuDeviceGetAttribute_37861, libname_cuStreamQuery_38335, libname_cuDeviceSetMemPool_40358, libname_cuMemPoolTrimTo_39480, libname_cuMemPoolSetAccess_38364, libname_cuCtxGetId_40890, libname_cublasSscal_v2_64_39523, libname_cudaRuntimeGetVersion_41007, libname_cuCtxGetId_37904, libname_cuMemPoolGetAttribute_37264, libname_cuDeviceGetCount_40845, libname_cublasSetMathMode_39329, libname_cuCtxGetCurrent_41517, libname_cuDeviceGetName_39783, libname_cuDevicePrimaryCtxRetain_40882, libname_cublasSsymm_v2_39283, libname_cublasSgemm_v2_39457, libname_cublasSscal_v2_39507, diffejulia_loss_function_37136wrap, libname_cublasSsymm_v2_64_39261, libname_cuMemPoolCreate_37939, libname_cuCtxPopCurrent_v2_38327, libname_cuMemPoolSetAccess_40535, libname_cuMemPoolCreate_41511, libname_cudaRuntimeGetVersion_37435, libname_cublasSscal_v2_64_39490, libname_cuMemPoolGetAttribute_40822, libname_cuMemPoolSetAttribute_41585, libname_cuMemPoolTrimTo_39288, libname_cuStreamQuery_40741, libname_cublasSetStream_v2_39392, libname_cuCtxGetApiVersion_40246, libname_cublasSetStream_v2_39382, libname_cuCtxSetCurrent_40832, libname_cuCtxGetId_40244, libname_cuCtxSetCurrent_39600, libname_cuDeviceCanAccessPeer_40546, libname_cuMemcpyPeerAsync_38458, libname_cuCtxPopCurrent_v2_41964, libname_cuCtxGetApiVersion_37906, libname_cublasSetMathMode_39340, libname_cuCtxGetCurrent_39598, libname_cublasSsymm_v2_39268, libname_cublasSgemm_v2_64_39476, libname_cuCtxSetCurrent_37274, libname_cuLaunchCooperativeKernel_38605, libname_cuDevicePrimaryCtxRetain_39650, libname_cuDeviceSetMemPool_41590, libname_cuMemPoolTrimTo_39343, libname_cuCtxPushCurrent_v2_38324 }) }
Stacktrace:
[1] macro expansion
@ ~/.julia/packages/LLVM/bzSzE/src/executionengine/utils.jl:32 [inlined]
[2] lookup
@ ~/.julia/packages/LLVM/bzSzE/src/orc.jl:434 [inlined]
[3] lookup
@ ~/.julia/packages/LLVM/bzSzE/src/orc.jl:433 [inlined]
[4] lookup
@ ~/.julia/packages/Enzyme/NVk8T/src/compiler/orcv2.jl:255 [inlined]
[5] _link(job::GPUCompiler.CompilerJob{…}, ::Tuple{…})
@ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/compiler.jl:5751
[6] cached_compilation
@ ~/.julia/packages/Enzyme/NVk8T/src/compiler.jl:5811 [inlined]
[7] (::Enzyme.Compiler.var"#562#563"{…})(ctx::LLVM.Context)
@ Enzyme.Compiler ~/.julia/packages/Enzyme/NVk8T/src/compiler.jl:5876
[8] JuliaContext(f::Enzyme.Compiler.var"#562#563"{…}; kwargs::@kwargs{})
@ GPUCompiler ~/.julia/packages/GPUCompiler/kqxyC/src/driver.jl:52
[9] JuliaContext(f::Function)
@ GPUCompiler ~/.julia/packages/GPUCompiler/kqxyC/src/driver.jl:42
[10] #s2025#561
@ ~/.julia/packages/Enzyme/NVk8T/src/compiler.jl:5828 [inlined]
[11]
@ Enzyme.Compiler ./none:0
[12] (::Core.GeneratedFunctionStub)(::UInt64, ::LineNumberNode, ::Any, ::Vararg{Any})
@ Core ./boot.jl:602
[13] autodiff
@ ~/.julia/packages/Enzyme/NVk8T/src/Enzyme.jl:286 [inlined]
[14] autodiff(::ReverseMode{…}, ::typeof(loss_function), ::Type{…}, ::Const{…}, ::Duplicated{…}, ::Duplicated{…}, ::Duplicated{…})
@ Enzyme ~/.julia/packages/Enzyme/NVk8T/src/Enzyme.jl:303
[15] top-level scope
@ /mnt/research/ongoing/lux/enzyme_cuda.jl:17
Some type information was truncated. Use
show(err)
to see complete types.in expression starting at /mnt/research/ongoing/lux/enzyme_cuda.jl:12
The text was updated successfully, but these errors were encountered: