Wrong result with @avx and vector transpose #117

baggepinnen · 2020-05-20T07:43:28Z

I think @avx gets confused by the following code, and produces the wrong result.

function avx_problem(x::AbstractVector{T}, m) where T
    N = length(x)
    n = N-m+1
    ent = zeros(T, N)
    x = x'
    @avx for i = 1:N
        for j = 1:size(x,1)
            ent[i] += x[j,i]*log(x[j,i])
        end
    end
    ent
end

function no_avx_problem(x::AbstractVector{T}, m) where T
    N = length(x)
    n = N-m+1
    ent = zeros(T, N)
    x = x'
    for i = 1:N
        for j = 1:size(x,1)
            ent[i] += x[j,i]*log(x[j,i])
        end
    end
    ent
end

x = rand(12);
e1 = avx_problem(x, 4);
e2 = no_avx_problem(x, 4);

julia> e1 ≈ e2
false

julia> e1
12-element Array{Float64,1}:
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445
 -0.34455632464675445

julia> e2
12-element Array{Float64,1}:
 -0.34455632464675445
 -0.08578599139751969
 -0.17945447308544002
 -0.3347521300765356
 -0.3667635023973011
 -0.1186417051096091
 -0.3148391305379354
 -0.1238397483230935
 -0.35429232069371336
 -0.3645463562552793
 -0.3641348970531969
 -0.34369736669015577

The text was updated successfully, but these errors were encountered:

chriselrod · 2020-05-20T08:07:40Z

Ah, thanks. It isn't respecting the transpose and thinks it is still a 1d vector.

chriselrod · 2020-05-23T05:04:46Z

Locally, I now have:

julia> using LoopVectorization
[ Info: Precompiling LoopVectorization [bdcacae8-1622-11e9-2a5c-532679323890]

julia> function avx_problem(x::AbstractVector{T}, m) where T
           N = length(x)
           n = N-m+1
           ent = zeros(T, N)
           x = x'
           @avx for i = 1:N
               for j = 1:size(x,1)
                   ent[i] += x[j,i]*log(x[j,i])
               end
           end
           ent
       end
avx_problem (generic function with 1 method)

julia> function no_avx_problem(x::AbstractVector{T}, m) where T
           N = length(x)
           n = N-m+1
           ent = zeros(T, N)
           x = x'
           for i = 1:N
               for j = 1:size(x,1)
                   ent[i] += x[j,i]*log(x[j,i])
               end
           end
           ent
       end
no_avx_problem (generic function with 1 method)

julia> x = rand(12);

julia> e1 = avx_problem(x, 4);

julia> e2 = no_avx_problem(x, 4);

julia> e1'
1×12 LinearAlgebra.Adjoint{Float64,Array{Float64,1}}:
 -0.331849  -0.367094  -0.359068  -0.244346  -0.0461393  -0.323758  -0.297252  -0.329479  -0.309549  -0.367815  -0.307403  -0.275631

julia> e2'
1×12 LinearAlgebra.Adjoint{Float64,Array{Float64,1}}:
 -0.331849  -0.367094  -0.359068  -0.244346  -0.0461393  -0.323758  -0.297252  -0.329479  -0.309549  -0.367815  -0.307403  -0.275631

I'll try and get this branch merged and make new releases within the next few hours.

The tricky bit was that transposed vectors have unit stride in two dimensions:

julia> e2'[1], e2'[2], e2'[3]
(-0.3318487689615286, -0.3670936384601229, -0.35906795855837714)

julia> e2'[1,1], e2'[1,2], e2'[1,3]
(-0.3318487689615286, -0.3670936384601229, -0.35906795855837714)

I originally only supported the former, treating it like a 1-d vector, ignoring higher indexes.

I still need to add proper support for this in LoopVectorization. The cost modeling isn't aware of two axes potentially being contiguous.
The simplest way to handle this would probably be to do a check,
and drop the first index if there are multiple indices.

chriselrod · 2020-05-24T12:49:03Z

Now fixed on master and tested.

chriselrod added a commit that referenced this issue May 24, 2020

Add test for now fixed-issues #116 and #117.

0dd833c

chriselrod mentioned this issue May 24, 2020

Revamp lowering of code to improve code gen #118

Merged

chriselrod closed this as completed May 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrong result with @avx and vector transpose #117

Wrong result with @avx and vector transpose #117

baggepinnen commented May 20, 2020 •

edited

Loading

chriselrod commented May 20, 2020

chriselrod commented May 23, 2020

chriselrod commented May 24, 2020

Wrong result with @avx and vector transpose #117

Wrong result with @avx and vector transpose #117

Comments

baggepinnen commented May 20, 2020 • edited Loading

chriselrod commented May 20, 2020

chriselrod commented May 23, 2020

chriselrod commented May 24, 2020

baggepinnen commented May 20, 2020 •

edited

Loading