We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The avo generated code seems to be loading one argument twice:
// func mulAvxTwo_1x1_64(matrix []byte, in [][]byte, out [][]byte, start int, n int) // Requires: AVX, AVX2, SSE2 TEXT ·mulAvxTwo_1x1_64(SB), $0-88 // Loading all tables to registers // Destination kept in GP registers // Full registers estimated 10 YMM used MOVQ n+80(FP), AX MOVQ matrix_base+0(FP), CX SHRQ $0x06, AX TESTQ AX, AX JZ mulAvxTwo_1x1_64_end VMOVDQU (CX), Y0 VMOVDQU 32(CX), Y1 MOVQ in_base+24(FP), CX MOVQ (CX), CX MOVQ out_base+48(FP), DX MOVQ out_base+48(FP), DX // <----- MOVQ (DX), DX MOVQ start+72(FP), BX
The text was updated successfully, but these errors were encountered:
Remove double load on 64 byte loop init
7e30574
Fixes #278
Remove double load on 64 byte loop init (#279)
ea0a106
Successfully merging a pull request may close this issue.
The avo generated code seems to be loading one argument twice:
The text was updated successfully, but these errors were encountered: