Skip to content

Commit

Permalink
Improve accuracy of vecdot routines
Browse files Browse the repository at this point in the history
  • Loading branch information
cafaxo committed Jul 18, 2024
1 parent 3d07db9 commit 587d270
Show file tree
Hide file tree
Showing 9 changed files with 409 additions and 868 deletions.
7 changes: 1 addition & 6 deletions src/Llama2.jl
Original file line number Diff line number Diff line change
Expand Up @@ -14,12 +14,7 @@ export load_gguf_model, load_karpathy_model, encode, sample
export train

# quantization
include("quantization/utils.jl")
include("quantization/common.jl")
include("quantization/q4.jl")
include("quantization/q6.jl")
include("quantization/q8.jl")
include("quantization/vecdot.jl")
include("kquants.jl")

# inference
include("tokenizer.jl")
Expand Down
Loading

0 comments on commit 587d270

Please sign in to comment.