Skip to content

[Bugfix] Remove hardcoded head_size=256 for Deepseek v2 and v3 (#12… #298

[Bugfix] Remove hardcoded head_size=256 for Deepseek v2 and v3 (#12…

[Bugfix] Remove hardcoded head_size=256 for Deepseek v2 and v3 (#12… #298

Annotations

1 warning

mypy (3.11)

succeeded Jan 16, 2025 in 31s