Skip to content

[Bugfix] Remove hardcoded head_size=256 for Deepseek v2 and v3 (#12… #31

[Bugfix] Remove hardcoded head_size=256 for Deepseek v2 and v3 (#12…

[Bugfix] Remove hardcoded head_size=256 for Deepseek v2 and v3 (#12… #31