-
Notifications
You must be signed in to change notification settings - Fork 511
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
建议 支持 DeepSeek-R1 #2778
Comments
确实需要 |
Deepseek v3 和 r1 对资源的要求非常高。 我们会先支持 distill 模型。 |
This issue is stale because it has been open for 7 days with no activity. |
坐地等,有这些资源的公司也挺多 |
@qinxuye 建议优先支持DeepSeek-R1-GGUF。资源使用量下降不少。 |
https://unsloth.ai/blog/deepseekr1-dynamic This article explains how they shrink the size, which looks promising. I think it's worth trying. |
坐等啊, 需要多少资源, 列出来, 方便准备呀. |
我自己下载了gguf,但是不支持gpu推理吗,只有cpu在跑 |
Deepseek R1 使用vllm部署时需要至少2个节点,vllm原生提供了分布式部署的能力,但是目前看xinference的架构应该没有考虑到对vllm夸节点TP和PP的支持。估计短时间内很难支持vllm全量部署DeepSeek R1 了。 |
Feature request / 功能建议
建议 支持 DeepSeek-R1
Motivation / 动机
DeepSeek-R1 蒸馏小模型
Your contribution / 您的贡献
DeepSeek-R1 https://github.com/deepseek-ai/DeepSeek-R1
The text was updated successfully, but these errors were encountered: