Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Work On CPU #16

Open
ZepinLi opened this issue May 21, 2024 · 1 comment
Open

Work On CPU #16

ZepinLi opened this issue May 21, 2024 · 1 comment

Comments

@ZepinLi
Copy link

ZepinLi commented May 21, 2024

Hi Sequoia team,

Can this code framework fit in cpu devices? If so, how can we do it? Any insights?

Regards

@dreaming-panda
Copy link
Contributor

I do not think this can be used on CPU. CPU does not have such high FLOPS for speculative decoding, so even if the code can run on CPU, no speed up will be found, unless you talk about some very advanced CPUs (I have no ideas about them).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants