Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bloom model integration #276

Open
Miezhiko opened this issue Aug 16, 2022 · 4 comments
Open

Bloom model integration #276

Miezhiko opened this issue Aug 16, 2022 · 4 comments

Comments

@Miezhiko
Copy link

maybe

there it is https://huggingface.co/bigscience/bloom

@guillaume-be
Copy link
Owner

Hello @Miezhiko ,

Thank you for the suggestion - I will look into it in the next few weeks. I will probably target the smaller models for an initial implementation as I don't have access to a machine which can load the full size model at the moment.

@danforbes
Copy link

What is the status of this effort?

@guillaume-be
Copy link
Owner

Hello @danforbes ,
I haven't had the chance to look into this yet, but would have some capacity starting next week. Is that something you would like to look into?

@danforbes
Copy link

Honestly, I would love to, but I'm afraid I would need a lot of help and may not be the best person for the job - I'm not a terribly experienced Rust developer, nor do I really understand all that much about LLMs 😅 At this point I would mostly describe myself as an enthusiast that happens to have a fair amount of programming experience 🤓 I recently authored this PR on the rustformers/llama-rs repository, which was a bit of a basic refactor and a lot of copy-pasta 😅 It did get me wondering if there are opportunities for that project to have some type of interaction with this one, though 🤔 That PR also has a fair amount of code related to BLOOM inference, so it may be helpful in integrating BLOOM with this project. Let me know if you have any thoughts on the above.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants