Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Batch Queueing #761

Merged
Merged

Conversation

martindevans
Copy link
Member

Added a batch queue to BatchedExecutor, allowing an arbitrary amount of work to be queued up in the executor and run in batches. This is helpful when prompting with very large numbers of tokens (e.g. a 2000 character system prompt would automatically be queued up in 4x512 batches).

This is a pre-requisite for embeddings prompting (i.e. llava), which needs an entirely new type of item in the batch queue. Embeddings usually also come in large numbers (e.g. one llava image might be hundreds of items), so the ability to queue up lots of work is valuable!

…t of work to be queued up in the executor and run in batches. This is helpful when prompting with very large numbers of tokens (e.g. a 2000 character system prompt would automatically be queued up in 4x512 batches).

This is a pre-requisite for embeddings prompting (i.e. llava), which needs an entirely new type of item in the batch queue.
@martindevans martindevans merged commit be2c4fe into SciSharp:master May 29, 2024
6 checks passed
@martindevans martindevans deleted the batched_executor_queueing branch May 29, 2024 22:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant