Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

👓 feat: Vision Support for Assistants #2195

Merged
merged 5 commits into from
Mar 25, 2024
Merged

Conversation

danny-avila
Copy link
Owner

@danny-avila danny-avila commented Mar 25, 2024

Summary

Introduces vision support for Assistants.

This may end up being natively supported by OpenAI but wanted to introduce at least an alternative method, as will be done for retrieval.

Their docs state they intend to do this at some point:

Messages can contain text, images, or files. At the moment, user-created Messages cannot contain image files but we plan to add support for this in the future. source

In anticipation of this, I've name the tool image_vision as opposed to just vision

Todo's (in another PR)

  • Tooltips over Assistant features to better explain them
  • Azure Support (may or may not work at the moment depending on config, needs robust approach)
  • Allow uploading images both to Host and OpenAI (for Code Interpreter)
    • Still deliberating how this might work

Other changes

  • Fixed a React key error that only occurred when the server was shut down before an assistant run could be initialized (edge case)
  • Refactors some of the handling of necessary API calls per-initialization of Assistant run in parallel now

Change Type

  • New feature (non-breaking change which adds functionality)

Checklist

  • My code adheres to this project's style guidelines
  • I have performed a self-review of my own code
  • I have commented in any complex areas of my code
  • I have made pertinent documentation changes
  • My changes do not introduce new warnings
  • I have written tests demonstrating that my changes are effective or that my feature works
  • Local unit tests pass with my changes
  • Any changes dependent on mine have been merged and published in downstream modules.
  • New documents have been locally validated with mkdocs

@danny-avila danny-avila merged commit 798e876 into main Mar 25, 2024
3 checks passed
@danny-avila danny-avila deleted the vision-assistants branch March 25, 2024 03:43
jinzishuai pushed a commit to aitok-ai/LibreChat that referenced this pull request May 20, 2024
* refactor(assistants/chat): use promises to speed up initialization, initialize shared variables, include `attachedFileIds` to streamRunManager

* chore: additional typedefs

* fix(OpenAIClient): handle edge case where attachments promise is resolved

* feat: createVisionPrompt

* feat: Vision Support for Assistants
kenshinsamue pushed a commit to intelequia/LibreChat that referenced this pull request Sep 17, 2024
* refactor(assistants/chat): use promises to speed up initialization, initialize shared variables, include `attachedFileIds` to streamRunManager

* chore: additional typedefs

* fix(OpenAIClient): handle edge case where attachments promise is resolved

* feat: createVisionPrompt

* feat: Vision Support for Assistants
BertKiv pushed a commit to BertKiv/LibreChat that referenced this pull request Dec 10, 2024
* refactor(assistants/chat): use promises to speed up initialization, initialize shared variables, include `attachedFileIds` to streamRunManager

* chore: additional typedefs

* fix(OpenAIClient): handle edge case where attachments promise is resolved

* feat: createVisionPrompt

* feat: Vision Support for Assistants
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant