Image Input #134

Legerdo · 2024-03-29T07:04:26Z

Describe the feature

hello.
llamafile seems to have image input functions such as jpg/png/gif/bmp.

Example)
llamafile -ngl 9999 --temp 0
--image ~/Pictures/lemurs.jpg
-m llava-v1.5-7b-Q4_K.gguf
--mmproj llava-v1.5-7b-mmproj-Q4_0.gguf
-e -p '### User: What do you see?\n### Assistant: '
--no-display-prompt 2>/dev/null

Is it possible to implement this feature in the future?
Or is there some problem that makes it impossible?

amakropoulos · 2024-03-29T08:01:18Z

hi, thanks for the request!
that should be feasible.
how would you like to use it / see it inside Unity?

Legerdo · 2024-03-29T08:33:17Z

Eventually, I would like to add the ability to describe to the user what the NPC character's camera (eyes) sees.

I haven't tested this against the local vision model yet, so I don't know to what extent it's possible, but it would be interesting if it were!

amakropoulos · 2024-09-04T17:07:51Z

I implemented most of the functionality in this branch: feature/multimodal_models
and I afterwards figured out that multimodal support has been dropped from the llama.cpp server and not brought back for the last months: ggerganov/llama.cpp#8010 😞

Legerdo added the enhancement New feature or request label Mar 29, 2024

amakropoulos mentioned this issue Jul 11, 2024

Adding New Features to LLMUnity #149

Closed

amakropoulos added this to the v2.2.1 milestone Aug 27, 2024

amakropoulos added this to LLM for Unity Roadmap Aug 27, 2024

amakropoulos moved this to Todo in LLM for Unity Roadmap Aug 27, 2024

amakropoulos moved this from Todo to In Progress in LLM for Unity Roadmap Sep 4, 2024

amakropoulos moved this from In Progress to Blocked in LLM for Unity Roadmap Sep 4, 2024

amakropoulos removed this from the v2.2.1 milestone Sep 4, 2024

amakropoulos added the llama.cpp label Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Image Input #134

Image Input #134

Legerdo commented Mar 29, 2024

amakropoulos commented Mar 29, 2024

Legerdo commented Mar 29, 2024

amakropoulos commented Sep 4, 2024

Image Input #134

Image Input #134

Comments

Legerdo commented Mar 29, 2024

Describe the feature

amakropoulos commented Mar 29, 2024

Legerdo commented Mar 29, 2024

amakropoulos commented Sep 4, 2024