txt+image as input generating image #144

Youngon · 2025-02-06T09:21:48Z

Could this model be used in such situation, user input a prompt along with a image, generate a result image.
for example:
prompt[generate a image suck like the input image] + image[IMG.png] -> generated image

MqLeet · 2025-02-08T09:26:51Z

I also wonder whether Janus-Pro-7B has this ability!

Youngon · 2025-02-08T09:47:50Z

I have changed the pipeline and can do "txt+img -> img" work, but the model doesn't work well, so looking up to the upgrade from janus.

MqLeet · 2025-02-08T09:49:54Z

I have changed the pipeline and can do "txt+img -> img" work, but the model doesn't work well, so looking up to the upgrade from janus.

Hi @Youngon , could you please share your modified "txt+img -> img" pipeline and corresponding preprocess codes? Thank you!🤗

Youngon · 2025-02-08T09:52:50Z

I checked my pipeline, if masking the image embeddings, it can generate image according to the prompt; while I add unstanding part, can successfully unstand the content of input image.

so the model doesn't have the abblity to do "txt + img -> img" task.

Youngon · 2025-02-08T11:57:27Z

I have changed the pipeline and can do "txt+img -> img" work, but the model doesn't work well, so looking up to the upgrade from janus.

Hi @Youngon , could you please share your modified "txt+img -> img" pipeline and corresponding preprocess codes? Thank you!🤗

it does not work at all, the model has no abblity to use input image generating new image.

MqLeet · 2025-02-08T13:13:22Z

alright, thanks for your time! 🥰

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

txt+image as input generating image #144

txt+image as input generating image #144

Youngon commented Feb 6, 2025 •

edited

Loading

MqLeet commented Feb 8, 2025

Youngon commented Feb 8, 2025

MqLeet commented Feb 8, 2025

Youngon commented Feb 8, 2025

Youngon commented Feb 8, 2025 •

edited

Loading

MqLeet commented Feb 8, 2025

txt+image as input generating image #144

txt+image as input generating image #144

Comments

Youngon commented Feb 6, 2025 • edited Loading

MqLeet commented Feb 8, 2025

Youngon commented Feb 8, 2025

MqLeet commented Feb 8, 2025

Youngon commented Feb 8, 2025

Youngon commented Feb 8, 2025 • edited Loading

MqLeet commented Feb 8, 2025

Youngon commented Feb 6, 2025 •

edited

Loading

Youngon commented Feb 8, 2025 •

edited

Loading