Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

txt+image as input generating image #144

Open
Youngon opened this issue Feb 6, 2025 · 6 comments
Open

txt+image as input generating image #144

Youngon opened this issue Feb 6, 2025 · 6 comments

Comments

@Youngon
Copy link

Youngon commented Feb 6, 2025

Could this model be used in such situation, user input a prompt along with a image, generate a result image.
for example:
prompt[generate a image suck like the input image] + image[IMG.png] -> generated image

@MqLeet
Copy link

MqLeet commented Feb 8, 2025

I also wonder whether Janus-Pro-7B has this ability!

@Youngon
Copy link
Author

Youngon commented Feb 8, 2025

I have changed the pipeline and can do "txt+img -> img" work, but the model doesn't work well, so looking up to the upgrade from janus.

@MqLeet
Copy link

MqLeet commented Feb 8, 2025

I have changed the pipeline and can do "txt+img -> img" work, but the model doesn't work well, so looking up to the upgrade from janus.

Hi @Youngon , could you please share your modified "txt+img -> img" pipeline and corresponding preprocess codes? Thank you!🤗

@Youngon
Copy link
Author

Youngon commented Feb 8, 2025

I checked my pipeline, if masking the image embeddings, it can generate image according to the prompt; while I add unstanding part, can successfully unstand the content of input image.

so the model doesn't have the abblity to do "txt + img -> img" task.

@Youngon
Copy link
Author

Youngon commented Feb 8, 2025

I have changed the pipeline and can do "txt+img -> img" work, but the model doesn't work well, so looking up to the upgrade from janus.

Hi @Youngon , could you please share your modified "txt+img -> img" pipeline and corresponding preprocess codes? Thank you!🤗

it does not work at all, the model has no abblity to use input image generating new image.

@MqLeet
Copy link

MqLeet commented Feb 8, 2025

alright, thanks for your time! 🥰

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants