-
Notifications
You must be signed in to change notification settings - Fork 2.1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
txt+image as input generating image #144
Comments
I also wonder whether Janus-Pro-7B has this ability! |
I have changed the pipeline and can do "txt+img -> img" work, but the model doesn't work well, so looking up to the upgrade from janus. |
Hi @Youngon , could you please share your modified "txt+img -> img" pipeline and corresponding preprocess codes? Thank you!🤗 |
I checked my pipeline, if masking the image embeddings, it can generate image according to the prompt; while I add unstanding part, can successfully unstand the content of input image. so the model doesn't have the abblity to do "txt + img -> img" task. |
it does not work at all, the model has no abblity to use input image generating new image. |
alright, thanks for your time! 🥰 |
Could this model be used in such situation, user input a prompt along with a image, generate a result image.
for example:
prompt[generate a image suck like the input image] + image[IMG.png] -> generated image
The text was updated successfully, but these errors were encountered: