-
Notifications
You must be signed in to change notification settings - Fork 349
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
SD3/3.5 issues with new releases #479
Comments
Git bisect result: 1c168d9 is the first bad commit
CMakeLists.txt | 6 -- @Green-Sky Do you have a clue what change could be the root cause? (my tests were with fa disabled) |
Damn. Yea I refactored the attention. However, the non-flash-attn code should be equivalent. See the changes in ggml_extend.hpp . |
I think it's the issue with |
Hm, I wonder what went wrong there. I tried to unify the attention, and it worked for every model I tested... |
@Green-Sky I don't exactly know what I'm doing, but I just managed to fix it on on my end by just setting diff --git a/vae.hpp b/vae.hpp
index 0c7d84f..622a1d9 100644
--- a/vae.hpp
+++ b/vae.hpp
@@ -104,7 +104,7 @@ public:
v = ggml_reshape_3d(ctx, v, c, h * w, n); // [N, h * w, in_channels]
// h_ = ggml_nn_attention(ctx, q, k, v, false); // [N, h * w, in_channels]
- h_ = ggml_nn_attention_ext(ctx, q, k, v, 1, nullptr, false, true, false);
+ h_ = ggml_nn_attention_ext(ctx, q, k, v, 1, nullptr, false, false, false); |
Oh, thats nice, did you test other models too?
I honestly dont remember. I will do more test later. |
Flux Lite and SDXL seem to work, so i'd assume everything should be fine. |
Also the black image with SD3 on Vulkan isn't a VAE issue. I tried with #454, and the previews were also black. I'll try to see if I can figure out what is going on here. |
Ok, that is kinda funny. The SD3 issue is actually related to another issue I already reported in llama.cpp (ggml-org/llama.cpp#10434), and ggml-org/llama.cpp#10437 fixed it. Patching soft_max.comp the same way fixes it. |
Since the recent update , SD 3.5 generated images look a bit burned out compared to images generated with the exact same parameters with the previous versions. This happens both with CPU and Vulkan backends (can't test the others). Looks like a VAE problem to me, since the image composition is almost exactly the same.
Also sd3 images are pitch black when using to the more recent version of GGML on Vulkan (on CPU it is just burned in like with sd3.5).
The text was updated successfully, but these errors were encountered: