-
Notifications
You must be signed in to change notification settings - Fork 349
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[SYCL] - Run hang with undefined temporary symbol error on Windows 11 #596
Comments
cc @airMeng for awareness. |
@airMeng FYI it's still not working on my B580, I tried with SD1.4 too, and I'm getting a different error this time:
|
@aahouzi The '-fno-sycl-id-queries-fit-in-int' message appears when the requested resolution is too large. If you remove the -W 1024 -H 1024 arguments, it should get further |
@HeyItsBATMAN Thanks for the info. With SD1.4, it's running, but the GPU is still not fully utilized—only small peaks appear continuously, even though the image size is just 256x256. How long does it take on your Linux-based B580? FYI, here is perf on my GPU, I still think it's too slow given model size and image size:
|
@aahouzi Using It should print a line like this:
You can see that everything is loaded into VRAM for me. What does it say for you? |
@HeyItsBATMAN I already checked that, and my model is fully loaded in VRAM:
Not sure if PR #330 enabling the SYCL backend was even tested on Windows. Also, I just noticed that there is still a TODO task in this PR to add support for large image inputs, which explains the '-fno-sycl-id-queries-fit-in-int' issue. |
@aahouzi In client dGPU, there is limitation of 4GB by driver in general. I test with SD 1.4 and SD 3.x on Arc770 (16GB). Meet same issue of memory malloc. Let me check possible methods. |
You can try to allocate several times like llama.cpp does. |
Type of issue
Hardware
GPU Driver version
32.0.101.6559
OS
Windows 11
The text was updated successfully, but these errors were encountered: