[WebGPU] Fix unexpected device lost error when intentional dispose #17250

CharlieFRuan · 2024-08-06T20:39:19Z

This PR fixes an issue introduced in #17005. deviceLostIsError was introduced to make sure intentional dispose() (hence webgpu's destroy()) does not cause device lost callback to treat it as an error. However, we cannot set deviceLostIsError immediately to true after calling this.lib.dispose() (which calls device.destroy()) because WebGPU is asynchronous. Otherwise, we would trigger the device lost callback when calling Instance.dispose() to destroy device intentionally.

CharlieFRuan · 2024-08-07T01:43:54Z

@tvm-bot rerun

CharlieFRuan · 2024-08-07T06:31:27Z

@tvm-bot rerun

No breaking changes. The only diff is the following PR: - #525 - This PR updates the engine reload() and unload() methods to allow users to abort an uncompleted reload() by either: - call unload() any time before reload() completed - call reload() again before the previous reload() completed - Besides, it fixes the previous issue where `device lost error` is raised unexpectedly when user simply switches a model ### TVMjs - To support the above PR, TVMjs is updated and compiled at apache/tvm@1fcb620 - Difference: - Device error lost fix: apache/tvm#17250 - Add AbortSignal to fetching APIs: - apache/tvm#17208 - apache/tvm#17227 - apache/tvm#17233

No breaking changes. The only diff is the following PR: - mlc-ai#525 - This PR updates the engine reload() and unload() methods to allow users to abort an uncompleted reload() by either: - call unload() any time before reload() completed - call reload() again before the previous reload() completed - Besides, it fixes the previous issue where `device lost error` is raised unexpectedly when user simply switches a model ### TVMjs - To support the above PR, TVMjs is updated and compiled at apache/tvm@1fcb620 - Difference: - Device error lost fix: apache/tvm#17250 - Add AbortSignal to fetching APIs: - apache/tvm#17208 - apache/tvm#17227 - apache/tvm#17233

[WebGPU] Fix unexpected device lost error when intentional dispose

24ec320

CharlieFRuan mentioned this pull request Aug 6, 2024

[Engine] Allow manually aborting reload, fix unexpected deviceLostError mlc-ai/web-llm#525

Merged

tqchen approved these changes Aug 7, 2024

View reviewed changes

tqchen merged commit 1fcb620 into apache:main Aug 8, 2024
15 checks passed

CharlieFRuan mentioned this pull request Aug 8, 2024

[Version] Bump to version 0.2.54 mlc-ai/web-llm#530

Merged

ysh329 mentioned this pull request Oct 16, 2024

[Release] v0.18.0 Release Candidate Notes #17468

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WebGPU] Fix unexpected device lost error when intentional dispose #17250

[WebGPU] Fix unexpected device lost error when intentional dispose #17250

CharlieFRuan commented Aug 6, 2024

CharlieFRuan commented Aug 7, 2024

CharlieFRuan commented Aug 7, 2024

[WebGPU] Fix unexpected device lost error when intentional dispose #17250

[WebGPU] Fix unexpected device lost error when intentional dispose #17250

Conversation

CharlieFRuan commented Aug 6, 2024

CharlieFRuan commented Aug 7, 2024

CharlieFRuan commented Aug 7, 2024