[Accelerate model loading] Fix meta device and super low memory usage #1016

patrickvonplaten · 2022-10-27T09:27:55Z

The tests:

FAILED tests/test_pipelines.py::PipelineSlowTests::test_stable_diffusion_accelerate_load_reduces_memory_footprint
FAILED tests/test_pipelines.py::PipelineSlowTests::test_stable_diffusion_pipeline_with_unet_on_gpu_only

are currently failing on main.

Also this PR renames:
cuda_with_minimal_gpu_usage to enable_sequential_cpu_offload as it's a more fitting name and disentangled enable_attention_slicing from cpu_offload

Related original PR: #850

@piEsposito does this work for you?

patrickvonplaten · 2022-10-27T09:30:58Z

The tests:

FAILED tests/test_pipelines.py::PipelineSlowTests::test_stable_diffusion_accelerate_load_reduces_memory_footprint
FAILED tests/test_pipelines.py::PipelineSlowTests::test_stable_diffusion_pipeline_with_unet_on_gpu_only

are currently failing on main.

Also this PR renames:
cuda_with_minimal_gpu_usage to enable_sequential_cpu_offload as it's a more fitting name and disentangled enable_attention_slicing from cpu_offload

Related original PR: #850

@piEsposito does this work for you?

patrickvonplaten · 2022-10-27T09:31:17Z

tests/test_pipelines.py

@@ -487,71 +483,3 @@ def test_ddpm_ddim_equality_batched(self):

        # the values aren't exactly equal, but the images look the same visually
        assert np.abs(ddpm_images - ddim_images).max() < 1e-1
-
-    @require_torch_gpu
-    def test_stable_diffusion_accelerate_load_works(self):


this test doesn't do anything so let's delete it

tests/pipelines/stable_diffusion/test_stable_diffusion.py

HuggingFaceDocBuilderDev · 2022-10-27T09:33:00Z

The documentation is not available anymore as the PR was closed or merged.

patil-suraj

thanks for fixing this, looks good to me!

patil-suraj · 2022-10-27T09:34:38Z

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

@@ -119,14 +119,13 @@ def disable_attention_slicing(self):
        # set slice_size = `None` to disable `attention slicing`
        self.enable_attention_slicing(None)

-    def cuda_with_minimal_gpu_usage(self):
+    def enable_sequential_cpu_offload(self):


Great name choice!

patil-suraj · 2022-10-27T09:35:12Z

tests/pipelines/stable_diffusion/test_stable_diffusion.py

+        pipeline_id = "CompVis/stable-diffusion-v1-4"
+
+        start_time = time.time()
+        pipeline_normal_load = StableDiffusionPipeline.from_pretrained(
+            pipeline_id, revision="fp16", torch_dtype=torch.float16, use_auth_token=True
+        )
+        pipeline_normal_load.to(torch_device)
+        normal_load_time = time.time() - start_time
+
+        start_time = time.time()
+        _ = StableDiffusionPipeline.from_pretrained(
+            pipeline_id, revision="fp16", torch_dtype=torch.float16, use_auth_token=True, device_map="auto"
+        )
+        meta_device_load_time = time.time() - start_time
+
+        assert 2 * meta_device_load_time < normal_load_time


piEsposito · 2022-10-27T10:40:01Z

@patrickvonplaten great naming choice, I love it!

…huggingface#1016) * [Accelerate model loading] Fix meta device and super low memory usage * better naming

patrickvonplaten added 2 commits October 27, 2022 09:27

[Accelerate model loading] Fix meta device and super low memory usage

c6d0a55

better naming

4ba23b4

patrickvonplaten commented Oct 27, 2022

View reviewed changes

tests/pipelines/stable_diffusion/test_stable_diffusion.py Show resolved Hide resolved

patrickvonplaten requested a review from patil-suraj October 27, 2022 09:33

patil-suraj approved these changes Oct 27, 2022

View reviewed changes

patrickvonplaten merged commit 3be9fa9 into main Oct 27, 2022

patrickvonplaten deleted the fix_meta_device_naming_tests branch October 27, 2022 10:11

yoonseokjin pushed a commit to yoonseokjin/diffusers that referenced this pull request Dec 25, 2023

[Accelerate model loading] Fix meta device and super low memory usage (…

2d3e4ab

…huggingface#1016) * [Accelerate model loading] Fix meta device and super low memory usage * better naming

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Accelerate model loading] Fix meta device and super low memory usage #1016

[Accelerate model loading] Fix meta device and super low memory usage #1016

patrickvonplaten commented Oct 27, 2022 •

edited

Loading

patrickvonplaten commented Oct 27, 2022

patrickvonplaten Oct 27, 2022

HuggingFaceDocBuilderDev commented Oct 27, 2022 •

edited

Loading

patil-suraj left a comment

patil-suraj Oct 27, 2022

patil-suraj Oct 27, 2022

piEsposito commented Oct 27, 2022

[Accelerate model loading] Fix meta device and super low memory usage #1016

[Accelerate model loading] Fix meta device and super low memory usage #1016

Conversation

patrickvonplaten commented Oct 27, 2022 • edited Loading

patrickvonplaten commented Oct 27, 2022

patrickvonplaten Oct 27, 2022

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 27, 2022 • edited Loading

patil-suraj left a comment

Choose a reason for hiding this comment

patil-suraj Oct 27, 2022

Choose a reason for hiding this comment

patil-suraj Oct 27, 2022

Choose a reason for hiding this comment

piEsposito commented Oct 27, 2022

patrickvonplaten commented Oct 27, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Oct 27, 2022 •

edited

Loading