[Cog] some minor fixes and nits #9466

sayakpaul · 2024-09-19T04:22:02Z

What does this PR do?

Cleans up check_inputs()
Moves the generator check in prepare_latents() at the beginning of the method so that we can error as early as possible.
Documentation nits.

HuggingFaceDocBuilderDev · 2024-09-19T04:32:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

a-r-r-o-w

Oops 😬 So, we don't have a test that passes both prompt embeds and negative prompt embeds? I believe that would have caught this, no? Surprising and very lucky that it was working for the normal prompt inputs so far.
Apologies for missing this on my part

sayakpaul · 2024-09-19T04:39:00Z

So, we don't have a test that passes both prompt embeds and negative prompt embeds? I believe that would have caught this, no?

I believe that could be added but the test would be about checking if an error is thrown if prompt and prompt_embeds are both passed and other adjacent areas.

sayakpaul · 2024-09-19T04:39:26Z

Sorry, I have additional updates to make. Will let you all know here after that.

src/diffusers/pipelines/cogvideo/pipeline_cogvideox_video2video.py

sayakpaul · 2024-09-19T07:29:14Z

src/diffusers/pipelines/cogvideo/pipeline_cogvideox.py


-        height = height or self.transformer.config.sample_size * self.vae_scale_factor_spatial
-        width = width or self.transformer.config.sample_size * self.vae_scale_factor_spatial


We don't need it because height and width are already at their default values.

sayakpaul · 2024-09-19T07:29:47Z

src/diffusers/pipelines/cogvideo/pipeline_cogvideox_image2video.py

+        self.vae_scaling_factor_image = (
+            self.vae.config.scaling_factor if hasattr(self, "vae") and self.vae is not None else 0.7
+        )


Is beneficial for scenarios where we want to run the pipeline without the VAE.

a-r-r-o-w

Looks good! Thanks for the fixes

sayakpaul · 2024-09-21T03:30:10Z

src/diffusers/pipelines/cogvideo/pipeline_cogvideox.py

@@ -188,6 +188,9 @@ def __init__(
        self.vae_scale_factor_temporal = (
            self.vae.config.temporal_compression_ratio if hasattr(self, "vae") and self.vae is not None else 4
        )
+        self.vae_scaling_factor_image = (


Adding this so that a pipeline can operate with the vae.

yiyixuxu

thanks!

FurkanGozukara · 2024-09-27T08:55:21Z

thank you for amazing work

i am using the code that is running here : https://huggingface.co/spaces/THUDM/CogVideoX-5B-Space

however I am getting below errors both in text to video

and also video to video

could there have been something broken the pipe?

pipe = CogVideoXPipeline.from_pretrained("THUDM/CogVideoX-5b", torch_dtype=default_dtype).to("cpu")
pipe.scheduler = CogVideoXDPMScheduler.from_config(pipe.scheduler.config, timestep_spacing="trailing")

    else:
        pipe.to(device)
        pipe.transformer = transformer
        pipe.vae = vae
        pipe.text_encoder = text_encoder

        if use_cpu_offload:
            pipe.enable_sequential_cpu_offload()
        if use_slicing:
            pipe.vae.enable_slicing()
        if use_tiling:
            pipe.vae.enable_tiling()

        video_pt = pipe(
            prompt=prompt,
            num_videos_per_prompt=1,
            num_inference_steps=num_inference_steps,
            num_frames=49,
            use_dynamic_cfg=True,
            output_type="pt",
            guidance_scale=guidance_scale,
            generator=torch.Generator(device="cpu").manual_seed(seed),
        ).frames
        gc.collect()
    return (video_pt, seed)

sayakpaul · 2024-09-27T08:57:30Z

Open a separate issue with a minimal fully reproducible code snippet. https://github.com/user-attachments/files/17161626/app.txt is quite long and is not minimal and the snippet you provided is incomplete.

FurkanGozukara · 2024-09-28T12:32:34Z

@sayakpaul current diffusers doesn't have this?
CogVideoXLoraLoaderMixin

it can't find it installed latest one on Windows

0.30.3

a-r-r-o-w · 2024-09-28T12:38:07Z

v0.30.3 only released the image-to-video and video-to-video pipelines. LoRA support will be added in a future release. For now, you can use the lora loader mixin via the main branch

* fix positional arguments in check_inputs(). * add video and latetns to check_inputs(). * prep latents_in_channels. * quality * multiple fixes. * fix

fix positional arguments in check_inputs().

f4ce633

sayakpaul requested review from yiyixuxu and a-r-r-o-w September 19, 2024 04:22

add video and latetns to check_inputs().

588d759

a-r-r-o-w approved these changes Sep 19, 2024

View reviewed changes

sayakpaul removed the request for review from yiyixuxu September 19, 2024 04:37

sayakpaul marked this pull request as draft September 19, 2024 04:37

sayakpaul added 2 commits September 19, 2024 10:12

prep latents_in_channels.

37c8922

quality

4b0dc80

a-r-r-o-w reviewed Sep 19, 2024

View reviewed changes

src/diffusers/pipelines/cogvideo/pipeline_cogvideox_video2video.py Outdated Show resolved Hide resolved

multiple fixes.

24b83a6

sayakpaul changed the title ~~[Cog V2V] fix positional arguments in check_inputs().~~ [Cog] some minor fixes and nits Sep 19, 2024

sayakpaul commented Sep 19, 2024

View reviewed changes

fix

514ed23

sayakpaul mentioned this pull request Sep 19, 2024

fix CogVideoX check_inputs and limit I2V frames to 49 #9469

Closed

Merge branch 'main' into correct-args-cog-v2v

e237924

sayakpaul requested a review from a-r-r-o-w September 21, 2024 01:38

a-r-r-o-w approved these changes Sep 21, 2024

View reviewed changes

Merge branch 'main' into correct-args-cog-v2v

271c110

sayakpaul marked this pull request as ready for review September 21, 2024 03:29

sayakpaul requested a review from yiyixuxu September 21, 2024 03:29

sayakpaul commented Sep 21, 2024

View reviewed changes

yiyixuxu approved these changes Sep 23, 2024

View reviewed changes

sayakpaul merged commit ba5af5a into main Sep 23, 2024
18 checks passed

sayakpaul deleted the correct-args-cog-v2v branch September 23, 2024 05:57

sayakpaul added a commit that referenced this pull request Dec 23, 2024

[Cog] some minor fixes and nits (#9466)

8fb4771

* fix positional arguments in check_inputs(). * add video and latetns to check_inputs(). * prep latents_in_channels. * quality * multiple fixes. * fix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Cog] some minor fixes and nits #9466

[Cog] some minor fixes and nits #9466

sayakpaul commented Sep 19, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Sep 19, 2024

a-r-r-o-w left a comment

sayakpaul commented Sep 19, 2024

sayakpaul commented Sep 19, 2024

sayakpaul Sep 19, 2024

sayakpaul Sep 19, 2024

a-r-r-o-w left a comment

sayakpaul Sep 21, 2024

yiyixuxu left a comment

FurkanGozukara commented Sep 27, 2024 •

edited

Loading

sayakpaul commented Sep 27, 2024

FurkanGozukara commented Sep 28, 2024

a-r-r-o-w commented Sep 28, 2024


		height = height or self.transformer.config.sample_size * self.vae_scale_factor_spatial
		width = width or self.transformer.config.sample_size * self.vae_scale_factor_spatial

[Cog] some minor fixes and nits #9466

[Cog] some minor fixes and nits #9466

Conversation

sayakpaul commented Sep 19, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Sep 19, 2024

a-r-r-o-w left a comment

Choose a reason for hiding this comment

sayakpaul commented Sep 19, 2024

sayakpaul commented Sep 19, 2024

sayakpaul Sep 19, 2024

Choose a reason for hiding this comment

sayakpaul Sep 19, 2024

Choose a reason for hiding this comment

a-r-r-o-w left a comment

Choose a reason for hiding this comment

sayakpaul Sep 21, 2024

Choose a reason for hiding this comment

yiyixuxu left a comment

Choose a reason for hiding this comment

FurkanGozukara commented Sep 27, 2024 • edited Loading

sayakpaul commented Sep 27, 2024

FurkanGozukara commented Sep 28, 2024

a-r-r-o-w commented Sep 28, 2024

sayakpaul commented Sep 19, 2024 •

edited

Loading

FurkanGozukara commented Sep 27, 2024 •

edited

Loading