[core] ConsisID #10140

SHYuanBest · 2024-12-06T08:55:10Z

What does this PR do?

Add support for ConsisID (#10100)

Paper: https://arxiv.org/abs/2411.17440
Project: https://pku-yuangroup.github.io/ConsisID
Code: https://github.com/PKU-YuanGroup/ConsisID
Demo: https://huggingface.co/spaces/BestWishYsh/ConsisID-preview-Space

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

SHYuanBest · 2024-12-06T09:00:07Z

@a-r-r-o-w Do we need to create a branch of huggingface: ConsisID, or I just use SHYuanBest: main?

a-r-r-o-w · 2024-12-06T09:02:01Z

SHYuanBest:main works. This is just a branch from your diffusers fork to HF diffusers library, so you are free to make any changes you'd like here. Looking forward to the ConsisID changes!

SHYuanBest · 2024-12-10T07:38:04Z

@a-r-r-o-w @HuggingFaceDocBuilderDev hi, I have add consisid to this branch, can you help us to reveiew the code? Is there anything else I missed?

import torch
from diffusers import ConsisIDPipeline
from diffusers.pipelines.consisid.consisid_utils import prepare_face_models, process_face_embeddings_infer
from diffusers.utils import export_to_video
from huggingface_hub import snapshot_download

snapshot_download(repo_id="BestWishYsh/ConsisID-preview", local_dir="BestWishYsh/ConsisID-preview")

face_helper_1, face_helper_2, face_clip_model, face_main_model, eva_transform_mean, eva_transform_std = prepare_face_models("BestWishYsh/ConsisID-preview", device="cuda", dtype=torch.bfloat16)

pipe = ConsisIDPipeline.from_pretrained("BestWishYsh/ConsisID-preview", torch_dtype=torch.bfloat16)
pipe.to("cuda")

prompt = "The video captures a boy walking along a city street, filmed in black and white on a classic 35mm camera. His expression is thoughtful, his brow slightly furrowed as if he's lost in contemplation. The film grain adds a textured, timeless quality to the image, evoking a sense of nostalgia. Around him, the cityscape is filled with vintage buildings, cobblestone sidewalks, and softly blurred figures passing by, their outlines faint and indistinct. Streetlights cast a gentle glow, while shadows play across the boy's path, adding depth to the scene. The lighting highlights the boy's subtle smile, hinting at a fleeting moment of curiosity. The overall cinematic atmosphere, complete with classic film still aesthetics and dramatic contrasts, gives the scene an evocative and introspective feel."
image = "https://github.com/PKU-YuanGroup/ConsisID/blob/main/asserts/example_images/2.png?raw=true"

id_cond, id_vit_hidden, image, face_kps = process_face_embeddings_infer(face_helper_1, face_clip_model, face_helper_2, eva_transform_mean, eva_transform_std, face_main_model, "cuda", torch.bfloat16, image, is_align_face=True)

video = pipe(image=image, prompt=prompt, use_dynamic_cfg=False, id_vit_hidden=id_vit_hidden, id_cond=id_cond, kps_cond=face_kps, generator=torch.Generator("cuda").manual_seed(42))
export_to_video(video.frames[0], "output.mp4", fps=8)

HuggingFaceDocBuilderDev · 2024-12-10T08:08:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/diffusers/pipelines/consisid/pipeline_consisid.py

Co-authored-by: hlky <[email protected]>

SHYuanBest · 2024-12-11T02:27:00Z

@a-r-r-o-w @hlky hi, what should I do next?

SHYuanBest · 2024-12-22T03:52:06Z

to do:

Make the test script very small and pass all (model, pipeline, lora).
Check if test_vae_tiling requires expected_max_diff==0.35.
Have a conversion script about nn.Sequential.
Merge https://huggingface.co/datasets/huggingface/documentation-images/discussions/406 and update the Doc links.

a-r-r-o-w · 2024-12-23T01:49:23Z

@SHYuanBest Great work on the changes! We will try and integrate this soon and target it for next diffusers release (we have one this week, which is why we've been very busy). On your end, I think we are mostly good with the changes, and just need to address some minor concerns for diffusers-side integration. I will let YiYi comment and do her review first and then we can tackle the remaining things

SHYuanBest · 2024-12-23T02:52:52Z

@a-r-r-o-w @yiyixuxu That's great, much thanks for your great support! Looking forward to merge.

a-r-r-o-w · 2025-01-04T21:50:12Z

Gentle ping @yiyixuxu

SHYuanBest · 2025-01-16T03:29:35Z

Could you help review the code and merge? Thanks @yiyixuxu

yiyixuxu

thanks!

SHYuanBest · 2025-01-17T05:38:19Z

@SHYuanBest Great work on the changes! We will try and integrate this soon and target it for next diffusers release (we have one this week, which is why we've been very busy). On your end, I think we are mostly good with the changes, and just need to address some minor concerns for diffusers-side integration. I will let YiYi comment and do her review first and then we can tackle the remaining things

@a-r-r-o-w Hi, it seem that yiyixuxu have approved the changes, could you help merge https://huggingface.co/datasets/huggingface/documentation-images/discussions/406 (so that i can update the doc links) and tackle the remaining things, thansk!

a-r-r-o-w · 2025-01-17T06:38:54Z

@SHYuanBest I've merged the doc PR just now :) Will do some last refactors after your changes and proceed to merge. I think it's okay to not have a conversion script in this specific case, so please don't worry about that

SHYuanBest · 2025-01-17T06:57:22Z

@a-r-r-o-w Thanks a lot! And have update the docs link.

a-r-r-o-w · 2025-01-18T11:42:29Z

@SHYuanBest Could you give the latest changes a look? It seems to be working for me locally as expected.

The major changes in refactor are:

Removed lora loader specific to ConsisID. We can re-use CogVideoX lora loader here because the underlying transformer architecture is same and there is a very low probability that users will train loras for other modeling components
Removed helper functions where not required

SHYuanBest · 2025-01-18T12:16:12Z

@a-r-r-o-w Thanks, I have looked the latest changs, it is good to me and the code can run nomally as expected.

Update __init__.py

0036376

SHYuanBest mentioned this pull request Dec 6, 2024

[training] CogVideoX-I2V LoRA #9482

Merged

SHYuanBest and others added 6 commits December 9, 2024 16:20

Merge branch 'huggingface:main' into main

940ec92

add consisid

c78cf01

update consisid

61c85f7

update consisid

12855b2

make style

787a69c

make_style

33d4291

hlky reviewed Dec 10, 2024

View reviewed changes

SHYuanBest and others added 7 commits December 10, 2024 16:32

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

455d68d

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

8f310c5

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

0f447a4

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

d348901

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

a35f92a

Co-authored-by: hlky <[email protected]>

Update src/diffusers/pipelines/consisid/pipeline_consisid.py

33f3acb

Co-authored-by: hlky <[email protected]>

add doc

6503a17

SHYuanBest requested a review from hlky December 10, 2024 09:04

SHYuanBest and others added 4 commits December 10, 2024 18:36

Merge branch 'main' into main

a24a4ee

Merge branch 'huggingface:main' into main

19d1fa3

make style

c13fb17

Rename consisid .md to consisid.md

61ad37b

hlky added 4 commits December 11, 2024 08:19

Update geodiff_molecule_conformation.ipynb

3a274ca

Update geodiff_molecule_conformation.ipynb

02c16ba

Update geodiff_molecule_conformation.ipynb

e76338e

Update demo.ipynb

a597713

Merge branch 'huggingface:main' into main

3b05257

SHYuanBest requested review from stevhliu and a-r-r-o-w December 22, 2024 04:26

SHYuanBest added 2 commits December 22, 2024 19:28

update

5813825

update

7734a29

SHYuanBest and others added 6 commits December 23, 2024 10:55

change expected_diff_max to 0.4

5fd9a81

Merge branch 'huggingface:main' into main

0937753

fix typo

cdc04bf

fix link

0af2f83

fix typo

e17aa82

Merge branch 'main' into main

3b17e2e

nitinmukesh mentioned this pull request Jan 2, 2025

Tracking these for completion newgenai79/newgenai#13

Open

a-r-r-o-w requested a review from yiyixuxu January 4, 2025 21:50

yiyixuxu approved these changes Jan 16, 2025

View reviewed changes

Merge branch 'huggingface:main' into main

71982b2

update docs

31c94a0

hlky approved these changes Jan 17, 2025

View reviewed changes

update

cca81bf

remove consisid lora tests

5348111

a-r-r-o-w merged commit 23b467c into huggingface:main Jan 19, 2025
11 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[core] ConsisID #10140

[core] ConsisID #10140

SHYuanBest commented Dec 6, 2024 •

edited

Loading

SHYuanBest commented Dec 6, 2024

a-r-r-o-w commented Dec 6, 2024

SHYuanBest commented Dec 10, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 10, 2024

SHYuanBest commented Dec 11, 2024 •

edited

Loading

SHYuanBest commented Dec 22, 2024 •

edited

Loading

a-r-r-o-w commented Dec 23, 2024

SHYuanBest commented Dec 23, 2024

a-r-r-o-w commented Jan 4, 2025

SHYuanBest commented Jan 16, 2025

yiyixuxu left a comment

SHYuanBest commented Jan 17, 2025

a-r-r-o-w commented Jan 17, 2025

SHYuanBest commented Jan 17, 2025

a-r-r-o-w commented Jan 18, 2025

SHYuanBest commented Jan 18, 2025 •

edited

Loading

[core] ConsisID #10140

[core] ConsisID #10140

Conversation

SHYuanBest commented Dec 6, 2024 • edited Loading

What does this PR do?

Who can review?

SHYuanBest commented Dec 6, 2024

a-r-r-o-w commented Dec 6, 2024

SHYuanBest commented Dec 10, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Dec 10, 2024

SHYuanBest commented Dec 11, 2024 • edited Loading

SHYuanBest commented Dec 22, 2024 • edited Loading

a-r-r-o-w commented Dec 23, 2024

SHYuanBest commented Dec 23, 2024

a-r-r-o-w commented Jan 4, 2025

SHYuanBest commented Jan 16, 2025

yiyixuxu left a comment

Choose a reason for hiding this comment

SHYuanBest commented Jan 17, 2025

a-r-r-o-w commented Jan 17, 2025

SHYuanBest commented Jan 17, 2025

a-r-r-o-w commented Jan 18, 2025

SHYuanBest commented Jan 18, 2025 • edited Loading

SHYuanBest commented Dec 6, 2024 •

edited

Loading

SHYuanBest commented Dec 10, 2024 •

edited

Loading

SHYuanBest commented Dec 11, 2024 •

edited

Loading

SHYuanBest commented Dec 22, 2024 •

edited

Loading

SHYuanBest commented Jan 18, 2025 •

edited

Loading