Flux Model #2302

calvinpelletier · 2025-01-27T23:35:33Z

Context

This adds the main Flux flow-matching model to TorchTune.

NOTE: @pbontrager had mostly finished an implementation of this model before going on leave, so this implementation is temporary to unblock the rest of the Flux PRs. I just copied the code from the official Flux repo with some minimal changes. We can replace it with @pbontrager 's version when he returns.

More Flux PRs coming soon.

Changelog

Flux flow model implementation and builders
LoRA builders
Unit tests
Removed input image shape assertion from autoencoder (the autoencoder can be used with many image resolutions)
Some utility functions for predicting the noise in an image latent using the flow model
Added checkpoint loading/saving logic for the flow model (which is currently a no-op since the temporary torchtune implementation is the same as the huggingface one, see note above)

Usage

tune download black-forest-labs/FLUX.1-dev --output-dir /tmp/flux

from torchtune.models.flux import flux_1_dev_flow_model
from torchtune.training.checkpointing import FullModelHFCheckpointer

model = flux_1_dev_flow_model()
checkpointer = FullModelHFCheckpointer(
    "/tmp/flux",
    ["flux1-dev.safetensors"],
    "FLUX",
    "/tmp/flux_",
)
sd = checkpointer.load_checkpoint()["model"]
model.load_state_dict(sd)

Test plan

Manual testing:

verified that the torchtune implemention of the flow model has perfect parity with the official implementation and same performance
verified that the LoRA trains properly

Checklist:

run pre-commit hooks and linters (make sure you've first installed via pre-commit install)
add unit tests for any new functionality
update docstrings for any new or updated methods or classes
run unit tests via pytest tests
run recipe tests via pytest tests -m integration_test

pytorch-bot · 2025-01-27T23:35:37Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchtune/2302

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 3 New Failures, 6 Cancelled Jobs

As of commit 9d90451 with merge base 90fd2d3 ():

NEW FAILURES - The following jobs have failed:

GPU tests / gpu_test (3.9, stable) (gh)
Process completed with exit code 2.
Recipe Tests / recipe_test (3.9) (gh)
Process completed with exit code 2.
Unit Test / unit_tests (3.9) (gh)
Process completed with exit code 2.

CANCELLED JOBS - The following jobs were cancelled. Please retry:

GPU tests / gpu_test (3.10, stable) (gh)
##[error]The operation was canceled.
GPU tests / gpu_test (3.11, stable) (gh)
##[error]The operation was canceled.
Recipe Tests / recipe_test (3.10) (gh)
##[error]The operation was canceled.
Recipe Tests / recipe_test (3.11) (gh)
Unit Test / unit_tests (3.10) (gh)
##[error]The operation was canceled.
Unit Test / unit_tests (3.11) (gh)
##[error]The operation was canceled.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

ebsmothers

I'm excited to see this PR! This is just a first pass, most of my comments are around the Flux model components. I don't have a ton to say about the noise prediction yet (I may just need to do more research/see it in action). Still need to look at the single/double stream blocks later, so will probably come back with a few more comments

ebsmothers · 2025-01-30T19:32:58Z

torchtune/models/flux/_util.py

@@ -0,0 +1,165 @@
+# Copyright (c) Meta Platforms, Inc. and affiliates.


nit but let's name this _utils.py just for consistency with other similar files

ebsmothers · 2025-01-30T19:34:47Z