Karras UNet 1D + 3D #295

MaximilienLC · 2024-02-21T19:32:52Z

Would the adjustments to transform the 2D Karras Unet to 1D be the same to the adjustments made to the 2D "standard" UNet to create the 1D one?

Thanks!

lucidrains · 2024-02-21T19:43:16Z

@MaximilienLC yup, it is quite simple; just replace the mp conv2d with mp conv1d https://github.com/lucidrains/lumiere-pytorch/blob/main/lumiere_pytorch/mp_lumiere.py#L120 + some other adjustments

would you like me to just knock this out for you tomorrow morning?

lucidrains · 2024-02-21T19:44:02Z

@MaximilienLC what type of data are you working with?

MaximilienLC · 2024-02-21T19:53:39Z

I trust you more than I trust myself so I'll gladly take the offer hhh. Working with audio-like signals that cannot be easily converted to spectrograms. Thanks a lot!

lucidrains · 2024-02-21T20:01:48Z

oh yes audio duh

ok, I'll knock this out tomorrow morning!

lucidrains · 2024-02-21T21:45:27Z

@MaximilienLC gave this more thought while walking doggo, and i think you need 2d still (to mix adjacent frequency information). i'll make it so the unet2d, you can finely control whether to downsample only the height or the width (and customize the conv2d kernel height/width dimensions too). that should generalize the 1d case

lucidrains · 2024-02-21T21:56:19Z

oh sorry I misread, thought you said you were working with spectrogram. nevermind!

MaximilienLC · 2024-02-21T22:54:59Z

btw unrelated to my first question, but while I have you here hhh:

I'm planning to eventually transition to conditional modeling, following the concatenation scheme proposed in Palette & Image Super-Resolution via Iterative Refinement (add extra channels with your conditional info). I'm not super familiar with all the changes that need to be made yet but will eventually (is the concatenation all you need?) Do you think that your lib is flexible enough for me to add this conditioning feature (btw would be happy to propose a PR if I succeed)?

QuantPrincess · 2024-02-22T05:17:59Z

While we are on the topic... If I wanted to adjust the original Unet to support a 3d input would the adjustments roughly be similar in terms of changing the conv to 3d + some other minor adjustments?

lucidrains · 2024-02-22T14:30:03Z

@QuantPrincess yup, similar complexity. can throw that in there too before month's end. i'm planning on just copy pasting my own code and making a few changes lol

QuantPrincess · 2024-02-22T17:05:27Z

Amazing! Thank you.

lucidrains · 2024-02-22T17:21:23Z

@QuantPrincess what are you using it for? the latest video unets are all pretrained with images on conv2d with conv1d slipped in for time dimension during video training stage (unless if Sora changed all that, haven't read the technical paper yet)

lucidrains · 2024-02-22T18:27:15Z

@MaximilienLC knocked it out just now - let me know if this runs for you

lucidrains · 2024-02-22T18:27:53Z

@QuantPrincess i can get out a 3d version too, if you aren't doing video stuff. if you are, i'd recommend this

QuantPrincess · 2024-02-22T22:02:52Z

Thanks so much, really appreciate the recommendation. Not doing video stuff though, just some 3d imaging. I will also try to adjust your code base for 3d in mean time!

lucidrains · 2024-02-22T22:06:19Z

@QuantPrincess ohh got it, you working with CT / MRI segmentation? yea i can add 3d for you then tomorrow morning

QuantPrincess · 2024-02-22T22:18:51Z

Haha yes CTs! Thanks so much for your help. Your code base is really a pleasure to work with.

lucidrains · 2024-02-22T22:22:31Z

@QuantPrincess awesome! yea i'll get that done

brings back memories of grad school when i tried to segment kidneys in CT scans with gofai algorithms (our team used watershed segmentation, super sh**ty). oh how far things have come

MaximilienLC · 2024-02-23T13:46:03Z

@MaximilienLC knocked it out just now - let me know if this runs for you

Thanks @lucidrains, will do in the coming days and report back!

btw, you might have missed my latest question

I'm planning to eventually transition to conditional modeling, following the concatenation scheme proposed in Palette & Image Super-Resolution via Iterative Refinement (add extra channels with your conditional info). I'm not super familiar with all the changes that need to be made yet but will eventually (is the concatenation all you need?) Do you think that your lib is flexible enough for me to add this conditioning feature (btw would be happy to propose a PR if I succeed)?

lucidrains · 2024-02-23T13:48:38Z

@MaximilienLC unfortunately there is the possibility my open source journey comes to an end soon and i can't get around to that. PR is welcome!

QuantPrincess · 2024-02-23T16:38:28Z

Thank you for this! I will check out today and report back. Really appreciate your help. :)

lucidrains · 2024-02-23T16:43:22Z

@QuantPrincess hey! so i realized it still lacks a few features to be usable (factorized attention, and being able to specify downsamples in space vs time separately) CT slices will be a much smaller than the spatial dimensions

lucidrains · 2024-02-23T16:43:35Z

@QuantPrincess let me get those in there this weekend, but do feel free to try it out as it is in the meanwhile

…ing work #295

lucidrains · 2024-02-26T04:12:03Z

@QuantPrincess let me know if this is intuitive

with downsample_types, you can control at each stage whether to downsample image (spatial) or frame (slices) or all (both). you can also control how many MP resnet blocks are at each stage by passing in a tuple of integer into num_blocks_per_stage

lucidrains · 2024-02-26T04:13:07Z

@QuantPrincess i can get to the factorized attention mid next week and finish off this issue

QuantPrincess · 2024-02-26T22:39:34Z

Thank you! I will look at tonight and get back to you. I really can't express enough how awesome your code base is to work with!

Parskatt · 2024-02-28T14:29:01Z

I think there might be some implementation mistake in the 3DUnet, I'm getting exploding activations. I'll see if I can make a simple repro.

Parskatt · 2024-02-28T15:28:48Z

@lucidrains see #296

lucidrains · 2024-02-28T15:33:32Z

@Parskatt thank you Johan! you beat me to it

lucidrains · 2024-02-28T18:46:51Z

@QuantPrincess ok, with this flag, you can do attention across space and time separately (axial attention)

this is my last open source contribution until further notice, good luck!

QuantPrincess · 2024-02-29T19:59:02Z

Thanks so much @lucidrains ! Cant wait to test out your work on the axial attention. Best of luck on your next endeavors!

…ing work lucidrains#295

lucidrains added a commit that referenced this issue Feb 22, 2024

improvised karras unet 1d for @MaximilienLC for #295

b7755ea

lucidrains added a commit that referenced this issue Feb 22, 2024

improvised karras unet 1d for @MaximilienLC for #295

e39893b

lucidrains added a commit that referenced this issue Feb 23, 2024

add preliminary karras unet 3d for @QuantPrincess at #295

14d134c

lucidrains added a commit that referenced this issue Feb 23, 2024

add preliminary karras unet 3d for @QuantPrincess at #295

3a3ab1b

lucidrains changed the title ~~Karras UNet 1D question~~ Karras UNet 1D + 3D Feb 23, 2024

lucidrains added a commit that referenced this issue Feb 26, 2024

another modification to karras unet3d for @QuantPrincess medical imag…

16b74de

…ing work #295

lucidrains added a commit that referenced this issue Feb 28, 2024

complete #295

5f8aa9f

lucidrains added a commit that referenced this issue Feb 28, 2024

complete #295

d0c68fc

lucidrains closed this as completed Feb 28, 2024

WillyChap pushed a commit to WillyChap/denoising-diffusion-pytorch that referenced this issue Sep 27, 2024

improvised karras unet 1d for @MaximilienLC for lucidrains#295

b2340d3

WillyChap pushed a commit to WillyChap/denoising-diffusion-pytorch that referenced this issue Sep 27, 2024

add preliminary karras unet 3d for @QuantPrincess at lucidrains#295

cd85c6d

WillyChap pushed a commit to WillyChap/denoising-diffusion-pytorch that referenced this issue Sep 27, 2024

another modification to karras unet3d for @QuantPrincess medical imag…

318ac92

…ing work lucidrains#295

WillyChap pushed a commit to WillyChap/denoising-diffusion-pytorch that referenced this issue Sep 27, 2024

complete lucidrains#295

095e1a1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Karras UNet 1D + 3D #295

Karras UNet 1D + 3D #295

MaximilienLC commented Feb 21, 2024

lucidrains commented Feb 21, 2024 •

edited

Loading

lucidrains commented Feb 21, 2024

MaximilienLC commented Feb 21, 2024

lucidrains commented Feb 21, 2024

lucidrains commented Feb 21, 2024

lucidrains commented Feb 21, 2024 •

edited

Loading

MaximilienLC commented Feb 21, 2024 •

edited

Loading

QuantPrincess commented Feb 22, 2024

lucidrains commented Feb 22, 2024 •

edited

Loading

QuantPrincess commented Feb 22, 2024

lucidrains commented Feb 22, 2024 •

edited

Loading

lucidrains commented Feb 22, 2024

lucidrains commented Feb 22, 2024

QuantPrincess commented Feb 22, 2024 •

edited

Loading

lucidrains commented Feb 22, 2024 •

edited

Loading

QuantPrincess commented Feb 22, 2024

lucidrains commented Feb 22, 2024 •

edited

Loading

MaximilienLC commented Feb 23, 2024

lucidrains commented Feb 23, 2024

QuantPrincess commented Feb 23, 2024

lucidrains commented Feb 23, 2024

lucidrains commented Feb 23, 2024 •

edited

Loading

lucidrains commented Feb 26, 2024

lucidrains commented Feb 26, 2024

QuantPrincess commented Feb 26, 2024

Parskatt commented Feb 28, 2024

Parskatt commented Feb 28, 2024

lucidrains commented Feb 28, 2024

lucidrains commented Feb 28, 2024

QuantPrincess commented Feb 29, 2024

Karras UNet 1D + 3D #295

Karras UNet 1D + 3D #295

Comments

MaximilienLC commented Feb 21, 2024

lucidrains commented Feb 21, 2024 • edited Loading

lucidrains commented Feb 21, 2024

MaximilienLC commented Feb 21, 2024

lucidrains commented Feb 21, 2024

lucidrains commented Feb 21, 2024

lucidrains commented Feb 21, 2024 • edited Loading

MaximilienLC commented Feb 21, 2024 • edited Loading

QuantPrincess commented Feb 22, 2024

lucidrains commented Feb 22, 2024 • edited Loading

QuantPrincess commented Feb 22, 2024

lucidrains commented Feb 22, 2024 • edited Loading

lucidrains commented Feb 22, 2024

lucidrains commented Feb 22, 2024

QuantPrincess commented Feb 22, 2024 • edited Loading

lucidrains commented Feb 22, 2024 • edited Loading

QuantPrincess commented Feb 22, 2024

lucidrains commented Feb 22, 2024 • edited Loading

MaximilienLC commented Feb 23, 2024

lucidrains commented Feb 23, 2024

QuantPrincess commented Feb 23, 2024

lucidrains commented Feb 23, 2024

lucidrains commented Feb 23, 2024 • edited Loading

lucidrains commented Feb 26, 2024

lucidrains commented Feb 26, 2024

QuantPrincess commented Feb 26, 2024

Parskatt commented Feb 28, 2024

Parskatt commented Feb 28, 2024

lucidrains commented Feb 28, 2024

lucidrains commented Feb 28, 2024

QuantPrincess commented Feb 29, 2024

lucidrains commented Feb 21, 2024 •

edited

Loading

lucidrains commented Feb 21, 2024 •

edited

Loading

MaximilienLC commented Feb 21, 2024 •

edited

Loading

lucidrains commented Feb 22, 2024 •

edited

Loading

lucidrains commented Feb 22, 2024 •

edited

Loading

QuantPrincess commented Feb 22, 2024 •

edited

Loading

lucidrains commented Feb 22, 2024 •

edited

Loading

lucidrains commented Feb 22, 2024 •

edited

Loading

lucidrains commented Feb 23, 2024 •

edited

Loading