Convert from Depth Pro default 1536x1536 implementation to 1024x1024 float16 tensor CoreML packages #45

harism · 2024-10-16T07:53:38Z

*This PR is not meant to be merged currently*

Testing with converting Depth Pro to run on stock Apple MacBook M2 Neural Engine, and to do so convert the model to CoreML packages which can be executed as part of an application.

I can try to provide more suitable implementation if this is something that this project would like to have here on upstream too.

Videos where StyleGAN generates images for Depth Pro;
https://youtu.be/0728BHmhXFc & https://youtu.be/pteYTX9oWz0

juntaosun · 2024-10-16T10:02:29Z

Is there a piece of code missing ？

depth_pro.py forward ： encoder and decoder are not exported

encodings = self.encoder(x)
features, features_0 = self.decoder(encodings)

convert_to_coreml.py：forward

class Depth(nn.Module):
    def __init__(self, head: nn.Module, fov: nn.Module):
        super(Depth, self).__init__()
        self.head = head
        self.fov = fov

    def forward(self, inputs: torch.Tensor) -> torch.Tensor:
        x = inputs[0]

        # How to get this input ? （features  ，  features_0）
        features = inputs[1]
        features_0 = inputs[2]

Can you update the 1024 inference code ， Thanks

harism · 2024-10-16T11:27:08Z

Is there a piece of code missing ？

I'm not exactly sure actually, I've been running this directly in root path python3 convert_to_coreml.py only successfully to generate those CoreML program files for the whole model, after executing the mandatory checkpoint read script and installing dependencies manually. And haven't done much to literally integrate the changes to upstream code yet but rather imported upstream code into this Python script use only.

harism · 2024-10-16T12:50:53Z

Can you update the 1024 inference code ， Thanks

I updated the code so that convert_to_coreml.py execution runs with the example image example.jpg first, shows the 1024x1024 resulting depth map on GUI, then continues to create those CoreML packages.

Hope this helps to see how to continue from here with different sizing options and what not.

…nsor size CoreML packages

charlieforward9 · 2024-12-19T05:49:40Z

I am looking to bind DepthPro into my iOS flutter application. What is the state of this work to enabling that?

harism · 2024-12-19T08:47:13Z

@charlieforward9 for better optimization I'd recommend to look at DepthAnything very similar work to what DepthPro has, and there are some readymade .mlpackage files available on HuggingFace for it. Those DepthAnything packages are trained with smaller DiNOV2 model optimizing them much better than I've reached here with decreasing the resolution only. I did some DiNOV2 model change trying too but unfortunately wasn't able to reach anything too much working with this.

charlieforward9 · 2024-12-19T14:36:02Z

Since having the model available as a tflite file is sufficient for me, I opened #79 and plan to run the conversion soon.

Are there any considerations I should make you'd like to warn me of? This is my first time doing this.

This was referenced Oct 16, 2024

How to port it to iOS? #3

Open

Reduced model for realtime applications? #24

Open

harism force-pushed the main branch from 9ba2417 to 6a19ba7 Compare October 16, 2024 12:47

harism changed the title ~~Convert from Depth Pro default 1536x1536 size to 1024x1024 float16 tensor CoreML programs~~ Convert from Depth Pro default 1536x1536 implementation to 1024x1024 float16 tensor CoreML programs Oct 16, 2024

harism force-pushed the main branch 2 times, most recently from b8dbdc5 to 499c3e8 Compare October 17, 2024 15:05

harism changed the title ~~Convert from Depth Pro default 1536x1536 implementation to 1024x1024 float16 tensor CoreML programs~~ Convert from Depth Pro default 1536x1536 implementation to 1024x1024 float16 tensor CoreML packages Oct 17, 2024

harism force-pushed the main branch 2 times, most recently from aa18879 to d32719f Compare October 23, 2024 20:35

Convert from Depth Pro default 1536x1536 implementation to smaller te…

dc7d196

…nsor size CoreML packages

harism force-pushed the main branch from d32719f to dc7d196 Compare October 27, 2024 16:28

leon0514 mentioned this pull request Dec 27, 2024

Lower resolution for faster inference? leon0514/ml-depth-pro-trt10#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert from Depth Pro default 1536x1536 implementation to 1024x1024 float16 tensor CoreML packages #45

Convert from Depth Pro default 1536x1536 implementation to 1024x1024 float16 tensor CoreML packages #45

harism commented Oct 16, 2024 •

edited

Loading

juntaosun commented Oct 16, 2024 •

edited

Loading

harism commented Oct 16, 2024 •

edited

Loading

harism commented Oct 16, 2024 •

edited

Loading

charlieforward9 commented Dec 19, 2024

harism commented Dec 19, 2024 •

edited

Loading

charlieforward9 commented Dec 19, 2024

Convert from Depth Pro default 1536x1536 implementation to 1024x1024 float16 tensor CoreML packages #45

Are you sure you want to change the base?

Convert from Depth Pro default 1536x1536 implementation to 1024x1024 float16 tensor CoreML packages #45

Conversation

harism commented Oct 16, 2024 • edited Loading

juntaosun commented Oct 16, 2024 • edited Loading

harism commented Oct 16, 2024 • edited Loading

harism commented Oct 16, 2024 • edited Loading

charlieforward9 commented Dec 19, 2024

harism commented Dec 19, 2024 • edited Loading

charlieforward9 commented Dec 19, 2024

harism commented Oct 16, 2024 •

edited

Loading

juntaosun commented Oct 16, 2024 •

edited

Loading

harism commented Oct 16, 2024 •

edited

Loading

harism commented Oct 16, 2024 •

edited

Loading

harism commented Dec 19, 2024 •

edited

Loading