[Llava] Add csv image loading in C++ runner #5380

digantdesai · 2024-09-15T05:28:53Z

Rationale - We don't want to depend on Torch when building for Android.

This add two things,
(1) for AoT, python image_util optionally generates a .csv from a jpg. In addition to .pt.

(2) add a runtime runner flag which hints at the provided image is a csv. And if so, the runner parses the csv and feeds it to the model.

This is very naive and obviously fragile. Added some checks in python.

Tested few ways,

On M1, with torch, loaded both .pt and .csv generated from the same jpg. And the LLM produces same text.
On Android, without torch, loaded .csv and it also produces similar text.:wq

pytorch-bot · 2024-09-15T05:28:56Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5380

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

High MacOS queue

✅ No Failures

As of commit 6c1b6b2 with merge base 444480b ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-09-16T18:09:13Z

@digantdesai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

swolchok · 2024-09-17T20:50:00Z

CSV seems like an unfortunate choice of image format. Why not PNG (or any other image format -- load the image and then blast it into a Tensor)?

Rationale - We don't want to depend on Torch when building for Android. This add two things, (1) for AoT, python image_util optionally generates a .csv from a jpg. In addition to .pt. (2) add a runtime runner flag which hints at the provided image is a csv. And if so, the runner parses the csv and feeds it to the model. This is very naive and obviously fragile. Added some checks in python. Tested few ways, - On M1, with torch, loaded both .pt and .csv generated from the same jpg. And the LLM produces same text. - On Android, without torch, loaded .csv and it also produces similar text.:wq

facebook-github-bot · 2024-09-18T14:14:24Z

@digantdesai has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

digantdesai requested a review from larryliu0820 September 15, 2024 05:28

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Sep 15, 2024

digantdesai force-pushed the llava_csv_image branch 2 times, most recently from 6b66a90 to 0944ca3 Compare September 15, 2024 06:20

digantdesai force-pushed the llava_csv_image branch from 0944ca3 to 0874123 Compare September 18, 2024 06:20

digantdesai force-pushed the llava_csv_image branch from 0874123 to 6c1b6b2 Compare September 18, 2024 14:10

larryliu0820 approved these changes Sep 18, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Llava] Add csv image loading in C++ runner #5380

[Llava] Add csv image loading in C++ runner #5380

digantdesai commented Sep 15, 2024

pytorch-bot bot commented Sep 15, 2024 •

edited

Loading

facebook-github-bot commented Sep 16, 2024

swolchok commented Sep 17, 2024

facebook-github-bot commented Sep 18, 2024

[Llava] Add csv image loading in C++ runner #5380

Are you sure you want to change the base?

[Llava] Add csv image loading in C++ runner #5380

Conversation

digantdesai commented Sep 15, 2024

pytorch-bot bot commented Sep 15, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/5380

❗ 1 Active SEVs

✅ No Failures

facebook-github-bot commented Sep 16, 2024

swolchok commented Sep 17, 2024

facebook-github-bot commented Sep 18, 2024

pytorch-bot bot commented Sep 15, 2024 •

edited

Loading