Enable duckdb extension to build/run on CUDA-enabled pytorch #273

changhiskhan · 2022-10-30T06:27:22Z

create_pytorch('name', '/path/to/model.pth', 'cuda')
OR
create_pytorch('name', '/path/to/model.pth', 'cpu')

inputs are automatically moved to corresopnding device

integration/duckdb/CMakeLists.txt

eddyxu · 2022-10-30T16:48:04Z

integration/duckdb/CMakeLists.txt

    list(APPEND available_contents Torch)
  endif()
 endif()

 FetchContent_MakeAvailable(${available_contents})

-set(CMAKE_CXX_STANDARD 20)
+set(CMAKE_CXX_STANDARD 17)


why downgrade CXX standard here?

nvcc doesn't support C++20

so if this is set to 20, cmake complains about not supporting CUDA20

eddyxu · 2022-10-30T16:50:57Z

integration/duckdb/CMakeLists.txt

+    if(LANCE_BUILD_CUDA)
+      FetchContent_Declare(
+        Torch
+        URL https://download.pytorch.org/libtorch/cu117/libtorch-cxx11-abi-shared-with-deps-1.13.0%2Bcu117.zip


is there a place to pin pytorch version? Or mention it in README or somewhere?

Btw, do we want to support more than 1 cuda version?

we can pin pytorch in lance, but we can't control if they pip install something different. Since 1.13 just got released, we should be ok for a little while. We can also check the version when doing import lance and print out a warning or something?

ok actually there's no good place to pin it. pytorch is not currently a dependency. i'll add a thing in README.

eddyxu · 2022-10-30T16:51:44Z

integration/duckdb/src/lance/duckdb/ml/catalog.cc

@@ -38,7 +38,8 @@ ModelCatalog* ModelCatalog::Get() {
 }

 bool ModelCatalog::Load(const std::string& name, const std::string& uri) {
-  if (models_.contains(name)) {
+  // nvcc doesn't support C++20 yet so can't use std::map::contains()


would it have impact to including lance codebase?

are you thinking we're going to want to merge this with main lance codebase at some point?

i was thinking if we provide native reader/writer ext from duckdb extension, similar to parquet extension.

But we can leave it to later discussion ofc.

eddyxu · 2022-10-31T07:21:00Z

integration/duckdb/src/lance/duckdb/ml/pytorch.cc

-    for (int i = 0; i < softmax.size(0); i++) {
-      values.emplace_back(::duckdb::Value::FLOAT(*softmax[i].data_ptr<float>()));
+    auto fmat = ReadImageFromDuckDBValue(args.data[1].GetValue(i));
+    if(fmat.has_value()) {


this format seems off

added a space after if and ran reformat code, no other changes were made by Gateway?

eddyxu · 2022-10-31T07:21:39Z

integration/duckdb/src/lance/duckdb/ml/pytorch.h

    module_.eval();
+    auto params = module_.named_parameters();
+    for (const auto &item : params) {


what is this?

eddyxu · 2022-10-31T07:22:25Z

integration/duckdb/src/lance/duckdb/ml/pytorch.h

  }

+  torch::Tensor RunInference(cv::Mat fmat);


should this be const cv::Mat& to avoid one copy, or do you expect move a mat here.

Should the interface design against to torch::Tensor instead of cv::Mat?

ah yes, good catch.

interface: this is a private member so i wasn't thinking about changing the interface. Maybe it makes sense to call this ImageToTensor or something and it'll do everything but the call to forward

eddyxu · 2022-10-31T07:25:01Z

integration/duckdb/tests/test_pytorch.py


+def create_model(db, model_path, device):
+    db.execute(f"CALL create_pytorch_model('resnet', '{str(model_path)}', '{device}');")


dont need to address in this PR. but in long term , should the third parameter (conf / params) be a map<string, ANY>?
Or use PRAGMA to customize behavior?

i guess that depends on whether there's more model-specific settings.

a few settings in mind, i.e., batch size, num of multiprocesses, and other parameters that a DataLoader can support?

changhiskhan requested a review from eddyxu October 30, 2022 06:27

builds on gpu

92bff76

changhiskhan force-pushed the changhiskhan/duckdb-gpu branch from 1476687 to 92bff76 Compare October 30, 2022 06:28

works with roundtrip to cuda

e0daad8

eddyxu reviewed Oct 30, 2022

View reviewed changes

changhiskhan added 3 commits October 31, 2022 02:18

not sure why it doesn't work

274c1bd

Merge branch 'main' into changhiskhan/duckdb-gpu

3046957

fuckin-a

30e7dc0

changhiskhan changed the title ~~builds on gpu~~ Enable duckdb extension to build/run on CUDA-enabled pytorch Oct 31, 2022

changhiskhan added 2 commits October 31, 2022 05:10

improve README

12a6c4b

fix tests

54f566c

eddyxu reviewed Oct 31, 2022

View reviewed changes

address review comments

ce8144f

eddyxu approved these changes Oct 31, 2022

View reviewed changes

Merge branch 'main' into changhiskhan/duckdb-gpu

f18b027

changhiskhan merged commit bb57d55 into main Nov 1, 2022

changhiskhan deleted the changhiskhan/duckdb-gpu branch November 1, 2022 02:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable duckdb extension to build/run on CUDA-enabled pytorch #273

Enable duckdb extension to build/run on CUDA-enabled pytorch #273

changhiskhan commented Oct 30, 2022 •

edited

Loading

eddyxu Oct 30, 2022

changhiskhan Oct 31, 2022

changhiskhan Oct 31, 2022

eddyxu Oct 30, 2022

changhiskhan Oct 31, 2022

changhiskhan Oct 31, 2022

eddyxu Oct 30, 2022

changhiskhan Oct 31, 2022

eddyxu Oct 31, 2022

eddyxu Oct 31, 2022

changhiskhan Oct 31, 2022

eddyxu Oct 31, 2022

changhiskhan Oct 31, 2022

eddyxu Oct 31, 2022

changhiskhan Oct 31, 2022

eddyxu Oct 31, 2022

changhiskhan Oct 31, 2022

eddyxu Oct 31, 2022


		def create_model(db, model_path, device):
		db.execute(f"CALL create_pytorch_model('resnet', '{str(model_path)}', '{device}');")

Enable duckdb extension to build/run on CUDA-enabled pytorch #273

Enable duckdb extension to build/run on CUDA-enabled pytorch #273

Conversation

changhiskhan commented Oct 30, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

changhiskhan commented Oct 30, 2022 •

edited

Loading