Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Duckdb] Zero copy Arrow Array/RecordBatch to duckdb Vector/DataChunk conversion #348

Open
4 tasks
eddyxu opened this issue Dec 4, 2022 · 0 comments
Open
4 tasks
Labels
arrow Apache Arrow related issues duckdb help wanted Extra attention is needed rust Rust related tasks

Comments

@eddyxu
Copy link
Contributor

eddyxu commented Dec 4, 2022

Problem Statement

#347 converts arrow array / RecordBatch reading from the LanceDataset to duckdb corresponding data structures via heavy data copy.

We need to implement zero-copy data transfer between arrow and duckdb to make the extension production ready.

  • Primitive Arrays to duckdb::Vector
  • ::arrow::StructArray
  • ::arrow::ListArray
  • ::arrow::DictionaryArray
@eddyxu eddyxu added help wanted Extra attention is needed c++ C++ issues duckdb arrow Apache Arrow related issues labels Dec 4, 2022
@eddyxu eddyxu added rust Rust related tasks and removed c++ C++ issues labels Feb 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arrow Apache Arrow related issues duckdb help wanted Extra attention is needed rust Rust related tasks
Projects
None yet
Development

No branches or pull requests

1 participant