Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a new column from already computed arrow Table / Arrow #587

Closed
eddyxu opened this issue Feb 15, 2023 · 0 comments · Fixed by #824
Closed

Add a new column from already computed arrow Table / Arrow #587

eddyxu opened this issue Feb 15, 2023 · 0 comments · Fixed by #824
Assignees
Labels
arrow Apache Arrow related issues rust Rust related tasks

Comments

@eddyxu
Copy link
Contributor

eddyxu commented Feb 15, 2023

Problem Statement

Bring back the functionality added in #320 in Lance

    def merge(self,
      right: pyarrow.Table,
      left_on: str,
      right_on: str,
      metadata: Optional[Dict[str, str]] = None,
    )

To use it,

import lance

dataset = lance.dataset("/tmp/dataset")
new_table = pyarrow.Table.from_pandas(df)
dataset.merge(new_table, left_on="id", right_on="id")

Once the merge is successfully done, a new version of dataset is generated.

References:

@eddyxu eddyxu assigned eddyxu and unassigned gsilvestrin Apr 10, 2023
@eddyxu eddyxu added arrow Apache Arrow related issues rust Rust related tasks labels Apr 10, 2023
@wjones127 wjones127 assigned wjones127 and unassigned eddyxu May 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
arrow Apache Arrow related issues rust Rust related tasks
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants