-
Notifications
You must be signed in to change notification settings - Fork 433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DeltaTable.to_pyarrow_dataset()
fails for tables containing map types
#713
Comments
DeltaTable.to_pyarrow_dataset()
fails for tables containing map types
I have a draft PR #712 which fixes the first issue and I'm investigating the second issue. |
Ok both of these PRs have merged but map types are not quite working fully.
|
Hmm I just learned about an option in PyArrow Docs: https://arrow.apache.org/docs/python/generated/pyarrow.parquet.ParquetWriter.html#pyarrow.parquet.ParquetWriter |
FYI I fixed the upstream casting issue. I will be available in PyArrow 10.0.0, which will be released in the next couple weeks. |
I've given it a test using Pyarrow 10.0.0 and everything seems to be working. Thanks everyone who contributed to fixing this especially @wjones127 |
Environment
Delta-rs version: 0.5.8
Binding: Python
Environment:
Bug
What happened:
When using
DeltaTable.to_pyarrow_dataset()
for a table containing map types it crashes with:What you expected to happen:
Open the table without error.
How to reproduce it:
Use
to_pyarrow_dataset()
on any table containing map types.I created a test that catches this.
More details:
I think there are 2 reasons why this doesn't work currently:
ArrowException: C Data interface error: The datatype ""+m"" is still not supported in Rust implementation
from this line. I'm unsure if this support is available in the latest version of rust arrow.The text was updated successfully, but these errors were encountered: