-
Notifications
You must be signed in to change notification settings - Fork 847
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"out of order projection is not supported" after Fix Parquet Arrow Schema Inference #1701
Comments
This is expected behaviour, DataFusion currently passed out of order projection to the parquet reader despite the ArrayReader having never supported it. We now complain explicitly that the reader does not support this, but imo this is better than silently ignoring it? |
What happened if the reader was passed out of order projections? Was data ignored? |
It silently ignored the out-of-order projection, i.e.
Would pass. To add to the mess
Would not pass, as it sort of understood column projection - although it was completely broken for nested types |
Do we need to make any changes to DataFusion? |
Not beyond updating it to use the new ProjectionMask within ParquetExec, so changing from |
Ok, Maybe I can find time to do so as part of preparing the arrow 15 release |
I'd be happy to help if you run into any roadblocks, it should be straightforward |
PR here - apache/datafusion#2631 |
Describe the bug
After #1682 from @tustvold some tests in datafusion begin to fail with
Note this code is not included in 14.0.0
To Reproduce
See the reproduction instructions on apache/datafusion#2530
Expected behavior
tests should pass
Additional context
Add any other context about the problem here.
The text was updated successfully, but these errors were encountered: