-
Notifications
You must be signed in to change notification settings - Fork 847
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support multi diskRanges for ChunkReader #1955
Comments
Why not just call get_read for each page instead of for the entire column chunk? There is no requirement for get_read to delimit column chunks, after all the same trait is used to read the footer, etc... Somewhat related, but something to keep in mind is how this will all work with |
make sense.
😂 For now i only check page filter in Is there any need to support in @tustvold How expert thinks 😊 |
It seems use IOx ObjectStore will only support asyn reader? could you show me the code example of IOX integrate with arrow-rs |
They reuse a lot of logic, however, the logic that differs concerns the IO for fetching pages. So support for this would need to be explicitly added.
Currently IOx fetches the entire file to memory and does not perform IO to object storage directly. This was partly driven by the limited support for more sophisticated predicate pushdown, and the fact IO was not a dominating factor for our query workloads. That being said, apache/datafusion#2677 switches DataFusion to using the async interface directly, and apache/datafusion#2504 has more about how I envisage this fitting with the rayon-based scheduler longer-term. Any feedback would be most welcome 😄 |
Wow! Wonderful work! changing with each passing day 😂 i will catch up 😊 |
Is your feature request related to a problem or challenge? Please describe what you are trying to do.
related to #1775
When i implement page index skipping #1792 , i found
it assume read whole column chunk bytes array, but when facing like
read
col1
page1 and page3 we need skip other pageswe should pass two offsets
Describe the solution you'd like
pass multi strart and length
Describe alternatives you've considered
A clear and concise description of any alternative solutions or features you've considered.
Additional context
Add any other context or screenshots about the feature request here.
The text was updated successfully, but these errors were encountered: