Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Python] Add arrow::ArrayStatistics bindings #45457

Closed
kou opened this issue Feb 7, 2025 · 1 comment
Closed

[Python] Add arrow::ArrayStatistics bindings #45457

kou opened this issue Feb 7, 2025 · 1 comment

Comments

@kou
Copy link
Member

kou commented Feb 7, 2025

Describe the enhancement requested

We can get it by arrow::Array::statistics(). If we can get it, we can access to statistics in Parquet from read Arrow data.

Component(s)

Python

kou added a commit to kou/arrow that referenced this issue Feb 17, 2025
It's the bindings of `arrow::ArrayStatistics`. You can get it by
`pyarrow.Array.statistics()`.
kou added a commit to kou/arrow that referenced this issue Feb 25, 2025
It's the bindings of `arrow::ArrayStatistics`. You can get it by
`pyarrow.Array.statistics()`.
kou added a commit that referenced this issue Feb 25, 2025
### Rationale for this change

Apache Arrow C++ can attach statistics read from Apache Parquet data to `arrow::Array`. If we have the bindings of the feature in Python, Python users can also use attached statistics. 

### What changes are included in this PR?

* Add `pyarrow.ArrayStatistics`
* Add `pyarrow.Array.statistics()`.

### Are these changes tested?

Yes.

### Are there any user-facing changes?

Yes.
* GitHub Issue: #45457

Lead-authored-by: Sutou Kouhei <[email protected]>
Co-authored-by: Sutou Kouhei <[email protected]>
Co-authored-by: Antoine Pitrou <[email protected]>
Signed-off-by: Sutou Kouhei <[email protected]>
@kou kou added this to the 20.0.0 milestone Feb 25, 2025
@kou
Copy link
Member Author

kou commented Feb 25, 2025

Issue resolved by pull request 45550
#45550

@kou kou closed this as completed Feb 25, 2025
arashandishgar pushed a commit to arashandishgar/arrow that referenced this issue Feb 25, 2025
### Rationale for this change

Apache Arrow C++ can attach statistics read from Apache Parquet data to `arrow::Array`. If we have the bindings of the feature in Python, Python users can also use attached statistics. 

### What changes are included in this PR?

* Add `pyarrow.ArrayStatistics`
* Add `pyarrow.Array.statistics()`.

### Are these changes tested?

Yes.

### Are there any user-facing changes?

Yes.
* GitHub Issue: apache#45457

Lead-authored-by: Sutou Kouhei <[email protected]>
Co-authored-by: Sutou Kouhei <[email protected]>
Co-authored-by: Antoine Pitrou <[email protected]>
Signed-off-by: Sutou Kouhei <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant