Add release notes for 0.241 #15143

caithagoras · 2020-09-09T06:26:07Z

Missing Release Notes

Daniel Ohayon

Enums support #1: base types and operators #14728 Enums support Add simple HashAggregation #1: base types and operators (Merged by: Rongrong Zhong)

George Wang

Add Oracle connector support #15014 Add Oracle connector support (Merged by: James Sun)

Vic Zhang

Fix precision definition in classification_precision function #15075 Fix precision definition in classification_precision function (Merged by: Maria Basmanova)

Extracted Release Notes

Add geometry_nearest_points function #14923 (Author: James Gill): Add geometry_nearest_points function
- Add :func:geometry_nearest_points to find nearest points of a pair of geometries.
Push dereferences into table scan for parquet tables #14955 (Author: Venki Korukanti): Push dereferences into table scan for parquet tables
- This change adds planner side support for pushing dereferences into Parquet table scan. Pushing deferences into table scan enables efficient scans as only the required nested column is read when required independent of the other projected nested columns in the same base column. Currently this functionality is behind a configuration variable hive.enable-parquet-dereference-pushdown.
Add warning message for UNION queries without ALL/DISTINCT keyword #14983 (Author: prithvip): Add warning message for UNION queries without ALL/DISTINCT keyword
- Added new warning message for UNION queries without ALL/DISTINCT keyword.
Add support for druid data ingestion #14995 (Author: Beinan Wang): Add support for druid data ingestion
- Add support for data ingestion.
Add Hive procedure to sync table partitions #15013 (Author: Vivek Bharathan): Add Hive procedure to sync table partitions
- Add procedure system.sync_partition_metadata() to synchronize the partitions in the metastore with the partitions that are physically on the file system.
Implement PrestoS3FileSystem#listFiles for direct recursive listings #15024 (Author: James Petty): Implement PrestoS3FileSystem#listFiles for direct recursive listings
- Add support for direct recursive file listings in PrestoS3FileSystem.
Implement JDBC ResultSet.getStatement #15027 (Author: Adam J. Shook): Implement JDBC ResultSet.getStatement
- Implemented ResultSet getStatement.
Revert "Upgrade ZSTD version" #15028 (Author: Rohit Jain): Revert "Upgrade ZSTD version"
- Downgrade ZSTD JNI compressor version to resolve the frequent excessive GC events introduced in version 0.238.
Separate operator and stage statistics from query statistics #15040 (Author: Mayank Garg): Separate operator and stage statistics from query statistics
- Add StageStatistics and OperatorStatistics to QueryCompletedEvent and remove stage and operator statistics from QueryStatistics.
Add documentation for Presto authorization #15042 (Author: Mayank Garg): Add documentation for Presto authorization
- Implement REST endpoint authorization in Presto. See :doc:/security/authorization.
Improve error handling during determinism analysis #15056 (Author: Leiqing Cai): Improve error handling during determinism analysis
- Fix an issue during determinism analysis where queries with LIMIT clause are not identified as non-deterministic when a rerun of the control query fails.
Support non-hive types in hive views #15065 (Author: Rebecca Schlussel): Support non-hive types in hive views
- Add support for non-hive types to hive views. This support had been removed in 0.233. If a view uses an unsupported type for any columns ,we will store only a single dummy column for that view in the metastore.
Dynamic filtering integration with Aria #15077 (Author: Ke Wang): Dynamic filtering integration with Aria
- Add dynamic filtering and bucket pruning support for inner join and semi join. The feature avoids full table scan on probe side for broadcast join or colocated join when the build side is small. Set config experimental.enable-dynamic-filtering to True to enable the feature. Configs experimental.dynamic-filtering-max-per-driver-row-count and experimental.dynamic-filtering-max-per-driver-size are available to tune the size on the build side join key space. Currently, only Hive connector can benefit from the feature.
Add memory tracking for OrcRecordReader #15078 (Author: Ying Su): Add memory tracking for OrcRecordReader
- Added memory tracking for OrcRecordReader.
Fix memory counting for SliceDictionaryBatchStreamReader #15099 (Author: Ying Su): Fix memory counting for SliceDictionaryBatchStreamReader
- Fixed a memory counting bug in SliceDictionaryBatchStreamReader.
Fix incorrect intersection between two envelopes #15104 (Author: James Gill): Fix incorrect intersection between two envelopes
- Fix bug when two envelopes intersect at a point for :func:ST_Intersection.
Support multiple control and test clusters in Verifier #15113 (Author: Leiqing Cai): Support multiple control and test clusters in Verifier
- Add support to allow multiple control and test clusters.

All Commits

61d5b87 Partial Aggregation Pushdown for ORC/Parquet (Vivek Bharathan)
731ea6b Fix incorrect intersection between two envelopes (James Gill)
7b07513 Add support to bounded varchar to BenchmarkSelectiveStreamReaders (Ying Su)
a8dce28 Support multiple columns in BenchmarkSelectiveStreamReaders (Ying Su)
921393d Allow custom filter rate for BenchmarkSelectiveStreamReaders (Ying Su)
bc44dab Add verification method to BenchmarkSelectiveStreamReaders (Ying Su)
cf49137 Refactor BenchmarkSelectiveStreamReaders (Ying Su)
6c8ad65 ParquetWriters cleanup (Zhenxiao Luo)
eadfd1b Add ConnectorMetadataUpdateHandle resolver (Nikhil Collooru)
f02c93d Add ConnectorMetadataUpdater to handle worker's metadata updates (Nikhil Collooru)
e6995f4 Rename PageSinkProperties to PageSinkContext (Nikhil Collooru)
ae0bdf8 Remove getFileStatus on openFile with Alluxio Cache (Bin Fan)
f74a84e Dynamic filtering integration with hive filter pushdown (Ke Wang)
76442b3 Support multiple control and test clusters in Verifier (Leiqing Cai)
2663a66 Make equals() and hashCode() consistent in TableHandle (Shixuan Fan)
344a3f8 Add check that dynamic filtering is not enabled with grouped execution (Ke Wang)
6b6b143 Rename dynamicFilterSupplier in ScanFilterAndProjectOperator (Ke Wang)
e548af5 Wrap OrcZstdDecompressor with airlift Decompressor (Vic Zhang)
b213954 Add zstd compression for PAGEFILE (Vic Zhang)
287c471 Add debug logging when running Presto queries in Verifier (Leiqing Cai)
eed4ae1 Make SimpleHttpResponseHandler generic (Tim Meehan)
5805f74 Make RequestErrorTracker generic (Tim Meehan)
f2c7097 Add tests for session propery ignore_unreadable_partition (Naveen007)
8ff1be5 Add Session property to ignore non-readable hive partitions (Naveen007)
3d4931b Add testMemoryTracking in TestSelectiveOrcReader (Ying Su)
696ffca Fix memory tracking for some SelectiveStreamReader's (Ying Su)
d375646 Add memory tracking for OrcSelectiveRecordReader (Ying Su)
9aaa218 Do not write buffered data when task is aborted (Vic Zhang)
1ac2744 Add ExecutionFailureInfo to BasicQueryInfo (Tim Meehan)
5d6ec74 Remove QueryInfo from DispatchManager (Tim Meehan)
4375cd2 Fix formatting in TestHiveIntegrationSmokeTest (Rebecca Schlussel)
256fb82 Suport non-hive types for Hive views (Rebecca Schlussel)
a458ecb Track cache objects sizes in CachingOrcDataSource (Ying Su)
91a1c79 Create OrcPageSource using the OrcDataSource from OrcReader (Ying Su)
b552b63 Fix memory counting for SliceDictionaryBatchStreamReader (Ying Su)
21c9c62 Update Presto on Spark splits assignment test (Vic Zhang)
fe13c96 Add config property for min spark partition count (Vic Zhang)
dcf54c5 Add more error message for Presto-on-Spark (Wenlei Xie)
99e67bc Add enum operators (Daniel Ohayon)
3649366 Support enum literals in queries (Daniel Ohayon)
f99842b Support type bound in TypeVariableConstraint (Daniel Ohayon)
7ced455 Add long and varchar enum types (Daniel Ohayon)
9bde821 Add Hive procedure to sync table partitions (Vivek)
a703fc4 Adds support for microsecond timestamp precision (Dmitry Borovsky)
da8303b Categorize the AccessControlException as user error (Venki Korukanti)
9b61fd7 Fix TestOrcMapNullKey (Masha Basmanova)
738bb9c Improve logging of queued queries (Tim Meehan)
fbe8766 Fix PrestoSparkQueryStatusInfo deserialization (Andrii Rosa)
1a8987f Adds option to read null map keys from orc file (Dmitry Borovsky)
9d92f0e Fix page splitter creating large dictionary page list (James Sun)
d8a12ee Add test cases for making temp dir under both exist and non-exist folder (Beinan Wang)
3f34a61 Create temporary root folder when it does not exist and add unit test (Beinan Wang)
c0ae304 fix CTAS failures when using viewfs (Beinan Wang)
6923691 Allow to store query results into a file in Presto on Spark (Andrii Rosa)
6fa3632 Restructure PrestoSparkQueryInfo to resemble QueryResults (Andrii Rosa)
2222b94 Expose more session parameters in PrestoSparkRunner (Andrii Rosa)
f8793a7 Improve error handling during determinism analysis (Leiqing Cai)
b18ccbe Fix precision definition in classification_precision function (Vic Zhang)
b7d5539 Clean up TrackedQuery (Tim Meehan)
86914fb Add max Spark input partition count for auto tune (Wenlei Xie)
f16f757 Recognize Presto-on-Spark error code in verifier (Wenlei Xie)
39872c4 Move QueryNotAdequatelyPushedDownErrorCode to PinotErrorCode (Xiang Fu)
ad31ac0 Add dynamic filter canonicalization in UnaliasSymbolReferences (Ke Wang)
0cfa23f Refactor WarningCollector to spi module enabling connectors to pass warnings back to the engine (Naveen007)
95725f6 Implement PrestoS3FileSystem#listFiles for direct recursive listings (James Petty)
e7af71b Add geometry_nearest_points function (James Gill)
f25aa0c Fix multi-join dynamic filtering (Ke Wang)
f4fd786 Fix for Pinot queries where order by column is pruned in projection (Dharak Kharod)
4778753 Separate operator and stage statistics from query statistics (Mayank Garg)
4066bd3 Invoke runtime plan checker in SqlQueryScheduler (Peizhen Guo)
9b29a74 Refactor RuntimeReorderJoin use PropertyDerivation (Peizhen Guo)
7dd650c Add documentation for Presto authorization (Mayank Garg)
c02e90e Add error code for Presto-on-Spark (Wenlei Xie)
ad1ab8b Remove OriginalExpression from PropertyDerivations (James Sun)
d60c948 Remove OriginalExpression from StreamPropertyDerivations (James Sun)
5de7bca Remove OriginalExpression from PushProjectionThroughUnion (James Sun)
0bd0854 Remove OriginalExpression from PushProjectionThroughExchange (James Sun)
49a2f19 Add Oracle connector (George Wang)
c79b714 Fix TABLESAMPLE SYSTEM for Presto on Spark (Andrii Rosa)
36b78d2 Fix pruning unreferenced variables when WindowNode is skipped in PruneUnreferencedOutputs (Venki Korukanti)
7e7be19 Enable dereference pushdown in more testcases (Zhenxiao Luo)
d07fbc5 Implement JDBC ResultSet.getStatement (Adam J. Shook)
8338c0b Improve DynamicFiltersChecker to catch unsupported dynamic filters (Ke Wang)
08e260c Cleanup nested dynamic filters in RemoveUnsupportedDynamicFilters (Ke Wang)
19cfbbc Handle null analysis for try_cast (James Sun)
5dc949e Fix NullabilityAnalyzer for RowExpression (James Sun)
a1b15d6 Remove unused NullabilityAnalyzer for Expression (James Sun)
f6fa7c8 Catch and throw storage connection error properly (Nikhil Collooru)
99c0d2a Use SYNTHESIZED type to represent pushed down Subfield in HiveColumnHandle (Venki Korukanti)
7d4a892 RowGroup pruning using the filter on pushed down subfield (Venki Korukanti)
04ed142 Fix the dereference validity check in PushdownDeferences rule (Venki Korukanti)
a444c04 Update Parquet reader to read pushed down dereference columns (Venki Korukanti)
009eb3f Pushdown dereferences into table scan for parquet tables (Venki Korukanti)
871dbf3 Add an option to control Parquet dereferance pushdown (Venki Korukanti)
9f867d4 Revert "Upgrade ZSTD version" (Rohit Jain)
172a552 Improve shuffle statistics collection (Andrii Rosa)
c8d9198 Implement batching of tiny rows for Presto on Spark (Andrii Rosa)
d2964ba Use offset instead of size in PrestoSparkRowBatch (Andrii Rosa)
de09645 Implement hdfs input source for druid ingestion (beinan)
8fb458d Implement druid ingestion by CTAS (beinan)
d29764e Implement sending ingestion task to druid (beinan)
572a4cb Write druid page data to gzip files (beinan)
84408b8 Add ingestion storage path to DruidConfig (beinan)
d5928f1 Add druid table insert/ingestion skeleton code (beinan)
27e922f Add warning message for UNION queries without ALL/DISTINCT keyword (prithvip)
c6c34d6 Modify regex for compatibility with both mysql, mariadb (Amit Sadaphule)
0cd9fb1 Dynamic bucket pruning on workers (Ke Wang)

yingsu00 · 2020-09-14T21:20:29Z

@kewang1024 @fgwang7w @beinan @zhenxiao I make changes to your commit messages. Could you please take a look to see if they're accurate?

presto-docs/src/main/sphinx/release/release-0.241.rst

ajaygeorge · 2020-09-14T21:32:28Z

presto-docs/src/main/sphinx/release/release-0.241.rst

+
+Hive Changes
+____________
+* Fix several memory counting bugs in ``OrcRecordReader`` and ``StreamReader``'s.


counting -> accounting reads better?

What is the trailing 's, remove?

What is the trailing 's, remove?

StreamReader is an interface. There're several implementations and the 's is plural form. I used plural form because the fixes were on several StreamReaders. Let's consult @aweisberg who is native speaker and runs the blog. Ariel, which way is better according to your opinion?

I removed the ' and kept the s. So now it will look like StreamReaders

presto-docs/src/main/sphinx/release/release-0.241.rst

caithagoras · 2020-09-14T21:46:54Z

presto-docs/src/main/sphinx/release/release-0.241.rst

+
+Hive Changes
+____________
+* Fix several memory counting bugs in ``OrcRecordReader`` and ``StreamReader``'s.


What is the trailing 's, remove?

presto-docs/src/main/sphinx/release/release-0.241.rst

caithagoras · 2020-09-14T21:51:15Z

presto-docs/src/main/sphinx/release/release-0.241.rst

+General Changes
+_______________
+* Fix incorrect results from function classification_precision() introduced in release 0.239
+* Add dynamic filtering and bucket pruning support for inner join and semi join, which can avoid full table scan on the probe side of broadcast joins or collocated joins when the build side is small. This can be enabled with the ``experimental.enable-dynamic-filtering`` configuration property and ``enable_dynamic_filtering`` system session property. The build side join key space size can be tuned by the ``experimental.dynamic-filtering-max-per-driver-row-count`` and ``experimental.dynamic-filtering-max-per-driver-size`` configuration properties and ``dynamic_filtering_max_per_driver_row_count`` and ``dynamic_filtering_max_per_driver_size`` session properties. Currently, only Hive connector can benefit from the feature.


this item is a performance improve, so the line should starts with "Improve".

Improve performance of broadcast and collocated join by supporting dynamic filtering and bucket pruning for inner and semi joins.

the item is way too long. Remove every after "The build side join key ...", and add a link to the PR or the issue instead. Something like

(:pr:`12345`)

Improve performance of broadcast and collocated join by supporting dynamic filtering and bucket pruning for inner and semi joins.

@caithagoras thanks for the rewriting. But this PR's primary goal is not improving the join's performance but the scan performance on the probe side. So I changed it to Improve performance for queries with broadcast or collocated joins by adding dynamic filtering and bucket pruning support.

beinan · 2020-09-16T06:16:01Z

@kewang1024 @fgwang7w @beinan @zhenxiao I make changes to your commit messages. Could you please take a look to see if they're accurate?

commit message looks good to me, thanks!

fgwang7w · 2020-09-16T06:29:37Z

@kewang1024 @fgwang7w @beinan @zhenxiao I make changes to your commit messages. Could you please take a look to see if they're accurate?

Msg LGTM, thanks!

yingsu00 · 2020-09-24T05:44:25Z

Thanks @ajaygeorge @caithagoras for reviewing. I have addressed the comments. Will you be able to take another look?

caithagoras · 2020-09-24T18:19:41Z

LGTM % sphinx compilation error

Warning, treated as error:
/Users/leiqing/fb/presto/presto-docs/src/main/sphinx/release/release-0.241.rst:23:Inline literal start-string without end-string.
make: *** [html] Error 2

ajaygeorge

LGTM

yingsu00 force-pushed the release-notes-0.241 branch 3 times, most recently from 4e20b72 to 5c87163 Compare September 11, 2020 22:12

yingsu00 self-assigned this Sep 14, 2020

yingsu00 requested a review from mbasmanova September 14, 2020 21:25

caithagoras requested review from tdcmeehan and mayankgarg1990 September 14, 2020 21:26

yingsu00 requested review from wenleix, zhenxiao, kewang1024, fgwang7w, rschlussel and mayankgarg1990 and removed request for mayankgarg1990 and tdcmeehan September 14, 2020 21:27

ajaygeorge reviewed Sep 14, 2020

View reviewed changes

presto-docs/src/main/sphinx/release/release-0.241.rst Outdated Show resolved Hide resolved

ajaygeorge reviewed Sep 14, 2020

View reviewed changes

yingsu00 requested review from rongrong and tdcmeehan September 14, 2020 21:35

caithagoras commented Sep 14, 2020

View reviewed changes

yingsu00 requested a review from aweisberg September 24, 2020 05:10

yingsu00 force-pushed the release-notes-0.241 branch from 5c87163 to 219721c Compare September 24, 2020 05:41

yingsu00 force-pushed the release-notes-0.241 branch from 219721c to 94696f0 Compare September 24, 2020 08:33

ajaygeorge approved these changes Sep 24, 2020

View reviewed changes

Add release notes for 0.241

e184c66

yingsu00 force-pushed the release-notes-0.241 branch from 94696f0 to e184c66 Compare September 24, 2020 19:41

caithagoras merged commit 8614ca5 into prestodb:master Sep 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add release notes for 0.241 #15143

Add release notes for 0.241 #15143

caithagoras commented Sep 9, 2020 •

edited by yingsu00

Loading

yingsu00 commented Sep 14, 2020

ajaygeorge Sep 14, 2020

caithagoras Sep 14, 2020

yingsu00 Sep 24, 2020

yingsu00 Sep 24, 2020

caithagoras Sep 14, 2020

caithagoras Sep 14, 2020

yingsu00 Sep 24, 2020

beinan commented Sep 16, 2020

fgwang7w commented Sep 16, 2020

yingsu00 commented Sep 24, 2020

caithagoras commented Sep 24, 2020

ajaygeorge left a comment

Add release notes for 0.241 #15143

Add release notes for 0.241 #15143

Conversation

caithagoras commented Sep 9, 2020 • edited by yingsu00 Loading

Missing Release Notes

Daniel Ohayon

George Wang

Vic Zhang

Extracted Release Notes

All Commits

yingsu00 commented Sep 14, 2020

ajaygeorge Sep 14, 2020

Choose a reason for hiding this comment

caithagoras Sep 14, 2020

Choose a reason for hiding this comment

yingsu00 Sep 24, 2020

Choose a reason for hiding this comment

yingsu00 Sep 24, 2020

Choose a reason for hiding this comment

caithagoras Sep 14, 2020

Choose a reason for hiding this comment

caithagoras Sep 14, 2020

Choose a reason for hiding this comment

yingsu00 Sep 24, 2020

Choose a reason for hiding this comment

beinan commented Sep 16, 2020

fgwang7w commented Sep 16, 2020

yingsu00 commented Sep 24, 2020

caithagoras commented Sep 24, 2020

ajaygeorge left a comment

Choose a reason for hiding this comment

caithagoras commented Sep 9, 2020 •

edited by yingsu00

Loading