Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add release notes for 0.241 #15143

Merged
merged 1 commit into from
Sep 24, 2020
Merged

Conversation

caithagoras
Copy link
Contributor

@caithagoras caithagoras commented Sep 9, 2020

Missing Release Notes

Daniel Ohayon

George Wang

Vic Zhang

Extracted Release Notes

All Commits

  • 61d5b87 Partial Aggregation Pushdown for ORC/Parquet (Vivek Bharathan)
  • 731ea6b Fix incorrect intersection between two envelopes (James Gill)
  • 7b07513 Add support to bounded varchar to BenchmarkSelectiveStreamReaders (Ying Su)
  • a8dce28 Support multiple columns in BenchmarkSelectiveStreamReaders (Ying Su)
  • 921393d Allow custom filter rate for BenchmarkSelectiveStreamReaders (Ying Su)
  • bc44dab Add verification method to BenchmarkSelectiveStreamReaders (Ying Su)
  • cf49137 Refactor BenchmarkSelectiveStreamReaders (Ying Su)
  • 6c8ad65 ParquetWriters cleanup (Zhenxiao Luo)
  • eadfd1b Add ConnectorMetadataUpdateHandle resolver (Nikhil Collooru)
  • f02c93d Add ConnectorMetadataUpdater to handle worker's metadata updates (Nikhil Collooru)
  • e6995f4 Rename PageSinkProperties to PageSinkContext (Nikhil Collooru)
  • ae0bdf8 Remove getFileStatus on openFile with Alluxio Cache (Bin Fan)
  • f74a84e Dynamic filtering integration with hive filter pushdown (Ke Wang)
  • 76442b3 Support multiple control and test clusters in Verifier (Leiqing Cai)
  • 2663a66 Make equals() and hashCode() consistent in TableHandle (Shixuan Fan)
  • 344a3f8 Add check that dynamic filtering is not enabled with grouped execution (Ke Wang)
  • 6b6b143 Rename dynamicFilterSupplier in ScanFilterAndProjectOperator (Ke Wang)
  • e548af5 Wrap OrcZstdDecompressor with airlift Decompressor (Vic Zhang)
  • b213954 Add zstd compression for PAGEFILE (Vic Zhang)
  • 287c471 Add debug logging when running Presto queries in Verifier (Leiqing Cai)
  • eed4ae1 Make SimpleHttpResponseHandler generic (Tim Meehan)
  • 5805f74 Make RequestErrorTracker generic (Tim Meehan)
  • f2c7097 Add tests for session propery ignore_unreadable_partition (Naveen007)
  • 8ff1be5 Add Session property to ignore non-readable hive partitions (Naveen007)
  • 3d4931b Add testMemoryTracking in TestSelectiveOrcReader (Ying Su)
  • 696ffca Fix memory tracking for some SelectiveStreamReader's (Ying Su)
  • d375646 Add memory tracking for OrcSelectiveRecordReader (Ying Su)
  • 9aaa218 Do not write buffered data when task is aborted (Vic Zhang)
  • 1ac2744 Add ExecutionFailureInfo to BasicQueryInfo (Tim Meehan)
  • 5d6ec74 Remove QueryInfo from DispatchManager (Tim Meehan)
  • 4375cd2 Fix formatting in TestHiveIntegrationSmokeTest (Rebecca Schlussel)
  • 256fb82 Suport non-hive types for Hive views (Rebecca Schlussel)
  • a458ecb Track cache objects sizes in CachingOrcDataSource (Ying Su)
  • 91a1c79 Create OrcPageSource using the OrcDataSource from OrcReader (Ying Su)
  • b552b63 Fix memory counting for SliceDictionaryBatchStreamReader (Ying Su)
  • 21c9c62 Update Presto on Spark splits assignment test (Vic Zhang)
  • fe13c96 Add config property for min spark partition count (Vic Zhang)
  • dcf54c5 Add more error message for Presto-on-Spark (Wenlei Xie)
  • 99e67bc Add enum operators (Daniel Ohayon)
  • 3649366 Support enum literals in queries (Daniel Ohayon)
  • f99842b Support type bound in TypeVariableConstraint (Daniel Ohayon)
  • 7ced455 Add long and varchar enum types (Daniel Ohayon)
  • 9bde821 Add Hive procedure to sync table partitions (Vivek)
  • a703fc4 Adds support for microsecond timestamp precision (Dmitry Borovsky)
  • da8303b Categorize the AccessControlException as user error (Venki Korukanti)
  • 9b61fd7 Fix TestOrcMapNullKey (Masha Basmanova)
  • 738bb9c Improve logging of queued queries (Tim Meehan)
  • fbe8766 Fix PrestoSparkQueryStatusInfo deserialization (Andrii Rosa)
  • 1a8987f Adds option to read null map keys from orc file (Dmitry Borovsky)
  • 9d92f0e Fix page splitter creating large dictionary page list (James Sun)
  • d8a12ee Add test cases for making temp dir under both exist and non-exist folder (Beinan Wang)
  • 3f34a61 Create temporary root folder when it does not exist and add unit test (Beinan Wang)
  • c0ae304 fix CTAS failures when using viewfs (Beinan Wang)
  • 6923691 Allow to store query results into a file in Presto on Spark (Andrii Rosa)
  • 6fa3632 Restructure PrestoSparkQueryInfo to resemble QueryResults (Andrii Rosa)
  • 2222b94 Expose more session parameters in PrestoSparkRunner (Andrii Rosa)
  • f8793a7 Improve error handling during determinism analysis (Leiqing Cai)
  • b18ccbe Fix precision definition in classification_precision function (Vic Zhang)
  • b7d5539 Clean up TrackedQuery (Tim Meehan)
  • 86914fb Add max Spark input partition count for auto tune (Wenlei Xie)
  • f16f757 Recognize Presto-on-Spark error code in verifier (Wenlei Xie)
  • 39872c4 Move QueryNotAdequatelyPushedDownErrorCode to PinotErrorCode (Xiang Fu)
  • ad31ac0 Add dynamic filter canonicalization in UnaliasSymbolReferences (Ke Wang)
  • 0cfa23f Refactor WarningCollector to spi module enabling connectors to pass warnings back to the engine (Naveen007)
  • 95725f6 Implement PrestoS3FileSystem#listFiles for direct recursive listings (James Petty)
  • e7af71b Add geometry_nearest_points function (James Gill)
  • f25aa0c Fix multi-join dynamic filtering (Ke Wang)
  • f4fd786 Fix for Pinot queries where order by column is pruned in projection (Dharak Kharod)
  • 4778753 Separate operator and stage statistics from query statistics (Mayank Garg)
  • 4066bd3 Invoke runtime plan checker in SqlQueryScheduler (Peizhen Guo)
  • 9b29a74 Refactor RuntimeReorderJoin use PropertyDerivation (Peizhen Guo)
  • 7dd650c Add documentation for Presto authorization (Mayank Garg)
  • c02e90e Add error code for Presto-on-Spark (Wenlei Xie)
  • ad1ab8b Remove OriginalExpression from PropertyDerivations (James Sun)
  • d60c948 Remove OriginalExpression from StreamPropertyDerivations (James Sun)
  • 5de7bca Remove OriginalExpression from PushProjectionThroughUnion (James Sun)
  • 0bd0854 Remove OriginalExpression from PushProjectionThroughExchange (James Sun)
  • 49a2f19 Add Oracle connector (George Wang)
  • c79b714 Fix TABLESAMPLE SYSTEM for Presto on Spark (Andrii Rosa)
  • 36b78d2 Fix pruning unreferenced variables when WindowNode is skipped in PruneUnreferencedOutputs (Venki Korukanti)
  • 7e7be19 Enable dereference pushdown in more testcases (Zhenxiao Luo)
  • d07fbc5 Implement JDBC ResultSet.getStatement (Adam J. Shook)
  • 8338c0b Improve DynamicFiltersChecker to catch unsupported dynamic filters (Ke Wang)
  • 08e260c Cleanup nested dynamic filters in RemoveUnsupportedDynamicFilters (Ke Wang)
  • 19cfbbc Handle null analysis for try_cast (James Sun)
  • 5dc949e Fix NullabilityAnalyzer for RowExpression (James Sun)
  • a1b15d6 Remove unused NullabilityAnalyzer for Expression (James Sun)
  • f6fa7c8 Catch and throw storage connection error properly (Nikhil Collooru)
  • 99c0d2a Use SYNTHESIZED type to represent pushed down Subfield in HiveColumnHandle (Venki Korukanti)
  • 7d4a892 RowGroup pruning using the filter on pushed down subfield (Venki Korukanti)
  • 04ed142 Fix the dereference validity check in PushdownDeferences rule (Venki Korukanti)
  • a444c04 Update Parquet reader to read pushed down dereference columns (Venki Korukanti)
  • 009eb3f Pushdown dereferences into table scan for parquet tables (Venki Korukanti)
  • 871dbf3 Add an option to control Parquet dereferance pushdown (Venki Korukanti)
  • 9f867d4 Revert "Upgrade ZSTD version" (Rohit Jain)
  • 172a552 Improve shuffle statistics collection (Andrii Rosa)
  • c8d9198 Implement batching of tiny rows for Presto on Spark (Andrii Rosa)
  • d2964ba Use offset instead of size in PrestoSparkRowBatch (Andrii Rosa)
  • de09645 Implement hdfs input source for druid ingestion (beinan)
  • 8fb458d Implement druid ingestion by CTAS (beinan)
  • d29764e Implement sending ingestion task to druid (beinan)
  • 572a4cb Write druid page data to gzip files (beinan)
  • 84408b8 Add ingestion storage path to DruidConfig (beinan)
  • d5928f1 Add druid table insert/ingestion skeleton code (beinan)
  • 27e922f Add warning message for UNION queries without ALL/DISTINCT keyword (prithvip)
  • c6c34d6 Modify regex for compatibility with both mysql, mariadb (Amit Sadaphule)
  • 0cd9fb1 Dynamic bucket pruning on workers (Ke Wang)

@yingsu00 yingsu00 force-pushed the release-notes-0.241 branch 3 times, most recently from 4e20b72 to 5c87163 Compare September 11, 2020 22:12
@yingsu00 yingsu00 self-assigned this Sep 14, 2020
@yingsu00
Copy link
Contributor

@kewang1024 @fgwang7w @beinan @zhenxiao I make changes to your commit messages. Could you please take a look to see if they're accurate?


Hive Changes
____________
* Fix several memory counting bugs in ``OrcRecordReader`` and ``StreamReader``'s.
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

counting -> accounting reads better?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the trailing 's, remove?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the trailing 's, remove?

StreamReader is an interface. There're several implementations and the 's is plural form. I used plural form because the fixes were on several StreamReaders. Let's consult @aweisberg who is native speaker and runs the blog. Ariel, which way is better according to your opinion?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I removed the ' and kept the s. So now it will look like StreamReaders

presto-docs/src/main/sphinx/release/release-0.241.rst Outdated Show resolved Hide resolved
presto-docs/src/main/sphinx/release/release-0.241.rst Outdated Show resolved Hide resolved

Hive Changes
____________
* Fix several memory counting bugs in ``OrcRecordReader`` and ``StreamReader``'s.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is the trailing 's, remove?

presto-docs/src/main/sphinx/release/release-0.241.rst Outdated Show resolved Hide resolved
General Changes
_______________
* Fix incorrect results from function classification_precision() introduced in release 0.239
* Add dynamic filtering and bucket pruning support for inner join and semi join, which can avoid full table scan on the probe side of broadcast joins or collocated joins when the build side is small. This can be enabled with the ``experimental.enable-dynamic-filtering`` configuration property and ``enable_dynamic_filtering`` system session property. The build side join key space size can be tuned by the ``experimental.dynamic-filtering-max-per-driver-row-count`` and ``experimental.dynamic-filtering-max-per-driver-size`` configuration properties and ``dynamic_filtering_max_per_driver_row_count`` and ``dynamic_filtering_max_per_driver_size`` session properties. Currently, only Hive connector can benefit from the feature.
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  1. this item is a performance improve, so the line should starts with "Improve".
Improve performance of broadcast and collocated join by supporting dynamic filtering and bucket pruning for inner and semi joins.
  1. the item is way too long. Remove every after "The build side join key ...", and add a link to the PR or the issue instead. Something like
(:pr:`12345`)

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Improve performance of broadcast and collocated join by supporting dynamic filtering and bucket pruning for inner and semi joins.

@caithagoras thanks for the rewriting. But this PR's primary goal is not improving the join's performance but the scan performance on the probe side. So I changed it to Improve performance for queries with broadcast or collocated joins by adding dynamic filtering and bucket pruning support.

@beinan
Copy link
Member

beinan commented Sep 16, 2020

@kewang1024 @fgwang7w @beinan @zhenxiao I make changes to your commit messages. Could you please take a look to see if they're accurate?

commit message looks good to me, thanks!

@fgwang7w
Copy link
Member

@kewang1024 @fgwang7w @beinan @zhenxiao I make changes to your commit messages. Could you please take a look to see if they're accurate?

Msg LGTM, thanks!

@yingsu00
Copy link
Contributor

Thanks @ajaygeorge @caithagoras for reviewing. I have addressed the comments. Will you be able to take another look?

@caithagoras
Copy link
Contributor Author

LGTM % sphinx compilation error

Warning, treated as error:
/Users/leiqing/fb/presto/presto-docs/src/main/sphinx/release/release-0.241.rst:23:Inline literal start-string without end-string.
make: *** [html] Error 2

Copy link
Contributor

@ajaygeorge ajaygeorge left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@caithagoras caithagoras merged commit 8614ca5 into prestodb:master Sep 24, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants