[SPARK-21335] [DOC] doc changes for disallowed un-aliased subquery use case #21647

cnZach · 2018-06-27T03:40:15Z

What changes were proposed in this pull request?

Document a change for un-aliased subquery use case, to address the last question in PR #18559:
#18559 (comment)

(Please fill in changes proposed in this fix)

How was this patch tested?

it does not affect tests.

Please review http://spark.apache.org/contributing.html before opening a pull request.

cnZach · 2018-06-27T03:44:04Z

@viirya @cloud-fan , please kindly help to review. Thanks.

viirya · 2018-06-27T03:56:51Z

It is quite a bit long before. Anyway, this document change looks fine to me.

viirya · 2018-06-27T03:57:44Z

We don't need the [apache/spark] prefix in the PR title. Can you remove it?

cloud-fan · 2018-06-27T03:59:53Z

LGTM except the title issue pointed out by @viirya

cnZach · 2018-06-27T04:07:47Z

okay, changed the PR title. Thanks. @cloud-fan @viirya

HyukjinKwon · 2018-06-27T04:25:59Z

ok to test

HyukjinKwon · 2018-06-27T04:32:58Z

docs/sql-programming-guide.md

@@ -2017,6 +2017,7 @@ working with timestamps in `pandas_udf`s to get the best performance, see
    - Literal values used in SQL operations are converted to DECIMAL with the exact precision and scale needed by them.
    - The configuration `spark.sql.decimalOperations.allowPrecisionLoss` has been introduced. It defaults to `true`, which means the new behavior described here; if set to `false`, Spark uses previous rules, ie. it doesn't adjust the needed scale to represent the values and it returns NULL if an exact representation of the value is not possible.
  - In PySpark, `df.replace` does not allow to omit `value` when `to_replace` is not a dictionary. Previously, `value` could be omitted in the other cases and had `None` by default, which is counterintuitive and error-prone.
+  - Un-aliased subquery is supported by Spark SQL for a long time. Its semantic was not well defined and had confusing behaviors. Since Spark 2.3, we invalid a weird use case: `SELECT v.i from (SELECT i FROM v)`. Now this query will throw analysis exception because users should not be able to use the qualifier inside a subquery. See [SPARK-20690](https://issues.apache.org/jira/browse/SPARK-20690) and [SPARK-21335](https://issues.apache.org/jira/browse/SPARK-21335) for details.


Not a big deal but please consider:

Un-aliased subquery is supported by Spark SQL for a long time. Its semantic was not well defined and had confusing behaviors. Since Spark 2.3, we invalid a weird use case: SELECT v.i from (SELECT i FROM v)

->

Un-aliased subquery's semantic has not been well defined with confusing behaviors. Since Spark 2.3, we invalidate such confusing cases, for example, SELECT v.i from (SELECT i FROM v).

Also consider:

Now this query will throw analysis exception because users should not be able to use the qualifier inside a subquery.

->

The cases throw an analysis exception now because users should not be able to use the qualifier inside a subquery.

for details. -> for more details.

SparkQA · 2018-06-27T04:42:51Z

Test build #92370 has finished for PR 21647 at commit c611a11.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

cnZach · 2018-06-27T04:56:49Z

@HyukjinKwon updated, thanks.

SparkQA · 2018-06-27T05:12:42Z

Test build #92371 has finished for PR 21647 at commit bebc3a8.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

cloud-fan · 2018-06-27T08:05:47Z

thanks, merging to master!

add documentaion for disallowed un-aliased subquery use case

c611a11

cnZach changed the title ~~[apache/spark] [SPARK-21335] [DOC] doc changes for disallowed un-aliased subquery use case~~ [SPARK-21335] [DOC] doc changes for disallowed un-aliased subquery use case Jun 27, 2018

HyukjinKwon reviewed Jun 27, 2018

View reviewed changes

update the use case description

bebc3a8

asfgit closed this in a1a64e3 Jun 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-21335] [DOC] doc changes for disallowed un-aliased subquery use case #21647

[SPARK-21335] [DOC] doc changes for disallowed un-aliased subquery use case #21647

cnZach commented Jun 27, 2018

cnZach commented Jun 27, 2018

viirya commented Jun 27, 2018 •

edited

Loading

viirya commented Jun 27, 2018

cloud-fan commented Jun 27, 2018

cnZach commented Jun 27, 2018

HyukjinKwon commented Jun 27, 2018

HyukjinKwon Jun 27, 2018 •

edited

Loading

HyukjinKwon Jun 27, 2018 •

edited

Loading

HyukjinKwon Jun 27, 2018

SparkQA commented Jun 27, 2018

cnZach commented Jun 27, 2018

SparkQA commented Jun 27, 2018

cloud-fan commented Jun 27, 2018

[SPARK-21335] [DOC] doc changes for disallowed un-aliased subquery use case #21647

[SPARK-21335] [DOC] doc changes for disallowed un-aliased subquery use case #21647

Conversation

cnZach commented Jun 27, 2018

What changes were proposed in this pull request?

How was this patch tested?

cnZach commented Jun 27, 2018

viirya commented Jun 27, 2018 • edited Loading

viirya commented Jun 27, 2018

cloud-fan commented Jun 27, 2018

cnZach commented Jun 27, 2018

HyukjinKwon commented Jun 27, 2018

HyukjinKwon Jun 27, 2018 • edited Loading

Choose a reason for hiding this comment

HyukjinKwon Jun 27, 2018 • edited Loading

Choose a reason for hiding this comment

HyukjinKwon Jun 27, 2018

Choose a reason for hiding this comment

SparkQA commented Jun 27, 2018

cnZach commented Jun 27, 2018

SparkQA commented Jun 27, 2018

cloud-fan commented Jun 27, 2018

viirya commented Jun 27, 2018 •

edited

Loading

HyukjinKwon Jun 27, 2018 •

edited

Loading

HyukjinKwon Jun 27, 2018 •

edited

Loading