[SPARK-22775][SQL] move dictionary related APIs from ColumnVector to WritableColumnVector #19970

cloud-fan · 2017-12-13T16:41:05Z

What changes were proposed in this pull request?

These dictionary related APIs are special to WritableColumnVector and should not be in ColumnVector, which will be public soon.

How was this patch tested?

existing tests

cloud-fan · 2017-12-13T16:41:29Z

cc @kiszk @ueshin

SparkQA · 2017-12-13T18:19:12Z

Test build #84871 has finished for PR 19970 at commit 103dca3.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

gatorsmile · 2017-12-13T21:45:59Z

retest this please

SparkQA · 2017-12-14T00:24:47Z

Test build #84881 has finished for PR 19970 at commit 103dca3.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

ueshin

We can simplify reserveDictionaryIds() at WritableColumnVector.java#L659 as well.

Otherwise, LGTM.

ueshin · 2017-12-14T02:56:01Z

sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/ColumnVector.java

   */
  public final ColumnarRow getStruct(int rowId) {
    return new ColumnarRow(this, rowId);
  }

  /**
-   * Returns a utility object to get structs.
-   * provided to keep API compatibility with InternalRow for code generation
+   * A special version of {@link #getShort(int)}, which is only used as an adapter for Spark codegen


getStruct(int) instead of getShort(int)?

SparkQA · 2017-12-14T06:15:01Z

Test build #84891 has finished for PR 19970 at commit c38f58e.

This patch fails PySpark unit tests.
This patch merges cleanly.
This patch adds no public classes.

cloud-fan · 2017-12-14T07:06:35Z

retest this pease

kiszk · 2017-12-14T08:02:51Z

sql/core/src/main/java/org/apache/spark/sql/execution/vectorized/WritableColumnVector.java

@@ -105,6 +107,57 @@ private void throwUnsupportedException(int requiredCapacity, Throwable cause) {
  @Override
  public boolean anyNullsSet() { return anyNullsSet; }

+  /**
+   * Returns the dictionary Id for rowId.
+   * This should only be called when the ColumnVector is dictionaryIds.


ColumnVector -> WritableColumnVector

when dictionaryIds has WritableColumnVector.?

kiszk · 2017-12-14T08:05:26Z

LGTM except a few comments for wording

SparkQA · 2017-12-14T11:19:51Z

Test build #84903 has finished for PR 19970 at commit 1de8de9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

cloud-fan · 2017-12-14T11:34:53Z

thanks, merging to master!

move dictionary related APIs from ColumnVector to WritableColumnVector

103dca3

ueshin reviewed Dec 14, 2017

View reviewed changes

address comments

c38f58e

kiszk reviewed Dec 14, 2017

View reviewed changes

address comments

1de8de9

asfgit closed this in 7d8e2ca Dec 14, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-22775][SQL] move dictionary related APIs from ColumnVector to WritableColumnVector #19970

[SPARK-22775][SQL] move dictionary related APIs from ColumnVector to WritableColumnVector #19970

cloud-fan commented Dec 13, 2017

cloud-fan commented Dec 13, 2017

SparkQA commented Dec 13, 2017

gatorsmile commented Dec 13, 2017

SparkQA commented Dec 14, 2017

ueshin left a comment

ueshin Dec 14, 2017

SparkQA commented Dec 14, 2017

cloud-fan commented Dec 14, 2017

kiszk Dec 14, 2017

kiszk Dec 14, 2017 •

edited

Loading

kiszk commented Dec 14, 2017

SparkQA commented Dec 14, 2017

cloud-fan commented Dec 14, 2017

[SPARK-22775][SQL] move dictionary related APIs from ColumnVector to WritableColumnVector #19970

[SPARK-22775][SQL] move dictionary related APIs from ColumnVector to WritableColumnVector #19970

Conversation

cloud-fan commented Dec 13, 2017

What changes were proposed in this pull request?

How was this patch tested?

cloud-fan commented Dec 13, 2017

SparkQA commented Dec 13, 2017

gatorsmile commented Dec 13, 2017

SparkQA commented Dec 14, 2017

ueshin left a comment

Choose a reason for hiding this comment

ueshin Dec 14, 2017

Choose a reason for hiding this comment

SparkQA commented Dec 14, 2017

cloud-fan commented Dec 14, 2017

kiszk Dec 14, 2017

Choose a reason for hiding this comment

kiszk Dec 14, 2017 • edited Loading

Choose a reason for hiding this comment

kiszk commented Dec 14, 2017

SparkQA commented Dec 14, 2017

cloud-fan commented Dec 14, 2017

kiszk Dec 14, 2017 •

edited

Loading