[SPARK-13293] [SQL] generate Expand #11177

davies · 2016-02-12T00:49:52Z

Expand suffer from create the UnsafeRow from same input multiple times, with codegen, it only need to copy some of the columns.

After this, we can see 3X improvements (from 43 seconds to 13 seconds) on a TPCDS query (Q67) that have eight columns in Rollup.

Ideally, we could mask some of the columns based on bitmask, I'd leave that in the future, because currently Aggregation (50 ns) is much slower than that just copy the variables (1-2 ns).

rxin · 2016-02-12T02:12:38Z

As always, can you paste the generated code? :)

SparkQA · 2016-02-12T02:29:51Z

Test build #51156 has finished for PR 11177 at commit 22ceda9.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-02-12T02:33:05Z

sql/core/src/main/scala/org/apache/spark/sql/execution/Expand.scala

+      }
+    }
+
+    // In order to prevent code exploration, we can't call `consume()` many times, so we call


what do you mean by "code exploration"?

btw any perf degradation from not unrolling the loop?

The loop and copy two variables should only take about 1-2 nano second, should not have regressions.

But if we don't have loop here, then the generated code could be much easier to be larger than 8K, that could be regression (slower than without codegen).

davies · 2016-02-12T21:20:49Z

The part of generated code for

sqlContext.range(N).selectExpr("id", "id % 1000 as k1", "id & 256 as k2")
        .cube("k1", "k2").sum("id").collect()

/* 075 */       /* (input[0, bigint] % 1000) */
/* 076 */       boolean project_isNull1 = false;
/* 077 */       long project_value1 = -1L;
/* 078 */       if (false || 1000L == 0) {
/* 079 */         project_isNull1 = true;
/* 080 */       } else {
/* 081 */         if (false) {
/* 082 */           project_isNull1 = true;
/* 083 */         } else {
/* 084 */           project_value1 = (long)(range_value % 1000L);
/* 085 */         }
/* 086 */       }
/* 087 */       /* (input[0, bigint] & 256) */
/* 088 */       long project_value4 = -1L;
/* 089 */       project_value4 = range_value & 256L;
/* 090 */       /* (input[0, bigint] % 1000) */
/* 091 */       boolean project_isNull7 = false;
/* 092 */       long project_value7 = -1L;
/* 093 */       if (false || 1000L == 0) {
/* 094 */         project_isNull7 = true;
/* 095 */       } else {
/* 096 */         if (false) {
/* 097 */           project_isNull7 = true;
/* 098 */         } else {
/* 099 */           project_value7 = (long)(range_value % 1000L);
/* 100 */         }
/* 101 */       }
/* 102 */       /* (input[0, bigint] & 256) */
/* 103 */       long project_value10 = -1L;
/* 104 */       project_value10 = range_value & 256L;
/* 105 */
/* 106 */       boolean expand_isNull3 = true;
/* 107 */       long expand_value3 = -1L;
/* 108 */
/* 109 */       boolean expand_isNull4 = true;
/* 110 */       long expand_value4 = -1L;
/* 111 */
/* 112 */       boolean expand_isNull5 = true;
/* 113 */       int expand_value5 = -1;
/* 114 */       for (int expand_i = 0; expand_i < 4; expand_i ++) {
/* 115 */         switch (expand_i) {
/* 116 */         case 0:
/* 117 */           expand_isNull3 = project_isNull7;
/* 118 */           expand_value3 = project_value7;
/* 119 */
/* 120 */           expand_isNull4 = false;
/* 121 */           expand_value4 = project_value10;
/* 122 */
/* 123 */           expand_isNull5 = false;
/* 124 */           expand_value5 = 0;
/* 125 */           break;
/* 126 */
/* 127 */         case 1:
/* 128 */           expand_isNull3 = project_isNull7;
/* 129 */           expand_value3 = project_value7;
/* 130 */
/* 131 */           /* null */
/* 132 */           final long expand_value10 = -1L;
/* 133 */           expand_isNull4 = true;
/* 134 */           expand_value4 = expand_value10;
/* 135 */
/* 136 */           expand_isNull5 = false;
/* 137 */           expand_value5 = 1;
/* 138 */           break;
/* 139 */
/* 140 */         case 2:
/* 141 */           /* null */
/* 142 */           final long expand_value12 = -1L;
/* 143 */           expand_isNull3 = true;
/* 144 */           expand_value3 = expand_value12;
/* 145 */
/* 146 */           expand_isNull4 = false;
/* 147 */           expand_value4 = project_value10;
/* 148 */
/* 149 */           expand_isNull5 = false;
/* 150 */           expand_value5 = 2;
/* 151 */           break;
/* 152 */
/* 153 */         case 3:
/* 154 */           /* null */
/* 155 */           final long expand_value15 = -1L;
/* 156 */           expand_isNull3 = true;
/* 157 */           expand_value3 = expand_value15;
/* 158 */
/* 159 */           /* null */
/* 160 */           final long expand_value16 = -1L;
/* 161 */           expand_isNull4 = true;
/* 162 */           expand_value4 = expand_value16;
/* 163 */
/* 164 */           expand_isNull5 = false;
/* 165 */           expand_value5 = 3;
/* 166 */           break;
/* 167 */         }
/* 168 */         expand_metricValue.add(1);
/* 169 */
/* 170 */         // generate grouping key
/* 171 */         agg_rowWriter.zeroOutNullBytes();
/* 172 */
/* 173 */         if (expand_isNull3) {
/* 174 */           agg_rowWriter.setNullAt(0);
/* 175 */         } else {
/* 176 */           agg_rowWriter.write(0, expand_value3);
/* 177 */         }
/* 178 */
/* 179 */         if (expand_isNull4) {
/* 180 */           agg_rowWriter.setNullAt(1);
/* 181 */         } else {
/* 182 */           agg_rowWriter.write(1, expand_value4);
/* 183 */         }
/* 184 */
/* 185 */         if (expand_isNull5) {
/* 186 */           agg_rowWriter.setNullAt(2);
/* 187 */         } else {
/* 188 */           agg_rowWriter.write(2, expand_value5);
/* 189 */         }
/* 190 */         /* hash(input[0, bigint],input[1, bigint],input[2, int],42) */
/* 191 */         int agg_value3 = 42;
/* 192 */
/* 193 */         if (!expand_isNull3) {
/* 194 */           agg_value3 = org.apache.spark.unsafe.hash.Murmur3_x86_32.hashLong(expand_value3, agg_value3);
/* 195 */         }
/* 196 */
/* 197 */         if (!expand_isNull4) {
/* 198 */           agg_value3 = org.apache.spark.unsafe.hash.Murmur3_x86_32.hashLong(expand_value4, agg_value3);
/* 199 */         }
/* 200 */
/* 201 */         agg_value3 = org.apache.spark.unsafe.hash.Murmur3_x86_32.hashInt(expand_value5, agg_value3);
/* 202 */         UnsafeRow agg_aggBuffer = null;
/* 203 */         if (true) {
/* 204 */           // try to get the buffer from hash map
/* 205 */           agg_aggBuffer = agg_hashMap.getAggregationBufferFromUnsafeRow(agg_result, agg_value3);
/* 206 */         }
/* 207 */         if (agg_aggBuffer == null) {
/* 208 */           if (agg_sorter == null) {
/* 209 */             agg_sorter = agg_hashMap.destructAndCreateExternalSorter();
/* 210 */           } else {
/* 211 */             agg_sorter.merge(agg_hashMap.destructAndCreateExternalSorter());
/* 212 */           }
/* 213 */
/* 214 */           // the hash map had be spilled, it should have enough memory now,
/* 215 */           // try  to allocate buffer again.
/* 216 */           agg_aggBuffer = agg_hashMap.getAggregationBufferFromUnsafeRow(agg_result, agg_value3);
/* 217 */           if (agg_aggBuffer == null) {
/* 218 */             // failed to allocate the first page
/* 219 */             throw new OutOfMemoryError("No enough memory for aggregation");
/* 220 */           }
/* 221 */         }

davies · 2016-02-12T21:23:43Z

@rxin Had posted the generated code, add more comments.

rxin · 2016-02-12T22:16:13Z

sql/core/src/main/scala/org/apache/spark/sql/execution/Expand.scala

+  override def doConsume(ctx: CodegenContext, input: Seq[ExprCode]): String = {
+    // Some columns have the same expression in all the projections, so collect the unique
+    // expressions.
+    val columnUniqueExpressions: IndexedSeq[Set[Expression]] = output.indices.map { i =>


for this one, can we explain what the indexes are, and what the expressions are?

SparkQA · 2016-02-12T23:05:43Z

Test build #51205 has finished for PR 11177 at commit e1fd87d.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2016-02-13T01:13:24Z

Test build #51219 has finished for PR 11177 at commit ff2b5a4.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

rxin · 2016-02-13T01:31:45Z

sql/core/src/main/scala/org/apache/spark/sql/execution/Expand.scala

@@ -17,11 +17,15 @@

 package org.apache.spark.sql.execution

+import scala.collection.immutable.IndexedSeq


this is no longer necessary. can you remove it in some other pr you have?

rxin · 2016-02-13T01:31:51Z

LGTM. Merging in master.

generate Expand

22ceda9

rxin reviewed Feb 12, 2016
View reviewed changes

Merge branch 'master' of github.com:apache/spark into gen_expand

e541b8d

add more comments

e1fd87d

rxin reviewed Feb 12, 2016
View reviewed changes

improve readability by @rxin

ff2b5a4

rxin reviewed Feb 13, 2016
View reviewed changes

asfgit closed this in 2228f07 Feb 13, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-13293] [SQL] generate Expand #11177

[SPARK-13293] [SQL] generate Expand #11177

davies commented Feb 12, 2016

rxin commented Feb 12, 2016

SparkQA commented Feb 12, 2016

rxin Feb 12, 2016

rxin Feb 12, 2016

davies Feb 12, 2016

davies commented Feb 12, 2016

davies commented Feb 12, 2016

rxin Feb 12, 2016

SparkQA commented Feb 12, 2016

SparkQA commented Feb 13, 2016

rxin Feb 13, 2016

rxin commented Feb 13, 2016

		@@ -17,11 +17,15 @@

		package org.apache.spark.sql.execution

		import scala.collection.immutable.IndexedSeq

[SPARK-13293] [SQL] generate Expand #11177

[SPARK-13293] [SQL] generate Expand #11177

Conversation

davies commented Feb 12, 2016

rxin commented Feb 12, 2016

SparkQA commented Feb 12, 2016

rxin Feb 12, 2016

Choose a reason for hiding this comment

rxin Feb 12, 2016

Choose a reason for hiding this comment

davies Feb 12, 2016

Choose a reason for hiding this comment

davies commented Feb 12, 2016

davies commented Feb 12, 2016

rxin Feb 12, 2016

Choose a reason for hiding this comment

SparkQA commented Feb 12, 2016

SparkQA commented Feb 13, 2016

rxin Feb 13, 2016

Choose a reason for hiding this comment

rxin commented Feb 13, 2016