[SPARK-12078][Core]Fix ByteBuffer.limit misuse #10076

zsxwing · 2015-12-01T21:44:52Z

ByteBuffer.limit is not the remaining size of ByteBuffer. ByteBuffer.limit is equal to ByteBuffer.remaining only if ByteBuffer.position is 0.

I just went through the codes and replaced misused limit with remaining.

srowen · 2015-12-01T22:01:08Z

core/src/main/scala/org/apache/spark/executor/Executor.scala

@@ -253,7 +253,7 @@ private[spark] class Executor(

        val directResult = new DirectTaskResult(valueBytes, accumUpdates, task.metrics.orNull)
        val serializedDirectResult = ser.serialize(directResult)
-        val resultSize = serializedDirectResult.limit
+        val resultSize = serializedDirectResult.remaining()


You're right that there's an implicit assumption in some of this code that the buffer's position is 0 on returning, and the entire buffer is filled with valid data. Do we have a situation where the position is not 0 though, but is correctly at the start of the data? at least, this looks like it handles the situation, but it sounds unusual. Equally, if that's an issue, are we sure the entire buffer has valid data, through the end? that assumption is still present here, that the end of the data is the end of the buffer.

Do we have a situation where the position is not 0 though, but is correctly at the start of the data?

If a ByteBuffer is from Netty, the position could be a non-zero value.

Equally, if that's an issue, are we sure the entire buffer has valid data, through the end? that assumption is still present here, that the end of the data is the end of the buffer.

The ByteBuffer may contain more data internally, but the user should only read the part between position and limit. I think that's defined in ByteBuffer/Buffer javadoc.

I found I was wrong about the position of ByteBuffer from Netty. Netty will call ByteBuffer.slice to reset the position to 0 before returning it: https://github.com/netty/netty/blob/0f9492c9affc528c766f9677952412564d4a3f6d/buffer/src/main/java/io/netty/buffer/PooledHeapByteBuf.java#L269

I think we don't need this patch.

SparkQA · 2015-12-02T00:14:23Z

Test build #46995 has finished for PR 10076 at commit 2073aea.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

srowen · 2015-12-02T09:54:58Z

core/src/main/scala/org/apache/spark/scheduler/TaskResultGetter.scala

@@ -51,14 +51,14 @@ private[spark] class TaskResultGetter(sparkEnv: SparkEnv, scheduler: TaskSchedul
        try {
          val (result, size) = serializer.get().deserialize[TaskResult[_]](serializedData) match {
            case directResult: DirectTaskResult[_] =>
-              if (!taskSetManager.canFetchMoreResults(serializedData.limit())) {
+              if (!taskSetManager.canFetchMoreResults(serializedData.remaining())) {


This looks like it makes a unit test fail. I think you may have to check the size before the deserializer consumes the byte buffer?

This is overall looking good but we probably have to comb through these a little more to think through the implications.

zsxwing · 2015-12-02T18:17:46Z

I want to put this one on hold until #10083 gets merged.

SparkQA · 2015-12-07T22:22:00Z

Test build #47281 has finished for PR 10076 at commit 5d5ab7e.

This patch fails Spark unit tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-12-08T02:07:32Z

Test build #47291 has finished for PR 10076 at commit 4cdbc7c.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

Fix ByteBuffer.limit misuse

2073aea

srowen reviewed Dec 1, 2015
View reviewed changes

srowen reviewed Dec 2, 2015
View reviewed changes

Merge branch 'master' into bytebuffer-limit

5d5ab7e

Get remaining() before consuming ByteBuffer

4cdbc7c

zsxwing closed this Dec 8, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-12078][Core]Fix ByteBuffer.limit misuse #10076

[SPARK-12078][Core]Fix ByteBuffer.limit misuse #10076

zsxwing commented Dec 1, 2015

srowen Dec 1, 2015

zsxwing Dec 2, 2015

zsxwing Dec 8, 2015

SparkQA commented Dec 2, 2015

srowen Dec 2, 2015

zsxwing commented Dec 2, 2015

SparkQA commented Dec 7, 2015

SparkQA commented Dec 8, 2015

[SPARK-12078][Core]Fix ByteBuffer.limit misuse #10076

[SPARK-12078][Core]Fix ByteBuffer.limit misuse #10076

Conversation

zsxwing commented Dec 1, 2015

srowen Dec 1, 2015

Choose a reason for hiding this comment

zsxwing Dec 2, 2015

Choose a reason for hiding this comment

zsxwing Dec 8, 2015

Choose a reason for hiding this comment

SparkQA commented Dec 2, 2015

srowen Dec 2, 2015

Choose a reason for hiding this comment

zsxwing commented Dec 2, 2015

SparkQA commented Dec 7, 2015

SparkQA commented Dec 8, 2015