[SPARK-7795] [Core] Speed up task scheduling in standalone mode by reusing serializer #6323

coolfrood · 2015-05-21T17:28:46Z

My experiments with scheduling very short tasks in standalone cluster mode indicated that a significant amount of time was being spent in scheduling the tasks (>500ms for 256 tasks). I found that most of the time was being spent in creating a new instance of serializer for each task. Changing this to just one serializer brought down the scheduling time to 8ms.

instead of creating a new one for each task.

srowen · 2015-05-21T17:31:28Z

@coolfrood have a look at https://cwiki.apache.org/confluence/display/SPARK/Contributing+to+Spark -- you missed a step or two. It's not 100% clear you can reuse the serializer but it is a good idea to float.

coolfrood · 2015-05-21T17:36:11Z

Argh. Sorry. I'll create a JIRA for this and update the PR.

JoshRosen · 2015-05-21T17:58:25Z

Serializer instances aren't thread-safe, but it looks like this single-threaded use should be fine. I implemented some similar optimizations in #5606 and am in the process of fixing a rare corner-case related to serializer re-use with uncommon Kryo configurations (#6293). Based on discussion in those patches, this change looks good to me.

We might actually be able to go a bit further and lift the serializer instance into a field of CoarseGrainedSchedulerBackend itself. That would be a bit riskier in terms of code review burden, since we'd have to double-check that CoarseGrainedSchedulerBackend is only ever used from a single thread (I think that this is actually the case, but would be good to double-check).

JoshRosen · 2015-05-21T17:58:27Z

Jenkins, this is ok to test.

coolfrood · 2015-05-21T18:11:23Z

@JoshRosen: it looks like launchTasks() is only called from makeOffers(), which in turn is only called from the Actor's receive, so it should be safe to move the serializer into a member field. Also, both launchTasks() and makeOffers() can be made private. What do you think?

JoshRosen · 2015-05-21T18:23:22Z

Yep, looks like both launchTasks() and makeOffers() can be private. In general, I prefer to minimize the visibility of fields / methods as much as possible, so let's make these private.

My PR made a similar change to CoarseGrainedExecutorBackend. Over there, I added a comment to explain that the shared serializer code would need to be revisited if we ever switched to a parallel / non-thread-safe RPC endpoint for the actor: https://github.com/apache/spark/pull/5606/files#diff-79391110e9f26657e415aa169a004998R51. It would be good to make a similar comment here so that we don't forget to update this if DriverEndpoint is ever parallelized.

SparkQA · 2015-05-21T19:02:52Z

Test build #33263 has finished for PR 6323 at commit 0b8ca93.

This patch fails Scala style tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-05-21T19:49:58Z

Test build #33259 has finished for PR 6323 at commit fe530cd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

SparkQA · 2015-05-21T21:01:23Z

Test build #33266 has finished for PR 6323 at commit bd4a5dd.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

JoshRosen · 2015-05-21T21:04:55Z

This looks good to me. I'm going to loop in @zsxwing or @rxin for another pair of eyes / final sanity-check.

zsxwing · 2015-05-21T21:45:13Z

LGTM

JoshRosen · 2015-05-22T17:46:16Z

Actually, one minor comment: ser currently has wider visibility than is strictly necessary: it's only used inside of the DriverEndpoint inner class, so it should be a field of that class rather than a field of CoarseGrainedSchedulerBackend.

SparkQA · 2015-05-22T19:57:51Z

Test build #33348 has finished for PR 6323 at commit 12d8c9e.

This patch passes all tests.
This patch merges cleanly.
This patch adds no public classes.

JoshRosen · 2015-05-23T05:02:44Z

LGTM, so I'm going to merge this to master. Although I think this is probably safe for 1.4, I don't want to risk conflicting with any attempts to cut another RC tonight, so I'm not going to pick it there for now. Thanks!

…using serializer My experiments with scheduling very short tasks in standalone cluster mode indicated that a significant amount of time was being spent in scheduling the tasks (>500ms for 256 tasks). I found that most of the time was being spent in creating a new instance of serializer for each task. Changing this to just one serializer brought down the scheduling time to 8ms. Author: Akshat Aranya <[email protected]> Closes apache#6323 from coolfrood/master and squashes the following commits: 12d8c9e [Akshat Aranya] Reduce visibility of serializer bd4a5dd [Akshat Aranya] Style fix 0b8ca93 [Akshat Aranya] Incorporate review comments fe530cd [Akshat Aranya] Speed up task scheduling in standalone mode by reusing serializer instead of creating a new one for each task.

Speed up task scheduling in standalone mode by reusing serializer

fe530cd

instead of creating a new one for each task.

coolfrood changed the title ~~Speed up task scheduling in standalone mode by reusing serializer~~ [SPARK-7795] [Core] Speed up task scheduling in standalone mode by reusing serializer May 21, 2015

Incorporate review comments

0b8ca93

Style fix

bd4a5dd

Reduce visibility of serializer

12d8c9e

asfgit closed this in a163574 May 23, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-7795] [Core] Speed up task scheduling in standalone mode by reusing serializer #6323

[SPARK-7795] [Core] Speed up task scheduling in standalone mode by reusing serializer #6323

coolfrood commented May 21, 2015

srowen commented May 21, 2015

coolfrood commented May 21, 2015

JoshRosen commented May 21, 2015

JoshRosen commented May 21, 2015

coolfrood commented May 21, 2015

JoshRosen commented May 21, 2015

SparkQA commented May 21, 2015

SparkQA commented May 21, 2015

SparkQA commented May 21, 2015

JoshRosen commented May 21, 2015

zsxwing commented May 21, 2015

JoshRosen commented May 22, 2015

SparkQA commented May 22, 2015

JoshRosen commented May 23, 2015

[SPARK-7795] [Core] Speed up task scheduling in standalone mode by reusing serializer #6323

[SPARK-7795] [Core] Speed up task scheduling in standalone mode by reusing serializer #6323

Conversation

coolfrood commented May 21, 2015

srowen commented May 21, 2015

coolfrood commented May 21, 2015

JoshRosen commented May 21, 2015

JoshRosen commented May 21, 2015

coolfrood commented May 21, 2015

JoshRosen commented May 21, 2015

SparkQA commented May 21, 2015

SparkQA commented May 21, 2015

SparkQA commented May 21, 2015

JoshRosen commented May 21, 2015

zsxwing commented May 21, 2015

JoshRosen commented May 22, 2015

SparkQA commented May 22, 2015

JoshRosen commented May 23, 2015