SPARK-25299: Add rest of shuffle writer benchmarks #507

yifeih · 2019-03-05T20:47:10Z

No description provided.

…-two-writers

mccheah · 2019-03-14T20:42:48Z

@yifeih can you target this to spark-25299? The downstream PR merged.

mccheah · 2019-03-14T20:45:54Z

core/src/test/scala/org/apache/spark/shuffle/sort/BypassMergeSortShuffleWriterBenchmark.scala

+      output = output)
+
+    addBenchmarkCase(benchmark, "without transferTo") { timer =>
+      val shuffleWriter = getWriter(false)


We're not closing the writer in this case.

I'm taking a closer look at this and wonder if we can model this similar to how JUnit models Parameterized Tests. Parameterized tests are useful when you have a configuration matrix that you want to try. In this case, the configuration matrix might be:

Dataset size: small vs. large

TransferTo: Enabled vs. disabled

writer type: unsafe, sort, bypass-merge-sort

Can we perhaps encode this explicitly? I think something like this, though this is by no means the most elegant possible - there might be something better here:

sealed trait WriterType case object UnsafeWriterType extends WriterType case object SortWriterType extends WriterType case object BypassMergeSortWriterType extends WriterType def doBenchmark(writerType: WriterType, transferTo: Boolean, useLargeDataset: Boolean): Unit = { val benchmarkName = s"shuffle writer, type $writerType, transferTo: $transferTo, datasetSize: ${if useLargeDataset "large" else "small"}" val writer = writerType match { case UnsafeWriterType => new UnsafeShuffleWriter(... transferTo) case SortWriterType => new SortShuffleWriter(... transferTo) case BypassMergeSortWriterType => new BypassMerge... (... transferTo) default => throw... } val datasetSize = if (useLargeDataset )... else ... // Do the benchmark stuff given your objects } // I think there's a much more Scala fluent esque way to do this by the way - look into product APIs? Seq(UnsafeWriterType, SortWriterType, BypassMergeSSortWriterType).forEach { writerType => Seq(true, false).forEach { transferTo => Seq(true, false).forEach { useLargeDataset => doBenchmark(writerType, transferTo, useLargeDataset) } } }

Hmmm I think the problem comes when the setup is slightly different. For example, the different writers need different mocks and objects setup because they're different in nature. But i think we can make this a little more parameterized, like passing in the writer, size, and some spill file assert number

It's a tricky balance because we don't want to make this a giant switch-case statement either. If the writer bootstrap is highly dependent on the type of writer it is, I would say the parameters can be just the size and enabling transferTo vs. not, and then we have separate methods each for bootstrapping their own writer against those parameters.

I don't have a great sense of what's best here; we're not using something like JUnit which has a canonical framework for this, so whatever we build has to come from first principles. I'm open to ideas, so feel free to propose something and we can evaluate it together.

mccheah · 2019-03-15T20:48:14Z

core/src/test/scala/org/apache/spark/shuffle/sort/UnsafeShuffleWriterBenchmark.scala

+  private val DATA_SIZE_LARGE =
+    PackedRecordPointer.MAXIMUM_PAGE_SIZE_BYTES/2/DEFAULT_DATA_STRING_SIZE
+
+  def getWriter(transferTo: Boolean): UnsafeShuffleWriter[String, String] = {


One idea I just thought of - why not make newWriter(transferTo: Boolean) an abstract method in ShuffleWRiterBenchmarkBase, then pass transferTo to addBenchmarkCase but removes the need to pass the writer supplier? The superclass can just call such an abstract method in addBenchmarkCase. Thoughts?

The transferTo flag only applies to BypassMergeSortShuffleWriter and UnsafeShuffleWriter, but the SortShuffleWriter tests have other parameters, none of which are transferTo

mccheah · 2019-03-15T22:10:48Z

dev/run-spark-25299-benchmarks.sh

@@ -50,12 +50,16 @@ done

 echo "Running SPARK-25299 benchmarks"

+SPARK_GENERATE_BENCHMARK_FILES=1 ./build/sbt "sql/test:runMain org.apache.spark.shuffle.sort.BypassMergeSortShuffleWriterBenchmark"


Not for right now, but if this list grows beyond size 5, let's make a text file with all the classes that we need to benchmark, and then for-loop over it.

mccheah · 2019-03-15T22:12:58Z

core/src/test/scala/org/apache/spark/shuffle/sort/ShuffleWriterBenchmarkBase.scala

@@ -121,10 +121,23 @@ abstract class ShuffleWriterBenchmarkBase extends BenchmarkBase {
      blockManager)
  }

-  def addBenchmarkCase(benchmark: Benchmark, name: String)(func: Benchmark.Timer => Unit): Unit = {
+  def addBenchmarkCase(benchmark: Benchmark, name: String, size: Int,
+                       writerSupplier: () => ShuffleWriter[String, String],


Nit: Start args on this line, then 1 arg per line, with 4-space indentation from def.

mccheah

Just a style change and then we're good to merge here.

svc-spark-25299

================================================================================================
BypassMergeSortShuffleWriter write
================================================================================================

Java HotSpot(TM) 64-Bit Server VM 1.8.0_131-b11 on Linux 4.15.0-1014-gcp
Intel(R) Xeon(R) CPU @ 2.30GHz
BypassMergeSortShuffleWrite without spill:  Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
small dataset without disk spill                      2              3           2          0.5        2048.2       1.0X

Java HotSpot(TM) 64-Bit Server VM 1.8.0_131-b11 on Linux 4.15.0-1014-gcp
Intel(R) Xeon(R) CPU @ 2.30GHz
BypassMergeSortShuffleWrite with spill:   Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
without transferTo                                 7369           7565         121          0.9        1098.1       1.0X
with transferTo                                    7515           7568          33          0.9        1119.8       1.0X

================================================================================================
SortShuffleWriter writer
================================================================================================

Java HotSpot(TM) 64-Bit Server VM 1.8.0_131-b11 on Linux 4.15.0-1014-gcp
Intel(R) Xeon(R) CPU @ 2.30GHz
SortShuffleWriter without spills:         Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
small dataset without spills                         10             16           5          0.1       10192.2       1.0X

Java HotSpot(TM) 64-Bit Server VM 1.8.0_131-b11 on Linux 4.15.0-1014-gcp
Intel(R) Xeon(R) CPU @ 2.30GHz
SortShuffleWriter with spills:            Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
no map side combine                               14008          14157         133          0.5        2087.3       1.0X
with map side aggregation                         13852          13946          75          0.5        2064.1       1.0X
with map side sort                                13797          13984         142          0.5        2055.9       1.0X

================================================================================================
UnsafeShuffleWriter write
================================================================================================

Java HotSpot(TM) 64-Bit Server VM 1.8.0_131-b11 on Linux 4.15.0-1014-gcp
Intel(R) Xeon(R) CPU @ 2.30GHz
UnsafeShuffleWriter without spills:       Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
small dataset without spills                         20             23           3          0.1       19926.9       1.0X

Java HotSpot(TM) 64-Bit Server VM 1.8.0_131-b11 on Linux 4.15.0-1014-gcp
Intel(R) Xeon(R) CPU @ 2.30GHz
UnsafeShuffleWriter with spills:          Best Time(ms)   Avg Time(ms)   Stdev(ms)    Rate(M/s)   Per Row(ns)   Relative
------------------------------------------------------------------------------------------------------------------------
without transferTo                                15742          15791          40          0.9        1172.8       1.0X
with transferTo                                   15888          15994          67          0.8        1183.7       1.0X

yifeih added 30 commits February 22, 2019 17:42

add initial bypass merge sort shuffle writer benchmarks

c7abec6

dd unsafe shuffle writer benchmarks

22ef648

changes in bypassmergesort benchmarks

4084e27

cleanup

fb8266d

add circle script

89104e2

add this branch for testing

b90b381

fix circle attempt 1

5e13dd8

checkout code

845e645

add some caches?

a68f459

why is it not pull caches...

757f6fe

save as artifact instead of publishing

0bcd5d9

mkdir

26c01ec

typo

0d7a036

try uploading artifacts again

3fc5331

try print per iteration to avoid circle erroring out on idle

8c33701

blah (#495)

9546397

make a PR comment

d72ba73

actually delete files

1859805

run benchmarks on test build branch

c20f0be

oops forgot to enable upload

444d46a

add sort shuffle writer benchmarks

2322933

add stdev

da0d91c

cleanup sort a bit

e590917

fix stdev text

cbfdb99

fix sort shuffle

cbe38c6

initial code for read side

acdda71

format

fd7a7c5

use times and sample stdev

d82618b

add assert for at least one iteration

610ea1d

cleanup shuffle write to use fewer mocks and single base interface

295d7f3

yifeih added 4 commits March 7, 2019 15:46

try uploading benchmarks (#498)

fa1b96c

Merge branch 'yh/add-benchmarks-and-ci' into yh/add-benchmarks-and-ci…

b38abb0

…-two-writers

don't upload results yet

96c66c9

only upload results when merging into the feature branch

37cef1f

yifeih mentioned this pull request Mar 11, 2019

SPARK-25299: add CI infrastructure and SortShuffleWriterBenchmark #498

Merged

yifeih added 10 commits March 12, 2019 15:35

lock down machine image

459e1b5

don't write input data to disk

4cabdbd

Merge branch 'yh/add-benchmarks-and-ci' into yh/add-benchmarks-and-ci…

9225bb7

…-two-writers

dont write input data to disk

a3b0ee5

run benchmark test

47d2dcf

stop creating file cleanup threads for every block manager

c78e491

use alphanumeric again

f28b75c

use a new random everytime

a85acf4

close the writers -__________-

f26ab40

Merge branch 'yh/add-benchmarks-and-ci' into yh/add-benchmarks-and-ci…

e5481b4

…-two-writers

yifeih changed the base branch from yh/add-benchmarks-and-ci to spark-25299 March 14, 2019 20:47

Merge branch 'spark-25299' into yh/add-benchmarks-and-ci-two-writers

6151cab

mccheah requested changes Mar 14, 2019

View reviewed changes

refactor

da1c2d0

mccheah reviewed Mar 15, 2019

View reviewed changes

style

350eb6e

mccheah approved these changes Mar 18, 2019

View reviewed changes

yifeih removed the do not merge label Mar 18, 2019

bulldozer-bot bot merged commit ddc9905 into spark-25299 Mar 18, 2019

bulldozer-bot bot deleted the yh/add-benchmarks-and-ci-two-writers branch March 18, 2019 19:06

svc-spark-25299 reviewed Mar 18, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SPARK-25299: Add rest of shuffle writer benchmarks #507

SPARK-25299: Add rest of shuffle writer benchmarks #507

yifeih commented Mar 5, 2019

mccheah commented Mar 14, 2019

mccheah Mar 14, 2019

yifeih Mar 14, 2019

mccheah Mar 14, 2019

mccheah Mar 15, 2019

yifeih Mar 15, 2019

mccheah Mar 15, 2019

mccheah Mar 15, 2019

mccheah left a comment

svc-spark-25299 left a comment

		@@ -50,12 +50,16 @@ done

		echo "Running SPARK-25299 benchmarks"

		SPARK_GENERATE_BENCHMARK_FILES=1 ./build/sbt "sql/test:runMain org.apache.spark.shuffle.sort.BypassMergeSortShuffleWriterBenchmark"

SPARK-25299: Add rest of shuffle writer benchmarks #507

SPARK-25299: Add rest of shuffle writer benchmarks #507

Conversation

yifeih commented Mar 5, 2019

mccheah commented Mar 14, 2019

mccheah Mar 14, 2019

Choose a reason for hiding this comment

yifeih Mar 14, 2019

Choose a reason for hiding this comment

mccheah Mar 14, 2019

Choose a reason for hiding this comment

mccheah Mar 15, 2019

Choose a reason for hiding this comment

yifeih Mar 15, 2019

Choose a reason for hiding this comment

mccheah Mar 15, 2019

Choose a reason for hiding this comment

mccheah Mar 15, 2019

Choose a reason for hiding this comment

mccheah left a comment

Choose a reason for hiding this comment

svc-spark-25299 left a comment

Choose a reason for hiding this comment