add error section in report and the rest queries #9150

wjxiz1992 · 2023-08-31T10:50:27Z

Close #8816 and #9118
Close #9165 and #9166
This PR adds in

the query content, an error field in the json report file to see what exceptions are thrown during the query run.
Not only the exceptions that a failed query throws, but also the exceptions that a Spark task failed at first but succeeded in retry. ([FEA] add error section in test report json file #9118)
the rest queries for Scale Test.([FEA] Write the rest of the scale test queries #8816 )
a new input option --dry: will only print the generated queries and its physical plan(.exlpain())
add correct candidate columns for key group 2

This PR is still in testing with different scales, but it contains lots of queries, and it requires a lot of review for all of them since they are parsed according to my personal understanding. Put it up for early feedbacks.

put one generated queries for reviewers to help easy check the query correctness or if the queries are expected:
queries.txt

Signed-off-by: Allen Xu <[email protected]>

andygrove · 2023-08-31T13:49:47Z

...ation_tests/src/main/scala/com/nvidia/spark/rapids/tests/scaletest/TaskFailureListener.scala

@@ -0,0 +1,24 @@
+package com.nvidia.spark.rapids.tests.scaletest


Missing license header

andygrove · 2023-08-31T13:50:01Z

integration_tests/src/main/scala/com/nvidia/spark/rapids/tests/scaletest/Utils.scala

@@ -0,0 +1,13 @@
+package com.nvidia.spark.rapids.tests.scaletest


Missing license header

wjxiz1992 · 2023-09-01T08:38:41Z

Found one blocking bug: #9165.
And another bug is not blocking but better to fix in this PR as well: #9166

convert it to draft until I resolve the 2 bugs above.

Signed-off-by: Allen Xu <[email protected]>

wjxiz1992 · 2023-09-06T01:46:16Z

build

firestarman · 2023-09-06T01:48:38Z

build

wjxiz1992 · 2023-09-06T07:26:54Z

Hi @revans2 I've added all queries into the test suite, please help take a look when you have time. I've also put a query content list queries.txt in the attachment in the PR description.
Let me know if the queries are not generated as expected.

revans2

In general it looks good, but I want to spend some time to test it myself. Just to be sure that the scale_factor/etc look correct and that the queries tax the plugin in ways we expect it to.

revans2 · 2023-09-06T18:29:38Z

integration_tests/src/main/scala/com/nvidia/spark/rapids/tests/scaletest/QuerySpecs.scala

@@ -61,14 +243,14 @@ class QuerySpecs(config: Config, spark: SparkSession) {
    }
  }

-  def getCandidateQueries(): Map[String, TestQuery] = {
+  def getCandidateQueries(): mutable.LinkedHashMap[String, TestQuery] = {


why are the maps mutable? Do we plan on changing them after they are constructed?

I was looking for Map that can reserve insertion order and found LinkedHashMap first. I just found ListMap can also do this, updated.

revans2 · 2023-09-06T18:33:54Z

integration_tests/src/main/scala/com/nvidia/spark/rapids/tests/scaletest/QuerySpecs.scala

+        config.iterations,
+        config.timeout,
+        "No obvious build side inner equi-join. (Shuffle partitions should be set to 10)",
+        shufflePartitions = 10),


Why 10 shuffle partitions? The original proposal was ceil(scale_factor/50). Not that this is wrong, just curious.

Oh I was refering to the queries in #8816. Now updated to what is set in test plan doc.

Signed-off-by: Allen Xu <[email protected]>

wjxiz1992 · 2023-09-07T06:59:43Z

build

wjxiz1992 · 2023-09-07T09:16:06Z

build

wjxiz1992 · 2023-09-07T09:21:47Z

In general it looks good, but I want to spend some time to test it myself. Just to be sure that the scale_factor/etc look correct and that the queries tax the plugin in ways we expect it to.

I can build pipelines to test this, any test preference? According to the test plan doc, I should start to generate scale_factor=100, complexity=300 data. I don't have enough disk space in my local PC so I can do this on our internal cluster.

revans2 · 2023-09-07T13:22:30Z

@wjxiz1992 lets check this in and then we can work on figuring out if we need to tweak things. We need to setup a pipeline anyways, so lets get started with that as a separate piece.

add error section in report and the rest queries

65799f5

Signed-off-by: Allen Xu <[email protected]>

wjxiz1992 added test Only impacts tests data gen scale test labels Aug 31, 2023

wjxiz1992 self-assigned this Aug 31, 2023

wjxiz1992 requested a review from revans2 August 31, 2023 11:15

remove strip call from string for java compatibility

e9a25e2

Signed-off-by: Allen Xu <[email protected]>

andygrove reviewed Aug 31, 2023

View reviewed changes

wjxiz1992 marked this pull request as draft September 1, 2023 08:39

wjxiz1992 added 3 commits September 1, 2023 19:18

fix some failed queries

980befe

Signed-off-by: Allen Xu <[email protected]>

Fix all queries errors

b41d75f

Signed-off-by: Allen Xu <[email protected]>

resolve premerge

942dc8a

Signed-off-by: Allen Xu <[email protected]>

wjxiz1992 marked this pull request as ready for review September 4, 2023 10:07

revans2 previously approved these changes Sep 6, 2023

View reviewed changes

resolve nits

21a1fda

Signed-off-by: Allen Xu <[email protected]>

wjxiz1992 dismissed revans2’s stale review via 21a1fda September 7, 2023 04:26

revans2 approved these changes Sep 7, 2023

View reviewed changes

revans2 merged commit e35e194 into NVIDIA:branch-23.10 Sep 7, 2023
27 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add error section in report and the rest queries #9150

add error section in report and the rest queries #9150

wjxiz1992 commented Aug 31, 2023 •

edited

Loading

andygrove Aug 31, 2023

wjxiz1992 Sep 5, 2023

andygrove Aug 31, 2023

wjxiz1992 Sep 5, 2023

wjxiz1992 commented Sep 1, 2023 •

edited

Loading

wjxiz1992 commented Sep 6, 2023

firestarman commented Sep 6, 2023

wjxiz1992 commented Sep 6, 2023

revans2 left a comment

revans2 Sep 6, 2023

wjxiz1992 Sep 7, 2023

revans2 Sep 6, 2023

wjxiz1992 Sep 7, 2023

wjxiz1992 commented Sep 7, 2023

wjxiz1992 commented Sep 7, 2023

wjxiz1992 commented Sep 7, 2023

revans2 commented Sep 7, 2023

		@@ -0,0 +1,24 @@
		package com.nvidia.spark.rapids.tests.scaletest

		@@ -0,0 +1,13 @@
		package com.nvidia.spark.rapids.tests.scaletest

add error section in report and the rest queries #9150

add error section in report and the rest queries #9150

Conversation

wjxiz1992 commented Aug 31, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wjxiz1992 commented Sep 1, 2023 • edited Loading

wjxiz1992 commented Sep 6, 2023

firestarman commented Sep 6, 2023

wjxiz1992 commented Sep 6, 2023

revans2 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

wjxiz1992 commented Sep 7, 2023

wjxiz1992 commented Sep 7, 2023

wjxiz1992 commented Sep 7, 2023

revans2 commented Sep 7, 2023

wjxiz1992 commented Aug 31, 2023 •

edited

Loading

wjxiz1992 commented Sep 1, 2023 •

edited

Loading