[GLUTEN-5341] Fix some Spark 3.5 UTs #5445

yma11 · 2024-04-18T03:58:24Z

What changes were proposed in this pull request?

Fix some Spark3.5 UTs

How was this patch tested?

CI

github-actions · 2024-04-18T03:58:39Z

#5341

github-actions · 2024-04-18T03:58:54Z

Run Gluten Clickhouse CI

ayushi-agarwal · 2024-04-18T06:07:44Z

backends-velox/src/test/scala/org/apache/gluten/execution/VeloxHashJoinSuite.scala

@@ -88,6 +87,8 @@ class VeloxHashJoinSuite extends VeloxWholeStageTransformerSuite {
      val wholeStages = plan.collect { case wst: WholeStageTransformer => wst }
      if (SparkShimLoader.getSparkVersion.startsWith("3.2.")) {
        assert(wholeStages.length == 1)
+      } else if (SparkShimLoader.getSparkVersion.startsWith("3.5.")) {


Why did it increase to 5 in 3.5? I was also debugging this and saw there were two more exchanges coming in 3.5, shall we debug why there are more exchanges?

The physical plan in spark3.5 seems changed:

*(9) Project [l_partkey#200L] +- *(9) SortMergeJoin [l_suppkey#201L], [ps_suppkey#156L], Inner :- *(6) Sort [l_suppkey#201L ASC NULLS FIRST], false, 0 : +- Exchange hashpartitioning(l_suppkey#201L, 5), ENSURE_REQUIREMENTS, [plan_id=300] : +- *(5) Project [l_partkey#200L, l_suppkey#201L] : +- *(5) SortMergeJoin [l_partkey#200L], [p_partkey#123L], Inner : :- *(2) Sort [l_partkey#200L ASC NULLS FIRST], false, 0 : : +- Exchange hashpartitioning(l_partkey#200L, 5), ENSURE_REQUIREMENTS, [plan_id=283] : : +- *(1) Filter (isnotnull(l_partkey#200L) AND isnotnull(l_suppkey#201L)) : : +- *(1) ColumnarToRow : : +- BatchScan parquet file:/root/workspace/apache_1/backends-velox/target/scala-2.12/test-classes/tpch-data-parquet-velox/lineitem[l_partkey#200L, l_suppkey#201L] ParquetScan DataFilters: [isnotnull(l_partkey#200L), isnotnull(l_suppkey#201L)], Format: parquet, Location: InMemoryFileIndex(1 paths)[file:/root/workspace/apache_1/backends-velox/target/scala-2.12/test-cl..., PartitionFilters: [], PushedAggregation: [], PushedFilters: [IsNotNull(l_partkey), IsNotNull(l_suppkey)], PushedGroupBy: [], ReadSchema: struct<l_partkey:bigint,l_suppkey:bigint> RuntimeFilters: [] : +- *(4) Sort [p_partkey#123L ASC NULLS FIRST], false, 0 : +- Exchange hashpartitioning(p_partkey#123L, 5), ENSURE_REQUIREMENTS, [plan_id=292] : +- *(3) Filter isnotnull(p_partkey#123L) : +- *(3) ColumnarToRow : +- BatchScan parquet file:/root/workspace/apache_1/backends-velox/target/scala-2.12/test-classes/tpch-data-parquet-velox/part[p_partkey#123L] ParquetScan DataFilters: [isnotnull(p_partkey#123L)], Format: parquet, Location: InMemoryFileIndex(1 paths)[file:/root/workspace/apache_1/backends-velox/target/scala-2.12/test-cl..., PartitionFilters: [], PushedAggregation: [], PushedFilters: [IsNotNull(p_partkey)], PushedGroupBy: [], ReadSchema: struct<p_partkey:bigint> RuntimeFilters: [] +- *(8) Sort [ps_suppkey#156L ASC NULLS FIRST], false, 0 +- Exchange hashpartitioning(ps_suppkey#156L, 5), ENSURE_REQUIREMENTS, [plan_id=309] +- *(7) Filter isnotnull(ps_suppkey#156L) +- *(7) ColumnarToRow +- BatchScan parquet file:/root/workspace/apache_1/backends-velox/target/scala-2.12/test-classes/tpch-data-parquet-velox/partsupp[ps_suppkey#156L] ParquetScan DataFilters: [isnotnull(ps_suppkey#156L)], Format: parquet, Location: InMemoryFileIndex(1 paths)[file:/root/workspace/apache_1/backends-velox/target/scala-2.12/test-cl..., PartitionFilters: [], PushedAggregation: [], PushedFilters: [IsNotNull(ps_suppkey)], PushedGroupBy: [], ReadSchema: struct<ps_suppkey:bigint> RuntimeFilters: []

My understanding is that it has 4 exchanges so 5 stages.

Yes, in 3.4 plan I saw there were only 2 exchanges, there was no exchange after part table and lineitem table scan for their join. Seems some regression in 3.5 resulting in 2 more exchanges.

@ayushi-agarwal Do you want to further dig out the plan change? if so, you can open a ticket dedicated for it. Maybe related with data set or some configurations. Let's merge this PR first.

Sure, I will create a ticket to investigate further. Thanks @yma11

GlutenPerfBot · 2024-04-19T05:38:08Z

===== Performance report for TPCH SF2000 with Velox backend, for reference only ====

query	log/native_5445_time.csv	log/native_master_04_17_2024_9b3f59a1c_time.csv	difference	percentage
q1	36.33	36.61	0.273	100.75%
q2	23.73	24.12	0.386	101.63%
q3	37.34	37.01	-0.326	99.13%
q4	40.70	38.03	-2.669	93.44%
q5	69.09	70.92	1.831	102.65%
q6	5.72	5.81	0.097	101.70%
q7	85.18	86.12	0.938	101.10%
q8	85.61	82.86	-2.756	96.78%
q9	125.12	123.45	-1.665	98.67%
q10	42.55	45.70	3.147	107.40%
q11	20.35	20.35	-0.005	99.98%
q12	28.72	28.15	-0.573	98.00%
q13	54.60	54.46	-0.136	99.75%
q14	17.48	17.88	0.400	102.29%
q15	30.81	30.69	-0.116	99.62%
q16	14.25	14.00	-0.246	98.27%
q17	101.97	101.27	-0.698	99.32%
q18	142.95	143.83	0.879	100.62%
q19	13.73	13.61	-0.115	99.16%
q20	28.62	27.86	-0.762	97.34%
q21	287.59	285.86	-1.725	99.40%
q22	14.74	14.42	-0.321	97.82%
total	1307.17	1303.01	-4.164	99.68%

enable VeloxCacheSuite, VeloxHashJoinSuite

zhouyuan approved these changes Apr 18, 2024

View reviewed changes

ayushi-agarwal reviewed Apr 18, 2024

View reviewed changes

[GLUTEN-5341] Fix some Spark 3.5 UTs

2c38425

zhouyuan merged commit eb5b27a into apache:main Apr 19, 2024
39 checks passed

yma11 deleted the ut-418 branch April 23, 2024 12:40

ayushi-agarwal mentioned this pull request Apr 25, 2024

Hash join plan with datasource v2 in 3.5 has 2 more exchanges than 3.4 #5530

Open

Preetesh2110 pushed a commit to Preetesh2110/incubator-gluten that referenced this pull request Apr 25, 2024

[GLUTEN-5341] Fix part of Spark 3.5 UTs (apache#5445)

b1ecb31

enable VeloxCacheSuite, VeloxHashJoinSuite

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GLUTEN-5341] Fix some Spark 3.5 UTs #5445

[GLUTEN-5341] Fix some Spark 3.5 UTs #5445

yma11 commented Apr 18, 2024

github-actions bot commented Apr 18, 2024

github-actions bot commented Apr 18, 2024

ayushi-agarwal Apr 18, 2024

yma11 Apr 18, 2024

ayushi-agarwal Apr 18, 2024 •

edited

Loading

yma11 Apr 19, 2024 •

edited

Loading

ayushi-agarwal Apr 19, 2024

ayushi-agarwal Apr 25, 2024

GlutenPerfBot commented Apr 19, 2024

[GLUTEN-5341] Fix some Spark 3.5 UTs #5445

[GLUTEN-5341] Fix some Spark 3.5 UTs #5445

Conversation

yma11 commented Apr 18, 2024

What changes were proposed in this pull request?

How was this patch tested?

github-actions bot commented Apr 18, 2024

github-actions bot commented Apr 18, 2024

ayushi-agarwal Apr 18, 2024

Choose a reason for hiding this comment

yma11 Apr 18, 2024

Choose a reason for hiding this comment

ayushi-agarwal Apr 18, 2024 • edited Loading

Choose a reason for hiding this comment

yma11 Apr 19, 2024 • edited Loading

Choose a reason for hiding this comment

ayushi-agarwal Apr 19, 2024

Choose a reason for hiding this comment

ayushi-agarwal Apr 25, 2024

Choose a reason for hiding this comment

GlutenPerfBot commented Apr 19, 2024

ayushi-agarwal Apr 18, 2024 •

edited

Loading

yma11 Apr 19, 2024 •

edited

Loading