You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When run ssb-q4.2 with scale 100T and enable columnar shuffle writes, we found that shuffle write byte added up of all stages increase as the number of partitions increases. However, when disable gluten, the growth trend of vanilla spark is not so obvious.
FelixYBW
changed the title
Columnar shuffle write byte increase as the number of partitions increases
[VL] Columnar shuffle write byte increase as the number of partitions increases
Oct 28, 2024
Did you enable sort based shuffle? Hash shuffle has this issue.
Is it a bug and will be fixed in the future? May I know the reason why the disk is rising?
I didn't see any config to force enable sort based shuffle to avoid hash shuffle. Could I just decrease the value of spark.gluten.sql.columnar.shuffle.sort.columns.threshold
Backend
VL (Velox)
Bug description
When run ssb-q4.2 with scale 100T and enable columnar shuffle writes, we found that shuffle write byte added up of all stages increase as the number of partitions increases. However, when disable gluten, the growth trend of vanilla spark is not so obvious.
The following table shows the shuffle write bytes sum by all stages.
Spark version
None
Spark configurations
No response
System information
No response
Relevant logs
No response
The text was updated successfully, but these errors were encountered: