Tutorial by Examples

Controlling Spark SQL Shuffle Partitions

In Apache Spark while doing shuffle operations like join and cogroup a lot of data gets transferred across network. Now, to control the number of partitions over which shuffle happens can be controlled by configurations given in Spark SQL. That configuration is as follows: spark.sql.shuffle.partiti...

apache-spark • Configuration: Apache Spark SQL

Page 1 of 1

Advertise with us
Contact us
Cookie Policy
Privacy Policy

Get monthly updates about new articles, cheatsheets, and tricks.